feat: Phase 2 complete — 13 Phases of COBOL type classification and test benchmark

P0.6: gcov infrastructure
P1: extract_structure output expansion (11 new feature fields)
P2: Confusion group rule engine (8 pairs + contradiction + backtrack)
P3: 4-factor confidence calculation + quality gate update
P4: 33+2 COBOL program type test samples (22 files, 7 categories)
P5: parametrized/ test data generation engine
P6: japanese_data.py lookup tables
P7-10: Type-specific test suites (~159 parametrized tests)
P11: Full classification pipeline (classify_program) + orchestrator integration
P12: Documentation (module-interfaces, test-plan v3.0, coverage-matrix)

Architecture decisions:
- classification_pipeline/ merged to hina/pipeline/
- parametrized/ as independent module
- japanese_data.py as root-level file
- hina/__all__ only exports classify_program()

Co-Authored-By: Claude <noreply@anthropic.com>
This commit is contained in:
hangshuo652
2026-06-19 23:51:55 +08:00
parent 63b5284715
commit bc1d56d1a4
129 changed files with 19378 additions and 261 deletions
@@ -0,0 +1,44 @@
* ==== TYPE: MT03 MATCHING(N:1) ====
* FEATURE: N master files, 1 transaction file
* BRANCHES: 6, DECISIONS: 3
IDENTIFICATION DIVISION.
PROGRAM-ID. MT03.
ENVIRONMENT DIVISION.
INPUT-OUTPUT SECTION.
FILE-CONTROL.
SELECT FILE-M ASSIGN TO 'FILEM.DAT'.
SELECT FILE-T ASSIGN TO 'FILET.DAT'.
DATA DIVISION.
FILE SECTION.
FD FILE-M.
01 REC-M PIC X(80).
FD FILE-T.
01 REC-T PIC X(80).
WORKING-STORAGE SECTION.
01 WS-KEY-M PIC X(10).
01 WS-KEY-T PIC X(10).
01 WS-M-EOF PIC X VALUE 'N'.
01 WS-T-EOF PIC X VALUE 'N'.
01 WS-COUNT PIC 9(4) VALUE 0.
PROCEDURE DIVISION.
MAIN-PROCEDURE.
OPEN INPUT FILE-M FILE-T.
READ FILE-M INTO REC-M
AT END MOVE 'Y' TO WS-M-EOF.
READ FILE-T INTO REC-T
AT END MOVE 'Y' TO WS-T-EOF.
PERFORM UNTIL WS-M-EOF = 'Y' OR WS-T-EOF = 'Y'
IF WS-KEY-M = WS-KEY-T
ADD 1 TO WS-COUNT
DISPLAY 'MATCHED: ' REC-M(1:50)
READ FILE-M AT END MOVE 'Y' TO WS-M-EOF
ELSE IF WS-KEY-M < WS-KEY-T
READ FILE-M AT END MOVE 'Y' TO WS-M-EOF
ELSE
DISPLAY 'TRAN ONLY: ' REC-T(1:50)
READ FILE-T AT END MOVE 'Y' TO WS-T-EOF
END-IF
END-PERFORM.
DISPLAY 'TOTAL MATCHED: ' WS-COUNT.
CLOSE FILE-M FILE-T.
STOP RUN.