feat: Phase 2 complete — 13 Phases of COBOL type classification and test benchmark
P0.6: gcov infrastructure P1: extract_structure output expansion (11 new feature fields) P2: Confusion group rule engine (8 pairs + contradiction + backtrack) P3: 4-factor confidence calculation + quality gate update P4: 33+2 COBOL program type test samples (22 files, 7 categories) P5: parametrized/ test data generation engine P6: japanese_data.py lookup tables P7-10: Type-specific test suites (~159 parametrized tests) P11: Full classification pipeline (classify_program) + orchestrator integration P12: Documentation (module-interfaces, test-plan v3.0, coverage-matrix) Architecture decisions: - classification_pipeline/ merged to hina/pipeline/ - parametrized/ as independent module - japanese_data.py as root-level file - hina/__all__ only exports classify_program() Co-Authored-By: Claude <noreply@anthropic.com>
This commit is contained in:
@@ -0,0 +1,39 @@
|
||||
* ==== TYPE: VL01 VALIDATION(WITH DUP) ====
|
||||
* FEATURE: Duplicate detection with WS-PREV-KEY
|
||||
* BRANCHES: 4, DECISIONS: 2
|
||||
IDENTIFICATION DIVISION.
|
||||
PROGRAM-ID. VL01.
|
||||
ENVIRONMENT DIVISION.
|
||||
INPUT-OUTPUT SECTION.
|
||||
FILE-CONTROL.
|
||||
SELECT IN-FILE ASSIGN TO 'INDATA.DAT'.
|
||||
DATA DIVISION.
|
||||
FILE SECTION.
|
||||
FD IN-FILE.
|
||||
01 IN-REC PIC X(80).
|
||||
WORKING-STORAGE SECTION.
|
||||
01 WS-KEY PIC X(10).
|
||||
01 WS-PREV-KEY PIC X(10) VALUE SPACES.
|
||||
01 WS-EOF PIC X VALUE 'N'.
|
||||
01 WS-DUP-FLAG PIC X VALUE 'N'.
|
||||
01 WS-REC-COUNT PIC 9(4) VALUE 0.
|
||||
01 WS-DUP-COUNT PIC 9(4) VALUE 0.
|
||||
PROCEDURE DIVISION.
|
||||
MAIN-PROCEDURE.
|
||||
OPEN INPUT IN-FILE.
|
||||
READ IN-FILE AT END MOVE 'Y' TO WS-EOF.
|
||||
PERFORM UNTIL WS-EOF = 'Y'
|
||||
ADD 1 TO WS-REC-COUNT
|
||||
IF WS-KEY = WS-PREV-KEY
|
||||
ADD 1 TO WS-DUP-COUNT
|
||||
MOVE 'Y' TO WS-DUP-FLAG
|
||||
DISPLAY 'DUP: ' IN-REC(1:50)
|
||||
ELSE
|
||||
MOVE WS-KEY TO WS-PREV-KEY
|
||||
DISPLAY 'NEW: ' IN-REC(1:50)
|
||||
END-IF
|
||||
READ IN-FILE AT END MOVE 'Y' TO WS-EOF
|
||||
END-PERFORM.
|
||||
DISPLAY 'TOTAL: ' WS-REC-COUNT ' DUPS: ' WS-DUP-COUNT.
|
||||
CLOSE IN-FILE.
|
||||
STOP RUN.
|
||||
@@ -0,0 +1,28 @@
|
||||
* ==== TYPE: VL02 VALIDATION(NO DUP) ====
|
||||
* FEATURE: No duplicate detection
|
||||
* BRANCHES: 2, DECISIONS: 1
|
||||
IDENTIFICATION DIVISION.
|
||||
PROGRAM-ID. VL02.
|
||||
ENVIRONMENT DIVISION.
|
||||
INPUT-OUTPUT SECTION.
|
||||
FILE-CONTROL.
|
||||
SELECT IN-FILE ASSIGN TO 'INDATA.DAT'.
|
||||
DATA DIVISION.
|
||||
FILE SECTION.
|
||||
FD IN-FILE.
|
||||
01 IN-REC PIC X(80).
|
||||
WORKING-STORAGE SECTION.
|
||||
01 WS-EOF PIC X VALUE 'N'.
|
||||
01 WS-COUNT PIC 9(4) VALUE 0.
|
||||
PROCEDURE DIVISION.
|
||||
MAIN-PROCEDURE.
|
||||
OPEN INPUT IN-FILE.
|
||||
READ IN-FILE AT END MOVE 'Y' TO WS-EOF.
|
||||
PERFORM UNTIL WS-EOF = 'Y'
|
||||
ADD 1 TO WS-COUNT
|
||||
DISPLAY 'VALID: ' IN-REC(1:50)
|
||||
READ IN-FILE AT END MOVE 'Y' TO WS-EOF
|
||||
END-PERFORM.
|
||||
DISPLAY 'VALID RECORDS: ' WS-COUNT.
|
||||
CLOSE IN-FILE.
|
||||
STOP RUN.
|
||||
Reference in New Issue
Block a user