Commit Graph

4 Commits

Author SHA1 Message Date
hangshuo652 7fb9304212 merge local cobol_testgen improvements into v3 shared modules
- cond.py: SQLCODE/SQLSTATE handling, alphanumeric >/< boundary fix
- output.py: termination tracking, db_input support, _is_field_assigned filter
- coverage.py: mark_from_gcov, THRU support, KeyError protection
- gcov.py: new file (dependency for coverage.py)
- grammar.lark: multi-segment PIC support
- read.py: SQL INCLUDE resolution, DECLARE TABLE parsing, * comment fix
- core.py: SQL parsing, blocked_names, keyword list
- design.py: multi-sentinel, THRU ranges, PERFORM VARYING last iteration
- __init__.py: local main() + v3 API functions, guarded imports

All 6 ZAN programs verified passing through v3 pipeline
2026-06-23 22:38:17 +08:00
NB-076 097f5449da fix: 溢出截断 + flatfile字段路由 + 多E2E验证
1. _make_numeric_value截断保护
   PIC 9(3)字段值超过999时截断(之前不截断)
2. flatfile.py字段路由
   write_all_files按FD分配字段值到对应的文件
3. 端到端运行验证:
   01-matching-1-1: PASS (8匹配/9不匹配)
   03-matching-N-1: PASS (COPYBOOK正常解析)
   10-divide-50: 程序自身OPEN逻辑问题
   34-sort-anomaly: PARTIAL(异常测试用例部分通过)

Co-Authored-By: Claude <noreply@anthropic.com>
2026-06-22 13:59:54 +08:00
NB-076 6e69dff7a4 fix: 3 bugs confirmed and repaired from honest audit
Bug #1: AND compound branch-body MOVE not propagated (HIGH)
  Root cause: ELSE on same line as false_body, rest of line lost after
  self.advance(). Fix: reinsert ELSE body text same as ELSE IF does.
  Result: MOVE 'Y'/'N' TO WS-FLAG correctly propagated, all 3 paths
  verified (A<=10/B<20=F, A>10/B<20=T, A>10/B>=20=F).

Bug #2: Performance — path explosion (25 IFs = 47s, 10000 records)
  Root cause: BrSeq inner loop combined all paths before capping.
  Fix: early break at _MAX_PATHS in the combo loop.
  + _MAX_PATHS reduced from 10000 to 500.
  Result: 47s/10000rec -> 0.2s/27rec (235x improvement)

Bug #3: COPY+REDEFINES parse failure (test-only)
  Root cause: test code called parse_data_division on full source
  instead of extract_data_division first. Fixed.
  Real pipeline (extract_structure -> generate_data) was never affected.

Co-Authored-By: Claude <noreply@anthropic.com>
2026-06-22 11:36:33 +08:00
hangshuo652 bc1d56d1a4 feat: Phase 2 complete — 13 Phases of COBOL type classification and test benchmark
P0.6: gcov infrastructure
P1: extract_structure output expansion (11 new feature fields)
P2: Confusion group rule engine (8 pairs + contradiction + backtrack)
P3: 4-factor confidence calculation + quality gate update
P4: 33+2 COBOL program type test samples (22 files, 7 categories)
P5: parametrized/ test data generation engine
P6: japanese_data.py lookup tables
P7-10: Type-specific test suites (~159 parametrized tests)
P11: Full classification pipeline (classify_program) + orchestrator integration
P12: Documentation (module-interfaces, test-plan v3.0, coverage-matrix)

Architecture decisions:
- classification_pipeline/ merged to hina/pipeline/
- parametrized/ as independent module
- japanese_data.py as root-level file
- hina/__all__ only exports classify_program()

Co-Authored-By: Claude <noreply@anthropic.com>
2026-06-19 23:51:55 +08:00