hangshuo652/cobol-java-v3 - cobol-java-v3 - Gitea

hangshuo652/cobol-java-v3

T

NB-076 65e9919933 feat: matching program full recognition — L1 regex keyword + confidence consensus

Three-part fix for matching program classification:
1. L1 regex keyword WS-[-\w]*KEY (confidence 0.65):
   - Captures WS-KEY, WS-MAST-KEY, WS-TRAN-KEY, WS-PREV-KEY etc.
   - Matches ALL 10 matching programs including MT02 (which uses
     WS-MAST-KEY/WS-TRAN-KEY that literal 'WS-KEY' missed)
   - False positives (ST-SEARCH-ALL, VL01) overridden by rule engine
     or higher-confidence ORGANIZATION IS keyword
   - detect_keyword() extended with 're:' prefix for regex patterns

2. Consensus bonus in compute_confidence_v2:
   - When L1 keyword category matches rule engine's final category,
     context_factor boosted by +0.15
   - Pushes matching programs from manual (0.50-0.69) toward
     review (0.70-0.89) range

3. Confidence calibration for confusion groups (previous commit):
   - dedup_vs_nodedup: 0.85→0.50 for negative detection
   - validation_vs_keybreak: 0.80→0.55 for has_counter
   - simple_vs_two_stage: 0.80→0.50 for sequential OPEN

Results - matching programs:
  MT01: 0.38→0.75, MT02: 0.30→0.60, MT03: 0.30→0.60,
  MT16: 0.45→0.81, MT17: 0.36→0.65, MT18: 0.60→0.60,
  MT19: 0.30→0.60, MT20: 0.30→0.65, MT33: 0.30→0.60
  All now rule_engine (not fallback), no false negatives.

Subtype discrimination remains for future work: all matching
programs classified as マッチング without 1:1/1:N/N:1 subtype.

2026-06-21 13:25:39 +08:00

feat: Phase 2 complete — 13 Phases of COBOL type classification and test benchmark

2026-06-19 23:51:55 +08:00

fix: 3 critical parsing bugs found through statement benchmark testing

2026-06-21 12:52:04 +08:00

feat: Phase 2 complete — 13 Phases of COBOL type classification and test benchmark

2026-06-19 23:51:55 +08:00

feat: Phase 2 complete — 13 Phases of COBOL type classification and test benchmark

2026-06-19 23:51:55 +08:00

feat: Phase 2 complete — 13 Phases of COBOL type classification and test benchmark

2026-06-19 23:51:55 +08:00

feat: Phase 2 complete — 13 Phases of COBOL type classification and test benchmark

2026-06-19 23:51:55 +08:00

feat: add COBOL statement benchmark plan and 34 P0 sample programs

2026-06-21 12:02:25 +08:00

feat: matching program full recognition — L1 regex keyword + confidence consensus

2026-06-21 13:25:39 +08:00

feat: Phase 2 complete — 13 Phases of COBOL type classification and test benchmark

2026-06-19 23:51:55 +08:00

feat: Phase 2 complete — 13 Phases of COBOL type classification and test benchmark

2026-06-19 23:51:55 +08:00

feat: Phase 2 complete — 13 Phases of COBOL type classification and test benchmark

2026-06-19 23:51:55 +08:00

feat: Phase 2 complete — 13 Phases of COBOL type classification and test benchmark

2026-06-19 23:51:55 +08:00

feat: Phase 2 complete — 13 Phases of COBOL type classification and test benchmark

2026-06-19 23:51:55 +08:00

feat: Phase 2 complete — 13 Phases of COBOL type classification and test benchmark

2026-06-19 23:51:55 +08:00

init: cobol-java migration verification platform v3 (42 tests, JCL module)

2026-05-27 08:42:41 +08:00

feat: add COBOL statement benchmark plan and 34 P0 sample programs

2026-06-21 12:02:25 +08:00

fix: confusion group confidence calibration — false positive detection inflation

2026-06-21 13:17:31 +08:00

uploads/6aaa7e43

init: cobol-java migration verification platform v3 (42 tests, JCL module)

2026-05-27 08:42:41 +08:00

feat: Phase 2 complete — 13 Phases of COBOL type classification and test benchmark

2026-06-19 23:51:55 +08:00

.gitignore

feat: Phase 2 complete — 13 Phases of COBOL type classification and test benchmark

2026-06-19 23:51:55 +08:00

aurak.toml

v3: gstack-code-gen 生成

2026-05-24 12:36:44 +08:00

CONTRIBUTING.md

feat: Phase 2 complete — 13 Phases of COBOL type classification and test benchmark

2026-06-19 23:51:55 +08:00

DESIGN.md

init: cobol-java migration verification platform v3 (42 tests, JCL module)

2026-05-27 08:42:41 +08:00

japanese_data.py

feat: Phase 2 complete — 13 Phases of COBOL type classification and test benchmark

2026-06-19 23:51:55 +08:00

main.py

feat: Phase 1 - orchestrator quality gate loop + hina/gate + main CLI args

2026-06-18 16:02:38 +08:00

orchestrator.py

feat: Phase 2 complete — 13 Phases of COBOL type classification and test benchmark

2026-06-19 23:51:55 +08:00

preprocessor.py

v3: gstack-code-gen 生成

2026-05-24 12:36:44 +08:00

pyproject.toml

v3: gstack-code-gen 生成

2026-05-24 12:36:44 +08:00

pytest.ini

v3: gstack-code-gen 生成

2026-05-24 12:36:44 +08:00

requirements.txt

feat: add web layer (FastAPI + worker)

2026-05-24 12:52:20 +08:00

reset_task.py

feat: Phase 2 complete — 13 Phases of COBOL type classification and test benchmark

2026-06-19 23:51:55 +08:00

test_llm.py

feat: Phase 2 complete — 13 Phases of COBOL type classification and test benchmark

2026-06-19 23:51:55 +08:00

test_pipeline.py

feat: Phase 2 complete — 13 Phases of COBOL type classification and test benchmark

2026-06-19 23:51:55 +08:00

write_result.py

feat: Phase 2 complete — 13 Phases of COBOL type classification and test benchmark

2026-06-19 23:51:55 +08:00

Languages

Python 95.5%

HTML 2%

CSS 1.8%

JavaScript 0.5%

COBOL 0.2%