feat: Phase 2 complete — 13 Phases of COBOL type classification and test benchmark

P0.6: gcov infrastructure P1: extract_structure output expansion (11 new feature fields) P2: Confusion group rule engine (8 pairs + contradiction + backtrack) P3: 4-factor confidence calculation + quality gate update P4: 33+2 COBOL program type test samples (22 files, 7 categories) P5: parametrized/ test data generation engine P6: japanese_data.py lookup tables P7-10: Type-specific test suites (~159 parametrized tests) P11: Full classification pipeline (classify_program) + orchestrator integration P12: Documentation (module-interfaces, test-plan v3.0, coverage-matrix) Architecture decisions: - classification_pipeline/ merged to hina/pipeline/ - parametrized/ as independent module - japanese_data.py as root-level file - hina/__all__ only exports classify_program() Co-Authored-By: Claude <noreply@anthropic.com>
2026-06-19 23:51:55 +08:00
parent 63b5284715
commit bc1d56d1a4
129 changed files with 19378 additions and 261 deletions
@@ -59,8 +59,17 @@ class JclExecutor:
                elif name in ("VALIDOUT", "REJECT", "REPORTERR", "CALCOUT", "STMT", "SUMMARY"):
                    env_out[name] = str(path)

-        input_path = env_in.get(list(env_in.keys())[0], "")
-        output_path = env_out.get(list(env_out.keys())[0], str(self.root_dir / "data" / "work" / f"{step.step_name.lower()}_out.bin"))
+        # 创建空输入文件（如果无 input DD）
+        if env_in:
+            input_path = env_in.get(next(iter(env_in.keys())), "")
+        else:
+            import tempfile
+            empty = Path(tempfile.mkstemp(suffix=".bin")[1])
+            empty.write_bytes(b"")
+            input_path = str(empty)
+        output_path = env_out.get(next(iter(env_out.keys()), ""), str(self.root_dir / "data" / "work" / f"{step.step_name.lower()}_out.bin"))
+        # 确保输出目录存在
+        Path(output_path).parent.mkdir(parents=True, exist_ok=True)

        run = runner.run(build.artifact_path, input_path, output_path)
        self.results[step.step_name] = {