feat: Phase 2 complete — 13 Phases of COBOL type classification and test benchmark

P0.6: gcov infrastructure
P1: extract_structure output expansion (11 new feature fields)
P2: Confusion group rule engine (8 pairs + contradiction + backtrack)
P3: 4-factor confidence calculation + quality gate update
P4: 33+2 COBOL program type test samples (22 files, 7 categories)
P5: parametrized/ test data generation engine
P6: japanese_data.py lookup tables
P7-10: Type-specific test suites (~159 parametrized tests)
P11: Full classification pipeline (classify_program) + orchestrator integration
P12: Documentation (module-interfaces, test-plan v3.0, coverage-matrix)

Architecture decisions:
- classification_pipeline/ merged to hina/pipeline/
- parametrized/ as independent module
- japanese_data.py as root-level file
- hina/__all__ only exports classify_program()

Co-Authored-By: Claude <noreply@anthropic.com>
This commit is contained in:
hangshuo652
2026-06-19 23:51:55 +08:00
parent 63b5284715
commit bc1d56d1a4
129 changed files with 19378 additions and 261 deletions
+22
View File
@@ -0,0 +1,22 @@
"""LLM 智能体包
公开 API:
LLMClient — LLM API 客户端(含缓存 + 重试)
Agent1Parser — COPYBOOK → FieldTree
Agent2Data — FieldTree → TestSuite(测试数据设计)
Agent3Diagnostic — FieldResult → 诊断建议文本
"""
from __future__ import annotations
from .llm import LLMClient
from .agent1_parser import Agent1Parser
from .agent2_data import Agent2Data
from .agent3_diagnostic import Agent3Diagnostic
__all__ = [
"LLMClient", # class
"Agent1Parser", # class
"Agent2Data", # class
"Agent3Diagnostic", # class
]
+6 -1
View File
@@ -15,7 +15,12 @@ class LLMClient:
def _get(self, k):
p = self.dir / f"{k}.json"
return json.loads(p.read_text())["response"] if p.exists() else None
if not p.exists():
return None
try:
return json.loads(p.read_text())["response"]
except (json.JSONDecodeError, KeyError):
return None
def _set(self, k, v):
(self.dir / f"{k}.json").write_text(json.dumps({"response": v}))