chrysopedia/backend/pipeline/quality
jlightner 4854dad086 feat: Ran manual chat evaluation against live endpoint, documented qual…
- ".gsd/milestones/M025/slices/S09/S09-QUALITY-REPORT.md"
- "backend/pipeline/quality/results/chat_eval_baseline.json"

GSD-Task: S09/T03
2026-04-04 14:50:44 +00:00
..
fixtures test: Created chat-specific LLM-as-judge scorer (5 dimensions), SSE-par… 2026-04-04 14:43:52 +00:00
results feat: Ran manual chat evaluation against live endpoint, documented qual… 2026-04-04 14:50:44 +00:00
__init__.py feat: Created PromptVariantGenerator (LLM-powered prompt mutation) and… 2026-04-01 09:08:01 +00:00
__main__.py test: Created chat-specific LLM-as-judge scorer (5 dimensions), SSE-par… 2026-04-04 14:43:52 +00:00
chat_eval.py test: Created chat-specific LLM-as-judge scorer (5 dimensions), SSE-par… 2026-04-04 14:43:52 +00:00
chat_scorer.py test: Created chat-specific LLM-as-judge scorer (5 dimensions), SSE-par… 2026-04-04 14:43:52 +00:00
fitness.py test: Built pipeline.quality package with FitnessRunner (9 tests, 4 cat… 2026-04-01 08:45:05 +00:00
optimizer.py optimize: Stage 5 synthesis prompt — round 0 winner (0.95→1.0 composite) 2026-04-01 10:15:24 +00:00
scorer.py feat: Generalized OptimizationLoop to stages 2-5 with per-stage fixture… 2026-04-01 09:24:42 +00:00
variant_generator.py feat: Added STAGE_CONFIGS registry (stages 2-5) with per-stage rubrics,… 2026-04-01 09:20:24 +00:00
voice_dial.py feat: Added VoiceDial class with 3-band prompt modification and ScoreRu… 2026-04-01 08:57:07 +00:00