chrysopedia

History

jlightner 4854dad086 feat: Ran manual chat evaluation against live endpoint, documented qual… - ".gsd/milestones/M025/slices/S09/S09-QUALITY-REPORT.md" - "backend/pipeline/quality/results/chat_eval_baseline.json" GSD-Task: S09/T03		2026-04-04 14:50:44 +00:00
..
fixtures	test: Created chat-specific LLM-as-judge scorer (5 dimensions), SSE-par…	2026-04-04 14:43:52 +00:00
results	feat: Ran manual chat evaluation against live endpoint, documented qual…	2026-04-04 14:50:44 +00:00
__init__.py	feat: Created PromptVariantGenerator (LLM-powered prompt mutation) and…	2026-04-01 09:08:01 +00:00
__main__.py	test: Created chat-specific LLM-as-judge scorer (5 dimensions), SSE-par…	2026-04-04 14:43:52 +00:00
chat_eval.py	test: Created chat-specific LLM-as-judge scorer (5 dimensions), SSE-par…	2026-04-04 14:43:52 +00:00
chat_scorer.py	test: Created chat-specific LLM-as-judge scorer (5 dimensions), SSE-par…	2026-04-04 14:43:52 +00:00
fitness.py	test: Built pipeline.quality package with FitnessRunner (9 tests, 4 cat…	2026-04-01 08:45:05 +00:00
optimizer.py	optimize: Stage 5 synthesis prompt — round 0 winner (0.95→1.0 composite)	2026-04-01 10:15:24 +00:00
scorer.py	feat: Generalized OptimizationLoop to stages 2-5 with per-stage fixture…	2026-04-01 09:24:42 +00:00
variant_generator.py	feat: Added STAGE_CONFIGS registry (stages 2-5) with per-stage rubrics,…	2026-04-01 09:20:24 +00:00
voice_dial.py	feat: Added VoiceDial class with 3-band prompt modification and ScoreRu…	2026-04-01 08:57:07 +00:00