chrysopedia

Author	SHA1	Message	Date
jlightner	0b0ca598b4	feat: Log LLM response token usage (prompt/completion/total, content_len, finish_reason)	2026-03-30 06:15:24 +00:00
jlightner	cf759f3739	fix: Add max_tokens=16384 to LLM requests (OpenWebUI defaults to 1000, truncating pipeline JSON)	2026-03-30 04:08:29 +00:00
jlightner	4aa4b08a7f	feat: Per-stage LLM model routing with thinking modality and think-tag stripping - Added 8 per-stage config fields: llm_stage{2-5}_model and llm_stage{2-5}_modality - LLMClient.complete() accepts modality ('chat'/'thinking') and model_override - Thinking modality: appends JSON instructions to system prompt, strips <think> tags - strip_think_tags() handles multiline, multiple blocks, and edge cases - Pipeline stages 2-5 read per-stage config and pass to LLM client - Updated .env.example with per-stage model/modality documentation - All 59 tests pass including new think-tag stripping test	2026-03-30 02:12:14 +00:00
jlightner	5c46d1e922	feat: Created sync EmbeddingClient, QdrantManager with idempotent colle… - "backend/pipeline/embedding_client.py" - "backend/pipeline/qdrant_client.py" - "backend/pipeline/stages.py" GSD-Task: S03/T03	2026-03-29 22:39:04 +00:00
jlightner	b5635a09db	feat: Created 4 prompt templates and implemented 5 Celery tasks (stages… - "prompts/stage2_segmentation.txt" - "prompts/stage3_extraction.txt" - "prompts/stage4_classification.txt" - "prompts/stage5_synthesis.txt" - "backend/pipeline/stages.py" - "backend/requirements.txt" GSD-Task: S03/T02	2026-03-29 22:36:06 +00:00
jlightner	12cc86aef9	chore: Extended Settings with 12 LLM/embedding/Qdrant config fields, cr… - "backend/config.py" - "backend/worker.py" - "backend/pipeline/schemas.py" - "backend/pipeline/llm_client.py" - "backend/requirements.txt" - "backend/pipeline/__init__.py" - "backend/pipeline/stages.py" GSD-Task: S03/T01	2026-03-29 22:30:31 +00:00

6 commits