- Add 3 columns to query_history: highlight_prompt, highlight_response, highlight_time_ms - HistoryService.update_highlights() updates existing row after batch LLM call - ChunkHighlightService measures timing, captures prompt and structured JSON response - SSE completed event includes history_id for frontend to pass back - Frontend captures historyId, passes as ?history_id= query param in batch POST - Highlight time tracked separately (excluded from total_time_ms) - All 153 tests pass (108 backend + 45 frontend) |
||
|---|---|---|
| .. | ||
| __init__.py | ||
| chunk_highlight_service.py | ||
| embedding_client.py | ||
| highlight_cache.py | ||
| history_service.py | ||
| llm_client.py | ||
| prompt_service.py | ||
| query_decomposer.py | ||
| rag.py | ||
| relevance_filter.py | ||