legco_ai_assistant

History

Woody 2aca18d30e docs: add vLLM structured output fix plan - Diagnose: vLLM ignores OpenAI-native response_format, causing NoneType error - Diagnose: legacy fallback prompt lacks JSON instruction → empty questions - Plan: use vLLM-native guided_json via extra_body instead of with_structured_output - Plan: update _SEED_DECOMPOSE with JSON format instruction - Plan: add diagnostic logging (exc_info, method, schema preview) wip: temporary function_calling switch for vLLM (to be replaced by guided_json)		2026-04-29 16:42:23 +08:00
..
__init__.py	feat: Phase 1.1 project setup with config, database, and models	2026-04-22 16:13:52 +08:00
chunk_highlight_service.py	feat: track highlight generation prompt, response, and timing in history (Phase 5.5)	2026-04-29 11:18:21 +08:00
embedding_client.py	feat(backend): add embedding client and update LLM client	2026-04-23 13:26:43 +08:00
highlight_cache.py	feat: add SQLite highlight cache service (Phase 5.4.3)	2026-04-29 09:26:20 +08:00
history_service.py	feat: track highlight generation prompt, response, and timing in history (Phase 5.5)	2026-04-29 11:18:21 +08:00
llm_client.py	docs: add vLLM structured output fix plan	2026-04-29 16:42:23 +08:00
prompt_service.py	feat(prompts): integrate filter_per_subq with PromptService, fix seed bugs, restructure UI	2026-04-27 11:14:27 +08:00
query_decomposer.py	feat: structured LLM output for decompose + citation fuzzy matching (Phase 5)	2026-04-28 15:39:17 +08:00
rag.py	feat: structured LLM output for decompose + citation fuzzy matching (Phase 5)	2026-04-28 15:39:17 +08:00
relevance_filter.py	fix(relevance): tolerate LLM score count mismatches via padding instead of discarding	2026-04-27 14:31:18 +08:00