- Diagnose: vLLM ignores OpenAI-native response_format, causing NoneType error - Diagnose: legacy fallback prompt lacks JSON instruction → empty questions - Plan: use vLLM-native guided_json via extra_body instead of with_structured_output - Plan: update _SEED_DECOMPOSE with JSON format instruction - Plan: add diagnostic logging (exc_info, method, schema preview) wip: temporary function_calling switch for vLLM (to be replaced by guided_json) |
||
|---|---|---|
| .. | ||
| __init__.py | ||
| chunk_highlight_service.py | ||
| embedding_client.py | ||
| highlight_cache.py | ||
| history_service.py | ||
| llm_client.py | ||
| prompt_service.py | ||
| query_decomposer.py | ||
| rag.py | ||
| relevance_filter.py | ||