Add retrieve_per_subquestion() that queries ChromaDB independently per sub-question instead of joining all sub-qs into one query string. Add filter_per_subquestion() that evaluates each chunk against its own originating sub-question in a single LLM call with a redesigned grouped prompt. Add generate_response_per_subquestion() that produces markdown sections per sub-question with grouped sources and {context_sections} template support. All existing methods preserved for backward compatibility.
Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent)
Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>
|
||
|---|---|---|
| .. | ||
| __init__.py | ||
| embedding_client.py | ||
| history_service.py | ||
| llm_client.py | ||
| prompt_service.py | ||
| query_decomposer.py | ||
| rag.py | ||
| relevance_filter.py | ||