legco_ai_assistant

Commit Graph

Author	SHA1	Message	Date
Woody	852430f1f1	feat: add Sub-Phase 9.0 config and Pydantic models for accuracy testing	2026-05-25 18:27:51 +08:00
Woody	552b4964bf	fix: change default OpenRouter STT model to google/chirp-3 google/gemini-3.1-flash-lite is not an STT model; chirp-3 is one of the 8 supported OpenRouter STT models. Ultraworked with Sisyphus Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-05-19 13:33:33 +08:00
Woody	39525a2344	feat: add ASR provider config, abstraction layer, and OpenRouter provider Add ASR_PROVIDER env var (dashscope\|openrouter), OPENROUTER_API_KEY, and ASR_OPENROUTER_MODEL to Settings. Create ASRProvider ABC with DashScopeASRProvider (wraps existing OpenAI-based DashScope calls via run_in_executor) and OpenRouterASRProvider (httpx + tenacity retry for batch STT). Add tenacity>=8.0.0 dependency. Realtime WebSocket stays DashScope-only. Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-05-19 09:47:30 +08:00
Woody	ef10b937cf	feat: Sub-Phase 8.0 — config & enums for Q&A-pair chunking strategy Backend: - Add 6 Q&A chunking config fields to Settings (default_chunking_strategy, qa_vision_enabled, qa_max_chunk_tokens, qa_structure_model, qa_include_internal_refs, qa_cache_vision_results) - Define ChunkingStrategyType Literal + VALID_CHUNKING_STRATEGIES frozenset - Add strategy field to IngestResponse (default token, non-breaking) - Add IngestRequest model with strategy param - Update .env.example with new env vars Frontend: - Add ChunkingStrategy type ('token' \| 'question') - Extend IngestResponse, DocumentInfo, ChunkInfo with Q&A fields Tests: - test_qa_chunking_config_defaults — all defaults verified - test_qa_chunking_config_from_env — env var overrides verified Plan fix: renamed qa_verification_model → qa_structure_model to match LLM-first architecture	2026-05-15 12:01:28 +08:00
Woody	534559b2e0	feat: Phase 7.1 — highlight prompt template + sequential citation [N] + highlightTerms parser - Backend: add ==term== highlighting instruction to _SEED_GENERATE_PER_SUBQ - Frontend: replaceFilename output with sequential [1] [2] [3] numbering - Frontend: add highlightTerms() to convert ==term== to <mark> HTML - Tests: 39 citation+highlight tests pass (28 updated + 11 new) - Fix: QueryInput partialText styling and disabled state	2026-05-15 10:46:55 +08:00
Woody	7bff4308b7	feat: Phase 4 — System Audio & Listen Mic capture into ASR → RAG Adds two new live audio sources alongside file Upload: - System Audio: getDisplayMedia() captures system/tab audio output, pipes through WebSocket → DashScope realtime ASR → RAG. - Listen Mic: getUserMedia() captures microphone input via the same audio pipeline (shared useMediaStreamASR hook). Backend: feature toggles (system_audio_enabled, mic_enabled) in config.py, source query param gating in ws_asr.py, 10 config tests. Bug fix: getDisplayMedia() rejected video:false per W3C spec — changed to video:true then stop video tracks to allow audio-only capture on Windows/macOS Chrome.	2026-05-14 22:55:06 +08:00
Woody	b05c361fbd	revert: remove Phase 3 YouTube proxy — all 7 sub-phases Reverts commits `284028b` through `b4096d6`. Phase 4 (System Audio Capture) will replace the YouTube use case with a more versatile getDisplayMedia approach. Removed: YouTube router, HLS proxy, YouTubeService, YouTubeInput, YouTubeVideoPlayer, useYouTubeASR hook, all Phase 3 tests, hls.js dep, YouTube config fields, YouTube README/plan sections. Modified files restored to pre-Phase-3 state: LTTPage (no source toggle), api.ts (no YouTube extract), types (no YouTube types), config.py (no youtube fields), main.py (no YouTube router), requirements.txt (no yt-dlp), .env.example (no YouTube vars), package.json (no hls.js). Relevant Phase 2 code preserved: ws_asr.py (unchanged), useVideoASR, VideoPlayer, VideoUpload, QueryInput, Full Transcript.	2026-05-09 21:07:21 +08:00
Woody	284028bb1f	feat: Phase 3.1 + 3.2 — YouTube config infra and URL extraction Phase 3.1 — Configuration & Infrastructure: - Add youtube_proxy_enabled, yt_dlp_timeout, yt_dlp_cache_ttl config fields - Add yt-dlp and hls.js dependencies - Create models/youtube.py (request/response schemas) - Create service stubs (youtube_service, hls_proxy) - Create router stub and register in main.py - 11 config tests Phase 3.2 — YouTube URL Extraction: - yt-dlp wrapper with async extraction (run_in_executor) - Format selection: ≤480p video-only + highest-bitrate audio (VOD) - Combined format fallback: same URL for live streams - In-memory URL cache: 5min TTL live, 30min VOD - lru_cache singleton service for cache persistence - Error handling: DownloadError → 200 with error field - 18 extract tests, 82/82 total pass (zero regressions) Real-URL verified: VOD (5bF3tkO5jAA) 24 formats, Live (fN9uYWCjQaw) 6 HLS	2026-05-09 15:53:04 +08:00
Woody	9934749d2b	feat: Phase 2.1 config + infrastructure and 2.2 video upload backend - Add DashScope ASR and video upload config fields to Settings - Create Pydantic models (video.py, asr.py) - Create VideoService with validation, save, serve, delete - Create ASR client stub with float32_to_s16le utility - Implement POST /api/v1/video/upload with streaming validation - Implement GET /api/v1/video/{video_id} with FileResponse - Create WebSocket ASR endpoint stub - Register new routers in main.py - Update .env.example and requirements.txt - Add reference examples for DashScope integration - 8 tests passing (3 config + 5 video upload)	2026-05-06 13:08:19 +08:00
Woody	76c3bec2ab	feat: configurable SubQuestions via Step 1.2 system prompt page - Split 'Step 1: Query Decomposition' into Step 1.1 (prompt template) and Step 1.2 (format config with description + max_length) - Add create_subquestions_model() and parse_decompose_format() to decompose.py - QueryDecomposer reads decompose_format from DB, creates dynamic Pydantic model at runtime - PromptEditor renders Step 1.2 as textarea (description) + number input (max_length 1-5) - Graceful fallback to static SubQuestions when decompose_format unavailable	2026-05-04 17:22:14 +08:00
Woody	40b338d3ca	chore: gitignore .research, switch to flash, tighten sub-questions Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-05-04 16:38:58 +08:00
Woody	73ae621f3b	feat: add Deepseek config fields and DI wiring (Phase 6) Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-05-04 14:58:39 +08:00
Woody	3ab6fd102a	fix: use vLLM-native guided_json for structured output vLLM servers support JSON schema enforcement via extra_body (guided_json or structured_outputs), not OpenAI's response_format protocol. LangChain's with_structured_output(method='json_schema') sends response_format which vLLM ignores, causing NoneType not iterable parsing errors. - vLLM path: direct OpenAI SDK call with extra_body={guided_json\|structured_outputs} - OpenRouter path: unchanged with_structured_output(method='json_schema') - Try new 'structured_outputs' format first, fall back to legacy 'guided_json' - Update _SEED_DECOMPOSE with explicit JSON array instruction - Add diagnostic logging: exc_info=True, schema preview, prompt template preview - Add logging in _parse_legacy_json for fallback failure debugging	2026-04-29 16:49:14 +08:00
Woody	41f59b396f	feat: track highlight generation prompt, response, and timing in history (Phase 5.5) - Add 3 columns to query_history: highlight_prompt, highlight_response, highlight_time_ms - HistoryService.update_highlights() updates existing row after batch LLM call - ChunkHighlightService measures timing, captures prompt and structured JSON response - SSE completed event includes history_id for frontend to pass back - Frontend captures historyId, passes as ?history_id= query param in batch POST - Highlight time tracked separately (excluded from total_time_ms) - All 153 tests pass (108 backend + 45 frontend)	2026-04-29 11:18:21 +08:00
Woody	4c56e81872	feat(prompts): enforce bullet-point output in generate template Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-04-28 16:42:55 +08:00
Woody	f2115ae563	feat: structured LLM output for decompose + citation fuzzy matching (Phase 5) Phase 5.1 — Structured LLM output for query decomposition: - Add SubQuestions Pydantic model with sub_question, keywords, rationale - Add LLMClient.complete_structured() using langchain with_structured_output - Update QueryDecomposer with structured output path + legacy json.loads fallback - Update SQLite seed templates: add subq+citation labeling requirement - Add tests: structured output, subquestions model validation, logging Phase 5.2 — Citation format alignment and fallback links: - Add document_id to SourceMetadata (backend + frontend types) - Rewrite citationParser.ts with fuzzy matching and fallback document links - Add RAGDatabasePage auto-expand from ?document= URL param - Tighten generate_per_subq seed prompt: 'Copy exact bracket labels shown' - Add citation parser tests for fuzzy match and fallback link scenarios - Defer: DOCX/TXT PDF generation → Phase 5.3 (fallback links sufficient)	2026-04-28 15:39:17 +08:00
Woody	711be3dfde	feat(llm): add VLLM_ENGINE env flag for provider-specific extra_body format	2026-04-28 13:30:27 +08:00
Woody	d444c99c23	feat(config): log resolved llm and embedding model names on startup Add INFO log in get_settings() to print the actual model names after merging .env and class defaults. Confirms pydantic-settings priority: env values override class defaults as expected. Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-04-27 15:11:36 +08:00
Woody	3b868a0133	feat(prompts): integrate filter_per_subq with PromptService, fix seed bugs, restructure UI Break the hardcoded per-sub-q filter prompt into 3 editable PromptService templates (filter_intro, filter_section, filter_outro) with placeholders for the for-loop iteration pattern. Refactor RelevanceFilter._build_per_subq_prompt() to compose them at runtime, falling back to built-in defaults when PromptService is unavailable. Fix two latent bugs from Package 4: - generate_per_subq was called by rag.py but never added to _VALID_STEPS or DB seed (would ValueError at runtime) - _SEED_GENERATE placeholder mismatch: flat generate_response() expects {question}/{context} but Package 4 changed it to {context_sections}. Restored flat template; generate_per_subq now holds {context_sections}. Add database backfill migration in seed_default_profiles() to INSERT OR IGNORE missing steps into existing profile rows, ensuring all 7 steps exist on restart. Restructure System Prompts UI: remove unused flat filter/generate steps, replace with Step 2.1-2.3 (filter_intro/section/outro) and Step 3 (generate_per_subq). Update PlaceholderDocs with {context_sections}, {subq_idx}, {subq_question}. Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-04-27 11:14:27 +08:00
Woody	0ecae11bf8	feat(db): update history schema and generate prompt template for Package 4 Add chunks_retrieved_per_subq_count and chunks_filtered_per_subq_count columns to query_history table with safe ALTER TABLE migration. Replace generate template {question}/{context} placeholders with {context_sections} for per-sub-question organized context sections. Update Phase 3 test assertions to match new template and schema shapes. Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-04-26 23:28:28 +08:00
Woody	475306f2b1	feat(history): Phase 3.5 — Query History backend (service, API, timing, XML capture)	2026-04-25 22:59:53 +08:00
Woody	e49a68b0bd	feat(prompts): Phase 3.2 — Prompt Backend (CRUD service, REST API, 33 tests) - PromptService (services/prompt_service.py): full CRUD for 3 profiles A/B/C with seed template reset, validation, and sqlite3.Row access - REST API (routers/prompts.py): 6 endpoints on /api/v1/prompts - Pydantic models (models/prompts.py): 6 schemas - DI wiring (dependencies.py): get_prompt_service() - App registration (main.py): prompts router - Mock fixture (conftest.py): mock_prompt_service - Tests: test_phase3_prompt_service.py (22) + test_phase3_prompts_router.py (11) - 162/166 total pass, 4 skipped, 0 fail	2026-04-25 21:11:17 +08:00
Woody	f4b404f27d	feat(db): Phase 3.1 — SQLite infrastructure (prompts.db + history.db) - Add sqlite_db.py with dual-DB connection factories (WAL mode, foreign keys) - init_prompts_db() creates system_prompt_profiles + system_prompts tables - init_history_db() creates query_history table + created_at index - seed_default_profiles() inserts 3 profiles (A/B/C) x 3 steps each - All 3 profiles start with identical seed templates; Profile A active - Add prompts_db_path + history_db_path to config (./data/ default) - Startup init in main.py creates data/ dir, inits both DBs, seeds profiles - Add PROMPTS_DB_PATH + HISTORY_DB_PATH to .env.example - Add data/ to .gitignore - 17 new tests in test_phase3_sqlite_db.py (all passing)	2026-04-25 20:29:29 +08:00
Woody	8c84062996	feat(backend): add PDF page extractor and chunk PDF storage config New pdf_extractor.py with extract_page_as_pdf() and extract_pages_as_pdf() for extracting individual PDF pages as separate files. Adds document_chunk_path setting to config and document_chunk/ to .gitignore. Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-04-24 10:52:57 +08:00
Woody	5dcb71369c	fix(backend): add embed_query method to EmbeddingFunctionWrapper for ChromaDB query ChromaDB 1.5.8 calls embed_query() during collection.query(), but the wrapper only implemented __call__ (used by collection.add()). Added embed_query() as alias and refactored to shared _embed() method. Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-04-24 10:15:08 +08:00
Woody	c6abe5c335	fix(backend): add name() method to EmbeddingFunctionWrapper for ChromaDB 1.5.8 ChromaDB 1.5.8 requires embedding functions to implement the name() method from the EmbeddingFunction protocol. Without this, collection.get() fails with AttributeError. Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-04-23 19:02:41 +08:00
Woody	74cb8b83d5	feat(backend): migrate LLM client to OpenAI SDK with thinking control - Replace httpx with openai.AsyncOpenAI - Add llm_enable_thinking config (default False) - Add _build_extra_body() for Qwen3.5 thinking mode control - Use chat_template_kwargs for vLLM/SGLang compatibility Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-04-23 14:10:26 +08:00
Woody	4cf930dc59	feat(backend): add dependency injection and update main entry point Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-04-23 13:27:30 +08:00
Woody	b93fc2e05b	chore(backend): update config, env template, and pytest settings Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-04-23 13:26:08 +08:00
Woody	c9f330d57e	fix(backend): wrap embedding function for ChromaDB 0.4.22 compatibility - Add _EmbeddingFunctionWrapper class with __call__(self, input) signature - Use ThreadPoolExecutor to run async embed in isolated thread with fresh event loop - Fixes asyncio.run() cannot be called from a running event loop Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-04-23 13:25:08 +08:00
Woody	3712397d64	feat: Phase 1.1 project setup with config, database, and models - Add requirements.txt with all dependencies - Add .env.example with required environment variables - Add Pydantic Settings (config.py) with .env loading - Add ChromaDB persistent client (database.py) - Add Pydantic schemas (ingest.py) for request/response - Add FastAPI main.py with CORS middleware - Add package __init__.py files - Add tests: test_phase1_config.py, test_phase1_database.py - All 5 tests pass	2026-04-22 16:13:52 +08:00

31 Commits