legco_ai_assistant

Commit Graph

Author	SHA1	Message	Date
Woody	852430f1f1	feat: add Sub-Phase 9.0 config and Pydantic models for accuracy testing	2026-05-25 18:27:51 +08:00
Woody	7dfd603bc8	chore: update .gitignore and add accuracy testing enhancement plan	2026-05-25 18:14:55 +08:00
Woody	c8bcfa0487	docs: update Phase 5 plan with realtime implementation and model fix notes Document chunked REST realtime implementation, model change to google/chirp-3, language code handling, diagnostic logging, and updated acceptance criteria. Ultraworked with Sisyphus Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-05-19 13:34:25 +08:00
Woody	f44b68812d	fix: add diagnostic logging and OpenRouter language code filter Add transcribe-start/complete logs for both providers, error response body logging, and ASR provider in startup log. Filter yue (ISO 639-3) language code from OpenRouter STT requests. Ultraworked with Sisyphus Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-05-19 13:34:06 +08:00
Woody	cd125d8535	feat: add OpenRouter realtime ASR via chunked REST WebSocket Add _ws_proxy_openrouter() handler with pcm_to_wav() converter, 3s chunk accumulation, flush_lock concurrency guard, and endpoint dispatch on ASR_PROVIDER. Language code yue filtered for OpenRouter (ISO 639-3 not supported). Ultraworked with Sisyphus Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-05-19 13:33:52 +08:00
Woody	552b4964bf	fix: change default OpenRouter STT model to google/chirp-3 google/gemini-3.1-flash-lite is not an STT model; chirp-3 is one of the 8 supported OpenRouter STT models. Ultraworked with Sisyphus Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-05-19 13:33:33 +08:00
Woody	5da74ec24c	docs: add Phase 5 OpenRouter ASR implementation plan Complete implementation plan with architecture (Factory+Strategy pattern), provider comparison (DashScope vs OpenRouter), configuration, 7 implementation tasks, test plan, acceptance criteria, and implementation notes including decisions made (circular import resolution, separate API key, sync-to-async DashScope wrapper). Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-05-19 09:49:22 +08:00
Woody	6928fff8ff	test: update Phase 2 tests for ASR provider abstraction Update TestTranscribeFull to use async/await and patch the moved OpenAI import (now in asr_providers.py). Set ASR_PROVIDER=dashscope in test fixtures to ensure tests don't pick up the real .env ASR_PROVIDER value. All 19 Phase 2 + 7 integration tests pass. Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-05-19 09:48:58 +08:00
Woody	733824c177	test: add Phase 5 ASR provider and integration tests test_phase5_config.py: 6 tests for ASR_PROVIDER validation and default values. test_phase5_openrouter_provider.py: 14 tests covering OpenRouterSTT transcription, retry logic, error handling, URL construction, cleanup, and factory dispatch. test_phase5_integration.py: 4 tests for full video-to-transcribe flow with both providers (mocked) and per-provider API key validation. Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-05-19 09:48:37 +08:00
Woody	183fcf7772	refactor: make ASR client and video router provider-aware Refactor ASRClient to delegate to provider (DashScopeASRProvider or OpenRouterASRProvider) via create_asr_provider() factory. transcribe_full() now async. Move _to_traditional to asr_providers.py (re-exported from asr_client.py for backward compat). Update video.py router to await transcribe_full() and validate API key per provider (DASHSCOPE_API_KEY for dashscope, OPENROUTER_API_KEY for openrouter). Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-05-19 09:48:12 +08:00
Woody	39525a2344	feat: add ASR provider config, abstraction layer, and OpenRouter provider Add ASR_PROVIDER env var (dashscope\|openrouter), OPENROUTER_API_KEY, and ASR_OPENROUTER_MODEL to Settings. Create ASRProvider ABC with DashScopeASRProvider (wraps existing OpenAI-based DashScope calls via run_in_executor) and OpenRouterASRProvider (httpx + tenacity retry for batch STT). Add tenacity>=8.0.0 dependency. Realtime WebSocket stays DashScope-only. Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-05-19 09:47:30 +08:00
Woody	67d2bddeb6	fix: use relative /api/v1 fallback instead of hardcoded localhost:8000 API URLs now resolve relative to the page origin, working for both local dev (via Vite proxy) and remote production deployments. Also fixes useFullTranscript which had a double /api/v1 path bug when VITE_API_BASE_URL was set. Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-05-18 17:27:28 +08:00
Woody	a54d688867	fix: use VITE_API_BASE_URL for highlight endpoints instead of hardcoded localhost Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-05-18 16:31:16 +08:00
Woody	6678f81283	fix: keep textarea editable during half-question API call Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-05-18 16:04:14 +08:00
Woody	531e7c435e	fix: enable half-question and final-submit buttons during interim ASR text Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-05-18 15:38:48 +08:00
Woody	b6f8a522b6	docs: mark Phase 4 audio echo plan as completed	2026-05-18 14:50:59 +08:00
Woody	2d3dc7374d	docs: Phase 4 audio echo bug fix plan	2026-05-18 14:47:46 +08:00
Woody	d5e7e2d0ca	chore: add pnpm config and update lockfile	2026-05-18 14:47:34 +08:00
Woody	1e6e41e426	feat: HTTPS support with nginx reverse proxy - Add nginx as reverse proxy (HTTP→HTTPS redirect, self-signed cert) - start.sh entrypoint: generates SSL cert, starts nginx + uvicorn - Single-stage Dockerfile (no separate frontend build stage) - Expose ports 80 and 443 in docker-compose - Update README port references for HTTPS	2026-05-18 14:47:22 +08:00
Woody	0445fdba19	fix: UUID fallback for non-secure HTTP contexts crypto.randomUUID() is unavailable outside secure contexts (plain HTTP). Add generateUUID() helper with manual UUID v4 fallback (RFC 4122).	2026-05-18 14:47:07 +08:00
Woody	821159a198	Merge branch 'RAG-workflow'	2026-05-18 14:42:00 +08:00
Woody	e00bb8853d	Merge branch 'Highlight-Response'	2026-05-18 14:11:04 +08:00
Woody	82cc3a1d02	feat: question-based chunking strategy selector in RAG Database Add ChunkingStrategy type ('token' \| 'question') and wire it through the ingest pipeline. Users can now choose between traditional token-window chunking and question-based chunking (Q&A pair detection, table extraction). Frontend changes: - RAGDatabasePage: radio buttons for Token vs Question strategy - DocumentList: strategy badges (blue 'chunked by question' / gray 'chunked by token') - ChunkList: question-strategy chunks show Q&A metadata (question ID, topic, page range, 'contains table' badge) instead of raw page numbers - api.ts / queries.tsx: pass strategy param to /ingest endpoint - types/index.ts: new ChunkingStrategy type, new fields on ChunkInfo, DocumentInfo, IngestResponse	2026-05-18 14:10:51 +08:00
Woody	80af17a255	fix: mute audio output during System Audio and Mic capture to prevent echo Insert a zero-gain GainNode between ScriptProcessorNode and audioContext.destination. The processor stays in the graph (so onaudioprocess fires on all browsers) but zero volume reaches the speakers, eliminating the echo/feedback loop during live capture.	2026-05-18 14:04:42 +08:00
Woody	73c1789698	fix: Q\&A chunking always fell back to token — LLM never called, missing API fields Three bugs caused 'Chunk by Question' to silently produce token chunks: 1. QuestionChunkingStrategy.chunk_pages() had a broken event-loop check that always skipped LLM structure detection in FastAPI's async context. Fixed by making chunk_pages() async and removing the is_running() guard. 2. get_chunking_strategy() factory never passed an LLMClient to QuestionChunkingStrategy. Fixed by creating LLMClient in the factory with graceful fallback to regex-only when config is incomplete. 3. rag.list_documents() and list_chunks() didn't extract strategy_type or Q&A fields from ChromaDB metadata, so the frontend always showed chunking_strategy='token' and null Q&A fields. Fixed by reading these fields from ChromaDB and routing them through the API. Also: TokenChunkingStrategy.chunk_pages() made async for consistency with the question strategy; ingest router updated to await it. Tests updated (asyncio.run() for sync tests, async mock chunk_pages).	2026-05-15 14:46:45 +08:00
Woody	f637ab10a5	Merge branch 'RAG-workflow'	2026-05-15 13:35:54 +08:00
Woody	9bef65de7b	test: Sub-Phase 8.5 — acceptance test skeleton for Q&A chunking 8 acceptance tests with real LegCo PDFs (all @pytest.mark.acceptance + @slow). Tests are skip()'d — run manually when real LLM is available: pytest app/test/acceptance/test_acceptance_phase8_qa_chunking.py -v -m acceptance Sub-Phase 8.6 (polish/edge cases) deferred — remaining items are O1-O4 format handling, [如被追問] nested Q&A, vision loading state. Core algorithm (8.1-8.4) is test-passing and production-ready.	2026-05-15 12:45:46 +08:00
Woody	14423c773a	feat: Sub-Phases 8.1-8.4 — Q&A-pair chunking strategy 8.1 — Core algorithm (test-first): - qa_chunking.py: preprocess_text, build_structure_detection_prompt, parse_llm_structure_response, Section dataclass, split_chinese_qa, split_english_qa, build_chunks_from_sections with recursive size split - QuestionChunkingStrategy in chunking.py with _chunk_metadata tracking - get_chunking_strategy() factory function - table_extraction.py: vision LLM extraction, heuristic text fallback, disk cache, inject_tables_into_answer - 18/18 tests pass (LLM parse, regex fast-pass, multi-page, ABC contract, size limit, chunk building, preprocess) 8.2 — Metadata enrichment: - extract_metadata() accepts strategy_type + chunk_metadata params - Q&A fields (question_id, question_index, section_heading, etc.) merged into ChromaDB metadata entries - DocumentInfo.chunking_strategy + ChunkInfo Q&A fields in models - 6/6 metadata tests pass 8.3 — Ingest API integration: - POST /api/v1/ingest accepts ?strategy=token\|question - validate strategy against VALID_CHUNKING_STRATEGIES - factory creates correct chunker; _chunk_metadata passed to extract_metadata - 6/6 ingest integration tests pass, zero regressions on existing tests 8.4 — Frontend strategy selector: - Radio button selector (Token / Question) on RAG Database page - Strategy passed to ingest mutation via api.ts - DocumentList: strategy badge (gray/blue) - ChunkList: Q&A display with question_id, question_text, page range, table badge - tsc --noEmit clean, vite build successful	2026-05-15 12:44:04 +08:00
Woody	c8a9c857f7	Merge branch 'Highlight-Response'	2026-05-15 12:05:17 +08:00
Woody	62db325f02	fix: add rehype-raw to ReactMarkdown so ==term== <mark> HTML renders Without rehype-raw, ReactMarkdown escaped the raw <mark> HTML injected by highlightTerms(), showing literal tags instead of yellow highlights. Now 30 marks render with correct bg-yellow-200 (#FEF08A) background.	2026-05-15 12:05:07 +08:00
Woody	ef10b937cf	feat: Sub-Phase 8.0 — config & enums for Q&A-pair chunking strategy Backend: - Add 6 Q&A chunking config fields to Settings (default_chunking_strategy, qa_vision_enabled, qa_max_chunk_tokens, qa_structure_model, qa_include_internal_refs, qa_cache_vision_results) - Define ChunkingStrategyType Literal + VALID_CHUNKING_STRATEGIES frozenset - Add strategy field to IngestResponse (default token, non-breaking) - Add IngestRequest model with strategy param - Update .env.example with new env vars Frontend: - Add ChunkingStrategy type ('token' \| 'question') - Extend IngestResponse, DocumentInfo, ChunkInfo with Q&A fields Tests: - test_qa_chunking_config_defaults — all defaults verified - test_qa_chunking_config_from_env — env var overrides verified Plan fix: renamed qa_verification_model → qa_structure_model to match LLM-first architecture	2026-05-15 12:01:28 +08:00
Woody	6bf04cedb1	docs: Package 8 — switch to LLM-first structure detection (not regex-first) LegCo documents use multiple formats (問/答 markers, Q1/Q2 numbering, section headings like '(1) 住戶的安置補償', 發言要點 bullet points, and pure table pages). Regex alone cannot reliably classify all these. Changes: - Primary detection: LLM call identifies ALL section types in one pass (qa, narrative, speaking_notes, table, toc, heading_only) - Regex: downgraded to optional fast-pass optimization for known patterns - Architecture diagram, algorithm detail, risks, and test plan all updated - Single model handles structure detection + table extraction + verification	2026-05-15 11:34:24 +08:00
Woody	29b4713f22	Merge branch 'Highlight-Response'	2026-05-15 11:23:02 +08:00
Woody	322caf1cc0	docs: Package 8 — add vLLM vision compatibility risk and smoke test to plan - New risk: vLLM may not support Qwen3.5-35B-A3B vision API depending on version - Dependencies: added vLLM compatibility note with smoke test snippet - Heuristic fallback (Option B) works regardless of OpenRouter or vLLM - qa_vision_enabled toggle provides escape hatch	2026-05-15 11:20:20 +08:00
Woody	16fbb107f4	Merge branch 'Ref-doc-highlight-bug'	2026-05-15 11:11:21 +08:00
Woody	dbae9411c6	docs: Package 8 enhancement plan — Q&A-pair chunking strategy with vision table extraction - New QuestionChunkingStrategy splits by 問/答 and Q1/Q2 boundaries - Vision-based table-to-markdown using existing Qwen3.5-35B-A3B (native vision model) - Strategy selector UI on RAG Database page (token vs question) - Hybrid approach: regex primary split + LLM verification for edge cases - Single-model architecture — no separate vision API needed - 6 sub-phases with test-first delivery, 7 new files, 15+ modified files	2026-05-15 11:10:36 +08:00
Woody	787c6b1692	fix: vLLM highlight batch failure — replace guided_json with response_format + add debug logging Root cause: guided_json removed in vLLM v0.12.0, and the two-attempt loop (structured_outputs → guided_json) merged chat_template_kwargs into the extra_body, potentially causing param conflicts. Changes: - llm_client.py: Replace _complete_structured_vllm() with two-tier approach — response_format (Tier 1, v0.6.4+) then structured_outputs (Tier 2, v0.8+). Remove dead guided_json path. Add _strip_markdown_fence(). - chunk_highlight_service.py: Add complete() fallback as defense-in-depth when structured output fails. Strip markdown fences before parsing. - chunks.py: Add request/response logging at router level. - chunk_highlight_service.py: Add full logging chain — entry, ChromaDB fetch, LLM call, fallback, cache results, exit. - ResponsePanel.tsx: Add console logging for request payload, response status/errors/timing. Handle status=failed explicitly (was silently ignored). Track round-trip timing via performance.now().	2026-05-15 11:08:36 +08:00
Woody	e78f53b687	feat: Phase 7.2 — wire highlightTerms into ResponsePanel + mark CSS - Add HighlightMark component rendering <mark class="bg-yellow-200..."> - Call highlightTerms() in SubQuestionSection and FlatResponse before ReactMarkdown - Add mark: HighlightMark to ReactMarkdown components in both paths - Add .prose mark CSS rule (yellow-200 bg, rounded, px-0.5) - Tests: 56/56 pass (citation + highlight + ResponsePanel)	2026-05-15 10:51:08 +08:00
Woody	534559b2e0	feat: Phase 7.1 — highlight prompt template + sequential citation [N] + highlightTerms parser - Backend: add ==term== highlighting instruction to _SEED_GENERATE_PER_SUBQ - Frontend: replaceFilename output with sequential [1] [2] [3] numbering - Frontend: add highlightTerms() to convert ==term== to <mark> HTML - Tests: 39 citation+highlight tests pass (28 updated + 11 new) - Fix: QueryInput partialText styling and disabled state	2026-05-15 10:46:55 +08:00
Woody	c3392989dc	docs: vLLM highlight failure fix plan — confirmed guided_json removed in v0.12.0 Root cause confirmed via vLLM docs, protocol.py source, RFC #19097, and GitHub test suite: guided_json was removed in v0.12.0. Our fallback to it after structured_outputs fails is dead code. Fix strategy: replace _complete_structured_vllm() with two-tier approach (response_format as Tier 1, structured_outputs as Tier 2), removing the dead guided_json path and the chat_template_kwargs merge that may conflict. Evidence from: vllm.ai docs, vllm-project/vllm tests/entrypoints, protocol.py to_sampling_params(), PRs #7654 #9530 #15627, RFC #19097	2026-05-15 10:13:07 +08:00
Woody	53ebafc401	docs: sync plan files with actual implementation — Phase 4 complete	2026-05-15 10:00:45 +08:00
Woody	8370f49631	docs: Package 7 — switch compact citations to sequential [1] [2] [3] numbering	2026-05-15 09:58:07 +08:00
Woody	29d2920b32	docs: Package 7 enhancement plan — response highlighting & compact citations	2026-05-15 09:53:15 +08:00
Woody	d69c180544	feat: Phase 4.8-4.9 — integration tests, acceptance tests, docs, and polish	2026-05-15 09:51:45 +08:00
Woody	1e8773469e	Merge branch 'Phase4-dev'	2026-05-14 23:29:42 +08:00
Woody	624df8cf9a	fix: no text displayed during mic capture DashScope realtime ASR sends utterance-completed (final) events without incremental deltas. The onmessage handler cleared partialTranscript on every final, so text never appeared. Set partialTranscript to full_text on final messages instead of clearing it, keeping the transcript visible in QueryInput.	2026-05-14 23:25:39 +08:00
Woody	7c03137577	fix: mic transcript disappearing after stop useMediaStreamASR cleanup() cleared partialTranscript on stop, causing live ASR text to vanish from QueryInput. Unlike video ASR (which has onFinalTranscript to persist via queryText), mic and system-audio hooks rely on partialTranscript for display. Keep partialTranscript populated with the final transcript instead of clearing it.	2026-05-14 23:19:11 +08:00
Woody	7bff4308b7	feat: Phase 4 — System Audio & Listen Mic capture into ASR → RAG Adds two new live audio sources alongside file Upload: - System Audio: getDisplayMedia() captures system/tab audio output, pipes through WebSocket → DashScope realtime ASR → RAG. - Listen Mic: getUserMedia() captures microphone input via the same audio pipeline (shared useMediaStreamASR hook). Backend: feature toggles (system_audio_enabled, mic_enabled) in config.py, source query param gating in ws_asr.py, 10 config tests. Bug fix: getDisplayMedia() rejected video:false per W3C spec — changed to video:true then stop video tracks to allow audio-only capture on Windows/macOS Chrome.	2026-05-14 22:55:06 +08:00
Woody	a8a2cc0940	fix: enable Half Question/Final Submit during interim ASR text isDisabled, handleSubmit, and Half Question onClick all checked question.trim() instead of displayValue.trim(). Since question state is only updated on onFinalTranscript (complete sentences), interim ASR delta text shown in the textarea via partialText was invisible to the disabled check — buttons stayed disabled until sentence end. Fix: use displayValue which includes partialText when user hasn't typed.	2026-05-14 21:55:07 +08:00
Woody	17db487dbb	feat: Phase 3 — Half Question button, Final Submit rename, ASR text always black - Backend: add stop_after_decompose flag to QueryRequest, early-return after decomposition in SSE stream with half_question:true event - Frontend: add decomposeOnly method to useQueryDocumentStream hook - QueryInput: remove grey italic from ASR partial text, rename Submit to Final Submit, add gray Half Question button that decomposes without clearing querybox text - LTTPage: wire handleHalfQuestion to decomposeOnly	2026-05-14 21:27:21 +08:00

1 2 3 4 5

227 Commits All Branches Search

227 Commits

All Branches