legco_ai_assistant

Commit Graph

Author	SHA1	Message	Date
Woody	17db487dbb	feat: Phase 3 — Half Question button, Final Submit rename, ASR text always black - Backend: add stop_after_decompose flag to QueryRequest, early-return after decomposition in SSE stream with half_question:true event - Frontend: add decomposeOnly method to useQueryDocumentStream hook - QueryInput: remove grey italic from ASR partial text, rename Submit to Final Submit, add gray Half Question button that decomposes without clearing querybox text - LTTPage: wire handleHalfQuestion to decomposeOnly	2026-05-14 21:27:21 +08:00
Woody	64a7a8a46b	chore: add pnpm lockfiles, Phase 4 plan, and dev plan status update	2026-05-14 20:26:17 +08:00
Woody	5832a854c5	chore: remove Phase 3 plan file after revert	2026-05-09 21:14:20 +08:00
Woody	b05c361fbd	revert: remove Phase 3 YouTube proxy — all 7 sub-phases Reverts commits `284028b` through `b4096d6`. Phase 4 (System Audio Capture) will replace the YouTube use case with a more versatile getDisplayMedia approach. Removed: YouTube router, HLS proxy, YouTubeService, YouTubeInput, YouTubeVideoPlayer, useYouTubeASR hook, all Phase 3 tests, hls.js dep, YouTube config fields, YouTube README/plan sections. Modified files restored to pre-Phase-3 state: LTTPage (no source toggle), api.ts (no YouTube extract), types (no YouTube types), config.py (no youtube fields), main.py (no YouTube router), requirements.txt (no yt-dlp), .env.example (no YouTube vars), package.json (no hls.js). Relevant Phase 2 code preserved: ws_asr.py (unchanged), useVideoASR, VideoPlayer, VideoUpload, QueryInput, Full Transcript.	2026-05-09 21:07:21 +08:00
Woody	b4096d6afc	feat: Phase 3.7 — Polish, PO token handling, docs, deployment verification - PO token handling: _is_po_token_error() detects YouTube bot-detection errors, invalidates cache on detection, logs warning for retry guidance (2 new tests) - README: YouTube Live Stream Proxy section with architecture, usage, config, limits - development_plan: Phase 3 complete, timeline updated, status → Phase 1-3 Complete - Dockerfile/compose: verified OK (ffmpeg + yt-dlp already present, no new volumes) - npm build: 1403 modules, production build clean - 59/59 backend + 44/44 frontend Phase 2+3 tests pass - Plan: 3.7 Complete, 7/7 sub-phases done	2026-05-09 17:27:54 +08:00
Woody	cee859d5d7	feat: Phase 3.6 — integration + acceptance tests for YouTube proxy - test_integration_phase3.py: 6 tests Extract→proxy flow (VOD manifest, VOD segment, live manifest), cache hit bypasses yt-dlp, upstream 404→502, extract disabled→503 Mocked yt-dlp, real FastAPI TestClient + HLSProxyService - test_acceptance_phase3_youtube.py: 3 tests Real YouTube VOD extraction, manifest proxy, segment proxy Follows master→variant→segment chain, verifies MPEG-TS sync byte - test_acceptance_phase3_live.py: 3 tests Real live stream extraction, no #EXT-X-ENDLIST assertion, cache refresh verification, graceful skip when offline - 201/201 CI pass (234 backend Phase 1-3, zero Phase 3 regressions) - Updated plan: 3.6 Complete, 6/7 sub-phases done	2026-05-09 17:18:55 +08:00
Woody	1699a249b0	feat: Phase 3.5 — YouTube → ASR integration with source toggle - useYouTubeASR.ts: adapted from useVideoASR, captures audio from HTMLAudioElement (hls.js → <audio> → AudioContext.createMediaElementSource → ScriptProcessorNode → WebSocket) Play/pause events on videoElement; same return shape as useVideoASR - LTTPage.tsx: Source toggle (Upload/YouTube tabs), YouTubeInput + YouTubeVideoPlayer wired with handleExtractSuccess → handleAudioReady → useYouTubeASR Full Transcript button hidden for YouTube source; unified asr variable - QueryInput.tsx: no changes needed (already supports partialText/value from any source) - Tests: 18 new (11 useYouTubeASR, 7 LTTPage integration) - 189/189 total pass (zero regressions) - Updated plan: 3.5 marked Complete, 5/7 sub-phases done	2026-05-09 17:00:32 +08:00
Woody	a8eea54c0f	feat: Phase 3.4 — YouTube Input + Video Player frontend components - YouTubeInput.tsx: URL input with validation (youtube.com/watch, youtu.be, /live/, /shorts/), loading/error states, Load Stream button, uses useYouTubeExtract mutation - YouTubeVideoPlayer.tsx: dual hls.js (video + hidden audio), forwardRef, thumbnail placeholder until play, LIVE badge, quality capped ≤480p, onAudioReady callback for ASR hook exposure, dynamic import('hls.js') - Types: YouTubeFormat, YouTubeStreamResponse interfaces - API: extractYouTubeStream() — POST /youtube/extract - Query: useYouTubeExtract() TanStack Query mutation hook - Tests: 16 new (7 YouTubeInput, 9 YouTubeVideoPlayer) - 171/171 total pass (zero regressions) - Updated plan: 3.4 marked Complete, 4/7 sub-phases done	2026-05-09 16:43:42 +08:00
Woody	284028bb1f	feat: Phase 3.1 + 3.2 — YouTube config infra and URL extraction Phase 3.1 — Configuration & Infrastructure: - Add youtube_proxy_enabled, yt_dlp_timeout, yt_dlp_cache_ttl config fields - Add yt-dlp and hls.js dependencies - Create models/youtube.py (request/response schemas) - Create service stubs (youtube_service, hls_proxy) - Create router stub and register in main.py - 11 config tests Phase 3.2 — YouTube URL Extraction: - yt-dlp wrapper with async extraction (run_in_executor) - Format selection: ≤480p video-only + highest-bitrate audio (VOD) - Combined format fallback: same URL for live streams - In-memory URL cache: 5min TTL live, 30min VOD - lru_cache singleton service for cache persistence - Error handling: DownloadError → 200 with error field - 18 extract tests, 82/82 total pass (zero regressions) Real-URL verified: VOD (5bF3tkO5jAA) 24 formats, Live (fN9uYWCjQaw) 6 HLS	2026-05-09 15:53:04 +08:00
Woody	09b5ea7d64	refactor: remove dead _merge_stash, add Phase 3 YouTube proxy plan - Remove _merge_stash (dead code since delta-based ASR refactor) - Replace TestMergeStash with TestTextFieldFormatting (53/53 Phase 2 tests pass) - Mark phase2_enhancement_use_text_field as Complete - Add Phase 3 YouTube live stream proxy implementation plan - README updates	2026-05-09 15:14:01 +08:00
Woody	78d1f8cc91	feat: delta-based ASR transcript — use text field, utterance boundaries, stash on pause Replace full_text responses with character-level deltas computed from DashScope's monotonically-growing 'text' field. Stash-only events (empty text) are skipped; trailing stash chars sent alongside deltas and appended on pause to complete final sentences. Backend: - Delta = text[len(prev_text):] — simple suffix diff, no merge logic - Track item_id for utterance boundaries, prepend space separator - Send stash alongside delta for frontend pause handler Frontend: - Accumulate deltas locally (transcriptRef += msg.delta) - Store lastStashRef from each message - On pause: append stash to text, fire onFinalTranscript Plan: .plans/phase2_enhancement_delta_sse.md updated to Complete	2026-05-07 11:26:19 +08:00
Woody	cb0ac07786	fix: text accumulation — stashes are sliding windows, merge via overlap detection DashScope stashes are ~7-char rolling windows, not cumulative. Each partial event replaces the previous. Completed events rarely sent. This caused text to jump/replace during streaming and disappear on pause. Backend: - Add _merge_stash() — finds overlapping suffix between successive stashes and appends only new characters, reconstructing full utterance from partials - format_transcription_event returns raw stash for read_events to merge - read_events maintains partial_buffer via _merge_stash, clears on completed - Guard against empty/whitespace-only stashes Frontend: - transcriptRef + onFinalTranscriptRef avoid stale closures in pause handler - stopStreaming fires onFinalTranscript(currentText) before clearing partial - Removed blind setPartialTranscript('') that erased text on pause Tests: 16/16 ws_protocol tests pass, frontend tests unchanged Plan: Updated phase2_implementation_plan.md to Complete with 11-bug log	2026-05-06 20:06:39 +08:00
Woody	9934749d2b	feat: Phase 2.1 config + infrastructure and 2.2 video upload backend - Add DashScope ASR and video upload config fields to Settings - Create Pydantic models (video.py, asr.py) - Create VideoService with validation, save, serve, delete - Create ASR client stub with float32_to_s16le utility - Implement POST /api/v1/video/upload with streaming validation - Implement GET /api/v1/video/{video_id} with FileResponse - Create WebSocket ASR endpoint stub - Register new routers in main.py - Update .env.example and requirements.txt - Add reference examples for DashScope integration - 8 tests passing (3 config + 5 video upload)	2026-05-06 13:08:19 +08:00
Woody	63e4c1a385	docs: add plan for configurable SubQuestions format	2026-05-04 17:22:38 +08:00
Woody	b6562f3d76	docs: add Package 6 enhancement plan Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-05-04 14:58:24 +08:00
Woody	2aca18d30e	docs: add vLLM structured output fix plan - Diagnose: vLLM ignores OpenAI-native response_format, causing NoneType error - Diagnose: legacy fallback prompt lacks JSON instruction → empty questions - Plan: use vLLM-native guided_json via extra_body instead of with_structured_output - Plan: update _SEED_DECOMPOSE with JSON format instruction - Plan: add diagnostic logging (exc_info, method, schema preview) wip: temporary function_calling switch for vLLM (to be replaced by guided_json)	2026-04-29 16:42:23 +08:00
Woody	41f59b396f	feat: track highlight generation prompt, response, and timing in history (Phase 5.5) - Add 3 columns to query_history: highlight_prompt, highlight_response, highlight_time_ms - HistoryService.update_highlights() updates existing row after batch LLM call - ChunkHighlightService measures timing, captures prompt and structured JSON response - SSE completed event includes history_id for frontend to pass back - Frontend captures historyId, passes as ?history_id= query param in batch POST - Highlight time tracked separately (excluded from total_time_ms) - All 153 tests pass (108 backend + 45 frontend)	2026-04-29 11:18:21 +08:00
Woody	36dedab485	docs: finalize Phase 5 enhancement plan with completion status - Mark Phase 5.4 complete with actual commit log - Add Phase 5.4 completion checklist (15 items all checked) - Add production notes (Vite proxy, port conflicts, cache location) - Update test counts to current (108 backend, 45 frontend, 153 total) - Update Decision #12 to reflect inline citation link upgrade	2026-04-29 10:54:18 +08:00
Woody	c632b9ea3b	feat: cited source extraction, background batch trigger, and View PDF link upgrade (Phase 5.4.6-5.4.8) - citationParser.ts: extractCitedSources() parses answer text for [citations], resolves against SourceMetadata, returns deduplicated cited sources - ResponsePanel.tsx: useEffect fires POST /api/v1/v2/highlights/batch after answer renders; View PDF link upgrades in-place to highlighted HTML when batch completes; stays as raw PDF on failure - Updated plan: LLM-based relevance detection, eager background computation, single batched LLM call, sqlite cache, regex sentence splitter - 45 frontend tests: 28 citationParser + 17 ResponsePanel (including 4 new sub-question highlight tests)	2026-04-29 09:27:04 +08:00
Woody	ec3b5a4ae1	docs: mark Phase 5.3 complete in enhancement plan Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-04-28 17:33:00 +08:00
Woody	091fa84443	docs: update Phase 5 plan with deferred/planned sub-phases Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-04-28 16:43:38 +08:00
Woody	f2115ae563	feat: structured LLM output for decompose + citation fuzzy matching (Phase 5) Phase 5.1 — Structured LLM output for query decomposition: - Add SubQuestions Pydantic model with sub_question, keywords, rationale - Add LLMClient.complete_structured() using langchain with_structured_output - Update QueryDecomposer with structured output path + legacy json.loads fallback - Update SQLite seed templates: add subq+citation labeling requirement - Add tests: structured output, subquestions model validation, logging Phase 5.2 — Citation format alignment and fallback links: - Add document_id to SourceMetadata (backend + frontend types) - Rewrite citationParser.ts with fuzzy matching and fallback document links - Add RAGDatabasePage auto-expand from ?document= URL param - Tighten generate_per_subq seed prompt: 'Copy exact bracket labels shown' - Add citation parser tests for fuzzy match and fallback link scenarios - Defer: DOCX/TXT PDF generation → Phase 5.3 (fallback links sufficient)	2026-04-28 15:39:17 +08:00
Woody	bb6b159315	docs(plan): add Phase PX profile export/import feature plan	2026-04-27 19:26:33 +08:00
Woody	3b868a0133	feat(prompts): integrate filter_per_subq with PromptService, fix seed bugs, restructure UI Break the hardcoded per-sub-q filter prompt into 3 editable PromptService templates (filter_intro, filter_section, filter_outro) with placeholders for the for-loop iteration pattern. Refactor RelevanceFilter._build_per_subq_prompt() to compose them at runtime, falling back to built-in defaults when PromptService is unavailable. Fix two latent bugs from Package 4: - generate_per_subq was called by rag.py but never added to _VALID_STEPS or DB seed (would ValueError at runtime) - _SEED_GENERATE placeholder mismatch: flat generate_response() expects {question}/{context} but Package 4 changed it to {context_sections}. Restored flat template; generate_per_subq now holds {context_sections}. Add database backfill migration in seed_default_profiles() to INSERT OR IGNORE missing steps into existing profile rows, ensuring all 7 steps exist on restart. Restructure System Prompts UI: remove unused flat filter/generate steps, replace with Step 2.1-2.3 (filter_intro/section/outro) and Step 3 (generate_per_subq). Update PlaceholderDocs with {context_sections}, {subq_idx}, {subq_question}. Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-04-27 11:14:27 +08:00
Woody	d509c14b80	docs(plan): add Package 4 per-sub-question enhancement plan Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-04-26 23:27:36 +08:00
Woody	d7cf785452	feat(frontend): Phase 3.6 — History page with timing bars, expandable cards, and pagination	2026-04-26 13:19:52 +08:00
Woody	475306f2b1	feat(history): Phase 3.5 — Query History backend (service, API, timing, XML capture)	2026-04-25 22:59:53 +08:00
Woody	8e6597a86e	feat(frontend): Phase 3.3 — System Prompt Configuration page - SystemPromptsPage: profile selector, activation, edit with TanStack Query - ProfileList: 3 profile cards (A/B/C) with active indicator + edit button - PromptEditor: 3 monospace textareas, placeholder badges, char count, unknown placeholder warnings, per-step reset (↺), action bar - PlaceholderDocs: info box showing {question}/{chunks}/{context} - Data layer: +7 types, +6 API functions, +6 TanStack Query hooks - Routing: /system-prompts route + NavBar link - Tests: 27 tests (PlaceholderDocs 6, ProfileList 7, PromptEditor 14) - 0TS errors, 27/27 tests pass, 1 pre-existing e2e failure (unrelated)	2026-04-25 21:26:42 +08:00
Woody	e49a68b0bd	feat(prompts): Phase 3.2 — Prompt Backend (CRUD service, REST API, 33 tests) - PromptService (services/prompt_service.py): full CRUD for 3 profiles A/B/C with seed template reset, validation, and sqlite3.Row access - REST API (routers/prompts.py): 6 endpoints on /api/v1/prompts - Pydantic models (models/prompts.py): 6 schemas - DI wiring (dependencies.py): get_prompt_service() - App registration (main.py): prompts router - Mock fixture (conftest.py): mock_prompt_service - Tests: test_phase3_prompt_service.py (22) + test_phase3_prompts_router.py (11) - 162/166 total pass, 4 skipped, 0 fail	2026-04-25 21:11:17 +08:00
Woody	3b741c1844	feat(query): stream extracted questions immediately via SSE Convert /query endpoint from synchronous JSON to Server-Sent Events (SSE) streaming. The frontend now receives extracted_questions as soon as the first LLM call completes, without waiting for retrieval, filtering, and answer generation. Backend: - Add StreamingQueryEvent union type (Decomposed, Retrieving, Filtering, Generating, Completed, Error) - Convert /query to return StreamingResponse with SSE format - Yield events after each pipeline phase Frontend: - Add queryDocumentStream() using fetch + ReadableStream - Add useQueryDocumentStream() hook with phase-aware state - Update LTTPage to use streaming instead of mutation - Update ResponsePanel to show phase messages (Searching documents..., Filtering passages..., Generating answer...) - Update ExtractedQuestionsDisplay to accept null Tests: - Update query_flow e2e test to mock queryDocumentStream - 84/85 tests pass (1 pre-existing failure from removed file-input)	2026-04-25 18:29:22 +08:00
Woody	5ff4eaa104	docs: mark sub-phase 2.6 complete — Package 2 all done All 6 UX enhancements implemented and tested. Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-04-24 17:53:56 +08:00
Woody	06f016c83d	docs: update enhancement plan with sub-phase 2.5 completion Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-04-24 17:10:03 +08:00
Woody	c518955d31	docs: update enhancement plan with sub-phase 2.4 completion Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-04-24 16:52:40 +08:00
Woody	f62dcad630	docs: update enhancement plan with sub-phase 2.3 completion Mark sub-phase 2.3 (Extracted Questions) as done. Status: 2.1 ✅, 2.2 ✅, 2.3 ✅. Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-04-24 16:25:02 +08:00
Woody	ecaa9ebb26	docs: update enhancement plan with sub-phase 2.1 and 2.2 completion Mark sub-phases 2.1 (Remove Upload) and 2.2 (Question Display) as done. Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-04-24 15:56:50 +08:00
Woody	d49756f374	feat: add chunk PDF serving endpoint and frontend clickable source links (1.5.6) - Add page_number and chunk_file_path to SourceMetadata model and query router - Add GET /chunks/{file_path}/pdf endpoint with path traversal protection - Add View PDF links in ResponsePanel source cards and ChunkList component - Update TypeScript types and API helper for chunk PDF URLs - Add backend tests (5) and frontend ChunkList tests (7) - Update enhancement plan: all 3 features complete	2026-04-24 11:49:39 +08:00
Woody	64043b75a7	docs: update enhancement plan with sub-phase 1.5.5 completion Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-04-24 10:53:53 +08:00
Woody	20b2f2c267	docs: update enhancement plan with sub-phase 1.5.4 completion status Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-04-24 10:30:55 +08:00
Woody	c10318b7f7	feat(frontend): add RAG Database management page with document CRUD UI Sub-phase 1.5.3: Full RAG Database page with document listing, expandable chunk viewer, delete with confirmation, and document upload. Adds TypeScript types, API functions, TanStack Query hooks (useQuery + useMutation with cache invalidation), and three new components (DocumentList, ChunkList, DocumentUpload). Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-04-24 09:41:56 +08:00
Woody	9a7329c5f8	docs: update enhancement plan with sub-phase 1.5.2 completion status Mark sub-phase 1.5.2 (backend CRUD) as complete. Update acceptance criteria, risk mitigations, and test plan. Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-04-23 19:05:01 +08:00
Woody	52c09b86cb	feat(frontend): add nav bar with routing, markdown rendering, and enhancement plan - Add react-router-dom with NavBar component (LTT + RAG Database tabs) - Extract AppContent into LTTPage, add RAGDatabasePage placeholder - Refactor App.tsx to BrowserRouter + Routes layout - Switch ResponsePanel to react-markdown for rich formatting - Fix ResponsePanel test for markdown rendering - Update RAG prompt to cite source name instead of number - Save Phase 1 enhancement plan (.plans/phase1_enhancement_plan.md)	2026-04-23 18:37:30 +08:00
Woody	12f4705b72	docs: update development plans with Phase 1 completion status Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-04-23 13:27:52 +08:00
Woody	95ef502bfa	docs: add debug logs for upload 500 error investigation - Document root causes: missing python-docx, ChromaDB version mismatch, NumPy 2.x incompatibility - Document fixes: venv setup, embedding wrapper rewrite, DB clear Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-04-23 13:25:23 +08:00
Woody	7493b3aaf6	feat: Phase 1.4 acceptance tests, error handling, and polish - Implement acceptance tests for ingest (real ChromaDB) and query (real LLM) - Full 3-step RAG pipeline verified: decompose → retrieve → filter → generate - Add logging to ingest and query routers - Improve error handling: empty doc detection, proper HTTPException re-raising - Add .txt file support to ingest endpoint - Fix query router: strip distance from retrieve tuples before relevance filter - Update plan: Phase 1 backend complete (all acceptance criteria met) - Tests: 41 unit passed, 5 acceptance passed (real OpenRouter calls)	2026-04-22 17:45:50 +08:00
Woody	4d346dc1c6	docs: Update Phase 1 backend plan with completion status - Mark Phase 1.1 and 1.2 as complete with test results - Update acceptance criteria checklist - Add Services Status table showing implemented/pending components - Mark Phase 1.3 tasks that are already done (LLM client, retrieval, response gen)	2026-04-22 16:55:17 +08:00
Woody	abffc9cf1d	docs: add PDF support alongside DOCX in all plans	2026-04-22 15:59:55 +08:00
Woody	eeb464528a	docs: add test-first and Phase X.Y sub-phase naming to AGENTS.md and plans	2026-04-22 15:54:34 +08:00
Woody	1f4e3a2572	docs: add Phase 1 backend and frontend development plans	2026-04-22 15:47:27 +08:00

48 Commits