Go to file
Woody f4fa577fb0 feat(backend): add page-aware PDF parsing with per-page text extraction
Add parse_pdf_by_page() that returns List[Tuple[int, str]] with 1-indexed page numbers. Pages with no extractable text are skipped. Follows same error handling as existing parse_pdf().

Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent)

Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>
2026-04-24 10:30:04 +08:00
.plans feat(frontend): add RAG Database management page with document CRUD UI 2026-04-24 09:41:56 +08:00
backend feat(backend): add page-aware PDF parsing with per-page text extraction 2026-04-24 10:30:04 +08:00
frontend feat(frontend): add RAG Database management page with document CRUD UI 2026-04-24 09:41:56 +08:00
test materials test: add sample documents for manual testing 2026-04-23 13:28:13 +08:00
.env.txt init: project setup with AGENTS.md, test structure, and plan directory 2026-04-22 15:22:29 +08:00
.gitignore feat(backend): add rotating file logging to backend/app/log/ 2026-04-23 14:09:48 +08:00
AGENTS.md docs: add logging anti-patterns to AGENTS.md 2026-04-23 14:10:09 +08:00
development_plan.md docs: update development plans with Phase 1 completion status 2026-04-23 13:27:52 +08:00