legco_ai_assistant

History

Woody f4fa577fb0 feat(backend): add page-aware PDF parsing with per-page text extraction Add parse_pdf_by_page() that returns List[Tuple[int, str]] with 1-indexed page numbers. Pages with no extractable text are skipped. Follows same error handling as existing parse_pdf(). Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>		2026-04-24 10:30:04 +08:00
..
core	fix(backend): add embed_query method to EmbeddingFunctionWrapper for ChromaDB query	2026-04-24 10:15:08 +08:00
models	feat(backend): add documents CRUD service methods and Pydantic schemas	2026-04-23 19:02:07 +08:00
routers	fix(backend): preserve original filename in chunk metadata instead of temp file name	2026-04-24 10:14:58 +08:00
services	feat(backend): add documents CRUD service methods and Pydantic schemas	2026-04-23 19:02:07 +08:00
test	feat(backend): add page-aware PDF parsing with per-page text extraction	2026-04-24 10:30:04 +08:00
utils	feat(backend): add page-aware PDF parsing with per-page text extraction	2026-04-24 10:30:04 +08:00
__init__.py	feat: Phase 1.1 project setup with config, database, and models	2026-04-22 16:13:52 +08:00
main.py	feat(backend): add documents CRUD endpoints and tests	2026-04-23 19:02:28 +08:00