Woody
|
d94abaac77
|
feat: Phase 1.2 ingestion pipeline with chunking and metadata
- Add document parsers (DOCX, PDF) with lazy imports
- Add TokenChunkingStrategy with ABC for future replacement
- Add metadata extraction (filename, upload_date, content_summary)
- Add RAGService for ChromaDB ingestion/retrieval/response generation
- Add POST /api/v1/ingest endpoint with file validation
- Test-first: 20 passed, 2 skipped (python-docx not installed)
|
2026-04-22 16:49:52 +08:00 |
Woody
|
3712397d64
|
feat: Phase 1.1 project setup with config, database, and models
- Add requirements.txt with all dependencies
- Add .env.example with required environment variables
- Add Pydantic Settings (config.py) with .env loading
- Add ChromaDB persistent client (database.py)
- Add Pydantic schemas (ingest.py) for request/response
- Add FastAPI main.py with CORS middleware
- Add package __init__.py files
- Add tests: test_phase1_config.py, test_phase1_database.py
- All 5 tests pass
|
2026-04-22 16:13:52 +08:00 |
Woody
|
1518b72969
|
chore: add .gitignore with Python, Node, env, and ChromaDB exclusions
|
2026-04-22 15:57:04 +08:00 |
Woody
|
be48b1d8c7
|
docs: add sub-phase development rules and acceptance test structure
|
2026-04-22 15:27:31 +08:00 |
Woody
|
3c2d647943
|
init: project setup with AGENTS.md, test structure, and plan directory
|
2026-04-22 15:22:29 +08:00 |