docs: add LLM_ENABLE_THINKING and VLLM_ENGINE to env example and README

This commit is contained in:
Woody 2026-04-28 13:32:41 +08:00
parent 711be3dfde
commit 06ec37492c
2 changed files with 14 additions and 1 deletions

View File

@ -59,11 +59,20 @@ All configurable via `backend/.env`:
| `LLM_BASE_URL` | `https://openrouter.ai/api/v1` | LLM API endpoint | | `LLM_BASE_URL` | `https://openrouter.ai/api/v1` | LLM API endpoint |
| `LLM_API_KEY` | — | API key for LLM provider | | `LLM_API_KEY` | — | API key for LLM provider |
| `LLM_MODEL_NAME` | `qwen/qwen3.5-35b-a3b` | Chat model | | `LLM_MODEL_NAME` | `qwen/qwen3.5-35b-a3b` | Chat model |
| `LLM_TIMEOUT` | `60.0` | LLM request timeout in seconds |
| `LLM_ENABLE_THINKING` | `false` | Enable LLM thinking/reasoning tokens |
| `VLLM_ENGINE` | `false` | Use vLLM-format `extra_body` instead of OpenRouter |
| `EMBEDDING_MODEL` | `qwen/qwen3-embedding-4b` | Embedding model | | `EMBEDDING_MODEL` | `qwen/qwen3-embedding-4b` | Embedding model |
| `EMBEDDING_BASE_URL` | `https://openrouter.ai/api/v1` | Embedding API endpoint |
| `EMBEDDING_API_KEY` | — | API key for embeddings (falls back to `LLM_API_KEY`) | | `EMBEDDING_API_KEY` | — | API key for embeddings (falls back to `LLM_API_KEY`) |
| `CHROMA_DB_PATH` | `./chroma_db` | ChromaDB persistent storage |
| `CHUNK_SIZE` | `1000` | Token chunk size |
| `CHUNK_OVERLAP` | `200` | Token chunk overlap |
| `RETRIEVAL_N_RESULTS` | `10` | Chunks per sub-question | | `RETRIEVAL_N_RESULTS` | `10` | Chunks per sub-question |
| `RELEVANCE_THRESHOLD` | `7.0` | Min relevance score (0-10) | | `RELEVANCE_THRESHOLD` | `7.0` | Min relevance score (0-10) |
| `LLM_TIMEOUT` | `60.0` | LLM request timeout in seconds | | `PROMPTS_DB_PATH` | `./data/prompts.db` | Prompt templates SQLite |
| `HISTORY_DB_PATH` | `./data/history.db` | Query history SQLite |
| `CORS_ORIGINS` | `["http://localhost:5173","http://localhost:3000"]` | Allowed CORS origins |
### Production: Nginx Reverse Proxy ### Production: Nginx Reverse Proxy
@ -156,6 +165,8 @@ docker run -d --name legco_test -p 8888:8000 \
-e LLM_API_KEY=your_key_here \ -e LLM_API_KEY=your_key_here \
-e LLM_MODEL_NAME=qwen/qwen3.6-35b-a3b \ -e LLM_MODEL_NAME=qwen/qwen3.6-35b-a3b \
-e LLM_TIMEOUT=60.0 \ -e LLM_TIMEOUT=60.0 \
-e LLM_ENABLE_THINKING=false \
-e VLLM_ENGINE=false \
-e EMBEDDING_MODEL=qwen/qwen3-embedding-4b \ -e EMBEDDING_MODEL=qwen/qwen3-embedding-4b \
-e EMBEDDING_BASE_URL=https://openrouter.ai/api/v1 \ -e EMBEDDING_BASE_URL=https://openrouter.ai/api/v1 \
-e EMBEDDING_API_KEY=your_key_here \ -e EMBEDDING_API_KEY=your_key_here \

View File

@ -2,6 +2,8 @@ LLM_BASE_URL=https://openrouter.ai/api/v1
LLM_API_KEY=your_openrouter_key_here LLM_API_KEY=your_openrouter_key_here
LLM_MODEL_NAME=qwen/qwen3.5-35b-a3b LLM_MODEL_NAME=qwen/qwen3.5-35b-a3b
LLM_TIMEOUT=60.0 LLM_TIMEOUT=60.0
LLM_ENABLE_THINKING=false
VLLM_ENGINE=false
EMBEDDING_MODEL=qwen/qwen3-embedding-4b EMBEDDING_MODEL=qwen/qwen3-embedding-4b
EMBEDDING_BASE_URL=https://openrouter.ai/api/v1 EMBEDDING_BASE_URL=https://openrouter.ai/api/v1