feat: add @tag model override support and remove Qdrant dependencies
- Add model-tags parser for @Tag syntax in chat messages - Support Anthropic models (Sonnet, Haiku, Opus) via @tag - Remove Qdrant vector database from infrastructure and configs - Simplify license model config to use null fallbacks - Add greeting stream after model switch via @tag - Fix protobuf field names to camelCase for v7 compatibility - Add 429 rate limit retry logic with exponential backoff - Remove RAG references from agent harness documentation
This commit is contained in:
@@ -10,7 +10,7 @@ The platform runs across two namespaces:
|
||||
|
||||
| Namespace | Contents |
|
||||
|-----------|----------|
|
||||
| `ai` | Gateway, web UI, all infrastructure services (postgres, minio, kafka, flink, relay, ingestor, qdrant, dragonfly, iceberg-catalog) |
|
||||
| `ai` | Gateway, web UI, all infrastructure services (postgres, minio, kafka, flink, relay, ingestor, dragonfly, iceberg-catalog) |
|
||||
| `sandbox` | Per-user sandbox containers (created dynamically by the gateway) |
|
||||
|
||||
Secrets are managed via 1Password CLI (`op inject`). All `.tpl.yaml` files in `deploy/k8s/prod/secrets/` contain `op://` references and are safe to commit; actual values are never stored in git.
|
||||
@@ -217,7 +217,7 @@ kubectl --context=prod -n ai get configmaps
|
||||
|
||||
## Step 7 — Deploy Infrastructure
|
||||
|
||||
Infrastructure services (postgres, minio, kafka, iceberg-catalog, dragonfly, qdrant, relay, ingestor, flink) are defined in `deploy/k8s/prod/infrastructure.yaml` and were applied in Step 4.
|
||||
Infrastructure services (postgres, minio, kafka, iceberg-catalog, dragonfly, relay, ingestor, flink) are defined in `deploy/k8s/prod/infrastructure.yaml` and were applied in Step 4.
|
||||
|
||||
Wait for the StatefulSets and Deployments to become ready:
|
||||
|
||||
@@ -225,7 +225,6 @@ Wait for the StatefulSets and Deployments to become ready:
|
||||
kubectl --context=prod -n ai rollout status statefulset/postgres
|
||||
kubectl --context=prod -n ai rollout status statefulset/minio
|
||||
kubectl --context=prod -n ai rollout status statefulset/kafka
|
||||
kubectl --context=prod -n ai rollout status statefulset/qdrant
|
||||
kubectl --context=prod -n ai rollout status deployment/dragonfly
|
||||
kubectl --context=prod -n ai rollout status deployment/iceberg-catalog
|
||||
kubectl --context=prod -n ai rollout status deployment/relay
|
||||
|
||||
Reference in New Issue
Block a user