Intelligent Mixture-of-Models Router for Efficient LLM Inference
python kubernetes rust golang mcp fine-tuning envoyproxy pii-detection mixture-of-models huggingface-transformers bert-classification prompt-engineering vllm huggingface-candle ai-gateway semantic-router llm-tool-call prompt-guard mcp-server envoy-ext-proc
-
Updated
Oct 17, 2025 - Go