litellm-ai-gateway

Here are 6 public repositories matching this topic...

BerriAI / litellm-pgvector

vector-store litellm ai-gateway litellm-ai-gateway openai-vector-store vector-store-gateway

Updated Jul 18, 2025
Python

aws-solutions-library-samples / Cost_Effective_and_Scalable_Models_Inference_on_AWS_Graviton

comprehensive, scalable ML inference architecture using Amazon EKS, leveraging both Graviton processors for cost-effective CPU-based inference and GPU instances for accelerated inference. Guidance provides a complete end-to-end platform for deploying LLMs with agentic AI capabilities, including RAG and MCP

inference inference-engine langfuse agentic-workflow agentic-ai mcp-server mcp-client litellm-ai-gateway

Updated Sep 26, 2025
Python

andreimerfu / pllm

Sponsor

Star

High-performance LLM Gateway built in Go - OpenAI compatible proxy with multi-provider support, adaptive routing, and enterprise features

Updated Sep 14, 2025
Go

aws-samples / sample-resilient-llm-inference

Star

This repo presents resilience patterns for scaling inference for Generative AI workloads on AWS: Bedrock cross-Region inference, AWS account sharding, and intelligent routing with LLM gateways.

fallback throttling load-balancing quotable-api litellm-ai-gateway bedrock-cross-region-inference genai-resilience aws-account-sharding