litellm/litellm/proxy
2026-06-06 20:59:33 -07:00
..
_experimental feat(litellm): add models and repository layers (#29686) 2026-06-06 20:59:33 -07:00
a2a feat(a2a): well-known agent-card discovery + LangGraph Platform mode (#28860) 2026-05-29 20:50:42 -07:00
agent_endpoints feat(litellm): add models and repository layers (#29686) 2026-06-06 20:59:33 -07:00
analytics_endpoints
anthropic_endpoints feat(litellm): add models and repository layers (#29686) 2026-06-06 20:59:33 -07:00
auth feat(litellm): add models and repository layers (#29686) 2026-06-06 20:59:33 -07:00
batches_endpoints Title: Fix managed batch cancel credential resolution (#29734) 2026-06-06 12:35:18 -07:00
client feat(proxy): persist allowlisted OIDC claims in CLI SSO poll (#28463) 2026-05-22 09:58:50 -07:00
common_utils feat(litellm): add models and repository layers (#29686) 2026-06-06 20:59:33 -07:00
config_management_endpoints build(deps-dev): bump black to 26.3.1 and apply formatting (#28525) 2026-05-21 17:24:18 -07:00
container_endpoints feat(litellm): add models and repository layers (#29686) 2026-06-06 20:59:33 -07:00
credential_endpoints feat(litellm): add models and repository layers (#29686) 2026-06-06 20:59:33 -07:00
custom_hooks
db feat(litellm): add models and repository layers (#29686) 2026-06-06 20:59:33 -07:00
discovery_endpoints
example_config_yaml Extend the record/replay proxy to chat, embeddings, moderations, rerank, and Anthropic (#29847) 2026-06-06 14:33:42 -07:00
fine_tuning_endpoints
google_endpoints fix(proxy): omit OpenAI [DONE] on google-genai streamGenerateContent (#29426) 2026-06-01 14:38:19 -07:00
guardrails feat(litellm): add models and repository layers (#29686) 2026-06-06 20:59:33 -07:00
health_check_utils feat(proxy): skip disable_background_health_check models on GET /health when flag set (#27716) 2026-05-13 09:49:05 -07:00
health_endpoints feat(proxy): native /health/drain preStop hook for graceful shutdown (#29439) 2026-06-02 16:30:44 -07:00
hooks feat(litellm): add models and repository layers (#29686) 2026-06-06 20:59:33 -07:00
image_endpoints fix: tighten file input handling in image edit endpoints 2026-04-22 18:04:39 -07:00
management_endpoints feat(litellm): add models and repository layers (#29686) 2026-06-06 20:59:33 -07:00
management_helpers feat(litellm): add models and repository layers (#29686) 2026-06-06 20:59:33 -07:00
memory feat(litellm): add models and repository layers (#29686) 2026-06-06 20:59:33 -07:00
middleware fix(proxy): point /metrics 401 at the opt-out flag 2026-05-08 18:09:14 -07:00
ocr_endpoints Litellm oss staging 04 21 2026 2 (#26569) 2026-05-20 21:25:19 -07:00
openai_evals_endpoints
openai_files_endpoints feat(litellm): add models and repository layers (#29686) 2026-06-06 20:59:33 -07:00
pass_through_endpoints feat(litellm): add models and repository layers (#29686) 2026-06-06 20:59:33 -07:00
policy_engine feat(litellm): add models and repository layers (#29686) 2026-06-06 20:59:33 -07:00
prompts feat(litellm): add models and repository layers (#29686) 2026-06-06 20:59:33 -07:00
public_endpoints feat(litellm): add models and repository layers (#29686) 2026-06-06 20:59:33 -07:00
rag_endpoints feat(litellm): add models and repository layers (#29686) 2026-06-06 20:59:33 -07:00
realtime_endpoints
rerank_endpoints
response_api_endpoints Litellm websocket improvements (#29563) 2026-06-03 11:48:35 -07:00
response_polling
search_endpoints feat(litellm): add models and repository layers (#29686) 2026-06-06 20:59:33 -07:00
shutdown feat(proxy): native /health/drain preStop hook for graceful shutdown (#29439) 2026-06-02 16:30:44 -07:00
spend_tracking feat(litellm): add models and repository layers (#29686) 2026-06-06 20:59:33 -07:00
swagger
test_prompts
types_utils fix(types_utils): drop opt-in env from remote-module runtime gate 2026-05-13 02:52:14 +00:00
ui_crud_endpoints feat(litellm): add models and repository layers (#29686) 2026-06-06 20:59:33 -07:00
vector_store_endpoints feat(litellm): add models and repository layers (#29686) 2026-06-06 20:59:33 -07:00
vector_store_files_endpoints Title: fix(proxy): resolve vector store file list credentials from team deployments (#29739) 2026-06-06 12:36:05 -07:00
vertex_ai_endpoints build(deps-dev): bump black to 26.3.1 and apply formatting (#28525) 2026-05-21 17:24:18 -07:00
video_endpoints
workflows feat(proxy): durable agent workflow run tracking via /v1/workflows/runs (#26793) 2026-04-29 17:12:18 -07:00
__init__.py
_lazy_features.py feat(a2a): well-known agent-card discovery + LangGraph Platform mode (#28860) 2026-05-29 20:50:42 -07:00
_lazy_openapi_snapshot.json Litellm oss staging 1 (#28337) 2026-05-20 17:27:03 -07:00
_lazy_openapi_snapshot.py [Fix] Proxy: Repair Stale HTTP_METHODS Reference In Lazy OpenAPI Snapshot 2026-05-01 15:11:56 -07:00
_logging.py
_new_new_secret_config.yaml
_new_secret_config.yaml Litellm krrish staging 04 20 2026 (#26138) 2026-04-20 16:22:12 -07:00
_super_secret_config.yaml
_types.py feat(litellm): add models and repository layers (#29686) 2026-06-06 20:59:33 -07:00
.gitignore
cached_logo.jpg
caching_routes.py Litellm oss staging 040626 (#29671) 2026-06-04 11:07:20 -07:00
common_request_processing.py fix(proxy): disable proxy buffering on streaming SSE responses (#29557) 2026-06-04 17:23:14 +05:30
compliance_checks.py
custom_auth_auto.py
custom_prompt_management.py
custom_sso.py
custom_validate.py
dd_span_tagger.py
enterprise
health_check.py Litellm agent oss staging 05 11 2026 (#27733) 2026-05-13 14:09:12 -07:00
lambda.py
litellm_pre_call_utils.py fix(proxy): enforce tag budgets for key-level tags (#29108) 2026-05-28 11:39:02 -07:00
llamaguard_prompt.txt
logo.jpg
mcp_registry.json
mcp_tools.py
model_config.yaml
openapi_registry.json
openapi.json
post_call_rules.py
prisma_migration.py
prometheus_cleanup.py
proxy_cli.py feat(proxy): hot-reload .env in dev when running with --reload (#29783) 2026-06-06 09:39:21 -07:00
proxy_config.yaml
proxy_server.py feat(litellm): add models and repository layers (#29686) 2026-06-06 20:59:33 -07:00
README.md
route_llm_request.py Gemini managed agents support (#28270) 2026-05-19 16:02:03 -07:00
schema.prisma feat(mcp): per-server env vars with global + per-user scopes (#28917) 2026-06-05 20:15:11 -07:00
start.sh
utils.py feat(litellm): add models and repository layers (#29686) 2026-06-06 20:59:33 -07:00

litellm-proxy

A local, fast, and lightweight OpenAI-compatible server to call 100+ LLM APIs.

usage

$ uv tool install litellm
$ litellm --model ollama/codellama 

#INFO: Ollama running on http://0.0.0.0:8000

replace openai base

import openai # openai v1.0.0+
client = openai.OpenAI(api_key="anything",base_url="http://0.0.0.0:8000") # set proxy to base_url
# request sent to model set on litellm proxy, `litellm --model`
response = client.chat.completions.create(model="gpt-3.5-turbo", messages = [
    {
        "role": "user",
        "content": "this is a test request, write a short poem"
    }
])

print(response)

See how to call Huggingface,Bedrock,TogetherAI,Anthropic, etc.


Folder Structure

Routes

  • proxy_server.py - all openai-compatible routes - /v1/chat/completion, /v1/embedding + model info routes - /v1/models, /v1/model/info, /v1/model_group_info routes.
  • health_endpoints/ - /health, /health/liveliness, /health/readiness
  • management_endpoints/key_management_endpoints.py - all /key/* routes
  • management_endpoints/team_endpoints.py - all /team/* routes
  • management_endpoints/internal_user_endpoints.py - all /user/* routes
  • management_endpoints/ui_sso.py - all /sso/* routes