History

Yassin Kortam 5e2db7eee4 feat(litellm): add models and repository layers (#29686 )		2026-06-06 20:59:33 -07:00
..
_experimental	feat(litellm): add models and repository layers (#29686 )	2026-06-06 20:59:33 -07:00
a2a	feat(a2a): well-known agent-card discovery + LangGraph Platform mode (#28860 )	2026-05-29 20:50:42 -07:00
agent_endpoints	feat(litellm): add models and repository layers (#29686 )	2026-06-06 20:59:33 -07:00
analytics_endpoints
anthropic_endpoints	feat(litellm): add models and repository layers (#29686 )	2026-06-06 20:59:33 -07:00
auth	feat(litellm): add models and repository layers (#29686 )	2026-06-06 20:59:33 -07:00
batches_endpoints	Title: Fix managed batch cancel credential resolution (#29734 )	2026-06-06 12:35:18 -07:00
client	feat(proxy): persist allowlisted OIDC claims in CLI SSO poll (#28463 )	2026-05-22 09:58:50 -07:00
common_utils	feat(litellm): add models and repository layers (#29686 )	2026-06-06 20:59:33 -07:00
config_management_endpoints	build(deps-dev): bump black to 26.3.1 and apply formatting (#28525 )	2026-05-21 17:24:18 -07:00
container_endpoints	feat(litellm): add models and repository layers (#29686 )	2026-06-06 20:59:33 -07:00
credential_endpoints	feat(litellm): add models and repository layers (#29686 )	2026-06-06 20:59:33 -07:00
custom_hooks
db	feat(litellm): add models and repository layers (#29686 )	2026-06-06 20:59:33 -07:00
discovery_endpoints
example_config_yaml	Extend the record/replay proxy to chat, embeddings, moderations, rerank, and Anthropic (#29847 )	2026-06-06 14:33:42 -07:00
fine_tuning_endpoints
google_endpoints	fix(proxy): omit OpenAI [DONE] on google-genai streamGenerateContent (#29426 )	2026-06-01 14:38:19 -07:00
guardrails	feat(litellm): add models and repository layers (#29686 )	2026-06-06 20:59:33 -07:00
health_check_utils	feat(proxy): skip disable_background_health_check models on GET /health when flag set (#27716 )	2026-05-13 09:49:05 -07:00
health_endpoints	feat(proxy): native /health/drain preStop hook for graceful shutdown (#29439 )	2026-06-02 16:30:44 -07:00
hooks	feat(litellm): add models and repository layers (#29686 )	2026-06-06 20:59:33 -07:00
image_endpoints	fix: tighten file input handling in image edit endpoints	2026-04-22 18:04:39 -07:00
management_endpoints	feat(litellm): add models and repository layers (#29686 )	2026-06-06 20:59:33 -07:00
management_helpers	feat(litellm): add models and repository layers (#29686 )	2026-06-06 20:59:33 -07:00
memory	feat(litellm): add models and repository layers (#29686 )	2026-06-06 20:59:33 -07:00
middleware	fix(proxy): point /metrics 401 at the opt-out flag	2026-05-08 18:09:14 -07:00
ocr_endpoints	Litellm oss staging 04 21 2026 2 (#26569 )	2026-05-20 21:25:19 -07:00
openai_evals_endpoints
openai_files_endpoints	feat(litellm): add models and repository layers (#29686 )	2026-06-06 20:59:33 -07:00
pass_through_endpoints	feat(litellm): add models and repository layers (#29686 )	2026-06-06 20:59:33 -07:00
policy_engine	feat(litellm): add models and repository layers (#29686 )	2026-06-06 20:59:33 -07:00
prompts	feat(litellm): add models and repository layers (#29686 )	2026-06-06 20:59:33 -07:00
public_endpoints	feat(litellm): add models and repository layers (#29686 )	2026-06-06 20:59:33 -07:00
rag_endpoints	feat(litellm): add models and repository layers (#29686 )	2026-06-06 20:59:33 -07:00
realtime_endpoints
rerank_endpoints
response_api_endpoints	Litellm websocket improvements (#29563 )	2026-06-03 11:48:35 -07:00
response_polling
search_endpoints	feat(litellm): add models and repository layers (#29686 )	2026-06-06 20:59:33 -07:00
shutdown	feat(proxy): native /health/drain preStop hook for graceful shutdown (#29439 )	2026-06-02 16:30:44 -07:00
spend_tracking	feat(litellm): add models and repository layers (#29686 )	2026-06-06 20:59:33 -07:00
swagger
test_prompts
types_utils	fix(types_utils): drop opt-in env from remote-module runtime gate	2026-05-13 02:52:14 +00:00
ui_crud_endpoints	feat(litellm): add models and repository layers (#29686 )	2026-06-06 20:59:33 -07:00
vector_store_endpoints	feat(litellm): add models and repository layers (#29686 )	2026-06-06 20:59:33 -07:00
vector_store_files_endpoints	Title: fix(proxy): resolve vector store file list credentials from team deployments (#29739 )	2026-06-06 12:36:05 -07:00
vertex_ai_endpoints	build(deps-dev): bump black to 26.3.1 and apply formatting (#28525 )	2026-05-21 17:24:18 -07:00
video_endpoints
workflows	feat(proxy): durable agent workflow run tracking via /v1/workflows/runs (#26793 )	2026-04-29 17:12:18 -07:00
__init__.py
_lazy_features.py	feat(a2a): well-known agent-card discovery + LangGraph Platform mode (#28860 )	2026-05-29 20:50:42 -07:00
_lazy_openapi_snapshot.json	Litellm oss staging 1 (#28337 )	2026-05-20 17:27:03 -07:00
_lazy_openapi_snapshot.py	[Fix] Proxy: Repair Stale HTTP_METHODS Reference In Lazy OpenAPI Snapshot	2026-05-01 15:11:56 -07:00
_logging.py
_new_new_secret_config.yaml
_new_secret_config.yaml	Litellm krrish staging 04 20 2026 (#26138 )	2026-04-20 16:22:12 -07:00
_super_secret_config.yaml
_types.py	feat(litellm): add models and repository layers (#29686 )	2026-06-06 20:59:33 -07:00
.gitignore
cached_logo.jpg
caching_routes.py	Litellm oss staging 040626 (#29671 )	2026-06-04 11:07:20 -07:00
common_request_processing.py	fix(proxy): disable proxy buffering on streaming SSE responses (#29557 )	2026-06-04 17:23:14 +05:30
compliance_checks.py
custom_auth_auto.py
custom_prompt_management.py
custom_sso.py
custom_validate.py
dd_span_tagger.py
enterprise
health_check.py	Litellm agent oss staging 05 11 2026 (#27733 )	2026-05-13 14:09:12 -07:00
lambda.py
litellm_pre_call_utils.py	fix(proxy): enforce tag budgets for key-level tags (#29108 )	2026-05-28 11:39:02 -07:00
llamaguard_prompt.txt
logo.jpg
mcp_registry.json
mcp_tools.py
model_config.yaml
openapi_registry.json
openapi.json
post_call_rules.py
prisma_migration.py
prometheus_cleanup.py
proxy_cli.py	feat(proxy): hot-reload .env in dev when running with --reload (#29783 )	2026-06-06 09:39:21 -07:00
proxy_config.yaml
proxy_server.py	feat(litellm): add models and repository layers (#29686 )	2026-06-06 20:59:33 -07:00
README.md
route_llm_request.py	Gemini managed agents support (#28270 )	2026-05-19 16:02:03 -07:00
schema.prisma	feat(mcp): per-server env vars with global + per-user scopes (#28917 )	2026-06-05 20:15:11 -07:00
start.sh
utils.py	feat(litellm): add models and repository layers (#29686 )	2026-06-06 20:59:33 -07:00

README.md

litellm-proxy

A local, fast, and lightweight OpenAI-compatible server to call 100+ LLM APIs.

usage

$ uv tool install litellm

$ litellm --model ollama/codellama 

#INFO: Ollama running on http://0.0.0.0:8000

replace openai base

import openai # openai v1.0.0+
client = openai.OpenAI(api_key="anything",base_url="http://0.0.0.0:8000") # set proxy to base_url
# request sent to model set on litellm proxy, `litellm --model`
response = client.chat.completions.create(model="gpt-3.5-turbo", messages = [
    {
        "role": "user",
        "content": "this is a test request, write a short poem"
    }
])

print(response)

See how to call Huggingface,Bedrock,TogetherAI,Anthropic, etc.

Folder Structure

Routes

proxy_server.py - all openai-compatible routes - /v1/chat/completion, /v1/embedding + model info routes - /v1/models, /v1/model/info, /v1/model_group_info routes.
health_endpoints/ - /health, /health/liveliness, /health/readiness
management_endpoints/key_management_endpoints.py - all /key/* routes
management_endpoints/team_endpoints.py - all /team/* routes
management_endpoints/internal_user_endpoints.py - all /user/* routes
management_endpoints/ui_sso.py - all /sso/* routes