litellm

Author	SHA1	Message	Date
user	bfdd786962	chore(deps): refresh dependency locks	2026-05-04 11:36:18 -07:00
Ishaan Jaffer	e8461b5b97	style: run black formatter on files from main merge	2026-04-17 13:02:59 -07:00
Yuneng Jiang	966be2982a	[Docs] Add missed content PRs to v1.83.7.rc.1 and update runbook - Add 8 content PRs that merged directly to the release branch outside the listed staging PRs: #23769 (Ramp callback), #25252 (JWT OAuth2 override), #25254 (AWS GovCloud mode), #25258 (batch-limit cleanup), #25334 (router custom_llm_provider), #25345 (Triton embeddings), #25347 (tag-based routing), #25358 (Baseten pricing attribution) - Add @kedarthakkar to new contributors (first-ever PR via #23769) - Update RELEASE_NOTES_GENERATION_INSTRUCTIONS: require walking git log range between release tags in addition to staging PRs, and verify new-contributor status per author rather than trusting the GH release body floor	2026-04-14 16:13:09 -07:00
Yuneng Jiang	8eec2c69b7	[Docs] Add release notes for v1.83.3-stable and v1.83.7.rc.1 - Retitle existing v1.83.3 preview file to v1.83.3-stable (same commit) - Add new v1.83.7.rc.1 preview release notes - Update RELEASE_NOTES_GENERATION_INSTRUCTIONS runbook with guidance on resolving staging PRs to their underlying commits	2026-04-14 15:58:13 -07:00
user	637ff30f97	fix(security): bump litellm in cookbook to 1.83.5 The cookbook example pinned litellm==1.61.15 which has 3 known vulnerabilities (CVE-2026-35029, CVE-2026-35030, and a password hash exposure issue), all patched in 1.83.0. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-09 19:22:20 +00:00
David Chen	d1df4e838b	Litellm fix update bedrock models (#24947 ) * update bedrock models in tests * updated more tests and model_prices_and_context_window * fix model id and pricing * replace more sonnet models * update tests * git push * update pricing * flaky total cost * monkey patch * relax the cost change * fix and revert some changes * revert the pricing * chore: move cost/pricing changes to bedrock-cost-fixes branch * chore: split Bedrock file-api beta stripping to separate branch Removes strip_unsupported_file_api_betas_for_bedrock_invoke from this branch; see litellm_bedrock_invoke_strip_file_api_betas for that fix. Made-with: Cursor	2026-04-01 19:22:54 -07:00
Krrish Dholakia	df2a36dd27	docs: document new github + gitlab ci scripts	2026-03-25 20:17:10 -07:00
Ishaan Jaffer	a2f02aa139	docs: remove phone numbers from readme and docs	2026-03-25 12:40:40 -07:00
yuneng-jiang	71c3503e57	Revert "[Feature] Add /public/supported_endpoints endpoint"	2026-02-26 17:21:43 -08:00
yuneng-jiang	efcc856234	Move provider_endpoints_support.json into litellm package The file was at the repo root and excluded from pip distributions. Moving it to litellm/proxy/public_endpoints/ alongside the other provider JSON files ensures it is packaged correctly. Updates all references in the endpoint handler, coverage tests, and release notes instructions. Co-Authored-By: Claude Haiku 4.5 <noreply@anthropic.com>	2026-02-26 15:15:16 -08:00
Krrish Dholakia	a26f83fd3c	fix: update calendly on repo	2026-02-23 06:13:59 -08:00
Trevor Prater	66ccbe37cd	Add gollem Go agent framework cookbook example (#21747 ) Show how to use gollem, a production Go agent framework, with LiteLLM proxy for multi-provider LLM access including tool use and streaming.	2026-02-21 19:51:28 -08:00
Krrish Dholakia	a39a234cf4	doc: add right readme.md	2026-02-18 03:33:23 +05:30
Harshit Jain	5da5a1478e	Merge branch 'litellm_prompt_registry_fix' of https://github.com/Harshit28j/litellm into litellm_prompt_registry_fix	2026-02-18 02:50:29 +05:30
Harshit Jain	d061ae9370	fix docs and format	2026-02-18 02:49:52 +05:30
Harshit Jain	2efe4ba165	Update cookbook/mock_prompt_management_server/mock_prompt_management_server.py Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>	2026-02-18 02:31:27 +05:30
Harshit Jain	56fab12fbe	fix: prompt registry	2026-02-18 00:34:54 +05:30
Krrish Dholakia	26cd194d97	feat: final improvements for prompt management api	2025-12-13 18:10:21 -08:00
Krish Dholakia	7f5a097e96	Prompt Management API - new API to interact with Prompt Management integrations (no PR required) (#17800 ) * feat: initial commit adding prompt management api * feat: initial commit adding prompt management api * fix: refactoring to make sure get prompt is async * fix: additional fixes	2025-12-10 17:30:19 -08:00
Ishaan Jaff	a9b654224e	1.80.8 RC docs (#17605 ) * stash docs * docs fix * doc fix * docs fix	2025-12-06 10:40:00 -08:00
Krish Dholakia	b3a3081e8e	Guardrails API - new `structured_messages` param (#17518 ) * fix(generic_guardrail_api.py): add 'structured_messages' support allows guardrail provider to know if text is from system or user * fix(generic_guardrail_api.md): document 'structured_messages' parameter give api provider a way to distinguish between user and system messages * feat(anthropic/): return openai chat completion format structured messages when calls made via `/v1/messages` on Anthropic * feat(responses/guardrail_translation): support 'structured_messages' param for guardrails structured openai chat completion spec messages, for guardrail checks when using /v1/responses api allows guardrail checks to work consistently across APIs	2025-12-04 22:08:00 -08:00
Krish Dholakia	51cc102c30	fix(unified_guardrail.py): support during_call event type for unified guardrails (#17514 ) * fix(unified_guardrail.py): support during_call event type for unified guardrails allows guardrails overriding apply_guardrails to work 'during_call' * feat(generic_guardrail_api.py): support new 'tool_calls' field for generic guardrail api returns the tool calls emitted by the LLM API to the user * fix(generic_guardrail_api.py): working anthropic /v1/messages tool call response send llm tool calls to guardrail api when called via `/v1/messages` API * fix(responses/): run generic_guardrail_api on responses api tool call responses * fix: fix tests * test: fix tests * fix: fix tests	2025-12-04 22:06:13 -08:00
Krish Dholakia	32013f63a0	Guardrail API - support tool call checks on OpenAI `/chat/completions`, OpenAI `/responses`, Anthropic `/v1/messages` (#17459 ) * fix(unified_guardrail.py): correctly map a v1/messages call to the anthropic unified guardrail * fix: add more rigorous call type checks * fix(anthropic_endpoints/endpoints.py): initialize logging object at the beginning of endpoint ensures call id + trace id are emitted to guardrail api * feat(anthropic/chat/guardrail_translation): support streaming guardrails sample on every 5 chunks * fix(openai/chat/guardrail_translation): support openai streaming guardrails * fix: initial commit fixing output guardrails for responses api * feat(openai/responses/guardrail_translation): handler.py - fix output checks on responses api * fix(openai/responses/guardrail_translation/handler.py): ensure responses api guardrails work on streaming * test: update tests * test: update tests * fix: support multiple kinds of input to the guardrail api * feat(guardrail_translation/handler.py): support extracting tool calls from openai chat completions for guardrail api's * feat(generic_guardrail_api.py): support extracting + returning modified tool calls on generic_guardrails_api allows guardrail api to analyze tool call being sent to provider - to run any analysis on it * fix(guardrails.py): support anthropic /v1/messages tool calls * feat(responses_api/): extract tool calls for guardrail processing * docs(generic_guardrail_api.md): document tools param support * docs: generic_guardrail_api.md improve documentation	2025-12-03 21:20:39 -08:00
Krish Dholakia	be0530a6b3	fix(unified_guardrail.py): correctly map a v1/messages call to the anthropic unified guardrail (#17424 ) * fix(unified_guardrail.py): correctly map a v1/messages call to the anthropic unified guardrail * fix: add more rigorous call type checks * fix(anthropic_endpoints/endpoints.py): initialize logging object at the beginning of endpoint ensures call id + trace id are emitted to guardrail api * feat(anthropic/chat/guardrail_translation): support streaming guardrails sample on every 5 chunks * fix(openai/chat/guardrail_translation): support openai streaming guardrails * fix: initial commit fixing output guardrails for responses api * feat(openai/responses/guardrail_translation): handler.py - fix output checks on responses api * fix(openai/responses/guardrail_translation/handler.py): ensure responses api guardrails work on streaming * test: update tests * test: update tests * test: update tests * fix(bedrock_guardrails.py): fix post call streaming iterator logic * fix: fix return * fix(bedrock_guardrails.py): fix	2025-12-03 20:54:56 -08:00
Krish Dholakia	4c7a988454	Guardrail API V2 - user api key metadata, session id, specify input type (request/response), image support (#17338 ) * refactor(generic_guardrail_api.py): refactor to update to new guardrail api logic * refactor: refactor llm api integrations to support passing in text as a list[str] instead of one at a time * refactor: fix linting errors * refactor: pass request type to guardrail api allows request vs. response processing to occur * feat: pass user api key dict information to the guardrail api * fix: pass user api key dict information to the guardrail api * feat: pass litellm call id + trace id, if present * docs: update docs	2025-12-01 20:11:58 -08:00
Krish Dholakia	b6d6f834e0	(feat) Generic Guardrail API - allows guardrail providers to add INSTANT support for LiteLLM w/out PR to repo (#17175 ) * feat(generic_guardrail_api.py): new generic api for guardrails Allows guardrail providers to work with litellm for guardrails without needing to make a PR to LiteLLM * docs(generic_guardrail_api.md): document new generic guardrail api * Fix: Improve PII detection and guardrail API integration Co-authored-by: krrishdholakia <krrishdholakia@gmail.com> * feat: correctly extract raw request from guardrail api * docs(generic_guardrail_api.md): document this is a beta feature --------- Co-authored-by: Cursor Agent <cursoragent@cursor.com>	2025-12-01 14:29:52 -08:00
Ishaan Jaffer	b43b68a072	docs fix	2025-11-22 14:02:14 -08:00
Ishaan Jaffer	badbadba0d	fix img URL for tests	2025-11-22 09:41:15 -08:00
Ishaan Jaff	661117678c	Revert "remove deprecated embedding model (#16724 )" (#16970 ) This reverts commit `b9bc903536`.	2025-11-22 09:34:53 -08:00
Ishaan Jaffer	95caa2e3de	bump openai 2.8.0	2025-11-19 17:47:18 -08:00
Sameer Kankute	b9bc903536	remove deprecated embedding model (#16724 )	2025-11-17 18:46:20 -08:00
Ishaan Jaff	630a746c84	[Feat] Add Custom Secret Manager - Allow users to define and write a custom secret manager (#16297 ) * add CustomSecretManager class * docs custom secret manager * add TestCustomSecretManager * add KeyManagementSystem.CUSTOM * add get_secret_from_manager * add custom secret manager * Potential fix for code scanning alert no. 3662: Clear-text logging of sensitive information Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com> * docs fix * load_custom_secret_manager * initialize_secret_manager * add custom_secret_manager * fix add custom secret manager * add custom secret manager to KeyManagementSystem * fix KeyManagementSystem.CUSTOM * fix custom secret manager within cookbook * fix link for custom secret manager * Potential fix for code scanning alert no. 3663: Clear-text logging of sensitive information Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com> --------- Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com>	2025-11-05 17:20:26 -08:00
TensorNull	e3566faf15	fix(cookbook): Remove the CometAPI key used for testing.	2025-10-16 14:14:58 +08:00
TensorNull	55a6dd3a8b	feat(cometapi): Add CometAPI provider support (embeddings, image generation, docs) - Add CometAPI embedding and image generation transformations and configs - Add image cost calculator and export/init files - Register provider in constants, utils, main (embedding path) and sidebars - Add CometAPI docs page and cookbook notebook (Colab) for usage examples	2025-10-16 13:08:14 +08:00
Ishaan Jaffer	10a801ce83	docs fix	2025-09-27 16:52:02 -07:00
Alexsander Hamir	eaa04cd8ce	fix: use fastuuid helper (#14903 ) * fix: use fastuuid helper across the codebase First batch of changes, simple drop in replacement. * second batch of changes * fixed: script mistake on helper file	2025-09-25 15:47:01 -07:00
Ishaan Jaff	b9ffa98c55	[Feat] Proxy CLI: Create a python method to login using litellm proxy (#14782 ) * fix: cli auth with SSO okta * fix: add LITTELM_CLI_SERVICE_ACCOUNT_NAME * fix: get_litellm_cli_user_api_key_auth * use existing_key CLI * fix: use existing key * test auth commands * test_cli_sso_callback_regenerate_vs_create_flow * feat: add CLI Token Utilities * fix: get_stored_api_key * move file * fix: get_valid_models * fix config.yaml * TestCLITokenUtils * TestGetValidModelsWithCLI * fix: tie user id to keys created through CLI * fix: add teams interface to CLI * add /keys/update to the list client commands * fix /sso/cli/poll to return the user_id * fix: working TeamsManagementClient * fix CLI Login command * fixes for auth * Potential fix for code scanning alert no. 3400: Clear-text logging of sensitive information Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com> * ruff fix --------- Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com>	2025-09-22 21:28:38 -07:00
Ishaan Jaff	10de012e12	[Docs] - v1.77.3 (#14751 ) * fix sidebar * v1 * docs fix * docs fix * docs fix * docs fix * docs fix * docs fix * docs fix * docs fix * docs fix * docs fix	2025-09-20 15:23:44 -07:00
Ishaan Jaff	f37dd6bb95	Litellm 1.77.2 stable notes (#14544 ) * fix release notes instructions * docs v1 * fix doc * fix highlights * docs fix * docs fix	2025-09-13 18:41:34 -07:00
Ishaan Jaff	075a089d82	[Feat] Bedrock Batches - Ensure correct transformation applied to incoming requests (#14522 ) * use is_batch_jsonl_file * fix valid_content_type * fix transform_create_file_request * fix _transform_openai_jsonl_content_to_bedrock_jsonl_content * test_transform_openai_jsonl_content_to_bedrock_jsonl_content * fix mypy linting errors * fix BEDROCK_BATCH_MODEL * fix working sample * fix comment * fix model list * fix: use with managed batches * refactor	2025-09-12 18:32:57 -07:00
Ishaan Jaff	a13aa4740a	[Fixes] Bug fixes to using LiteLLM MCP Gateway (#14392 ) * fix: use _get_mcp_servers_in_path * fix checks for using litellm_proxy as MCP tool provider * fix: fix mcp_tools_with_litellm_proxy * fix: fix aresponses_api_with_mcp * aresponses_api_with_mcp * test_mcp_allowed_tools_filtering * fix: _filter_mcp_tools_by_allowed_tools * fix: _filter_mcp_tools_by_allowed_tools * test_streaming_responses_api_with_mcp_tools * fixes: test tools transfrom MCP->OpenaI spec * test_streaming_responses_api_with_mcp_tools * fix: chat ui allow multi select with allowed tools * fix: use correct MCP events with litellm proxy response API * fix get_event_model_class * fix litellm proxy MCP handler * fix MCPEnhancedStreamingIterator * chat ui show list tools result * UI: show MCP events * fix stream iterator * fixes: litellm proxy mcp handler * test responses + mcp * fix: update responses api with mcp handling * ruff check fix * central: _process_mcp_tools_to_openai_format * fix: refactor code * test_mcp_allowed_tools_filtering * test mcp with litellm proxy * fix mcp call * demo: video using MCP ui * fixes for using stream iterator * test_no_duplicate_mcp_tools_in_streaming_e2e * docs fix * fix code snippet	2025-09-10 19:12:11 -07:00
Ishaan Jaff	23ae7170d1	[Feat] Allow using Veo Video Generation through LiteLLM Pass through routes (#14228 ) * fix: add follow_redirects=True, * test_pass_through_with_httpbin_redirect * cook book veo video * docs Veo Video Generation with Google AI Studio * add veo-3.0-generate-preview cost tracking details * track vertex_video_models	2025-09-03 18:25:43 -07:00
Philip Kiely	7c3d522435	Update Baseten LiteLLM integration	2025-08-19 12:21:05 -07:00
Ishaan Jaff	4d941c914e	[Feat] Responses API Session Handling - Multi media support (#13347 ) * rename ResponsesSessionHandler * use ResponsesSessionHandler * test session handler * refactor ResponsesSessionHandler * fix get_proxy_server_request_from_spend_log * use constant for LITELLM_TRUNCATED_PAYLOAD_FIELD * add _should_check_cold_storage_for_full_payload * add get_class_type_for_custom_logger_name * get_active_custom_logger_for_callback_name * add get_proxy_server_request_from_cold_storage to CustomLogger * add ColdStorageHandler * start using cold storage integration * add get_proxy_server_request_from_cold_storage * fixes from manual testing * s3 v2 fix getting region name * ChatCompletionImageUrlObject * use _get_configured_cold_storage_custom_logger * fixes for _should_check_cold_storage_for_full_payload * fix _download_object_from_s3 * test_s3_v2_with_cold_storage * add cold_storage_object_key to StandardLoggingMetadata * use get_proxy_server_request_from_cold_storage_with_object_key * add cold_storage_object_key to SpendLogsMetadata * add cold_storage_object_key * get_proxy_server_request_from_cold_storage_with_object_key * use get_proxy_server_request_from_cold_storage_with_object_key * test responses API * add get_proxy_server_request_from_cold_storage_with_object_key * session handler fixes * test session handler * fix ruff checks * _download_object_from_s3 * cleanup * test * lint fix * test_e2e_cold_storage_successful_retrieval * test_e2e_generate_cold_storage_object_key_successful * test_async_gcs_pub_sub_v1 * test fix * test fix * test fix * test_standard_logging_metadata_has_cold_storage_object_key_field * test_sanitize_request_body_for_spend_logs_payload_basic * test_transform_input_image_item_to_image_item_with_image_data	2025-08-07 10:59:53 -07:00
Krish Dholakia	d37cc63250	Add new model provider Novita AI (#7582 ) (#9527 ) * Add new model provider Novita AI (#7582) * feat: add new model provider Novita AI * feat: use deepseek r1 model for examples in Novita AI docs * fix: fix tests * fix: fix tests for novita * fix: fix novita transformation * ci: fix ci yaml * fix: fix novita transformation and test (#10056) --------- Co-authored-by: Jason <ggbbddjm@gmail.com>	2025-05-12 21:49:30 -07:00
Ishaan Jaff	2cc4a87861	[Docs] Using litellm with Google ADK (#10777 ) * docs litellm ADK usage * docs litellm google adk * docs litellm ADK * docs litellm with ADK usage examples * docs litellm proxy with ADK * cookbook litellm ADK	2025-05-12 16:41:49 -07:00
minatoaquaMK2	65b99d6bc3	feat(grafana_dashboard): enable datasource selection via templating (#10257 ) This commit updates the Grafana dashboard configuration to include a datasource template variable. This allows users to dynamically select the datasource directly within the Grafana dashboard, improving flexibility and user experience.	2025-04-25 08:49:29 -07:00
Krish Dholakia	34bdf36eab	Add inference providers support for Hugging Face (#8258 ) (#9738 ) (#9773 ) * Add inference providers support for Hugging Face (#8258) * add first version of inference providers for huggingface * temporarily skipping tests * Add documentation * Fix titles * remove max_retries from params and clean up * add suggestions * use llm http handler * update doc * add suggestions * run formatters * add tests * revert * revert * rename file * set maxsize for lru cache * fix embeddings * fix inference url * fix tests following breaking change in main * use ChatCompletionRequest * fix tests and lint * [Hugging Face] Remove outdated chat completion tests and fix embedding tests (#9749) * remove or fix tests * fix link in doc * fix(config_settings.md): document hf api key --------- Co-authored-by: célina <hanouticelina@gmail.com>	2025-04-05 10:50:15 -07:00
Ishaan Jaff	5965680176	fix dev release.txt	2025-04-01 12:02:51 -07:00
dependabot[bot]	8f35bdffb0	build(deps): bump litellm in /cookbook/litellm-ollama-docker-image Bumps [litellm](https://github.com/BerriAI/litellm) from 1.55.3 to 1.61.15. - [Release notes](https://github.com/BerriAI/litellm/releases) - [Commits](https://github.com/BerriAI/litellm/commits) --- updated-dependencies: - dependency-name: litellm dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com>	2025-03-20 21:03:29 +00:00

1 2 3 4 5 ...

256 Commits