History

Yassin Kortam 410b892f77 fix(register_model): preserve built-in cache pricing when registering custom overrides under unmapped keys (#30044 ) * fix(spend-tracking): fall back to direct spend-counter increment when reservation reconcile fails When the reservation-reconcile path in `_reconcile_budget_reservation_for_counter_update` hits a Redis error, it now correctly returns an empty set so that `increment_spend_counters` re-runs the direct increment for the affected counters. Previously, the function logged the failure, invalidated the reserved counters, and still returned the reserved counter keys, which caused the caller to skip the direct increment. With the increment skipped and the counter deleted, the next request reseeded the counter from `LiteLLM_VerificationToken.spend`, a column the batched flusher only updates every few seconds, so the enforced cross-pod spend value collapsed to a stale snapshot and budget gating stopped firing for affected keys. Adds a regression test that exercises the failure path with a flaky redis backend and asserts the actual response cost lands in the shared counter. * fix(register_model): preserve built-in cache pricing when registering custom overrides under unmapped keys When a custom-priced model is registered under a key shape that get_model_info cannot resolve (e.g. litellm_params.model set to bedrock/bedrock/us.anthropic.claude-sonnet-4-6 or another non-canonical alias), register_model previously fell back to an empty existing_model. The merged entry then carried only the fields the user set explicitly (input/output cost, provider) and dropped cache pricing. Downstream the cost calculator defaulted cache_creation_input_token_cost and cache_read_input_token_cost to 0, silently dropping the bulk of the bill for cache-heavy Anthropic traffic. register_model now attempts to resolve a canonical built-in entry by stripping provider prefixes, region prefixes, and provider-specific suffixes before giving up. When a variant resolves, its defaults (notably cache pricing) are inherited while the user's explicit overrides still win. When nothing resolves and the user supplied no cache pricing, it logs a warning instead of silently under-billing. * fix(router): inherit built-in cache pricing on deployments with partial custom pricing A deployment configured with only input_cost_per_token and output_cost_per_token under model_info was being registered under its model_info.id with no cache cost fields. The cost calculator then defaulted cache_creation_input_token_cost and cache_read_input_token_cost to 0, silently billing cache_read and cache_creation tokens at zero. For cache-heavy Anthropic traffic this drops the bulk of the bill. When the deployment's litellm_params.model resolves to a built-in cost-map entry, pull the cache pricing fields from there before registering. User-specified cache fields still win on merge; only missing fields are inherited. Pairs with the register_model fallback added earlier in this branch: that handles unmapped key shapes like bedrock/bedrock/x, this handles deploy-id keys whose backend model is mapped. * fix(register_model): inherit only cache pricing on unmapped-key fallback, not provider The unmapped-key fallback in register_model copied the entire resolved built-in entry, so registering openai/command-r-plus inherited the cohere built-in's litellm_provider and get_model_info(custom_llm_provider=openai) could no longer resolve it. Restrict the fallback to the cache-pricing fields, matching the router-side _inherit_builtin_cache_pricing, so the cache-cost dropout stays fixed without clobbering the registered provider. Add a direct unit test for Router._inherit_builtin_cache_pricing so the router coverage check sees it, and pin the fixed spend-counter contract: when reservation reconcile fails the counter must hold the directly incremented cost rather than being left at None.		2026-06-10 12:11:03 -07:00
..
a2a_protocol	Fix : a2a bugs 030626 (#29566 )	2026-06-03 11:14:15 -07:00
anthropic_interface/exceptions	style: run black formatter on files from main merge	2026-04-17 13:02:59 -07:00
caching	fix(caching): restore stored prompt_tokens on embedding cache hits instead of recomputing (#30046 )	2026-06-10 15:49:20 +05:30
completion_extras	fix(responses-bridge): map system-only chat request to system input item (#29817 )	2026-06-06 16:11:54 -07:00
containers	fix(containers): record ownership for service-account keys + fix Prisma Json serialization (#28990 )	2026-05-27 21:00:07 -07:00
enterprise	feat(enterprise): add RESEND_FROM_EMAIL for self-hosted Resend sends (#28830 )	2026-05-30 10:09:05 -07:00
expected_fine_tuning_api	refactor: refactor testing	2026-03-28 18:39:32 -07:00
expected_responses_api_request
experimental_mcp_client	feat(mcp): per-server env vars with global + per-user scopes (#28917 )	2026-06-05 20:15:11 -07:00
google_genai	fix(gemini): normalize response_schema on native generateContent (#27775 )	2026-05-12 23:26:34 -07:00
images	fix(image_edit): read vertex_project/location from litellm_params in Imagen get_complete_url	2026-04-21 15:03:40 +05:30
integrations	feat(vantage): include organization metadata in FOCUS Tags export (#28184 )	2026-06-09 02:59:21 +03:00
interactions	fix(interactions): never drop streamed text deltas; always emit terminal completion (#28394 )	2026-05-20 16:41:40 -07:00
litellm_core_utils	Litellm oss 090626 (#30021 )	2026-06-10 10:34:07 -07:00
llms	Litellm oss 090626 (#30021 )	2026-06-10 10:34:07 -07:00
models	feat(litellm): add models and repository layers (#29686 )	2026-06-06 20:59:33 -07:00
ocr	chore: reject bare str at file-input sinks to prevent local-file read (#27762 )	2026-05-12 16:40:07 -07:00
passthrough	refactor: trim explanatory comments from streaming-flush fix	2026-04-30 02:39:28 +00:00
proxy	fix(register_model): preserve built-in cache pricing when registering custom overrides under unmapped keys (#30044 )	2026-06-10 12:11:03 -07:00
repositories	feat(litellm): add models and repository layers (#29686 )	2026-06-06 20:59:33 -07:00
responses	Litellm oss staging 080626 (#29932 )	2026-06-08 13:49:52 -07:00
router_strategy	fix(router): enforce deployment budgets for dynamically added models (#29273 )	2026-05-29 19:43:14 -07:00
router_utils	Litellm oss 090626 (#30021 )	2026-06-10 10:34:07 -07:00
secret_managers	Implement normalize_nonempty_secret_str function to trim whitespace from secrets and treat empty values as unset. Update proxy_server to use this function for Grafana credentials. Enhance tests to validate the new normalization behavior.	2026-05-04 18:17:31 +00:00
test_router
types	Litellm oss staging 080626 (#29932 )	2026-06-08 13:49:52 -07:00
vector_stores	style: run black formatter on files from main merge	2026-04-17 13:02:59 -07:00
__init__.py
conftest.py	style: run black formatter on files from main merge	2026-04-17 13:02:59 -07:00
log.txt
readme.md
test__types.py	Litellm OSS Staging 010626 (#29422 )	2026-06-01 21:42:51 -07:00
test_a2a_registry_lookup.py	style: run black formatter on files from main merge	2026-04-17 13:02:59 -07:00
test_acompletion_session_reuse_e2e.py	style: run black formatter on files from main merge	2026-04-17 13:02:59 -07:00
test_add_deployment_no_master_key.py	style: run black formatter on files from main merge	2026-04-17 13:02:59 -07:00
test_aembedding_session_reuse_e2e.py	style: run black formatter on files from main merge	2026-04-17 13:02:59 -07:00
test_anthropic_beta_headers_filtering.py	Litellm oss 090626 (#30021 )	2026-06-10 10:34:07 -07:00
test_anthropic_skills_transformation.py	fix: encode upstream URL path identifiers	2026-04-29 22:02:39 -07:00
test_azure_video_router.py	style: run black formatter on files from main merge	2026-04-17 13:02:59 -07:00
test_bedrock_anthropic_1hr_cache_pricing.py	Litellm oss staging 030626 (#29578 )	2026-06-03 11:01:51 -07:00
test_bedrock_usgov_haiku_1hr_cache.py	Litellm OSS Staging 010626 (#29422 )	2026-06-01 21:42:51 -07:00
test_bedrock_usgov_pricing.py	Litellm OSS Staging 010626 (#29422 )	2026-06-01 21:42:51 -07:00
test_chat_ui_responses_session.py	style: run black formatter on files from main merge	2026-04-17 13:02:59 -07:00
test_check_licenses.py	fix(check_licenses): read PEP 639 license-expression metadata (#28529 )	2026-05-22 11:22:38 -07:00
test_claude_fable_5_config.py	Add Claude Fable 5 across Anthropic, Bedrock, Vertex AI, and Azure AI (#30064 )	2026-06-10 08:50:15 +05:30
test_claude_haiku_4_5_config.py	feat(anthropic): add Claude Opus 4.8 and prune reasoning-effort flags (#29238 )	2026-05-28 18:50:33 -07:00
test_claude_opus_4_6_config.py	feat(anthropic): add Claude Opus 4.8 and prune reasoning-effort flags (#29238 )	2026-05-28 18:50:33 -07:00
test_claude_opus_4_8_config.py	fix(anthropic): route Claude Opus 4.8 through adaptive thinking (#29702 )	2026-06-05 16:19:01 +05:30
test_claude_sonnet_4_6_config.py	feat(anthropic): add Claude Opus 4.8 and prune reasoning-effort flags (#29238 )	2026-05-28 18:50:33 -07:00
test_completion_timeout_resolution.py	style: run black formatter on files from main merge	2026-04-17 13:02:59 -07:00
test_compression.py	Prompt Compression - add it to the proxy (#25729 )	2026-04-20 15:08:00 -07:00
test_constants.py	Fix test	2026-03-27 21:21:43 +05:30
test_container_router.py	style: run black formatter on files from main merge	2026-04-17 13:02:59 -07:00
test_cost_calculation_log_level.py	style: run black formatter on files from main merge	2026-04-17 13:02:59 -07:00
test_cost_calculator.py	[internal copy of #27491 ] fix(realtime): Fix Realtime Audio Token Cost Tracking (#29722 )	2026-06-05 18:53:17 +05:30
test_count_tokens_public_api.py	style: run black formatter on files from main merge	2026-04-17 13:02:59 -07:00
test_dashscope_image_generation.py	fix(dashscope): fail fast on image generation API errors	2026-04-23 18:41:01 +05:30
test_deepseek_model_metadata.py	Revert "Merge pull request #16590 from Chesars/refactor/remove-backup-file-dry-principle"	2026-04-25 17:10:41 -03:00
test_dockerfile_non_root.py	fix(docker.non_root): use numeric UID 65534 for K8s runAsNonRoot (#26268 )	2026-04-22 18:00:04 -07:00
test_eager_tiktoken_load.py	style: run black formatter on files from main merge	2026-04-17 13:02:59 -07:00
test_exception_exports.py	style: run black formatter on files from main merge	2026-04-17 13:02:59 -07:00
test_exception_header_preservation.py	style: run black formatter on files from main merge	2026-04-17 13:02:59 -07:00
test_exception_mapping_request_attribute.py	style: run black formatter on files from main merge	2026-04-17 13:02:59 -07:00
test_filter_out_litellm_params.py	style: run black formatter on files from main merge	2026-04-17 13:02:59 -07:00
test_get_blog_posts.py	style: run black formatter on files from main merge	2026-04-17 13:02:59 -07:00
test_gpt_image_cost_calculator.py	Add gpt-image-2 support (#26644 ) (#26705 )	2026-04-28 20:10:42 -07:00
test_groq_streaming_encoding.py	style: run black formatter on files from main merge	2026-04-17 13:02:59 -07:00
test_guardrail_exception_status_codes.py	Litellm oss staging (#28161 )	2026-05-18 16:27:44 -07:00
test_lazy_imports.py	style: run black formatter on files from main merge	2026-04-17 13:02:59 -07:00
test_litellm_params_reserved_keys.py	fix(snowflake): transform tool_choice string to object format (#23268 )	2026-03-11 01:41:24 +05:30
test_logging.py	style: run black formatter on files from main merge	2026-04-17 13:02:59 -07:00
test_lowest_latency_zero_tokens.py	style: run black formatter on files from main merge	2026-04-17 13:02:59 -07:00
test_main_module_header.py	Add main module header comment	2026-05-06 00:26:17 +00:00
test_main.py	Litellm oss staging 040626 (#29671 )	2026-06-04 11:07:20 -07:00
test_model_cost_aliases.py	[Feat] - Ishaan main merge branch (#23596 )	2026-03-14 09:40:00 -07:00
test_model_param_helper.py	style: black format test_model_param_helper.py	2026-04-15 18:18:52 -07:00
test_model_response_normalization.py	style: run black formatter on files from main merge	2026-04-17 13:02:59 -07:00
test_nested_drop_params.py	style: run black formatter on files from main merge	2026-04-17 13:02:59 -07:00
test_openai_embedding_encoding_format_default.py	fix(embeddings): allow omitting encoding_format via env sentinel none	2026-05-01 23:03:13 +05:30
test_project_alias_tracking.py	feat(proxy): add project_alias tracking through callback metadata pipeline	2026-03-23 10:44:17 -07:00
test_project_tags_pydantic.py
test_rate_limit_error_unification.py	feat: standardize rate limit errors with category, rate_limit_type, model, and llm_provider fields (#27687 )	2026-06-06 17:50:29 -07:00
test_redact_string_in_error_paths.py	style: run black formatter on files from main merge	2026-04-17 13:02:59 -07:00
test_redis.py	Litellm oss staging (#28161 )	2026-05-18 16:27:44 -07:00
test_register_model_custom_pricing.py	fix(register_model): preserve built-in cache pricing when registering custom overrides under unmapped keys (#30044 )	2026-06-10 12:11:03 -07:00
test_responses_api_bridge_non_stream.py	style: run black formatter on files from main merge	2026-04-17 13:02:59 -07:00
test_responses_id_security.py	style: run black formatter on files from main merge	2026-04-17 13:02:59 -07:00
test_retrieve_batch_bedrock_dispatch.py	feat(batch-job): bedrock batch model invocation job retrieval (#26834 )	2026-05-11 13:22:26 -07:00
test_router_google_genai.py	style: run black formatter on files from main merge	2026-04-17 13:02:59 -07:00
test_router_model_cost_isolation.py	fix(register_model): preserve built-in cache pricing when registering custom overrides under unmapped keys (#30044 )	2026-06-10 12:11:03 -07:00
test_router_order_fallback.py	fix(router): enable order fallback for wildcard model groups	2026-04-15 17:48:25 +05:30
test_router_per_deployment_num_retries.py
test_router_redis_init.py
test_router_retry_non_retryable_errors.py	style: run black formatter on files from main merge	2026-04-17 13:02:59 -07:00
test_router_silent_experiment.py	style: run black formatter on files from main merge	2026-04-17 13:02:59 -07:00
test_router_weighted_failover.py	feat: add weighted-routing failover (#27980 )	2026-05-15 17:28:54 +00:00
test_router.py	Litellm oss 090626 (#30021 )	2026-06-10 10:34:07 -07:00
test_secret_redaction.py	Litellm oss staging (#29492 )	2026-06-02 08:48:10 -07:00
test_service_logger.py	fix: missing span for guardrail passthrough (#29552 )	2026-06-03 01:25:15 +00:00
test_setup_wizard.py	test: test	2026-03-28 19:17:38 -07:00
test_shared_session_integration.py	style: run black formatter on files from main merge	2026-04-17 13:02:59 -07:00
test_ssl_verify_unit.py	Litellm OSS Staging (#29161 )	2026-06-01 21:22:35 -07:00
test_stream_chunk_builder_annotations.py	fix: merge annotations from all streaming chunks in stream_chunk_builder	2026-03-15 14:20:45 +05:30
test_streaming_connection_cleanup.py	style: run black formatter on files from main merge	2026-04-17 13:02:59 -07:00
test_system_message_format_bug.py	style: run black formatter on files from main merge	2026-04-17 13:02:59 -07:00
test_thinking_enabled.py	Litellm oss staging 250526 (#28770 )	2026-05-26 11:57:39 -07:00
test_utils_module_docstring.py	Add utils module docstring	2026-05-06 00:42:49 +00:00
test_utils.py	Litellm oss 090626 (#30021 )	2026-06-10 10:34:07 -07:00
test_uuid_helper.py
test_vcr_safe_body_matcher.py	test: stabilize batch VCR coverage and stop live upload/network leaks (#29477 )	2026-06-02 16:11:52 -07:00
test_video_generation.py	fix(vertex-ai): use DB credentials in video handlers + implement Veo video edit (#29098 )	2026-05-28 11:45:41 -07:00
test_xai_grok_4_3_model_metadata.py	feat(xai): add grok-4.3 and grok-4.3-latest to model_prices_and_conte… (#27154 )	2026-05-07 09:06:56 -07:00
test_xai_responses_auto_routing.py	style: run black formatter on files from main merge	2026-04-17 13:02:59 -07:00

readme.md

Testing for `litellm/`

This directory 1:1 maps the the litellm/ directory, and can only contain mocked tests.

The point of this is to:

Increase test coverage of litellm/
Make it easy for contributors to add tests for the litellm/ package and easily run tests without needing LLM API keys.

File name conventions

litellm/proxy/test_caching_routes.py maps to litellm/proxy/caching_routes.py
test_<filename>.py maps to litellm/<filename>.py

readme.md

Testing for litellm/

File name conventions

Testing for `litellm/`