litellm

Author	SHA1	Message	Date
Yassin Kortam	b5d3a5fc85	feat: add read-replica routing for Prisma DB via DATABASE_URL_READ_REPLICA (#27493 ) - Introduce RoutingPrismaWrapper that transparently routes read operations (find_*, count, group_by, query_raw, query_first) to a reader endpoint while writes remain on the writer, enabling Aurora-style reader/writer endpoint splits - Add IAMEndpoint dataclass and parse_iam_endpoint_from_url() to capture static connection fields from a reader URL so only the IAM token needs to rotate, avoiding the need for separate DATABASE_HOST_READ_REPLICA/etc. env vars - Enhance PrismaWrapper with per-instance knobs (db_url_env_var, iam_endpoint, recreate_uses_datasource, log_prefix) so writer and reader wrappers are independent: the reader writes its fresh URL to DATABASE_URL_READ_REPLICA and passes datasource override to Prisma since Prisma only auto-reads DATABASE_URL - Fix deadlock in PrismaWrapper.__getattr__: when called from inside a running event loop, schedule the token refresh as a background task instead of blocking with run_coroutine_threadsafe + future.result(), which would deadlock the loop thread waiting for a coroutine that needs the loop to run - Fix botocore crash when DATABASE_PORT is unset by defaulting to "5432" in both proxy_cli.py and PrismaWrapper.get_rds_iam_token(); passing None caused botocore to embed the literal string "None" in the presigned URL - Implement graceful reader degradation: reader connect/recreate failures are non-fatal; wrapper sets _reader_unavailable=True and silently routes reads to the writer to keep the proxy serving traffic during transient reader outages - Add PrismaClient.writer_db property so the reconnect smoke-test always validates the writer engine specifically; query_raw on the routing wrapper would route to the reader and not verify the newly-recreated writer - Expose DATABASE_URL_READ_REPLICA in Helm chart (values.yaml + deployment.yaml) via both plain value and secret key reference, and document the field in docker-compose.yml - Add 887-line test suite covering routing logic, IAM token refresh paths, reader degradation scenarios, datasource override behavior, and the deadlock regression Co-authored-by: Yassin Kortam <yassinkortam@g.ucla.edu>	2026-05-08 21:05:50 -07:00
Yassin Kortam	451ce161fc	fix: remove separate health app	2026-05-07 16:04:56 -07:00
Yassin Kortam	618df94433	helm: increase default probe timeouts, disable debug logging by default (#27237 ) Co-authored-by: Yassin Kortam <yassinkortam@g.ucla.edu>	2026-05-05 16:58:34 -07:00
Chesars	1be6b31e2f	merge: resolve conflicts between main and litellm_oss_staging_03_11_2026	2026-03-12 09:38:31 -03:00
RJ Duffner	0c95d415e1	Add Abilty To Set minReadySeconds From values Files (#23173 ) * Add Abilty To Set minReadySeconds From values Files * typo * uppercase Min as it comes after deployment * Don't use defaults, just omit	2026-03-11 23:29:15 +05:30
Harshit28j	3127d79da8	feat: add strategy to deployment for helmchart	2026-03-10 05:49:46 +05:30
Sean Marsh Glover	4652c73259	feat(proxy): limit concurrent health checks with health_check_concurrency (#20584 ) * staged first pass * black * Update litellm/proxy/health_check.py Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com> * simpler * restore cached logo * fix tests for perform_health_check max_concurrency arg * implement pr suggestion * and the helm chart * add configureable resources and probes to the deployment in the helm chart * more helm chart unittests * move some background healthcheck loggin to debug --------- Co-authored-by: Sean Glover <sglover@athenahealth.com> Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>	2026-02-24 08:16:59 -08:00
Pragya Sardana	b4a27712a1	Add Init Containers in the community helm chart (#19816 )	2026-01-27 18:10:47 -08:00
Harshit Jain	9084c1d1bd	feat(helm): Enable PreStop hook configuration in values.yaml (#19613 )	2026-01-22 19:28:52 -08:00
R.Sicart	608979c7e9	feat: add support for keda in helm chart (#19337 ) * feat: add support for keda in helm chart Signed-off-by: R.Sicart <roger.sicart@gmail.com> * chore: bump chart version --------- Signed-off-by: R.Sicart <roger.sicart@gmail.com>	2026-01-19 10:38:41 -08:00
expruc	2d112fc8b2	add option to include additional resources to chart (#17627 )	2025-12-07 23:25:57 -08:00
Lukas de Boer	3b8a6ec888	Helm Chart: Add possibility to override command, args and add deployment labels (#17535 ) * Helm Chart: Add possibility to override command, args and also add deployment labels * Helm Chart: Fix helm lint issue * Helm Chart: Fix helm unit tests	2025-12-06 14:01:09 -08:00
Fabian Reinold	c173a4a275	Helm Chart: add ingress-only labels (#17348 ) * feat(helm): add ingress-only labels * feat(helm): add ingress configuration tests * chore(helm): bump chart version	2025-12-02 22:30:54 -08:00
Saar wintrov	777ef628d2	Enhancement(helm): ServiceMonitor template rendering (#17038 ) * Metadata: fix 401 when audio/transcriptions * check if str, CR fixes * Added new helmchart functionality * . * . * adding new tests	2025-11-24 20:53:02 -08:00
Abhinav	b6c26c3365	helm(chart): add optional PodDisruptionBudget for litellm proxy (#14062 ) (#14093 )	2025-09-01 12:21:44 -07:00
Const-antine	ff4040bbe1	add functionality to mount existing configmap if needed	2025-08-28 13:18:05 -04:00
Jugal D. Bhatt	d63f5f99e9	Enhance database configuration: add support for optional endpointKey in values.yaml and update deployment/migrations job templates to conditionally source DATABASE_HOST from the secret if endpointKey is set. (#13763 )	2025-08-21 14:58:50 -07:00
Mattias Andersson	89f71af4cd	Add possibility to configure resources for migrations-job in Helm chart	2025-08-14 17:08:26 +02:00
Jugal D. Bhatt	7cf3b4682a	[Separate Health App] Update Helm Deployment.yaml (#13162 ) * add helm deployment fix * clean deployment	2025-08-01 16:50:23 -07:00
unique-jakub	3edb71e617	allow helm hooks for migrations job (#13174 )	2025-07-31 21:51:07 -07:00
Victor Krylov	1d58fc5429	Add deployment annotations (#11849 ) * Add deployment annotations * Correct the indent and simplify if 0 annotations	2025-06-19 20:11:31 -07:00
Steven Aldinger	b8bdf98a4b	feat(helm): [BerriAI/litellm#11648] support extraContainers in migrations-job.yaml (#11649 )	2025-06-11 23:16:06 -07:00
Gunjan Solanki	f1cc2d544e	feat(helm): Add loadBalancerClass support for LoadBalancer services (#11064 ) * feat(helm): Add loadBalancerClass support for LoadBalancer services Adds the ability to specify a loadBalancerClass when using LoadBalancer service type. This enables integration with custom load balancer implementations like Tailscale. * fixup! feat(helm): Add loadBalancerClass support for LoadBalancer services	2025-05-22 22:45:14 -07:00
Manuel Cañete	c4ea1ab61b	feat: add extraEnvVars to the helm deployment (#9292 )	2025-04-11 10:32:16 -07:00
Manuel Cañete	44373baeee	feat: make masterkey secret configurable	2025-03-16 12:39:13 +01:00
Ashwin Madavan	40a3af7d61	chore: set ttlSecondsAfterFinished on the migration job in the litellm-helm chart (#8593 ) * chore: add ttlSecondsAfterFinished to values.yaml * chore: set ttlSecondsAfterFinished on the job * chore: bump version in Chart.yaml	2025-02-26 14:21:37 -08:00
Ishaan Jaff	4081aeb15e	(helm) - allow specifying envVars on values.yaml + add helm lint test (#7789 ) * litellm use envVars values.yaml * fix values.yaml * add helm lint to ci/cd pipeline * working values.yaml * add helm tests to ci/cd * fix helm chart testing * update helm tests * fix helm test * fix use test values in ci * fix busy box testing on helm * fix test-values.yaml * update helm tests * fix test connection	2025-01-15 21:11:17 -08:00
Low Jian Sheng	9cfd7c790a	Prevent istio injection for db migrations cron job (#7513 ) * allow option to prevent istio injection * make annotations more generic * make annotations empty * revert comment deletions	2025-01-04 22:45:19 -08:00
Ishaan Jaff	688d513459	(feat) helm hook to sync db schema (#6715 ) * v0 migration job * fix job	2024-11-12 11:58:35 -08:00
Krish Dholakia	cc8dd80209	allow configuring httpx hooks for AsyncHTTPHandler (#6290 ) (#6415 ) * allow configuring httpx hooks for AsyncHTTPHandler (#6290) Co-authored-by: Krish Dholakia <krrishdholakia@gmail.com> * Fixes and minor improvements for Helm Chart (#6402) * reckoner hack * fix default * add extracontainers option * revert chart * fix extracontainers * fix deployment * remove init container * update docs * add helm lint to deploy step * change name * (refactor) prometheus async_log_success_event to be under 100 LOC (#6416) * unit testig for prometheus * unit testing for success metrics * use 1 helper for _increment_token_metrics * use helper for _increment_remaining_budget_metrics * use _increment_remaining_budget_metrics * use _increment_top_level_request_and_spend_metrics * use helper for _set_latency_metrics * remove noqa violation * fix test prometheus * test prometheus * unit testing for all prometheus helper functions * fix prom unit tests * fix unit tests prometheus * fix unit test prom * (refactor) router - use static methods for client init utils (#6420) * use InitalizeOpenAISDKClient * use InitalizeOpenAISDKClient static method * fix # noqa: PLR0915 * (code cleanup) remove unused and undocumented logging integrations - litedebugger, berrispend (#6406) * code cleanup remove unused and undocumented code files * fix unused logging integrations cleanup * update chart version * add circleci tests --------- Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com> Co-authored-by: Xingyao Wang <xingyao@all-hands.dev> * fix: fix linting error * fix(http_handler.py): fix linting error --------- Co-authored-by: Alejandro Rodríguez <alejorro70@gmail.com> Co-authored-by: Robert Brennan <accounts@rbren.io> Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com> Co-authored-by: Xingyao Wang <xingyao@all-hands.dev>	2024-10-24 22:00:24 -07:00
Peter Laß	e8066f13cb	Update litellm helm envconfigmap (#5872 ) * Added the possibility to use ConfigMapRef for env values. * bumped aapVersion to 1.46.6 and chartVersion to 0.3.0 --------- Co-authored-by: Peter Laß <peter.lass@maibornwolff.de>	2024-09-25 21:43:08 -07:00
Peter Laß	b1ecfe065c	fix #5614 (#5615 ) Co-authored-by: Peter Laß <peter.lass@maibornwolff.de>	2024-09-10 09:26:44 -07:00
ishaan-jaff	54dceea4b0	(fix) releasing litellm helm as "litellm-helm"	2024-03-16 10:06:14 -07:00
Lunik	880213d4a6	✨ Refresh Helm chart structure Signed-off-by: Lunik <lunik@tiwabbit.fr>	2024-02-16 22:59:14 +01:00
Shaun Maher	4d0e369e7a	Authored a Helm chart for LiteLLM. Added GitHub workflows/actions to build and push the helm chart to the ghcr.io OCI registry.	2024-01-25 11:53:59 +11:00

35 Commits