Commit Graph

35 Commits

Author SHA1 Message Date
Yassin Kortam
b5d3a5fc85
feat: add read-replica routing for Prisma DB via DATABASE_URL_READ_REPLICA (#27493)
- Introduce RoutingPrismaWrapper that transparently routes read operations (find_*, count, group_by, query_raw, query_first) to a reader endpoint while writes remain on the writer, enabling Aurora-style reader/writer endpoint splits
- Add IAMEndpoint dataclass and parse_iam_endpoint_from_url() to capture static connection fields from a reader URL so only the IAM token needs to rotate, avoiding the need for separate DATABASE_HOST_READ_REPLICA/etc. env vars
- Enhance PrismaWrapper with per-instance knobs (db_url_env_var, iam_endpoint, recreate_uses_datasource, log_prefix) so writer and reader wrappers are independent: the reader writes its fresh URL to DATABASE_URL_READ_REPLICA and passes datasource override to Prisma since Prisma only auto-reads DATABASE_URL
- Fix deadlock in PrismaWrapper.__getattr__: when called from inside a running event loop, schedule the token refresh as a background task instead of blocking with run_coroutine_threadsafe + future.result(), which would deadlock the loop thread waiting for a coroutine that needs the loop to run
- Fix botocore crash when DATABASE_PORT is unset by defaulting to "5432" in both proxy_cli.py and PrismaWrapper.get_rds_iam_token(); passing None caused botocore to embed the literal string "None" in the presigned URL
- Implement graceful reader degradation: reader connect/recreate failures are non-fatal; wrapper sets _reader_unavailable=True and silently routes reads to the writer to keep the proxy serving traffic during transient reader outages
- Add PrismaClient.writer_db property so the reconnect smoke-test always validates the writer engine specifically; query_raw on the routing wrapper would route to the reader and not verify the newly-recreated writer
- Expose DATABASE_URL_READ_REPLICA in Helm chart (values.yaml + deployment.yaml) via both plain value and secret key reference, and document the field in docker-compose.yml
- Add 887-line test suite covering routing logic, IAM token refresh paths, reader degradation scenarios, datasource override behavior, and the deadlock regression

Co-authored-by: Yassin Kortam <yassinkortam@g.ucla.edu>
2026-05-08 21:05:50 -07:00
Yassin Kortam
451ce161fc fix: remove separate health app 2026-05-07 16:04:56 -07:00
Yassin Kortam
618df94433
helm: increase default probe timeouts, disable debug logging by default (#27237)
Co-authored-by: Yassin Kortam <yassinkortam@g.ucla.edu>
2026-05-05 16:58:34 -07:00
Chesars
1be6b31e2f merge: resolve conflicts between main and litellm_oss_staging_03_11_2026 2026-03-12 09:38:31 -03:00
RJ Duffner
0c95d415e1
Add Abilty To Set minReadySeconds From values Files (#23173)
* Add Abilty To Set minReadySeconds From values Files

* typo

* uppercase Min as it comes after deployment

* Don't use defaults, just omit
2026-03-11 23:29:15 +05:30
Harshit28j
3127d79da8 feat: add strategy to deployment for helmchart 2026-03-10 05:49:46 +05:30
Sean Marsh Glover
4652c73259
feat(proxy): limit concurrent health checks with health_check_concurrency (#20584)
* staged first pass

* black

* Update litellm/proxy/health_check.py

Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>

* simpler

* restore cached logo

* fix tests for perform_health_check max_concurrency arg

* implement pr suggestion

* and the helm chart

* add configureable resources and probes to the deployment in the helm chart

* more helm chart unittests

* move some background healthcheck loggin to debug

---------

Co-authored-by: Sean Glover <sglover@athenahealth.com>
Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>
2026-02-24 08:16:59 -08:00
Pragya Sardana
b4a27712a1
Add Init Containers in the community helm chart (#19816) 2026-01-27 18:10:47 -08:00
Harshit Jain
9084c1d1bd
feat(helm): Enable PreStop hook configuration in values.yaml (#19613) 2026-01-22 19:28:52 -08:00
R.Sicart
608979c7e9
feat: add support for keda in helm chart (#19337)
* feat: add support for keda in helm chart

Signed-off-by: R.Sicart <roger.sicart@gmail.com>

* chore: bump chart version

---------

Signed-off-by: R.Sicart <roger.sicart@gmail.com>
2026-01-19 10:38:41 -08:00
expruc
2d112fc8b2
add option to include additional resources to chart (#17627) 2025-12-07 23:25:57 -08:00
Lukas de Boer
3b8a6ec888
Helm Chart: Add possibility to override command, args and add deployment labels (#17535)
* Helm Chart: Add possibility to override command, args and also add deployment labels

* Helm Chart: Fix helm lint issue

* Helm Chart: Fix helm unit tests
2025-12-06 14:01:09 -08:00
Fabian Reinold
c173a4a275
Helm Chart: add ingress-only labels (#17348)
* feat(helm): add ingress-only labels

* feat(helm): add ingress configuration tests

* chore(helm): bump chart version
2025-12-02 22:30:54 -08:00
Saar wintrov
777ef628d2
Enhancement(helm): ServiceMonitor template rendering (#17038)
* Metadata: fix 401 when audio/transcriptions

* check if str, CR fixes

* Added new helmchart functionality

* .

* .

* adding new tests
2025-11-24 20:53:02 -08:00
Abhinav
b6c26c3365
helm(chart): add optional PodDisruptionBudget for litellm proxy (#14062) (#14093) 2025-09-01 12:21:44 -07:00
Const-antine
ff4040bbe1 add functionality to mount existing configmap if needed 2025-08-28 13:18:05 -04:00
Jugal D. Bhatt
d63f5f99e9
Enhance database configuration: add support for optional endpointKey in values.yaml and update deployment/migrations job templates to conditionally source DATABASE_HOST from the secret if endpointKey is set. (#13763) 2025-08-21 14:58:50 -07:00
Mattias Andersson
89f71af4cd Add possibility to configure resources for migrations-job in Helm chart 2025-08-14 17:08:26 +02:00
Jugal D. Bhatt
7cf3b4682a
[Separate Health App] Update Helm Deployment.yaml (#13162)
* add helm deployment fix

* clean deployment
2025-08-01 16:50:23 -07:00
unique-jakub
3edb71e617
allow helm hooks for migrations job (#13174) 2025-07-31 21:51:07 -07:00
Victor Krylov
1d58fc5429
Add deployment annotations (#11849)
* Add deployment annotations

* Correct the indent and simplify if 0 annotations
2025-06-19 20:11:31 -07:00
Steven Aldinger
b8bdf98a4b
feat(helm): [BerriAI/litellm#11648] support extraContainers in migrations-job.yaml (#11649) 2025-06-11 23:16:06 -07:00
Gunjan Solanki
f1cc2d544e
feat(helm): Add loadBalancerClass support for LoadBalancer services (#11064)
* feat(helm): Add loadBalancerClass support for LoadBalancer services

Adds the ability to specify a loadBalancerClass when using LoadBalancer service type.
This enables integration with custom load balancer implementations like Tailscale.

* fixup! feat(helm): Add loadBalancerClass support for LoadBalancer services
2025-05-22 22:45:14 -07:00
Manuel Cañete
c4ea1ab61b
feat: add extraEnvVars to the helm deployment (#9292) 2025-04-11 10:32:16 -07:00
Manuel Cañete
44373baeee feat: make masterkey secret configurable 2025-03-16 12:39:13 +01:00
Ashwin Madavan
40a3af7d61
chore: set ttlSecondsAfterFinished on the migration job in the litellm-helm chart (#8593)
* chore: add ttlSecondsAfterFinished to values.yaml

* chore: set ttlSecondsAfterFinished on the job

* chore: bump version in Chart.yaml
2025-02-26 14:21:37 -08:00
Ishaan Jaff
4081aeb15e
(helm) - allow specifying envVars on values.yaml + add helm lint test (#7789)
* litellm use envVars values.yaml

* fix values.yaml

* add helm lint to ci/cd pipeline

* working values.yaml

* add helm tests to ci/cd

* fix helm chart testing

* update helm tests

* fix helm test

* fix use test values in ci

* fix busy box testing on helm

* fix test-values.yaml

* update helm tests

* fix test connection
2025-01-15 21:11:17 -08:00
Low Jian Sheng
9cfd7c790a
Prevent istio injection for db migrations cron job (#7513)
* allow option to prevent istio injection

* make annotations more generic

* make annotations empty

* revert comment deletions
2025-01-04 22:45:19 -08:00
Ishaan Jaff
688d513459
(feat) helm hook to sync db schema (#6715)
* v0 migration job

* fix job
2024-11-12 11:58:35 -08:00
Krish Dholakia
cc8dd80209
allow configuring httpx hooks for AsyncHTTPHandler (#6290) (#6415)
* allow configuring httpx hooks for AsyncHTTPHandler (#6290)

Co-authored-by: Krish Dholakia <krrishdholakia@gmail.com>

* Fixes and minor improvements for Helm Chart (#6402)

* reckoner hack

* fix default

* add extracontainers option

* revert chart

* fix extracontainers

* fix deployment

* remove init container

* update docs

* add helm lint to deploy step

* change name

* (refactor) prometheus async_log_success_event to be under 100 LOC  (#6416)

* unit testig for prometheus

* unit testing for success metrics

* use 1 helper for _increment_token_metrics

* use helper for _increment_remaining_budget_metrics

* use _increment_remaining_budget_metrics

* use _increment_top_level_request_and_spend_metrics

* use helper for _set_latency_metrics

* remove noqa violation

* fix test prometheus

* test prometheus

* unit testing for all prometheus helper functions

* fix prom unit tests

* fix unit tests prometheus

* fix unit test prom

* (refactor) router - use static methods for client init utils  (#6420)

* use InitalizeOpenAISDKClient

* use InitalizeOpenAISDKClient static method

* fix  # noqa: PLR0915

* (code cleanup) remove unused and undocumented logging integrations - litedebugger, berrispend  (#6406)

* code cleanup remove unused and undocumented code files

* fix unused logging integrations cleanup

* update chart version

* add circleci tests

---------

Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com>
Co-authored-by: Xingyao Wang <xingyao@all-hands.dev>

* fix: fix linting error

* fix(http_handler.py): fix linting error

---------

Co-authored-by: Alejandro Rodríguez <alejorro70@gmail.com>
Co-authored-by: Robert Brennan <accounts@rbren.io>
Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com>
Co-authored-by: Xingyao Wang <xingyao@all-hands.dev>
2024-10-24 22:00:24 -07:00
Peter Laß
e8066f13cb
Update litellm helm envconfigmap (#5872)
* Added the possibility to use ConfigMapRef for env values.

* bumped aapVersion to 1.46.6 and chartVersion to 0.3.0

---------

Co-authored-by: Peter Laß <peter.lass@maibornwolff.de>
2024-09-25 21:43:08 -07:00
Peter Laß
b1ecfe065c
fix #5614 (#5615)
Co-authored-by: Peter Laß <peter.lass@maibornwolff.de>
2024-09-10 09:26:44 -07:00
ishaan-jaff
54dceea4b0 (fix) releasing litellm helm as "litellm-helm" 2024-03-16 10:06:14 -07:00
Lunik
880213d4a6
Refresh Helm chart structure
Signed-off-by: Lunik <lunik@tiwabbit.fr>
2024-02-16 22:59:14 +01:00
Shaun Maher
4d0e369e7a Authored a Helm chart for LiteLLM. Added GitHub workflows/actions to build and push the helm chart to the ghcr.io OCI registry. 2024-01-25 11:53:59 +11:00