litellm

Author	SHA1	Message	Date
Yassin Kortam	b5d3a5fc85	feat: add read-replica routing for Prisma DB via DATABASE_URL_READ_REPLICA (#27493 ) - Introduce RoutingPrismaWrapper that transparently routes read operations (find_*, count, group_by, query_raw, query_first) to a reader endpoint while writes remain on the writer, enabling Aurora-style reader/writer endpoint splits - Add IAMEndpoint dataclass and parse_iam_endpoint_from_url() to capture static connection fields from a reader URL so only the IAM token needs to rotate, avoiding the need for separate DATABASE_HOST_READ_REPLICA/etc. env vars - Enhance PrismaWrapper with per-instance knobs (db_url_env_var, iam_endpoint, recreate_uses_datasource, log_prefix) so writer and reader wrappers are independent: the reader writes its fresh URL to DATABASE_URL_READ_REPLICA and passes datasource override to Prisma since Prisma only auto-reads DATABASE_URL - Fix deadlock in PrismaWrapper.__getattr__: when called from inside a running event loop, schedule the token refresh as a background task instead of blocking with run_coroutine_threadsafe + future.result(), which would deadlock the loop thread waiting for a coroutine that needs the loop to run - Fix botocore crash when DATABASE_PORT is unset by defaulting to "5432" in both proxy_cli.py and PrismaWrapper.get_rds_iam_token(); passing None caused botocore to embed the literal string "None" in the presigned URL - Implement graceful reader degradation: reader connect/recreate failures are non-fatal; wrapper sets _reader_unavailable=True and silently routes reads to the writer to keep the proxy serving traffic during transient reader outages - Add PrismaClient.writer_db property so the reconnect smoke-test always validates the writer engine specifically; query_raw on the routing wrapper would route to the reader and not verify the newly-recreated writer - Expose DATABASE_URL_READ_REPLICA in Helm chart (values.yaml + deployment.yaml) via both plain value and secret key reference, and document the field in docker-compose.yml - Add 887-line test suite covering routing logic, IAM token refresh paths, reader degradation scenarios, datasource override behavior, and the deadlock regression Co-authored-by: Yassin Kortam <yassinkortam@g.ucla.edu>	2026-05-08 21:05:50 -07:00
Krrish Dholakia	f90dea7315	fix(docker-compose.yml): move to docker.litellm.ai	2025-12-16 08:50:34 +05:30
Sergei Silnov	f5dc8b7c38	fix: Use python instead of wget for healthcheck in docker-compose.yml (#17646 ) Fixes #17645	2025-12-08 18:54:54 -08:00
Pablo Gomez	f27a823256	Deletion of unnecessary and error causing volume section comment (#15425 )	2025-10-10 17:51:27 -07:00
Mateo Di Loreto	6e5fe51184	add openssl in apk install in runtime stage in dockerfile.non_root (#13168 ) * add openssl in apk install in runtime stage in dockerfile.non_rootdocker-compose logs -f litellm * Improve Docker-compose.yaml for local debugging --------- Co-authored-by: Krish Dholakia <krrishdholakia@gmail.com>	2025-07-31 21:52:11 -07:00
Kowyo	c35003bba0	fix: remove obsolete attribute `version` in docker compose (#13172 ) Fix the warning: WARN[0000] docker-compose.yml: the attribute `version` is obsolete, it will be ignored, please remove it to avoid potential confusion	2025-07-30 22:56:06 -07:00
Andy Gajdosik	828f9491dd	Fix #9295 docker-compose healthcheck test uses curl but curl is not in the image (#9737 ) Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com>	2025-05-26 10:19:59 -07:00
Krish Dholakia	290e2528cd	Schedule budget resets at expectable times (#10331 ) (#10333 ) * Schedule budget resets at expectable times (#10331) * Enhance budget reset functionality with timezone support and standardized reset times - Added `get_next_standardized_reset_time` function to calculate budget reset times based on specified durations and timezones. - Introduced `timezone_utils.py` to manage timezone retrieval and budget reset time calculations. - Updated budget reset logic in `reset_budget_job.py`, `internal_user_endpoints.py`, `key_management_endpoints.py`, and `team_endpoints.py` to utilize the new timezone-aware reset time calculations. - Added unit tests for the new reset time functionality in `test_duration_parser.py`. - Updated `.gitignore` to include `test.py` and made minor formatting adjustments in `docker-compose.yml` for consistency. * Fixed linting * Fix for mypy * Fixed testcase for reset * fix(duration_parser.py): move off zoneinfo - doesn't work with python 3.8 * test: update test * refactor: improve budget reset time calculation and update related tests for accuracy * clean up imports in team_endpoints.py * test: update budget remaining hours assertions to reflect new reset time logic * build(model_prices_and_context_window.json): update model --------- Co-authored-by: Prathamesh Saraf <pratamesh1867@gmail.com>	2025-04-29 20:59:44 -07:00
Ishaan Jaff	b19529a46e	fix docker compose	2025-03-25 07:03:43 -07:00
Ishaan Jaff	08a4ba1b7e	Merge branch 'main' into litellm_exp_mcp_server	2025-03-24 19:03:56 -07:00
xucai	3382ca2f60	add healthcheck	2025-03-13 22:38:37 +08:00
xucai	3944f67b1a	add healthcheck	2025-03-13 22:32:19 +08:00
xucai	9a03e1dfd8	add postgres-volumes comment	2025-03-12 03:30:14 +08:00
xucai	c771a7aab4	feat/postgres-volumes	2025-02-23 16:17:39 +08:00
Sebastian Sosa	64ccf4cd6e	expose port & required env vars & instructions for running in dev (#8404 )	2025-02-08 16:29:44 -08:00
Marcos Cannabrava	c0a7e8352f	docs: cleanup docker compose comments (#7414 ) * docs: cleanup docker compose comments * pr template: fix typo	2024-12-25 16:10:31 -08:00
Krish Dholakia	d46660ea0f	LiteLLM Minor Fixes & Improvements (09/18/2024) (#5772 ) * fix(proxy_server.py): fix azure key vault logic to not require client id/secret * feat(cost_calculator.py): support fireworks ai cost tracking * build(docker-compose.yml): add lines for mounting config.yaml to docker compose Closes https://github.com/BerriAI/litellm/issues/5739 * fix(input.md): update docs to clarify litellm supports content as a list of dictionaries Fixes https://github.com/BerriAI/litellm/issues/5755 * fix(input.md): update input.md to include all message values * fix(image_handling.py): follow image url redirects Fixes https://github.com/BerriAI/litellm/issues/5763 * fix(router.py): Fix model key/base leak in error message Fixes https://github.com/BerriAI/litellm/issues/5762 * fix(http_handler.py): fix linting error * fix(azure.py): fix logging to show azure_ad_token being used Fixes https://github.com/BerriAI/litellm/issues/5767 * fix(_redis.py): add redis sentinel support Closes https://github.com/BerriAI/litellm/issues/4381 * feat(_redis.py): add redis sentinel support Closes https://github.com/BerriAI/litellm/issues/4381 * test(test_completion_cost.py): fix test * Databricks Integration: Integrate Databricks SDK as optional mechanism for fetching API base and token, if unspecified (#5746) * LiteLLM Minor Fixes & Improvements (09/16/2024) (#5723) * coverage (#5713) Signed-off-by: dbczumar <corey.zumar@databricks.com> * Move (#5714) Signed-off-by: dbczumar <corey.zumar@databricks.com> * fix(litellm_logging.py): fix logging client re-init (#5710) Fixes https://github.com/BerriAI/litellm/issues/5695 * fix(presidio.py): Fix logging_hook response and add support for additional presidio variables in guardrails config Fixes https://github.com/BerriAI/litellm/issues/5682 * feat(o1_handler.py): fake streaming for openai o1 models Fixes https://github.com/BerriAI/litellm/issues/5694 * docs: deprecated traceloop integration in favor of native otel (#5249) * fix: fix linting errors * fix: fix linting errors * fix(main.py): fix o1 import --------- Signed-off-by: dbczumar <corey.zumar@databricks.com> Co-authored-by: Corey Zumar <39497902+dbczumar@users.noreply.github.com> Co-authored-by: Nir Gazit <nirga@users.noreply.github.com> * feat(spend_management_endpoints.py): expose `/global/spend/refresh` endpoint for updating material view (#5730) * feat(spend_management_endpoints.py): expose `/global/spend/refresh` endpoint for updating material view Supports having `MonthlyGlobalSpend` view be a material view, and exposes an endpoint to refresh it * fix(custom_logger.py): reset calltype * fix: fix linting errors * fix: fix linting error * fix Signed-off-by: dbczumar <corey.zumar@databricks.com> * fix: fix import * Fix Signed-off-by: dbczumar <corey.zumar@databricks.com> * fix Signed-off-by: dbczumar <corey.zumar@databricks.com> * DB test Signed-off-by: dbczumar <corey.zumar@databricks.com> * Coverage Signed-off-by: dbczumar <corey.zumar@databricks.com> * progress Signed-off-by: dbczumar <corey.zumar@databricks.com> * fix Signed-off-by: dbczumar <corey.zumar@databricks.com> * fix Signed-off-by: dbczumar <corey.zumar@databricks.com> * fix Signed-off-by: dbczumar <corey.zumar@databricks.com> * fix test name Signed-off-by: dbczumar <corey.zumar@databricks.com> --------- Signed-off-by: dbczumar <corey.zumar@databricks.com> Co-authored-by: Krish Dholakia <krrishdholakia@gmail.com> Co-authored-by: Nir Gazit <nirga@users.noreply.github.com> * test: fix test * test(test_databricks.py): fix test * fix(databricks/chat.py): handle custom endpoint (e.g. sagemaker) * Apply code scanning fix for clear-text logging of sensitive information Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com> * fix(__init__.py): fix known fireworks ai models --------- Signed-off-by: dbczumar <corey.zumar@databricks.com> Co-authored-by: Corey Zumar <39497902+dbczumar@users.noreply.github.com> Co-authored-by: Nir Gazit <nirga@users.noreply.github.com> Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com>	2024-09-19 13:25:29 -07:00
Krrish Dholakia	d9539e518e	build(docker-compose.yml): add prometheus scraper to docker compose persists prometheus data across restarts	2024-07-24 10:09:23 -07:00
Wanis Elabbar	d7556020b3	Fix errors with docker-compose file The Docker Compose file is causing an error during the healthcheck, stating "cannot find role 'account used to run compose'". I've modified the file to set a database, username, and password, and ensured the database and username are configured correctly in the healthcheck.	2024-07-22 16:45:59 +01:00
Ishaan Jaff	5e12364fad	update docker compose to show how to pass a config.yaml	2024-07-09 17:59:02 -07:00
Krrish Dholakia	ce4ba80fd4	build(docker-compose.yml): load local .env in docker compose quick start	2024-06-01 16:27:07 -07:00
Krrish Dholakia	9b4a19b3aa	build(docker-compose.yml): startup docker compose with postgres	2024-06-01 16:18:12 -07:00
Krrish Dholakia	30798126eb	build(docker-compose.yml): fix default docker compose to run with config	2024-04-09 16:27:03 -07:00
Ishaan Jaff	a8fa4e28e9	Update docker-compose.yml	2024-03-02 17:19:11 -08:00
ishaan-jaff	9bb55a0671	add clickhouse docs	2024-02-26 18:31:10 -08:00
ishaan-jaff	031e0eabf8	(v0) start clickhouse	2024-02-26 14:18:56 -08:00
ishaan-jaff	8d3897592e	(fix) docker compose	2024-02-16 14:01:27 -08:00
ishaan-jaff	bdb7c0a0a7	(ci/cd) docker compose up with ui	2024-01-25 17:13:19 -08:00

28 Commits