litellm

History

Sameer Kankute dfd6cbc514 fix(vertex): propagate Vertex AI metadata in streaming success callbacks (#29899 ) * fix(vertex): propagate Vertex AI metadata in streaming success callbacks Streaming calls assembled via stream_chunk_builder were missing vertex_ai_grounding_metadata and vertex_ai_url_context_metadata in standard_logging_object.response. Merge metadata from chunks into the assembled response and mirror non-streaming hidden_params on Gemini chunks. Co-authored-by: Cursor <cursoragent@cursor.com> * refactor(vertex): move streaming metadata merge into provider config hook Address review feedback by delegating assembled-stream metadata propagation to VertexGeminiConfig via BaseConfig.apply_assembled_streaming_response_metadata, and only write chunk hidden_params when metadata is non-empty. Co-authored-by: Cursor <cursoragent@cursor.com> * fix(redaction): scrub Vertex provider metadata when message logging is off Clear vertex_ai_grounding_metadata and related fields from standard logging responses and assembled streaming ModelResponse objects so turn_off_message_logging cannot leak prompt-derived web search queries. Co-authored-by: Cursor <cursoragent@cursor.com> * Use assembled model for streaming metadata hook * Fix Vertex metadata redaction bypass in logging callbacks. Scrub Vertex provider fields from litellm_params.metadata.hidden_params during perform_redaction so streaming success_handler merges do not leak prompt-derived metadata when message logging is disabled. Co-authored-by: Cursor <cursoragent@cursor.com> * Fix Vertex streaming metadata from hidden params * fix(vertex): mirror vertex_ai_safety_results on assembled streaming responses The non-streaming transform_response stores safety data under vertex_ai_safety_results, but the streaming path only wrote vertex_ai_safety_ratings. Assembled streaming responses therefore never carried vertex_ai_safety_results, so any consumer reading that field saw a silent difference between streaming and non-streaming calls. Set vertex_ai_safety_results alongside vertex_ai_safety_ratings in the shared stream metadata setter and add it to the assembled metadata field list so it propagates through stream_chunk_builder. * fix(streaming): log provider streaming metadata hook failures instead of swallowing them * refactor(vertex): share single Vertex metadata field tuple across redaction and streaming * refactor(vertex): move Vertex metadata redaction helpers into llms/vertex_ai --------- Co-authored-by: Cursor <cursoragent@cursor.com> Co-authored-by: mateo-berri <277851410+mateo-berri@users.noreply.github.com>		2026-06-08 16:14:30 -07:00
..
a2a_protocol	Fix : a2a bugs 030626 (#29566 )	2026-06-03 11:14:15 -07:00
anthropic_interface	feat(context_management): compact_20260112 polyfill for non-Anthropic providers (#28868 )	2026-05-30 09:20:05 -07:00
assistants
batch_completion
batches	fix(vertex-ai): fix zero cost/usage on completed Vertex AI batch jobs (#27912 )	2026-05-15 04:47:02 -07:00
caching	Litellm oss staging 080626 (#29932 )	2026-06-08 13:49:52 -07:00
completion_extras	fix(responses-bridge): map system-only chat request to system input item (#29817 )	2026-06-06 16:11:54 -07:00
compression	build(deps-dev): bump black to 26.3.1 and apply formatting (#28525 )	2026-05-21 17:24:18 -07:00
containers
endpoints/speech/speech_to_completion_bridge
evals
experimental_mcp_client	feat(mcp): per-server env vars with global + per-user scopes (#28917 )	2026-06-05 20:15:11 -07:00
files	build(deps-dev): bump black to 26.3.1 and apply formatting (#28525 )	2026-05-21 17:24:18 -07:00
fine_tuning
google_genai	build(deps-dev): bump black to 26.3.1 and apply formatting (#28525 )	2026-05-21 17:24:18 -07:00
images
integrations	feat(galileo): add health check support for UI callback test (#29908 )	2026-06-08 13:57:03 -07:00
interactions	build(deps-dev): bump black to 26.3.1 and apply formatting (#28525 )	2026-05-21 17:24:18 -07:00
litellm_core_utils	fix(vertex): propagate Vertex AI metadata in streaming success callbacks (#29899 )	2026-06-08 16:14:30 -07:00
llms	fix(vertex): propagate Vertex AI metadata in streaming success callbacks (#29899 )	2026-06-08 16:14:30 -07:00
models	feat(litellm): add models and repository layers (#29686 )	2026-06-06 20:59:33 -07:00
ocr	chore: reject bare str at file-input sinks to prevent local-file read (#27762 )	2026-05-12 16:40:07 -07:00
passthrough	fix(proxy): Bedrock Knowledge Base pass-through: preserve SigV4 headers and signed request body (#27526 )	2026-05-25 19:21:55 +05:30
proxy	fix(team_endpoints): don't block /team/update on unchanged team budget (#29525 )	2026-06-09 01:14:24 +03:00
proxy_auth
rag	style: black formatting	2026-04-25 14:47:54 -07:00
realtime_api	Litellm oss staging (#29492 )	2026-06-02 08:48:10 -07:00
repositories	feat(litellm): add models and repository layers (#29686 )	2026-06-06 20:59:33 -07:00
rerank_api	Fix review	2026-04-30 09:10:24 +05:30
responses	Litellm oss staging 080626 (#29932 )	2026-06-08 13:49:52 -07:00
router_strategy	feat(litellm): add models and repository layers (#29686 )	2026-06-06 20:59:33 -07:00
router_utils	build(deps-dev): bump black to 26.3.1 and apply formatting (#28525 )	2026-05-21 17:24:18 -07:00
search	Litellm oss staging 050626 (#29774 )	2026-06-05 13:51:51 -07:00
secret_managers	build(deps-dev): bump black to 26.3.1 and apply formatting (#28525 )	2026-05-21 17:24:18 -07:00
skills	chore(proxy): scope skills and container resources	2026-04-30 18:23:58 -07:00
types	fix(vertex): propagate Vertex AI metadata in streaming success callbacks (#29899 )	2026-06-08 16:14:30 -07:00
vector_store_files	build(deps-dev): bump black to 26.3.1 and apply formatting (#28525 )	2026-05-21 17:24:18 -07:00
vector_stores	feat(litellm): add models and repository layers (#29686 )	2026-06-06 20:59:33 -07:00
videos
__init__.py	feat: standardize rate limit errors with category, rate_limit_type, model, and llm_provider fields (#27687 )	2026-06-06 17:50:29 -07:00
_internal_context.py
_lazy_imports_registry.py	Litellm oss staging 050626 (#29774 )	2026-06-05 13:51:51 -07:00
_lazy_imports.py
_logging.py	fix(bedrock-mantle): use /anthropic/v1/messages path for Mantle endpo… (#27976 )	2026-05-15 13:31:59 -07:00
_redis_credential_provider.py	feat: add ability to auth to azure with token (#27556 )	2026-05-09 22:34:09 +00:00
_redis.py	Litellm oss staging (#28161 )	2026-05-18 16:27:44 -07:00
_service_logger.py	fix: missing span for guardrail passthrough (#29552 )	2026-06-03 01:25:15 +00:00
_uuid.py	build(deps-dev): bump black to 26.3.1 and apply formatting (#28525 )	2026-05-21 17:24:18 -07:00
_version.py
anthropic_beta_headers_config.json	fix(anthropic,bedrock,databricks): four reasoning_effort follow-ups	2026-05-03 10:03:53 -07:00
anthropic_beta_headers_manager.py
blog_posts.json
budget_manager.py	docs(budget_manager): add docstring to BudgetManager.reset_cost (#27867 )	2026-05-13 13:28:22 -07:00
constants.py	Litellm oss staging 080626 (#29932 )	2026-06-08 13:49:52 -07:00
cost_calculator.py	[internal copy of #27491 ] fix(realtime): Fix Realtime Audio Token Cost Tracking (#29722 )	2026-06-05 18:53:17 +05:30
cost.json
exceptions.py	feat: standardize rate limit errors with category, rate_limit_type, model, and llm_provider fields (#27687 )	2026-06-06 17:50:29 -07:00
main.py	fix(vertex): propagate Vertex AI metadata in streaming success callbacks (#29899 )	2026-06-08 16:14:30 -07:00
model_prices_and_context_window_backup.json	fix(fireworks): enable tool calling for glm-5p1 in model cost map (#29697 )	2026-06-08 15:54:19 -07:00
mypy.ini
policy_templates_backup.json
provider_endpoints_support_backup.json	Litellm oss staging 050626 (#29774 )	2026-06-05 13:51:51 -07:00
py.typed
router.py	Title: Fix managed batch cancel credential resolution (#29734 )	2026-06-06 12:35:18 -07:00
scheduler.py
setup_wizard.py	feat(anthropic): add Claude Opus 4.8 and prune reasoning-effort flags (#29238 )	2026-05-28 18:50:33 -07:00
timeout.py	docs: add class docstring to _LoopWrapper (#27870 )	2026-05-13 13:54:00 -07:00
utils.py	Litellm oss staging 080626 (#29932 )	2026-06-08 13:49:52 -07:00