litellm

History

Krrish Dholakia 386f334fee Prompt Compression - add it to the proxy (#25729 ) * refactor: new agentic loop event hook simplifies how to create logic for tool based multi llm calls * fix: compress - make it work on anthropic input as well * fix(compress.py): working prompt compression for claude code ensures claude code messages can run through proxy easily * docs: add agentic loop hook guide * docs: add agentic_loop_hook to sidebar * fix: fix multiple arguments error * fix: fix tool call loop for compression on streaming /v1/messages * fix: fix linting errors * fix: fix ci/cd errors * feat(litellm_pre_call_utils.py): use claude code session for litellm session id allows claude code logs to be stitched together, making it easy to know they were all part of the same conversation * fix: suppress incorrect mypy warning rE: module * revert: drop PR's changes to litellm/proxy/_experimental/out/ Restores the 34 HTML files under _experimental/out/ to their pre-PR paths (X/index.html -> X.html). All renames are R100 (content unchanged); no other files are touched. * fix: address greptile review comments on PR #25729 - Skip ``kwargs["tools"] = []`` injection when compression is a no-op — Anthropic Messages rejects empty tool arrays on requests that did not originally declare tools. - Move agentic-loop safety guards (fingerprint cycle / max depth) out of the per-callback try/except so they propagate instead of being swallowed by the generic exception handler. Extracted _check_agentic_loop_safety. - Gate generic ``x-<vendor>-session-id`` capture behind the LITELLM_CAPTURE_VENDOR_SESSION_HEADERS env var (off by default) to preserve backwards compatibility; explicit x-litellm-* headers are unaffected. - Fix monkeypatch target in pre-call-hook test to patch the actual module-level binding (litellm.integrations.compression_interception.handler.compress). - Add regression tests for empty-tools skip and opt-in session capture. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * revert: drop LITELLM_CAPTURE_VENDOR_SESSION_HEADERS flag Generic x-<vendor>-session-id header capture is a new feature and only runs after the explicit x-litellm-trace-id / x-litellm-session-id checks, so it does not change behavior for any existing caller that was already using the LiteLLM headers — no backwards-incompatibility to gate. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * refactor(compress): replace input_type with CallTypes call_type Drop the bespoke ``CompressionInputType`` literal and use the existing ``litellm.types.utils.CallTypes`` enum instead. ``litellm.compress()`` now takes ``call_type: Union[CallTypes, str]`` (default ``CallTypes.completion``) — no new concept to learn, and the enum is already the way the rest of the codebase talks about request shapes. Supported values: ``completion`` / ``acompletion`` (OpenAI chat-completions shape) and ``anthropic_messages`` (Anthropic structured content blocks). Updated: compress(), the compression_interception handler, tests, docs, and the two eval scripts. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>		2026-04-20 15:08:00 -07:00
..
health_check	style: run black formatter on files from main merge	2026-04-17 13:02:59 -07:00
benchmark_mock.py	style: run black formatter on files from main merge	2026-04-17 13:02:59 -07:00
benchmark_proxy_vs_provider.py	style: run black formatter on files from main merge	2026-04-17 13:02:59 -07:00
create_litellm_branch.ps1	feat: add script to create branches with litellm_ prefix (#17606 )	2025-12-06 10:41:39 -08:00
create_litellm_branch.sh	enhance: create_litellm_branch tool to be more robust (#17874 )	2025-12-12 05:35:50 -08:00
create_team_key_and_submit_guardrail.sh	feat(guardrails): team-based guardrail registration and approval workflow (#22459 )	2026-03-02 22:06:49 -08:00
eval_compression.py	Prompt Compression - add it to the proxy (#25729 )	2026-04-20 15:08:00 -07:00
install.sh	build: migrate packaging, CI, and Docker from Poetry to uv (#25007 )	2026-04-09 11:46:23 -07:00
mock_grayswan_timeout_server.py	implement failopen option default to True on grayswan guardrail (#18266 )	2026-01-06 15:17:05 +05:30
test_agent_mcp_endpoints.sh	Agents - assign tools (#22064 )	2026-02-25 11:44:30 -08:00
test_guardrails_register_endpoints.sh	feat(guardrails): team-based guardrail registration and approval workflow (#22459 )	2026-03-02 22:06:49 -08:00
test_tool_allowlist_script.py	style: run black formatter on files from main merge	2026-04-17 13:02:59 -07:00