litellm/tests/proxy_e2e_anthropic_messages_tests/test_config.yaml
Mateo Wang 33c363d4d4
Extend the record/replay proxy to chat, embeddings, moderations, rerank, and Anthropic (#29847)
* test(ci): extend record/replay proxy to chat, embeddings, moderations, rerank, anthropic

The record/replay proxy that took the gpt-image-1 spend E2E off the live OpenAI
path now fronts every provider, so the other real-provider E2Es stop paying for
and depending on live calls each commit. It keys per upstream and selects a
non-OpenAI provider by a /__recorder_upstream/<host>/ path prefix carried on the
model's api_base, since some litellm handlers (cohere rerank) drop custom
request headers. Wired into build_and_test (chat, embeddings, moderations,
image), the otel job (cohere rerank), and the anthropic-messages job via a
reusable start_openai_record_replay_proxy command.

Dropped the time.time()/uuid prompt cache-busters in the build_and_test chat
tests, whose config has the response cache off, so identical requests are
recordable. The image spend test now asserts a repeat call still bills spend,
failing loudly if the proxy response cache is ever turned on.

Responses, the anthropic passthrough, bedrock, and fake-endpoint tests are left
live: their lifecycles, api_base assertions, providers, or fake targets make a
stateless body-keyed cache either break them or add nothing.

* docs(ci): note the recorder command's OpenAI default upstream and prefix override

Addresses a review note: the shared start_openai_record_replay_proxy command
defaults the upstream to OpenAI, so a non-OpenAI model must carry the
/__recorder_upstream/<host>/ prefix on its api_base. Document that in the
command description so a future caller does not assume the default follows the
provider.
2026-06-06 14:33:42 -07:00

59 lines
1.8 KiB
YAML

model_list:
- model_name: claude-sonnet-4-5-20250929
litellm_params:
model: "anthropic/claude-sonnet-4-5-20250929"
api_key: os.environ/ANTHROPIC_API_KEY
api_base: os.environ/RECORDER_ANTHROPIC_BASE_URL # In CI, routes through the record/replay proxy; unset elsewhere -> direct to Anthropic
- model_name: bedrock-claude-sonnet-3.5
litellm_params:
model: "bedrock/us.anthropic.claude-haiku-4-5-20251001-v1:0"
aws_region_name: "us-east-1"
- model_name: bedrock-claude-sonnet-4.6
litellm_params:
model: "bedrock/us.anthropic.claude-sonnet-4-6"
aws_region_name: "us-east-1"
- model_name: bedrock-claude-sonnet-4.5
litellm_params:
model: "bedrock/us.anthropic.claude-sonnet-4-5-20250929-v1:0"
aws_region_name: "us-east-1"
- model_name: bedrock-claude-opus-4.5
litellm_params:
model: "bedrock/us.anthropic.claude-opus-4-5-20251101-v1:0"
aws_region_name: "us-east-1"
- model_name: bedrock-nova-pro
litellm_params:
model: "bedrock/us.amazon.nova-pro-v1:0"
aws_region_name: "us-east-1"
# Converse API models
- model_name: bedrock-converse-claude-sonnet-4.5
litellm_params:
model: "bedrock/converse/us.anthropic.claude-sonnet-4-5-20250929-v1:0"
aws_region_name: "us-east-1"
# Azure AI models
- model_name: azure-ai-claude-opus-4.5
litellm_params:
model: "azure_ai/claude-opus-4.5"
api_key: os.environ/AZURE_AI_API_KEY
api_base: os.environ/AZURE_AI_API_BASE
# Vertex AI models
- model_name: vertex-ai-claude-opus-4-6
litellm_params:
model: "vertex_ai/claude-opus-4-6"
vertex_ai_project: "pathrise-convert-1606954137718"
vertex_ai_location: "asia-southeast1"
general_settings:
forward_client_headers_to_llm_api: true
litellm_settings:
drop_params: true
modify_params: true