Achieving Sub-Millisecond Proxy Overhead
Our Q1 performance target and architectural direction for achieving sub-millisecond proxy overhead on modest hardware.
Guides, announcements, and best practices from the LiteLLM team.
Our Q1 performance target and architectural direction for achieving sub-millisecond proxy overhead on modest hardware.
Guide to using Gemini 3 Flash on LiteLLM Proxy and SDK with day 0 support.
Guide to Claude Opus 4.5 and advanced features in LiteLLM: Tool Search, Programmatic Tool Calling, and Effort Parameter.
Common questions and best practices for using gemini-3-pro-preview with LiteLLM Proxy and SDK.