Release date 2025/09/05
Feature
-
Support for Gemini Rerank in native format
Bugfix
-
Fix panic in LLM observability when populating token usage for AI Proxy token usage details
-
AI Proxy Advanced: Fixed an issue where gemini provider not support model garden
-
Fixed an issue where Mistral models would return
Unsupported field: seed
when using some inference libraries. -
Skip unknown cached details key from o11y observation.
-
Fixed an issue where array input is not recognized as valid request for some providers. Bedrock, Gemini Public and Mistral provider don’t accept array input as before.
-
Fixed an issue where some Titan embeddings model reported malformed request.