Release date 2025/12/18
Feature
-
Added support for Cerebras - a new AI Provider.
-
Added support for HuggingFace’s new serverless API in the AI Proxy plugin, enabling seamless integration and improved compatibility.
-
Added support for native format video generation in Bedrock and Gemini LLM drivers, allowing direct passthrough of provider-specific video generation requests without format conversion.
-
Added support for batch API of bedrock.
-
Introduced Alibaba Dashscope as new provider.
-
Added support for batch mode of anthropic.
-
Added support for inline batch mode of gemini
Bugfix
-
Added dimension configuration support for Amazon Titan Embed v2 models.
-
Fixed an issue where cohere embedding model on Bedrock returned a bad request error
-
Fixed an issue where
tls_certificate_verifyconfiguration was not respected in JSON-RPC and AWS STS calls. -
Added tool_calls passthrough and preserved finish_reason in Huggingface driver responses.
-
Fixed an issue where HuggingFace embedding driver were incorrectly parsed responses from embedding API
-
Fixed an issue where extra inputs were not permitted for huggingface inference provider
-
Fixed an issue where using ai-proxy-advanced in conjunction with a logging plugin (such as file-log) resulting in missing information on the last entry of the balancer “tries” section.