Release date 2025/11/18
Bugfix
-
Fixed an issue where Cohere and Huggingface models did not work with the semantic cache because of the polluted request format.
-
Fixed an issue where the plugin did not report time to first token (TTFT) metrics when hit.