You can proxy requests to Cerebras AI models through AI Gateway using the AI Proxy and AI Proxy Advanced plugins. This reference documents all supported AI capabilities, configuration requirements, and provider-specific details needed for proper integration.
Cerebras provider
Upstream paths
AI Gateway automatically routes requests to the appropriate Cerebras API endpoints. The following table shows the upstream paths used for each capability.
| Capability | Upstream path or API |
|---|---|
| Chat completions | /v1/chat/completions |
Supported capabilities
The following tables show the AI capabilities supported by Cerebras provider when used with the AI Proxy or the AI Proxy Advanced plugin.
Set the plugin’s
route_typebased on the capability you want to use. See the tables below for supported route types.
Text generation
Support for Cerebras basic text generation capabilities including chat, completions, and embeddings:
| Capability | Route type | Streaming | Model example | Min version |
|---|---|---|---|---|
| Chat completions | llm/v1/chat |
llama-3.3-70b | 3.13 |
Cerebras base URL
The base URL is https://api.cerebras.ai/{route_type_path}, where {route_type_path} is determined by the capability.
AI Gateway uses this URL automatically. You only need to configure a URL if you’re using a self-hosted or Cerebras-compatible endpoint, in which case set the upstream_url plugin option.
Configure Cerebras with AI Proxy
To use Cerebras with AI Gateway, configure the AI Proxy or AI Proxy Advanced.
Here’s a minimal configuration for chat completions:
For more configuration options and examples, see: