AI Proxy Advanced: OpenAI SDK: Multi-deployment chat routing - Plugin

OpenAI SDK: Multi-deployment chat routingv3.8+

Use separate Routes to map Azure OpenAI SDK requests to specific deployments of GPT-4o and GPT-4-1-mini.

Using OpenAI SDK with the AI Proxy Advanced plugin, you can configure multiple Routes in Kong Gateway to represent different Azure OpenAI deployments. Each Route maps a unique path segment (such as azure-gpt-3-5 or azure-gpt-4) to the corresponding deployment ID and model name. This setup allows you to use a single Azure-compatible OpenAI SDK client to switch between deployments by changing only the base URL.

For example:

client = OpenAI(
  base_url="http://127.0.0.1:8000/openai/deployments/azure-gpt-3-5"
)

Copied!

Or:

client = OpenAI(
  base_url="http://127.0.0.1:8000/openai/deployments/azure-gpt-4o"
)

Copied!

Kong Gateway reads the deployment path, maps it to the appropriate Azure deployment ID and model, and handles authentication automatically.

For this configuration to work properly, you need a Route with the following configuration:

routes:
 - name: azure-chat-gpt-4-1-mini
   paths:
     - "~/openai/deployments/azure-gpt-4-1-mini/chat/completions$"
   methods:
     - POST

Copied!

and:

routes:
 - name: azure-chat-gpt-4o
   paths:
     - "~/openai/deployments/azure-gpt-4o/chat/completions$"
   methods:
     - POST

Copied!

For a complete tutorial of this configuration, see the Azure OpenAI SDK example.

Prerequisites

Azure account

Environment variables

AZURE_API_KEY: The API key to authenticate requests to Azure.
AZURE_INSTANCE: The name of your Azure OpenAI instance.
AZURE_DEPLOYMENT: The deployment ID of the Azure OpenAI model you want to use.

Set up the plugin

Add this section to your kong.yaml configuration file:

kong.yaml

Copied!

_format_version: "3.0"
plugins:
  - name: ai-proxy-advanced
    config:
    - targets:
      - route_type: llm/v1/chat
        auth:
          header_name: api-key
          header_value: ${{ env "DECK_AZURE_API_KEY" }}
        logging:
          log_statistics: true
          log_payloads: false
        model:
          provider: azure
          name: gpt-4o
          options:
            zure_instance: ${{ env "DECK_AZURE_INSTANCE" }}
            azure_deployment_id: ${{ env "DECK_AZURE_DEPLOYMENT" }}
    - targets:
      - route_type: llm/v1/chat
        auth:
          header_name: api-key
          header_value: ${{ env "DECK_AZURE_API_KEY" }}
        logging:
          log_statistics: true
          log_payloads: false
        model:
          provider: azure
          name: gpt-4-1-mini
          options:
            azure_instance: ${{ env "DECK_AZURE_INSTANCE" }}
            azure_deployment_id: ${{ env "DECK_AZURE_DEPLOYMENT" }}

Make the following request:

curl -i -X POST http://localhost:8001/plugins/ \
    --header "Accept: application/json" \
    --header "Content-Type: application/json" \
    --data '
    {
      "name": "ai-proxy-advanced",
      "config": [
        {
          "targets": [
            {
              "route_type": "llm/v1/chat",
              "auth": {
                "header_name": "api-key",
                "header_value": "'$AZURE_API_KEY'"
              },
              "logging": {
                "log_statistics": true,
                "log_payloads": false
              },
              "model": {
                "provider": "azure",
                "name": "gpt-4o",
                "options": {
                  "zure_instance": "'$AZURE_INSTANCE'",
                  "azure_deployment_id": "'$AZURE_DEPLOYMENT'"
                }
              }
            }
          ]
        },
        {
          "targets": [
            {
              "route_type": "llm/v1/chat",
              "auth": {
                "header_name": "api-key",
                "header_value": "'$AZURE_API_KEY'"
              },
              "logging": {
                "log_statistics": true,
                "log_payloads": false
              },
              "model": {
                "provider": "azure",
                "name": "gpt-4-1-mini",
                "options": {
                  "azure_instance": "'$AZURE_INSTANCE'",
                  "azure_deployment_id": "'$AZURE_DEPLOYMENT'"
                }
              }
            }
          ]
        }
      ],
      "tags": []
    }
    '

Copied!

Make the following request:

curl -X POST https://{region}.api.konghq.com/v2/control-planes/{controlPlaneId}/core-entities/plugins/ \
    --header "accept: application/json" \
    --header "Content-Type: application/json" \
    --header "Authorization: Bearer $KONNECT_TOKEN" \
    --data '
    {
      "name": "ai-proxy-advanced",
      "config": [
        {
          "targets": [
            {
              "route_type": "llm/v1/chat",
              "auth": {
                "header_name": "api-key",
                "header_value": "'$AZURE_API_KEY'"
              },
              "logging": {
                "log_statistics": true,
                "log_payloads": false
              },
              "model": {
                "provider": "azure",
                "name": "gpt-4o",
                "options": {
                  "zure_instance": "'$AZURE_INSTANCE'",
                  "azure_deployment_id": "'$AZURE_DEPLOYMENT'"
                }
              }
            }
          ]
        },
        {
          "targets": [
            {
              "route_type": "llm/v1/chat",
              "auth": {
                "header_name": "api-key",
                "header_value": "'$AZURE_API_KEY'"
              },
              "logging": {
                "log_statistics": true,
                "log_payloads": false
              },
              "model": {
                "provider": "azure",
                "name": "gpt-4-1-mini",
                "options": {
                  "azure_instance": "'$AZURE_INSTANCE'",
                  "azure_deployment_id": "'$AZURE_DEPLOYMENT'"
                }
              }
            }
          ]
        }
      ],
      "tags": []
    }
    '

Copied!

Make sure to replace the following placeholders with your own values:

region: Geographic region where your Kong Konnect is hosted and operates.
KONNECT_TOKEN: Your Personal Access Token (PAT) associated with your Konnect account.
controlPlaneId: The id of the control plane.

See the Konnect API reference to learn about region-specific URLs and personal access tokens.

echo "
apiVersion: configuration.konghq.com/v1
kind: KongClusterPlugin
metadata:
  name: ai-proxy-advanced
  namespace: kong
  annotations:
    kubernetes.io/ingress.class: kong
    konghq.com/tags: ''
  labels:
    global: 'true'
config:
- targets:
  - route_type: llm/v1/chat
    auth:
      header_name: api-key
      header_value: '$AZURE_API_KEY'
    logging:
      log_statistics: true
      log_payloads: false
    model:
      provider: azure
      name: gpt-4o
      options:
        zure_instance: '$AZURE_INSTANCE'
        azure_deployment_id: '$AZURE_DEPLOYMENT'
- targets:
  - route_type: llm/v1/chat
    auth:
      header_name: api-key
      header_value: '$AZURE_API_KEY'
    logging:
      log_statistics: true
      log_payloads: false
    model:
      provider: azure
      name: gpt-4-1-mini
      options:
        azure_instance: '$AZURE_INSTANCE'
        azure_deployment_id: '$AZURE_DEPLOYMENT'
plugin: ai-proxy-advanced
" | kubectl apply -f -

Copied!

Prerequisite: Configure your Personal Access Token

terraform {
  required_providers {
    konnect = {
      source  = "kong/konnect"
    }
  }
}

provider "konnect" {
  personal_access_token = "$KONNECT_TOKEN"
  server_url            = "https://us.api.konghq.com/"
}

Copied!

Add the following to your Terraform configuration to create a Konnect Gateway Plugin:

resource "konnect_gateway_plugin_ai_proxy_advanced" "my_ai_proxy_advanced" {
  enabled = true

  config = [
    {
      targets = [
        {
          route_type = "llm/v1/chat"

          auth = {
            header_name = "api-key"
            header_value = var.azure_api_key
          }

          logging = {
            log_statistics = true
            log_payloads = false
          }

          model = {
            provider = "azure"
            name = "gpt-4o"

            options = {
              zure_instance = var.azure_instance
              azure_deployment_id = var.azure_deployment
            }
          }
        }      ]
    }, 

    {
      targets = [
        {
          route_type = "llm/v1/chat"

          auth = {
            header_name = "api-key"
            header_value = var.azure_api_key
          }

          logging = {
            log_statistics = true
            log_payloads = false
          }

          model = {
            provider = "azure"
            name = "gpt-4-1-mini"

            options = {
              azure_instance = var.azure_instance
              azure_deployment_id = var.azure_deployment
            }
          }
        }      ]
    }  ]
  tags = []

  control_plane_id = konnect_gateway_control_plane.my_konnect_cp.id
}

Copied!

This example requires the following variables to be added to your manifest. You can specify values at runtime by setting TF_VAR_name=value.

variable "azure_deployment" {
  type = string
}

Copied!

Add this section to your kong.yaml configuration file:

kong.yaml

Copied!

_format_version: "3.0"
plugins:
  - name: ai-proxy-advanced
    service: serviceName|Id
    config:
    - targets:
      - route_type: llm/v1/chat
        auth:
          header_name: api-key
          header_value: ${{ env "DECK_AZURE_API_KEY" }}
        logging:
          log_statistics: true
          log_payloads: false
        model:
          provider: azure
          name: gpt-4o
          options:
            zure_instance: ${{ env "DECK_AZURE_INSTANCE" }}
            azure_deployment_id: ${{ env "DECK_AZURE_DEPLOYMENT" }}
    - targets:
      - route_type: llm/v1/chat
        auth:
          header_name: api-key
          header_value: ${{ env "DECK_AZURE_API_KEY" }}
        logging:
          log_statistics: true
          log_payloads: false
        model:
          provider: azure
          name: gpt-4-1-mini
          options:
            azure_instance: ${{ env "DECK_AZURE_INSTANCE" }}
            azure_deployment_id: ${{ env "DECK_AZURE_DEPLOYMENT" }}

Make sure to replace the following placeholders with your own values:

serviceName|Id: The id or name of the service the plugin configuration will target.

Make the following request:

curl -i -X POST http://localhost:8001/services/{serviceName|Id}/plugins/ \
    --header "Accept: application/json" \
    --header "Content-Type: application/json" \
    --data '
    {
      "name": "ai-proxy-advanced",
      "config": [
        {
          "targets": [
            {
              "route_type": "llm/v1/chat",
              "auth": {
                "header_name": "api-key",
                "header_value": "'$AZURE_API_KEY'"
              },
              "logging": {
                "log_statistics": true,
                "log_payloads": false
              },
              "model": {
                "provider": "azure",
                "name": "gpt-4o",
                "options": {
                  "zure_instance": "'$AZURE_INSTANCE'",
                  "azure_deployment_id": "'$AZURE_DEPLOYMENT'"
                }
              }
            }
          ]
        },
        {
          "targets": [
            {
              "route_type": "llm/v1/chat",
              "auth": {
                "header_name": "api-key",
                "header_value": "'$AZURE_API_KEY'"
              },
              "logging": {
                "log_statistics": true,
                "log_payloads": false
              },
              "model": {
                "provider": "azure",
                "name": "gpt-4-1-mini",
                "options": {
                  "azure_instance": "'$AZURE_INSTANCE'",
                  "azure_deployment_id": "'$AZURE_DEPLOYMENT'"
                }
              }
            }
          ]
        }
      ],
      "tags": []
    }
    '

Copied!

Make sure to replace the following placeholders with your own values:

serviceName|Id: The id or name of the service the plugin configuration will target.

Make the following request:

curl -X POST https://{region}.api.konghq.com/v2/control-planes/{controlPlaneId}/core-entities/services/{serviceId}/plugins/ \
    --header "accept: application/json" \
    --header "Content-Type: application/json" \
    --header "Authorization: Bearer $KONNECT_TOKEN" \
    --data '
    {
      "name": "ai-proxy-advanced",
      "config": [
        {
          "targets": [
            {
              "route_type": "llm/v1/chat",
              "auth": {
                "header_name": "api-key",
                "header_value": "'$AZURE_API_KEY'"
              },
              "logging": {
                "log_statistics": true,
                "log_payloads": false
              },
              "model": {
                "provider": "azure",
                "name": "gpt-4o",
                "options": {
                  "zure_instance": "'$AZURE_INSTANCE'",
                  "azure_deployment_id": "'$AZURE_DEPLOYMENT'"
                }
              }
            }
          ]
        },
        {
          "targets": [
            {
              "route_type": "llm/v1/chat",
              "auth": {
                "header_name": "api-key",
                "header_value": "'$AZURE_API_KEY'"
              },
              "logging": {
                "log_statistics": true,
                "log_payloads": false
              },
              "model": {
                "provider": "azure",
                "name": "gpt-4-1-mini",
                "options": {
                  "azure_instance": "'$AZURE_INSTANCE'",
                  "azure_deployment_id": "'$AZURE_DEPLOYMENT'"
                }
              }
            }
          ]
        }
      ],
      "tags": []
    }
    '

Copied!

Make sure to replace the following placeholders with your own values:

region: Geographic region where your Kong Konnect is hosted and operates.
KONNECT_TOKEN: Your Personal Access Token (PAT) associated with your Konnect account.
controlPlaneId: The id of the control plane.
serviceId: The id of the service the plugin configuration will target.

See the Konnect API reference to learn about region-specific URLs and personal access tokens.

echo "
apiVersion: configuration.konghq.com/v1
kind: KongPlugin
metadata:
  name: ai-proxy-advanced
  namespace: kong
  annotations:
    kubernetes.io/ingress.class: kong
    konghq.com/tags: ''
config:
- targets:
  - route_type: llm/v1/chat
    auth:
      header_name: api-key
      header_value: '$AZURE_API_KEY'
    logging:
      log_statistics: true
      log_payloads: false
    model:
      provider: azure
      name: gpt-4o
      options:
        zure_instance: '$AZURE_INSTANCE'
        azure_deployment_id: '$AZURE_DEPLOYMENT'
- targets:
  - route_type: llm/v1/chat
    auth:
      header_name: api-key
      header_value: '$AZURE_API_KEY'
    logging:
      log_statistics: true
      log_payloads: false
    model:
      provider: azure
      name: gpt-4-1-mini
      options:
        azure_instance: '$AZURE_INSTANCE'
        azure_deployment_id: '$AZURE_DEPLOYMENT'
plugin: ai-proxy-advanced
" | kubectl apply -f -

Copied!

Next, apply the KongPlugin resource by annotating the service resource:

kubectl annotate -n kong service SERVICE_NAME konghq.com/plugins=ai-proxy-advanced

Copied!

Prerequisite: Configure your Personal Access Token

terraform {
  required_providers {
    konnect = {
      source  = "kong/konnect"
    }
  }
}

provider "konnect" {
  personal_access_token = "$KONNECT_TOKEN"
  server_url            = "https://us.api.konghq.com/"
}

Copied!

Add the following to your Terraform configuration to create a Konnect Gateway Plugin:

resource "konnect_gateway_plugin_ai_proxy_advanced" "my_ai_proxy_advanced" {
  enabled = true

  config = [
    {
      targets = [
        {
          route_type = "llm/v1/chat"

          auth = {
            header_name = "api-key"
            header_value = var.azure_api_key
          }

          logging = {
            log_statistics = true
            log_payloads = false
          }

          model = {
            provider = "azure"
            name = "gpt-4o"

            options = {
              zure_instance = var.azure_instance
              azure_deployment_id = var.azure_deployment
            }
          }
        }      ]
    }, 

    {
      targets = [
        {
          route_type = "llm/v1/chat"

          auth = {
            header_name = "api-key"
            header_value = var.azure_api_key
          }

          logging = {
            log_statistics = true
            log_payloads = false
          }

          model = {
            provider = "azure"
            name = "gpt-4-1-mini"

            options = {
              azure_instance = var.azure_instance
              azure_deployment_id = var.azure_deployment
            }
          }
        }      ]
    }  ]
  tags = []

  control_plane_id = konnect_gateway_control_plane.my_konnect_cp.id
  service = {
    id = konnect_gateway_service.my_service.id
  }
}

Copied!

This example requires the following variables to be added to your manifest. You can specify values at runtime by setting TF_VAR_name=value.

variable "azure_deployment" {
  type = string
}

Copied!

Add this section to your kong.yaml configuration file:

kong.yaml

Copied!

_format_version: "3.0"
plugins:
  - name: ai-proxy-advanced
    route: routeName|Id
    config:
    - targets:
      - route_type: llm/v1/chat
        auth:
          header_name: api-key
          header_value: ${{ env "DECK_AZURE_API_KEY" }}
        logging:
          log_statistics: true
          log_payloads: false
        model:
          provider: azure
          name: gpt-4o
          options:
            zure_instance: ${{ env "DECK_AZURE_INSTANCE" }}
            azure_deployment_id: ${{ env "DECK_AZURE_DEPLOYMENT" }}
    - targets:
      - route_type: llm/v1/chat
        auth:
          header_name: api-key
          header_value: ${{ env "DECK_AZURE_API_KEY" }}
        logging:
          log_statistics: true
          log_payloads: false
        model:
          provider: azure
          name: gpt-4-1-mini
          options:
            azure_instance: ${{ env "DECK_AZURE_INSTANCE" }}
            azure_deployment_id: ${{ env "DECK_AZURE_DEPLOYMENT" }}

Make sure to replace the following placeholders with your own values:

routeName|Id: The id or name of the route the plugin configuration will target.

Make the following request:

curl -i -X POST http://localhost:8001/routes/{routeName|Id}/plugins/ \
    --header "Accept: application/json" \
    --header "Content-Type: application/json" \
    --data '
    {
      "name": "ai-proxy-advanced",
      "config": [
        {
          "targets": [
            {
              "route_type": "llm/v1/chat",
              "auth": {
                "header_name": "api-key",
                "header_value": "'$AZURE_API_KEY'"
              },
              "logging": {
                "log_statistics": true,
                "log_payloads": false
              },
              "model": {
                "provider": "azure",
                "name": "gpt-4o",
                "options": {
                  "zure_instance": "'$AZURE_INSTANCE'",
                  "azure_deployment_id": "'$AZURE_DEPLOYMENT'"
                }
              }
            }
          ]
        },
        {
          "targets": [
            {
              "route_type": "llm/v1/chat",
              "auth": {
                "header_name": "api-key",
                "header_value": "'$AZURE_API_KEY'"
              },
              "logging": {
                "log_statistics": true,
                "log_payloads": false
              },
              "model": {
                "provider": "azure",
                "name": "gpt-4-1-mini",
                "options": {
                  "azure_instance": "'$AZURE_INSTANCE'",
                  "azure_deployment_id": "'$AZURE_DEPLOYMENT'"
                }
              }
            }
          ]
        }
      ],
      "tags": []
    }
    '

Copied!

Make sure to replace the following placeholders with your own values:

routeName|Id: The id or name of the route the plugin configuration will target.

Make the following request:

curl -X POST https://{region}.api.konghq.com/v2/control-planes/{controlPlaneId}/core-entities/routes/{routeId}/plugins/ \
    --header "accept: application/json" \
    --header "Content-Type: application/json" \
    --header "Authorization: Bearer $KONNECT_TOKEN" \
    --data '
    {
      "name": "ai-proxy-advanced",
      "config": [
        {
          "targets": [
            {
              "route_type": "llm/v1/chat",
              "auth": {
                "header_name": "api-key",
                "header_value": "'$AZURE_API_KEY'"
              },
              "logging": {
                "log_statistics": true,
                "log_payloads": false
              },
              "model": {
                "provider": "azure",
                "name": "gpt-4o",
                "options": {
                  "zure_instance": "'$AZURE_INSTANCE'",
                  "azure_deployment_id": "'$AZURE_DEPLOYMENT'"
                }
              }
            }
          ]
        },
        {
          "targets": [
            {
              "route_type": "llm/v1/chat",
              "auth": {
                "header_name": "api-key",
                "header_value": "'$AZURE_API_KEY'"
              },
              "logging": {
                "log_statistics": true,
                "log_payloads": false
              },
              "model": {
                "provider": "azure",
                "name": "gpt-4-1-mini",
                "options": {
                  "azure_instance": "'$AZURE_INSTANCE'",
                  "azure_deployment_id": "'$AZURE_DEPLOYMENT'"
                }
              }
            }
          ]
        }
      ],
      "tags": []
    }
    '

Copied!

Make sure to replace the following placeholders with your own values:

region: Geographic region where your Kong Konnect is hosted and operates.
KONNECT_TOKEN: Your Personal Access Token (PAT) associated with your Konnect account.
controlPlaneId: The id of the control plane.
routeId: The id of the route the plugin configuration will target.

See the Konnect API reference to learn about region-specific URLs and personal access tokens.

echo "
apiVersion: configuration.konghq.com/v1
kind: KongPlugin
metadata:
  name: ai-proxy-advanced
  namespace: kong
  annotations:
    kubernetes.io/ingress.class: kong
    konghq.com/tags: ''
config:
- targets:
  - route_type: llm/v1/chat
    auth:
      header_name: api-key
      header_value: '$AZURE_API_KEY'
    logging:
      log_statistics: true
      log_payloads: false
    model:
      provider: azure
      name: gpt-4o
      options:
        zure_instance: '$AZURE_INSTANCE'
        azure_deployment_id: '$AZURE_DEPLOYMENT'
- targets:
  - route_type: llm/v1/chat
    auth:
      header_name: api-key
      header_value: '$AZURE_API_KEY'
    logging:
      log_statistics: true
      log_payloads: false
    model:
      provider: azure
      name: gpt-4-1-mini
      options:
        azure_instance: '$AZURE_INSTANCE'
        azure_deployment_id: '$AZURE_DEPLOYMENT'
plugin: ai-proxy-advanced
" | kubectl apply -f -

Copied!

Next, apply the KongPlugin resource by annotating the httproute or ingress resource:

kubectl annotate -n kong httproute  konghq.com/plugins=ai-proxy-advanced

Copied!

kubectl annotate -n kong ingress  konghq.com/plugins=ai-proxy-advanced

Copied!

Prerequisite: Configure your Personal Access Token

terraform {
  required_providers {
    konnect = {
      source  = "kong/konnect"
    }
  }
}

provider "konnect" {
  personal_access_token = "$KONNECT_TOKEN"
  server_url            = "https://us.api.konghq.com/"
}

Copied!

Add the following to your Terraform configuration to create a Konnect Gateway Plugin:

resource "konnect_gateway_plugin_ai_proxy_advanced" "my_ai_proxy_advanced" {
  enabled = true

  config = [
    {
      targets = [
        {
          route_type = "llm/v1/chat"

          auth = {
            header_name = "api-key"
            header_value = var.azure_api_key
          }

          logging = {
            log_statistics = true
            log_payloads = false
          }

          model = {
            provider = "azure"
            name = "gpt-4o"

            options = {
              zure_instance = var.azure_instance
              azure_deployment_id = var.azure_deployment
            }
          }
        }      ]
    }, 

    {
      targets = [
        {
          route_type = "llm/v1/chat"

          auth = {
            header_name = "api-key"
            header_value = var.azure_api_key
          }

          logging = {
            log_statistics = true
            log_payloads = false
          }

          model = {
            provider = "azure"
            name = "gpt-4-1-mini"

            options = {
              azure_instance = var.azure_instance
              azure_deployment_id = var.azure_deployment
            }
          }
        }      ]
    }  ]
  tags = []

  control_plane_id = konnect_gateway_control_plane.my_konnect_cp.id
  route = {
    id = konnect_gateway_route.my_route.id
  }
}

Copied!

This example requires the following variables to be added to your manifest. You can specify values at runtime by setting TF_VAR_name=value.

variable "azure_deployment" {
  type = string
}

Copied!

Add this section to your kong.yaml configuration file:

kong.yaml

Copied!

_format_version: "3.0"
plugins:
  - name: ai-proxy-advanced
    consumer: consumerName|Id
    config:
    - targets:
      - route_type: llm/v1/chat
        auth:
          header_name: api-key
          header_value: ${{ env "DECK_AZURE_API_KEY" }}
        logging:
          log_statistics: true
          log_payloads: false
        model:
          provider: azure
          name: gpt-4o
          options:
            zure_instance: ${{ env "DECK_AZURE_INSTANCE" }}
            azure_deployment_id: ${{ env "DECK_AZURE_DEPLOYMENT" }}
    - targets:
      - route_type: llm/v1/chat
        auth:
          header_name: api-key
          header_value: ${{ env "DECK_AZURE_API_KEY" }}
        logging:
          log_statistics: true
          log_payloads: false
        model:
          provider: azure
          name: gpt-4-1-mini
          options:
            azure_instance: ${{ env "DECK_AZURE_INSTANCE" }}
            azure_deployment_id: ${{ env "DECK_AZURE_DEPLOYMENT" }}

Make sure to replace the following placeholders with your own values:

consumerName|Id: The id or name of the consumer the plugin configuration will target.

Make the following request:

curl -i -X POST http://localhost:8001/consumers/{consumerName|Id}/plugins/ \
    --header "Accept: application/json" \
    --header "Content-Type: application/json" \
    --data '
    {
      "name": "ai-proxy-advanced",
      "config": [
        {
          "targets": [
            {
              "route_type": "llm/v1/chat",
              "auth": {
                "header_name": "api-key",
                "header_value": "'$AZURE_API_KEY'"
              },
              "logging": {
                "log_statistics": true,
                "log_payloads": false
              },
              "model": {
                "provider": "azure",
                "name": "gpt-4o",
                "options": {
                  "zure_instance": "'$AZURE_INSTANCE'",
                  "azure_deployment_id": "'$AZURE_DEPLOYMENT'"
                }
              }
            }
          ]
        },
        {
          "targets": [
            {
              "route_type": "llm/v1/chat",
              "auth": {
                "header_name": "api-key",
                "header_value": "'$AZURE_API_KEY'"
              },
              "logging": {
                "log_statistics": true,
                "log_payloads": false
              },
              "model": {
                "provider": "azure",
                "name": "gpt-4-1-mini",
                "options": {
                  "azure_instance": "'$AZURE_INSTANCE'",
                  "azure_deployment_id": "'$AZURE_DEPLOYMENT'"
                }
              }
            }
          ]
        }
      ],
      "tags": []
    }
    '

Copied!

Make sure to replace the following placeholders with your own values:

consumerName|Id: The id or name of the consumer the plugin configuration will target.

Make the following request:

curl -X POST https://{region}.api.konghq.com/v2/control-planes/{controlPlaneId}/core-entities/consumers/{consumerId}/plugins/ \
    --header "accept: application/json" \
    --header "Content-Type: application/json" \
    --header "Authorization: Bearer $KONNECT_TOKEN" \
    --data '
    {
      "name": "ai-proxy-advanced",
      "config": [
        {
          "targets": [
            {
              "route_type": "llm/v1/chat",
              "auth": {
                "header_name": "api-key",
                "header_value": "'$AZURE_API_KEY'"
              },
              "logging": {
                "log_statistics": true,
                "log_payloads": false
              },
              "model": {
                "provider": "azure",
                "name": "gpt-4o",
                "options": {
                  "zure_instance": "'$AZURE_INSTANCE'",
                  "azure_deployment_id": "'$AZURE_DEPLOYMENT'"
                }
              }
            }
          ]
        },
        {
          "targets": [
            {
              "route_type": "llm/v1/chat",
              "auth": {
                "header_name": "api-key",
                "header_value": "'$AZURE_API_KEY'"
              },
              "logging": {
                "log_statistics": true,
                "log_payloads": false
              },
              "model": {
                "provider": "azure",
                "name": "gpt-4-1-mini",
                "options": {
                  "azure_instance": "'$AZURE_INSTANCE'",
                  "azure_deployment_id": "'$AZURE_DEPLOYMENT'"
                }
              }
            }
          ]
        }
      ],
      "tags": []
    }
    '

Copied!

Make sure to replace the following placeholders with your own values:

region: Geographic region where your Kong Konnect is hosted and operates.
KONNECT_TOKEN: Your Personal Access Token (PAT) associated with your Konnect account.
controlPlaneId: The id of the control plane.
consumerId: The id of the consumer the plugin configuration will target.

See the Konnect API reference to learn about region-specific URLs and personal access tokens.

echo "
apiVersion: configuration.konghq.com/v1
kind: KongPlugin
metadata:
  name: ai-proxy-advanced
  namespace: kong
  annotations:
    kubernetes.io/ingress.class: kong
    konghq.com/tags: ''
config:
- targets:
  - route_type: llm/v1/chat
    auth:
      header_name: api-key
      header_value: '$AZURE_API_KEY'
    logging:
      log_statistics: true
      log_payloads: false
    model:
      provider: azure
      name: gpt-4o
      options:
        zure_instance: '$AZURE_INSTANCE'
        azure_deployment_id: '$AZURE_DEPLOYMENT'
- targets:
  - route_type: llm/v1/chat
    auth:
      header_name: api-key
      header_value: '$AZURE_API_KEY'
    logging:
      log_statistics: true
      log_payloads: false
    model:
      provider: azure
      name: gpt-4-1-mini
      options:
        azure_instance: '$AZURE_INSTANCE'
        azure_deployment_id: '$AZURE_DEPLOYMENT'
plugin: ai-proxy-advanced
" | kubectl apply -f -

Copied!

Next, apply the KongPlugin resource by annotating the KongConsumer resource:

kubectl annotate -n kong kongconsumer CONSUMER_NAME konghq.com/plugins=ai-proxy-advanced

Copied!

Prerequisite: Configure your Personal Access Token

terraform {
  required_providers {
    konnect = {
      source  = "kong/konnect"
    }
  }
}

provider "konnect" {
  personal_access_token = "$KONNECT_TOKEN"
  server_url            = "https://us.api.konghq.com/"
}

Copied!

Add the following to your Terraform configuration to create a Konnect Gateway Plugin:

resource "konnect_gateway_plugin_ai_proxy_advanced" "my_ai_proxy_advanced" {
  enabled = true

  config = [
    {
      targets = [
        {
          route_type = "llm/v1/chat"

          auth = {
            header_name = "api-key"
            header_value = var.azure_api_key
          }

          logging = {
            log_statistics = true
            log_payloads = false
          }

          model = {
            provider = "azure"
            name = "gpt-4o"

            options = {
              zure_instance = var.azure_instance
              azure_deployment_id = var.azure_deployment
            }
          }
        }      ]
    }, 

    {
      targets = [
        {
          route_type = "llm/v1/chat"

          auth = {
            header_name = "api-key"
            header_value = var.azure_api_key
          }

          logging = {
            log_statistics = true
            log_payloads = false
          }

          model = {
            provider = "azure"
            name = "gpt-4-1-mini"

            options = {
              azure_instance = var.azure_instance
              azure_deployment_id = var.azure_deployment
            }
          }
        }      ]
    }  ]
  tags = []

  control_plane_id = konnect_gateway_control_plane.my_konnect_cp.id
  consumer = {
    id = konnect_gateway_consumer.my_consumer.id
  }
}

Copied!

This example requires the following variables to be added to your manifest. You can specify values at runtime by setting TF_VAR_name=value.

variable "azure_deployment" {
  type = string
}

Copied!

Add this section to your kong.yaml configuration file:

kong.yaml

Copied!

_format_version: "3.0"
plugins:
  - name: ai-proxy-advanced
    consumer_group: consumerGroupName|Id
    config:
    - targets:
      - route_type: llm/v1/chat
        auth:
          header_name: api-key
          header_value: ${{ env "DECK_AZURE_API_KEY" }}
        logging:
          log_statistics: true
          log_payloads: false
        model:
          provider: azure
          name: gpt-4o
          options:
            zure_instance: ${{ env "DECK_AZURE_INSTANCE" }}
            azure_deployment_id: ${{ env "DECK_AZURE_DEPLOYMENT" }}
    - targets:
      - route_type: llm/v1/chat
        auth:
          header_name: api-key
          header_value: ${{ env "DECK_AZURE_API_KEY" }}
        logging:
          log_statistics: true
          log_payloads: false
        model:
          provider: azure
          name: gpt-4-1-mini
          options:
            azure_instance: ${{ env "DECK_AZURE_INSTANCE" }}
            azure_deployment_id: ${{ env "DECK_AZURE_DEPLOYMENT" }}

Make sure to replace the following placeholders with your own values:

consumerGroupName|Id: The id or name of the consumer group the plugin configuration will target.

Make the following request:

curl -i -X POST http://localhost:8001/consumer_groups/{consumerGroupName|Id}/plugins/ \
    --header "Accept: application/json" \
    --header "Content-Type: application/json" \
    --data '
    {
      "name": "ai-proxy-advanced",
      "config": [
        {
          "targets": [
            {
              "route_type": "llm/v1/chat",
              "auth": {
                "header_name": "api-key",
                "header_value": "'$AZURE_API_KEY'"
              },
              "logging": {
                "log_statistics": true,
                "log_payloads": false
              },
              "model": {
                "provider": "azure",
                "name": "gpt-4o",
                "options": {
                  "zure_instance": "'$AZURE_INSTANCE'",
                  "azure_deployment_id": "'$AZURE_DEPLOYMENT'"
                }
              }
            }
          ]
        },
        {
          "targets": [
            {
              "route_type": "llm/v1/chat",
              "auth": {
                "header_name": "api-key",
                "header_value": "'$AZURE_API_KEY'"
              },
              "logging": {
                "log_statistics": true,
                "log_payloads": false
              },
              "model": {
                "provider": "azure",
                "name": "gpt-4-1-mini",
                "options": {
                  "azure_instance": "'$AZURE_INSTANCE'",
                  "azure_deployment_id": "'$AZURE_DEPLOYMENT'"
                }
              }
            }
          ]
        }
      ],
      "tags": []
    }
    '

Copied!

Make sure to replace the following placeholders with your own values:

consumerGroupName|Id: The id or name of the consumer group the plugin configuration will target.

Make the following request:

curl -X POST https://{region}.api.konghq.com/v2/control-planes/{controlPlaneId}/core-entities/consumer_groups/{consumerGroupId}/plugins/ \
    --header "accept: application/json" \
    --header "Content-Type: application/json" \
    --header "Authorization: Bearer $KONNECT_TOKEN" \
    --data '
    {
      "name": "ai-proxy-advanced",
      "config": [
        {
          "targets": [
            {
              "route_type": "llm/v1/chat",
              "auth": {
                "header_name": "api-key",
                "header_value": "'$AZURE_API_KEY'"
              },
              "logging": {
                "log_statistics": true,
                "log_payloads": false
              },
              "model": {
                "provider": "azure",
                "name": "gpt-4o",
                "options": {
                  "zure_instance": "'$AZURE_INSTANCE'",
                  "azure_deployment_id": "'$AZURE_DEPLOYMENT'"
                }
              }
            }
          ]
        },
        {
          "targets": [
            {
              "route_type": "llm/v1/chat",
              "auth": {
                "header_name": "api-key",
                "header_value": "'$AZURE_API_KEY'"
              },
              "logging": {
                "log_statistics": true,
                "log_payloads": false
              },
              "model": {
                "provider": "azure",
                "name": "gpt-4-1-mini",
                "options": {
                  "azure_instance": "'$AZURE_INSTANCE'",
                  "azure_deployment_id": "'$AZURE_DEPLOYMENT'"
                }
              }
            }
          ]
        }
      ],
      "tags": []
    }
    '

Copied!

Make sure to replace the following placeholders with your own values:

region: Geographic region where your Kong Konnect is hosted and operates.
KONNECT_TOKEN: Your Personal Access Token (PAT) associated with your Konnect account.
controlPlaneId: The id of the control plane.
consumerGroupId: The id of the consumer group the plugin configuration will target.

See the Konnect API reference to learn about region-specific URLs and personal access tokens.

echo "
apiVersion: configuration.konghq.com/v1
kind: KongPlugin
metadata:
  name: ai-proxy-advanced
  namespace: kong
  annotations:
    kubernetes.io/ingress.class: kong
    konghq.com/tags: ''
config:
- targets:
  - route_type: llm/v1/chat
    auth:
      header_name: api-key
      header_value: '$AZURE_API_KEY'
    logging:
      log_statistics: true
      log_payloads: false
    model:
      provider: azure
      name: gpt-4o
      options:
        zure_instance: '$AZURE_INSTANCE'
        azure_deployment_id: '$AZURE_DEPLOYMENT'
- targets:
  - route_type: llm/v1/chat
    auth:
      header_name: api-key
      header_value: '$AZURE_API_KEY'
    logging:
      log_statistics: true
      log_payloads: false
    model:
      provider: azure
      name: gpt-4-1-mini
      options:
        azure_instance: '$AZURE_INSTANCE'
        azure_deployment_id: '$AZURE_DEPLOYMENT'
plugin: ai-proxy-advanced
" | kubectl apply -f -

Copied!

Next, apply the KongPlugin resource by annotating the KongConsumerGroup resource:

kubectl annotate -n kong kongconsumergroup CONSUMERGROUP_NAME konghq.com/plugins=ai-proxy-advanced

Copied!

Prerequisite: Configure your Personal Access Token

terraform {
  required_providers {
    konnect = {
      source  = "kong/konnect"
    }
  }
}

provider "konnect" {
  personal_access_token = "$KONNECT_TOKEN"
  server_url            = "https://us.api.konghq.com/"
}

Copied!

Add the following to your Terraform configuration to create a Konnect Gateway Plugin:

resource "konnect_gateway_plugin_ai_proxy_advanced" "my_ai_proxy_advanced" {
  enabled = true

  config = [
    {
      targets = [
        {
          route_type = "llm/v1/chat"

          auth = {
            header_name = "api-key"
            header_value = var.azure_api_key
          }

          logging = {
            log_statistics = true
            log_payloads = false
          }

          model = {
            provider = "azure"
            name = "gpt-4o"

            options = {
              zure_instance = var.azure_instance
              azure_deployment_id = var.azure_deployment
            }
          }
        }      ]
    }, 

    {
      targets = [
        {
          route_type = "llm/v1/chat"

          auth = {
            header_name = "api-key"
            header_value = var.azure_api_key
          }

          logging = {
            log_statistics = true
            log_payloads = false
          }

          model = {
            provider = "azure"
            name = "gpt-4-1-mini"

            options = {
              azure_instance = var.azure_instance
              azure_deployment_id = var.azure_deployment
            }
          }
        }      ]
    }  ]
  tags = []

  control_plane_id = konnect_gateway_control_plane.my_konnect_cp.id
  consumer_group = {
    id = konnect_gateway_consumer_group.my_consumer_group.id
  }
}

Copied!

This example requires the following variables to be added to your manifest. You can specify values at runtime by setting TF_VAR_name=value.

variable "azure_deployment" {
  type = string
}

Copied!

AI Proxy Advanced

OpenAI SDK: Multi-deployment chat routingv3.8+

Prerequisites

Environment variables

Set up the plugin

Help us make these docs great!

Still need help?