AI PII Sanitizer

AI License Required

Overview Examples Guides Configuration reference Changelog

How it works

The AI PII Sanitizer plugin can be applied to:

Input data (requests)
Output data (responses) v3.12+
Both input and output data v3.12+

Here’s how it works if you apply it to both requests and responses:

The plugin intercepts the request body and sends it to the external PII service.
1. The PII service detects sensitive data and applies the chosen sanitization method (placeholders or synthetic replacements).
The sanitized request is forwarded upstream with the AI Proxy or AI Proxy Advanced plugin.
On the way back, the plugin intercepts the response body and sends it to the external PII service. v3.12+
1. The PII service detects sensitive data and applies the chosen sanitization method (placeholders or synthetic replacements).
(Only applies to input data sanitization) If restoration is enabled, the plugin restores the original request data in responses before returning them to the client.

 
sequenceDiagram
    autonumber
    participant Client
    participant Plugin as AI PII Sanitizer
    participant PII as PII Service
    participant Proxy as AI Proxy/Advanced
    participant AI as Upstream AI Service

    Client->>Plugin: Send request
    Plugin->>PII: Intercept & send request body
    PII->>PII: Detect sensitive data in request
    PII->>Plugin: Return sanitized request
(placeholders/synthetic data)
    Plugin->>Proxy: Forward sanitized request
    Proxy->>AI: Process sanitized request
    AI->>Proxy: Return AI response
    Proxy->>Plugin: Forward response
    Plugin->>PII: Intercept & send response body
    PII->>PII: Detect sensitive data in response
    PII->>Plugin: Return sanitized response
(placeholders/synthetic data)
    Plugin->>Client: Return sanitized response

Figure 1: Diagram showing the request and response flow with the AI PII Sanitizer plugin.

AI PII Anonymizer service

Kong provides several AI PII Anonymizer service Docker images in a private repository. Each image includes a built-in NLP model and is tagged using the version-lang_code format. For example:

service:v0.1.4-en: English model, version 0.1.4
service:v0.1.4-it: Italian model, version 0.1.4
service:v0.1.4-fr: French model, version 0.1.4

All models are bundled into a single image per version, tagged using the format v<version>. For example: v0.1.4 If you need to add or modify models, edit the configuration file at ai_pii_service/nlp_engine_conf.yml.

Access the Docker images

Kong distributes these images via a private Cloudsmith registry. Contact Kong Support to request access.

Authenticate with the private Cloudsmith registry

To pull images, you must authenticate first with the token provided by the Support:

docker login docker.cloudsmith.io

Copied!

Docker will then prompt you to enter username and password:

Username: kong/ai-pii
Password: YOUR-TOKEN

Copied!

This is a token-based login with read-only access. You can pull images but not push them.

Pull the AI PII service image

To pull an image:

docker pull docker.cloudsmith.io/kong/ai-pii/IMAGE-NAME:TAG

Copied!

Replace IMAGE-NAME and TAG with the appropriate image and version, such as:

docker pull docker.cloudsmith.io/kong/ai-pii/service:v0.1.4-en

Copied!

AI PII service Dockerfile usage

To use an image in a Dockerfile, reference it as follows:

FROM docker.cloudsmith.io/kong/ai-pii/ai-pii-service:v0.1.4-en

Copied!

Available language tags

The following language-specific images are currently available:

-en (English)
-es (Spanish)
-fr (French)
-de (German)
-it (Italian)
-ja (Japanese)
-ko (Korean)
-pt (Portuguese)
-tr (Turkish)

The PII Anonymizer service loads one NLP model by default. Ensure at least 600MB of free memory is available when running the container.

Image configuration options

This service takes the following optional environment variables at startup:

GUNICORN_WORKERS: Specifies the number of Gunicorn processes to run
PII_SERVICE_ENGINE_CONF: Specifies the natural language processing (NLP) engine configuration file
GUNICORN_LOG_LEVEL: Specifies log level

Sanitization endpoints

POST /llm/v1/sanitize: Sanitize specified types of PII information, including credentials, and custom patterns
POST /llm/v1/sanitize_credentials: Only for sanitizing credentials

Available anonymization modes

You can anonymize data in requests using the following redact modes:

placeholder: Replaces sensitive data with a fixed placeholder pattern, PLACEHOLDER{i}, where i is a sequence number. Identical original values receive the same placeholder.

For example, the location New York City might be replaced with LOCATION.
synthetic: Redact the sensitive data with a word in the same type.

For example, the name John might be replaced with Amir.
- Custom patterns are replaced with CUSTOM{i}.
- Credentials are replaced with a string of # characters matching the original length.

Custom patterns

You can define an array of custom patterns on a per-request basis. Currently, only regex patterns are supported, and all fields are required: name, regex, and score.

The name must be unique for each pattern.

Fields that can be anonymized

You can use the following fields in the anonymize array:

general: Anonymizes general PII entities such as person names, locations, and organizations.
phone: Anonymizes phone numbers (for example, mobile, landline).
email: Anonymizes email addresses.
creditcard: Anonymizes credit card numbers.
crypto: Anonymizes cryptocurrency addresses.
date: Anonymizes dates and timestamps.
ip: Anonymizes IP addresses (both IPv4 and IPv6).
nrp: Anonymizes a person’s nationality, religious, or political group.
ssn: Anonymizes Social Security Numbers (SSN) and other related identifiers like ITIN, NIF, ABN, and more.
domain: Anonymizes domain names. It was deprecated, use url instead.
url: Anonymizes web URLs.
medical: Anonymizes medical identifiers (for example, medical license numbers, NHS numbers, medicare numbers).
driverlicense: Anonymizes driver’s license numbers.
passport: Anonymizes passport numbers.
bank: Anonymizes bank account numbers and related banking identifiers (for example, VAT codes, IBAN).
nationalid: Anonymizes various national identification numbers (for example, Aadhaar, PESEL, NRIC, social security, or voter IDs).
custom: Anonymizes user-defined custom PII patterns using regular expressions only when custom patterns are provided.
credentials: Anonymizes the credentials, similar to /sanitize_credentials.
all: Includes all the fields above, including custom ones.

FAQs

Can I use a custom PII anonymization service instead of Kong’s AI PII Anonymizer?

To use a custom PII service, configure the Request Callout or Datakit plugin to:

Send the request payload to your PII service.
Receive the sanitized response.
Forward the transformed payload to the upstream service.

Your custom service must implement Kong’s PII service interface if you want to use the AI PII Sanitizer plugin with it.

AI PII Sanitizer

How it works

AI PII Anonymizer service

Access the Docker images

Authenticate with the private Cloudsmith registry

Pull the AI PII service image

AI PII service Dockerfile usage

Available language tags

Image configuration options

Sanitization endpoints

Available anonymization modes

Custom patterns

Fields that can be anonymized

FAQs

Help us make these docs great!

Still need help?