Self-Hosted Quickstart

In this self-hosted quickstart, you will run AISIX AI Gateway locally, create the minimum resources needed for traffic, and send one request through the OpenAI-compatible proxy API.

AISIX AI Gateway uses etcd for self-hosted configuration storage. In this quickstart, Docker Compose starts a local etcd container and a local AISIX AI Gateway container.

This quickstart uses OpenAI as the example upstream provider. In AISIX, the client sends a caller API key, and the gateway uses the configured provider key when it calls the upstream provider.

The request follows this path:

The client sends the caller API key and the AISIX model name gpt-4o-mini. AISIX authenticates the client and uses the stored provider key to call OpenAI with the same upstream model. The client never sends the upstream OpenAI key.

Prerequisites

Install Docker with Docker Compose to start local etcd and AISIX AI Gateway containers.
Install cURL to send requests to the admin and proxy APIs.
Install jq to read IDs from admin API responses.
Prepare an OpenAI API key for an account with access to gpt-4o-mini and available quota.

Get AISIX AI Gateway

First, create the local files and start AISIX AI Gateway with Docker Compose.

Create a Working Directory

Create a directory for the local configuration files:

mkdir aisix-quickstart
cd aisix-quickstart

Create the Local Configuration

Create a config.yaml file for the local gateway container:

config.yaml
etcd:
  endpoints:
    - "http://etcd:2379"
  prefix: "/aisix"
  dial_timeout_ms: 5000
  request_timeout_ms: 5000

proxy:
  addr: "0.0.0.0:3000"

admin:
  addr: "0.0.0.0:3001"
  admin_keys:
    - "YOUR_ADMIN_KEY"

observability:
  service_name: "aisix"
  log_level: "info"

cache:
  backend: "memory"

Create the Compose Stack

Create a docker-compose.yml file:

docker-compose.yml
services:
  etcd:
    image: quay.io/coreos/etcd:v3.5.18
    command:
      - /usr/local/bin/etcd
      - --advertise-client-urls=http://etcd:2379
      - --listen-client-urls=http://0.0.0.0:2379

  aisix:
    image: ghcr.io/api7/aisix:dev
    volumes:
      - ./config.yaml:/etc/aisix/config.yaml:ro
    ports:
      - "3000:3000"
      - "3001:3001"
    depends_on:
      - etcd

Start AISIX AI Gateway

Start the local stack:

docker compose up -d

The gateway exposes the proxy listener on http://127.0.0.1:3000 and the admin listener on http://127.0.0.1:3001. Check both listeners:

curl -sS "http://127.0.0.1:3000/livez"
curl -sS "http://127.0.0.1:3001/livez"

Each command should return ok.

Create Gateway Resources

Next, create the provider key, model, and caller API key that AISIX needs before it can proxy traffic.

Export Local Variables

Export the values used by the commands below:

# Same value as admin.admin_keys in config.yaml.
export AISIX_ADMIN_KEY="YOUR_ADMIN_KEY"

# Replace with your OpenAI API key.
export OPENAI_API_KEY="YOUR_OPENAI_API_KEY"

# Caller API key for client requests.
export AISIX_API_KEY="YOUR_CALLER_API_KEY"

Create a Provider Key

Create a provider key that stores the OpenAI credential. The command prints the response and saves the returned ID for the next step:

PROVIDER_KEY_RESPONSE=$(curl -sS -X POST "http://127.0.0.1:3001/admin/v1/provider_keys" \
  -H "Authorization: Bearer ${AISIX_ADMIN_KEY}" \
  -H "Content-Type: application/json" \
  -d '{
    "display_name": "openai-upstream",
    "provider": "openai",
    "adapter": "openai",
    "secret": "'"${OPENAI_API_KEY}"'",
    "api_base": "https://api.openai.com/v1"
  }')

printf '%s\n' "${PROVIDER_KEY_RESPONSE}" | jq .
PROVIDER_KEY_ID=$(jq -r '.id // empty' <<< "${PROVIDER_KEY_RESPONSE}")

For more information about provider, adapter, and base URL settings, see Provider Credentials.

Create a Model

This resource connects the client-facing AISIX model name to the upstream OpenAI model. In this example, both names are gpt-4o-mini.

curl -sS -X POST "http://127.0.0.1:3001/admin/v1/models" \
  -H "Authorization: Bearer ${AISIX_ADMIN_KEY}" \
  -H "Content-Type: application/json" \
  -d '{
    "display_name": "gpt-4o-mini",
    "provider": "openai",
    "model_name": "gpt-4o-mini",
    "provider_key_id": "'"${PROVIDER_KEY_ID}"'"
  }'

Clients use gpt-4o-mini as the model value on the proxy API. The upstream provider receives the same model name.

Create a Client API Key

AISIX stores only the SHA-256 hash of the caller API key. Create the hash before writing the API key resource:

AISIX_API_KEY_HASH=$(printf '%s' "${AISIX_API_KEY}" | shasum -a 256 | awk '{print $1}')

Then create the API key resource that allows the caller API key to use gpt-4o-mini:

curl -sS -X POST "http://127.0.0.1:3001/admin/v1/apikeys" \
  -H "Authorization: Bearer ${AISIX_ADMIN_KEY}" \
  -H "Content-Type: application/json" \
  -d '{
    "key_hash": "'"${AISIX_API_KEY_HASH}"'",
    "allowed_models": ["gpt-4o-mini"]
  }'

Verify Traffic

The proxy reads gateway resources after they are written to etcd. List the models visible to the caller API key:

curl -sS "http://127.0.0.1:3000/v1/models" \
  -H "Authorization: Bearer ${AISIX_API_KEY}" \
  | jq -r '.data[].id'

The output should include gpt-4o-mini. Now the gateway is running and the proxy can see the configured model.

Send a request to the gateway:

curl -sS -X POST "http://127.0.0.1:3000/v1/chat/completions" \
  -H "Authorization: Bearer ${AISIX_API_KEY}" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-4o-mini",
    "messages": [
      {"role": "user", "content": "Say hello from AISIX AI Gateway."}
    ]
  }'

You should receive an OpenAI chat-completions response similar to the following:

{
  "id": "chatcmpl-DnHR8vV2AmxQioGhIXRVvEGcf22hC",
  "object": "chat.completion",
  "created": **********,
  "model": "gpt-4o-mini",
  "choices": [
    {
      "index": 0,
      "message": {
        "role": "assistant",
        "content": "Hello from AISIX AI Gateway! How can I assist you today?"
      },
      "finish_reason": "stop"
    }
  ],
  "usage": {
    "prompt_tokens": 15,
    "completion_tokens": 14,
    "total_tokens": 29
  }
}

Next Steps

You have now set up a local gateway, a caller API key, and an AISIX model name for sending provider-backed requests through AISIX. From here, read Resource Model to see how the gateway resources fit together.

To call the same gateway from application code, use the OpenAI SDK Quickstart or Anthropic SDK Quickstart.

Prerequisites​

Get AISIX AI Gateway​

Create a Working Directory​

Create the Local Configuration​

Create the Compose Stack​

Start AISIX AI Gateway​

Create Gateway Resources​

Export Local Variables​

Create a Provider Key​

Create a Model​

Create a Client API Key​

Verify Traffic​

Next Steps​