Skip to main content

Supported Endpoints

AISIX exposes proxy endpoints for the request formats application teams already use. Start here when you need to know which client API shape to call before choosing a provider, model alias, or traffic policy.

Main API Families

API familyRoutesUse for
Chat completionsPOST /v1/chat/completionsOpenAI-compatible chat requests, streaming chat, tool calling, routed models, and ensemble models.
Anthropic messagesPOST /v1/messages, POST /v1/messages/count_tokensAnthropic-style messages requests and token counting for Anthropic-backed models.
ResponsesPOST /v1/responsesOpenAI Responses API clients and agent-style response flows.
Text completionsPOST /v1/completionsLegacy OpenAI-compatible text completion clients.
EmbeddingsPOST /v1/embeddingsVector embeddings through supported providers.
RerankPOST /v1/rerankDocument reranking through supported providers.
Image generationPOST /v1/images/generationsText-to-image generation requests.
Speech and audioPOST /v1/audio/transcriptions, POST /v1/audio/translations, POST /v1/audio/speechSpeech-to-text, translation, and text-to-speech requests.
Provider passthroughANY /passthrough/:provider/*restProvider-specific calls that should use AISIX authentication and quota checks without AISIX normalizing the request body.

Discovery and Health

EndpointUse for
GET /v1/modelsReturn the model aliases the caller API key can access. Use it when a client needs to discover gateway-facing model names.
GET /livezCheck whether the proxy listener is alive. Use it for proxy listener health checks, not for model or provider readiness.

Gateway Behavior

Modeled proxy routes share the same core gateway behavior: AISIX authenticates the caller API key, checks model access, resolves the requested model alias, applies configured controls, dispatches to the selected upstream provider, and records usage and telemetry when the route can be attributed to a model.

Some behavior is route-specific. For example, response caching applies to chat completions when a matching cache policy is configured, ensemble models are supported on chat completions, and token counting is limited to Anthropic-backed models. For provider and route constraints, see Provider Compatibility.

For exact request and response details, see the Proxy API Reference.

API7.ai Logo

The digital world is connected by APIs,
API7.ai exists to make APIs more efficient, reliable, and secure.

Sign up for API7 newsletter

Product

API7 Gateway

SOC2 Type IIISO 27001HIPAAGDPRRed Herring

Copyright © APISEVEN PTE. LTD 2019 – 2026. Apache, Apache APISIX, APISIX, and associated open source project names are trademarks of the Apache Software Foundation