qwen logo

Qwen: Qwen3.5 397B A17B

GPU TEE
Chat
qwen/qwen3.5-397b-a17b
Created Feb 28, 2026|262K context|$0.55/M input tokens|$3.50/M output tokens|$0.225/M cache read
Intel TDXNVIDIA CC

The Qwen3.5 series 397B-A17B native vision-language model is built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. It delivers state-of-the-art performance comparable to leading-edge models across a wide range of tasks, including language understanding, logical reasoning, code generation, agent-based tasks, image understanding, video understanding, and graphical user interface (GUI) interactions. With its robust code-generation and agent capabilities, the model exhibits strong generalization across diverse agent.

Providers for Qwen: Qwen3.5 397B A17B

RedPill routes requests to the best providers that are able to handle your prompt size and parameters, with fallbacks to maximize uptime.

phala logo
phala
Total Context
262K
Input
$0.55/M
Output
$3.50/M
Cache Read
$0.225/M

API

RedPill provides an OpenAI-compatible completion API to all models & providers that you can call directly, or using the OpenAI SDK. Additionally, some third-party SDKs are available.

fetch("https://api.redpill.ai/v1/chat/completions", {
  method: "POST",
  headers: {
    "Authorization": "Bearer <YOUR-REDPILL-API-KEY>",
    "Content-Type": "application/json"
  },
  body: JSON.stringify({
    "model": "qwen/qwen3.5-397b-a17b",
    "messages": [
      {
        "role": "user",
        "content": "What is the meaning of life?"
      }
    ]
  })
})