Z.AI: GLM 4.7 Flash

GPU TEE

Deprecated

Chat

z-ai/glm-4.7-flash

Created Jan 20, 2026|203K context|$0.10/M input tokens|$0.43/M output tokens

Intel TDXNVIDIA CCBETA

As a 30B-class SOTA model, GLM-4.7-Flash offers a new option that balances performance and efficiency. It is further optimized for agentic coding use cases, strengthening coding capabilities, long-horizon task planning, and tool collaboration, and has achieved leading performance among open-source models of the same size on several current public benchmark leaderboards.

Providers for Z.AI: GLM 4.7 Flash

RedPill routes requests to the best providers that are able to handle your prompt size and parameters, with fallbacks to maximize uptime.

phala

Total Context

203K

Input

$0.10/M

Output

$0.43/M

API

RedPill provides an OpenAI-compatible completion API to all models & providers that you can call directly, or using the OpenAI SDK. Additionally, some third-party SDKs are available.

fetch("https://api.redpill.ai/v1/chat/completions", {
  method: "POST",
  headers: {
    "Authorization": "Bearer <YOUR-REDPILL-API-KEY>",
    "Content-Type": "application/json"
  },
  body: JSON.stringify({
    "model": "z-ai/glm-4.7-flash",
    "messages": [
      {
        "role": "user",
        "content": "What is the meaning of life?"
      }
    ]
  })
})

Privacy-first AI solutions that keep your data secure and confidential.

Z.AI: GLM 4.7 Flash

Providers for Z.AI: GLM 4.7 Flash

API

Products

Developers

Resources