phala logo

Phala: Qwen3.6 35B-A3B Uncensored (Aggressive)

GPU TEE
Chat
phala/qwen3.6-35b-a3b-uncensored
Created May 23, 2026|131K context|$0.30/M input tokens|$1.50/M output tokens
Intel TDXNVIDIA CC

Uncensored "Aggressive" variant of Qwen3.6-35B-A3B from Alibaba's Qwen team. The fine-tune by HauhauCS removes refusal behaviors (0/465 refusals) without modifying datasets or core capabilities. The base architecture is a 35B-parameter Mixture-of-Experts model with 256 experts routing 8 per token (~3B active params), 40 layers, and a hybrid linear+full-softmax attention mechanism (3:1 ratio). Supports a native 262K context and is natively multimodal across text, images, and video. Served on Phala in TDX-attested H200 enclave with end-to-end ECDSA response signing; FP8 quantization by lamianlbe.

Providers for Phala: Qwen3.6 35B-A3B Uncensored (Aggressive)

RedPill routes requests to the best providers that are able to handle your prompt size and parameters, with fallbacks to maximize uptime.

phala logo
phala
Total Context
131K

API

RedPill provides an OpenAI-compatible completion API to all models & providers that you can call directly, or using the OpenAI SDK. Additionally, some third-party SDKs are available.

fetch("https://api.redpill.ai/v1/chat/completions", {
  method: "POST",
  headers: {
    "Authorization": "Bearer <YOUR-REDPILL-API-KEY>",
    "Content-Type": "application/json"
  },
  body: JSON.stringify({
    "model": "phala/qwen3.6-35b-a3b-uncensored",
    "messages": [
      {
        "role": "user",
        "content": "What is the meaning of life?"
      }
    ]
  })
})