Phala: Qwen3.6 35B-A3B Uncensored (Aggressive)

GPU TEE

phala/qwen3.6-35b-a3b-uncensored

Created May 23, 2026|131K context|$0.30/M input tokens|$1.50/M output tokens

Intel TDXNVIDIA CC

Uncensored "Aggressive" variant of Qwen3.6-35B-A3B from Alibaba's Qwen team. The fine-tune by HauhauCS removes refusal behaviors (0/465 refusals) without modifying datasets or core capabilities. The base architecture is a 35B-parameter Mixture-of-Experts model with 256 experts routing 8 per token (~3B active params), 40 layers, and a hybrid linear+full-softmax attention mechanism (3:1 ratio). Supports a native 262K context and is natively multimodal across text, images, and video. Served on Phala in TDX-attested H200 enclave with end-to-end ECDSA response signing; FP8 quantization by lamianlbe.

Providers for Phala: Qwen3.6 35B-A3B Uncensored (Aggressive)

RedPill routes requests across these providers with automatic fallbacks to maximize uptime. Pricing is unified — you pay the same price no matter which provider serves your request.

Total Context

131K

Input

$0.30/M

Output

$1.50/M

Provider	TTFT	Throughput	Uptime
phala

API

RedPill provides a unified completion API to all models & providers that you can call directly, or using the OpenAI SDK. Additionally, some third-party SDKs are available.

fetch("https://api.redpill.ai/v1/chat/completions", {
  method: "POST",
  headers: {
    "Authorization": "Bearer <YOUR-REDPILL-API-KEY>",
    "Content-Type": "application/json"
  },
  body: JSON.stringify({
    "model": "phala/qwen3.6-35b-a3b-uncensored",
    "messages": [
      {
        "role": "user",
        "content": "What is the meaning of life?"
      }
    ]
  })
})

Verify Evidence

Confidential GPU-TEE responses carry two proof layers you can check yourself: a nonce-bound attestation report for the gateway, and a signed receipt that binds your request and response to an attested upstream session.

# 1. Attest the gateway (nonce-bound, proves which TEE workload serves you)
NONCE="$(openssl rand -hex 16)"
curl -s "https://api.redpill.ai/v1/aci/attestation?nonce=$NONCE" \
  -H "Authorization: Bearer $REDPILL_API_KEY" -o report.json

# 2. Call the model and capture the x-receipt-id response header
curl -s "https://api.redpill.ai/v1/chat/completions" -D headers.txt \
  -H "Authorization: Bearer $REDPILL_API_KEY" -H "Content-Type: application/json" \
  -d '{"model":"phala/qwen3.6-35b-a3b-uncensored","messages":[{"role":"user","content":"Hello"}]}' -o response.json
RECEIPT_ID="$(grep -i ^x-receipt-id headers.txt | tr -d '\r' | awk '{print $2}')"

# 3. Fetch the signed receipt, then follow it to the attested session
curl -s "https://api.redpill.ai/v1/aci/receipts/$RECEIPT_ID" \
  -H "Authorization: Bearer $REDPILL_API_KEY" -o receipt.json
SESSION_ID="$(jq -r '.event_log[]|select(.type=="upstream.verified").session_id' receipt.json)"
curl -s "https://api.redpill.ai/v1/aci/sessions/$SESSION_ID" \
  -H "Authorization: Bearer $REDPILL_API_KEY"

Full verification walkthrough →

The confidential AI cloud: verifiable inference with attestation reports, signed receipts, audit sessions, and E2EE paths.

Phala: Qwen3.6 35B-A3B Uncensored (Aggressive)

Providers for Phala: Qwen3.6 35B-A3B Uncensored (Aggressive)

API

Products

Developers

Resources