Build with Private AI.

Integrate RedPill's Private AI into your app with a simple API. Access dozens of AI models through one secure endpoint. No more juggling multiple AI APIs or worrying about data compliance.

Key Features for Developers

Unified API for 60+ Models

One API key unlocks GPT-4, Claude, Llama, Mistral and more. No vendor lock-in - switch models or use Smart Router to auto-select the best model per request.

Privacy & Security Built-In

All API calls are processed in confidential enclaves. Feed sensitive data to the API and even we can't read it. Ideal for healthcare, legal, or enterprise apps.

Simple SDKs & Docs

SDKs available in Python, JavaScript, and more. Robust REST API with clear documentation. Get started in minutes with our quickstart guides.

Example Use Cases

Add a confidential AI assistant to your app. Process user data with AI without storing it. Use RedPill as a secure backend for chatbots or automation.

Flexible Deployment

Enterprise options for dedicated private instances. Deploy on-prem or in your VPC for maximum control and compliance with your organization's policies.

Performance & Cost Controls

Smart Router ensures efficient model usage. Save costs by routing to appropriate models per request. Rate limits and flexible pricing tiers available.

Just a few lines of code

YOUR CODE.
OUR PRIVACY.

Integrate private AI into your app with simple SDKs. OpenAI-compatible API means minimal code changes to switch from other providers.

View Full Docs
redpill-chat.js
// Node.js / JavaScript SDKimport RedPill from 'redpill-sdk';const client = new RedPill({  apiKey: process.env.REDPILL_API_KEY});// Simple chat completionconst response = await client.chat.completions.create({  model: 'gpt-4',  messages: [    { role: 'user', content: 'Summarize this contract' }  ]});console.log(response.choices[0].message.content);// With streamingconst stream = await client.chat.completions.create({  model: 'claude-3-opus',  messages: [{ role: 'user', content: 'Write a haiku' }],  stream: true});for await (const chunk of stream) {  process.stdout.write(chunk.choices[0]?.delta?.content || '');}

Explore AI Models

From private models in GPU TEE to all your favorites.

deepseek logo
DeepSeek: DeepSeek V3.2
GPU TEE
DeepSeek-V3.2 is a large language model designed to harmonize high computational efficiency with strong reasoning and agentic tool-use performance. It introduces DeepSeek Sparse Attention (DSA), a fine-grained sparse attention mechanism that reduces training and inference cost while preserving quality in long-context scenarios. A scalable reinforcement learning post-training framework further improves reasoning, with reported performance in the GPT-5 class, and the model has demonstrated gold-medal results on the 2025 IMO and IOI. V3.2 also uses a large-scale agentic task synthesis pipeline to better integrate reasoning into tool-use settings, boosting compliance and generalization in interactive environments.
by phala|164K context|$0.27/M input|$0.40/M output
Intel TDXNVIDIA CC
sentence-transformers logo
Sentence Transformers: all-MiniLM-L6-v2
GPU TEE
The all-MiniLM-L6-v2 embedding model maps sentences and short paragraphs into a 384-dimensional dense vector space, enabling high-quality semantic representations that are ideal for downstream tasks such as information retrieval, clustering, similarity scoring, and text ranking.
by phala|512 context|$0.005/M input|$0.00/M output
Intel TDXNVIDIA CC
qwen logo
Qwen2.5 7B Instruct
GPU TEE
Qwen2.5 7B is the latest series of Qwen large language models. Qwen2.5 brings the following improvements upon Qwen2:
  • Significantly more knowledge and has greatly improved capabilities in coding and mathematics, thanks to our specialized expert models in these domains.
  • Significant improvements in instruction following, generating long texts (over 8K tokens), understanding structured data (e.g, tables), and generating structured outputs especially JSON. More resilient to the diversity of system prompts, enhancing role-play implementation and condition-setting for chatbots.
  • Long-context Support up to 128K tokens and can generate up to 8K tokens.
  • Multilingual support for over 29 languages, including Chinese, English, French, Spanish, Portuguese, German, Italian, Russian, Japanese, Korean, Vietnamese, Thai, Arabic, and more.
Usage of this model is subject to .
by phala|33K context|$0.04/M input|$0.10/M output
Intel TDXNVIDIA CC
qwen logo
Qwen: Qwen2.5 VL 72B Instruct
GPU TEE
Qwen2.5-VL is proficient in recognizing common objects such as flowers, birds, fish, and insects. It is also highly capable of analyzing texts, charts, icons, graphics, and layouts within images.
by phala|66K context|$0.59/M input|$0.59/M output
Intel TDXNVIDIA CC
google logo
Google: Gemma 3 27B
GPU TEE
Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities, including structured outputs and function calling. Gemma 3 27B is Google's latest open source model, successor to
by phala|54K context|$0.11/M input|$0.40/M output
Intel TDXNVIDIA CC
openai logo
OpenAI: GPT OSS 120B
GPU TEE
gpt-oss-120b is an open-weight, 117B-parameter Mixture-of-Experts (MoE) language model from OpenAI designed for high-reasoning, agentic, and general-purpose production use cases. It activates 5.1B parameters per forward pass and is optimized to run on a single H100 GPU with native MXFP4 quantization. The model supports configurable reasoning depth, full chain-of-thought access, and native tool use, including function calling, browsing, and structured output generation.
by phala|131K context|$0.10/M input|$0.49/M output
Intel TDXNVIDIA CC

Start Building.

API Documentation

Comprehensive guides, API references, and tutorials to help you integrate RedPill into your applications. Try the interactive playground or get a free API key.

Developer Community

Join our Discord community to connect with other developers, get help with integration questions, and share what you're building with RedPill.

Ready to experience private AI?

Try RedPill in our Private AI Playground - no signup needed. Your conversations stay encrypted and completely private.

Try RedPill Free
Private Chat
E2E Encrypted
AI
Hi! I'm your private AI assistant. Ask me anything - your conversations are fully encrypted.
Zero data retentionTEE secured