COCOON Architecture

COCOON is a decentralized AI inference platform on TON that securely connects GPU owners who provide compute with privacy-conscious applications that need to run AI models. For GPU Providers, it defines how suitable hardware can become part of a confidential, attested compute layer – for Developers, it is the backend that executes model requests and settles payments on-chain.

This document outlines how COCOON provides private, attested model execution on confidential-compute hardware. At a high level, it covers its main components, the end-to-end request flow and the system’s trust model.

For a complete technical overview, this page should be read alongside our in-depth technical documentation available here.

Index

Overview
Components
- Worker
- Proxy
- Client
Smart Contracts
Security Properties
Network Topology

Overview

COCOON is a decentralized AI inference platform built on TON blockchain, enabling GPU owners to earn cryptocurrency by serving AI models in trusted execution environments (TEE).

Key Goals:

Anyone with a GPU server can rent it out and earn money
Requests and responses remain private, known only to the client
Clients can verify that responses come from the requested model
Payment happens through TON blockchain

Components

COCOON consists of three parts:

Client: The client pays for requests and sends them to the proxy.
Proxy: The proxy selects a suitable worker and forwards the request.
Worker: The worker executes requests on GPU.

Both proxy and worker run inside TEE, ensuring all data (prompts, responses) remains private and cannot be accessed by server owners.

Worker

Role: Executes AI inference requests inside TEE-protected VMs.

Runs AI models (e.g., LLMs via vllm) inside confidential virtual machines
Protected by TEE (currently Intel TDX)
Ensures all requests are kept private and the correct model is used
Receives payment from proxies for completed work
Minimal setup required: install image, provide config (model name, TON config, wallet address)

Proxy

Role: Routes requests from clients to workers

Protected by TEE (currently Intel TDX)
Selects appropriate workers based on model type, load, and reputation
Accepts payment from clients
Pays workers for completed requests
Takes commission on each transaction
Significantly fewer proxies than workers in the network

Current deployment: Proxies are operated by the COCOON team for simplicity.

Future: Anyone will be able to run their own proxy (just like workers), creating a fully decentralized network.

Client

Role: Library for sending inference requests.

Sends requests to proxies
Validates proxy TEE attestations to ensure requests are sent only to trusted proxies
Pays proxies for completed requests

Note: The client library is designed to run on servers (backend infrastructure). For example, the Telegram backend would run multiple client instances to handle user requests.

For direct device/app usage (mobile apps, desktop apps), a separate lightweight library is being developed (WIP).

Smart Contracts

Root Contract (On-Chain Registry)

Role: Stores allowed image and model hashes, addresses of proxies, and other network-wide settings.

Current deployment: Currently managed by the COCOON team for simplicity.

Future: DAO will govern this smart contract.

Payment Contracts

Role: Stores payment information for clients and proxies. Similar to payment channels.

Request Workflow

Client establishes RA-TLS connection with proxy, verifying proxy's TEE attestation against root contract
Proxy establishes RA-TLS connection with selected worker, verifying worker's TEE attestation
Client sends inference request (with prepayment) → Proxy forwards to worker → Worker processes in TEE
Worker returns response → Proxy pays worker via smart contract → Proxy returns response to client

All communication is encrypted via RA-TLS. Attestations are verified at connection time, not per request. Only the client can see prompts and responses.

Security Properties

COCOON enforces security through static measurements, on-chain registries and runtime hardware verification. This section details the properties used to validate identity and secure communication channels across the infrastructure.

Cocoon welcomes security researchers to audit its code and protocol. Relevant findings can be submitted here.

Image Verification

Each TEE VM's identity is defined by its image hash, which includes:

Base image - The root filesystem and all binaries (measured by TEE)
Static config - Configuration that affects security/correctness (measured by TEE)

Runtime config (not measured) affects behavior but not safety or correctness. Examples:

Model name and download location
Root smart contract address
Network endpoints

Root Contract

The root smart contract on TON blockchain serves as the trusted registry containing:

List of proxy IPs - Known proxy endpoints
Allowed image hashes - Valid proxy and worker TEE measurements
Supported model hashes - Verified AI model hashes
Config parameters - Network-wide settings (pricing, limits, etc.)
Smart contract code - Code for worker and proxy contracts

Anyone can verify against this on-chain registry.

Governance: Currently centralized (COCOON team manages updates). Future: decentralized governance via DAO.

Attestation and Communication

RA-TLS ensures we communicate with correct VMs:

All connections verify TEE attestations via RA-TLS
Handled transparently by router - inner VM services don't need attestation logic
Each party verifies the remote party's image hash matches expected values

GPU Verification

GPU is verified by the VM itself during boot (via attestation script)
Not explicitly verified by remote parties (included in user claims if needed)
Security note: Malicious host could disable GPU, but cannot affect correctness of successful computations
Failed GPU verification prevents VM from starting

Performance and Availability

TEE does not guarantee performance:

Slow or hanging workers are handled via reputation system
Proxies track worker response times and success rates
Clients can choose proxies and verify service quality
Reputation scores stored on-chain for transparency

Network Topology

Multiple Clients
      ↓
Few Proxies (e.g., 10-100)
      ↓
Many Workers (e.g., 1000+)

Workers connect to proxies. Proxies track worker reputation and load-balance requests.

Full Documentation

TDX and Images - Intel TDX fundamentals, boot sequence, and image generation
RA-TLS - Remote attestation over TLS, proxy-cli, and certificate generation
Smart Contracts - Payment system and TON blockchain integration
Seal Keys - Persistent key derivation via SGX/TDX interaction
GPU - GPU passthrough and confidential computing validation
Deployment - Deployment, testing, and debugging
Developers - Information for developers looking to integrate COCOON

Cocoon welcomes security researchers to audit its code and protocol. Relevant findings can be submitted here.