Inference at the speed of light

Up to 1,000 tokens/second by ditching the real bottleneck: GPUs

Get API Key → $100 Free Credit Book a Demo

Up to 16.1x faster

OpenCode running the identical task on both sides. A real coding session at measured operating points.

GPU cloud today

110 tok/s/user

loop 1 of 32 · 0 tokensstreaming…

2,000 tok/s/user

loop 1 of 32 · 0 tokensstreaming…

Explore the benchmarks

Products

For Startups and Enterprises

The same hardware sits behind all three. Begin on the public API, graduate to dedicated capacity, or bring your own model — the speed comes along for the ride.

01 / API

API Access

Get an API key

OpenAI-compatible REST endpoints. Swap your base URL, keep your code.

02 / DEDICATED

Custom Deployments

Talk to us

Dedicated capacity with SLAs and guaranteed throughput, sized to your traffic.

03 / BYO

Bring Your Own Model

See how it works

Ship your own weights on the same optimized stack. Your model, our hardware.

Your agent can sign up for you.

Hand this prompt to any autonomous coding agent. It handles the whole signup flow and comes back with a key — no dashboard, no forms.

Codex

Claude Code

OpenCode

Cursor

agent · signup

›

Sign me up for a General Compute API account and get an API key. Fetch instructions from https://docs.generalcompute.com/agent-signup and follow them.

Benchmarks

Verified benchmarks

GPT-OSS-120B on General Compute against the GPU cloud baseline — same model, same prompts, measured head to head.

See the full methodology and every model →

0.0x

Faster time to first token

0.0x

Higher output throughput

0.0x

Lower end-to-end latency

0 tok/s

Output on GPT-OSS-120B

Switch in 30 seconds.
No GPU required.

OpenAI-compatible API. Change your base URL, swap your key, and you're running on ASIC infrastructure. Your existing code doesn't change.

View Docs

main.py

from openai import OpenAI

client = OpenAI(
    base_url="https://api.generalcompute.com",
    api_key="your-api-key",
)

response = client.chat.completions.create(
    model="gpt-oss-120b",
    messages=[{"role": "user", "content": "Hello!"}],
    stream=True,
)

$100 in free credit when you sign up

Stop paying the GPU tax.

Get your API key in seconds. OpenAI-compatible — just change your base URL. $100 free credit to see the difference yourself.

Get API Key — $100 Free Credit Contact Sales

ModeHuman Agent

Inference at the speed of light

Up to 16.1x faster

For Startups and Enterprises

API Access

Custom Deployments

Bring Your Own Model

Your agent can sign up for you.

Verified benchmarks

Switch in 30 seconds. No GPU required.

Stop paying the GPU tax.

Switch in 30 seconds.
No GPU required.