Launch
Start training specialized open models and pay only for the GPU time you use.
- Credit top-ups from the dashboard
- Training jobs & experiment traces
- Provider GPU rates plus markup
- Community support
PRICING
Train a smaller open model on your workflow. When it matches or beats Claude Opus or GPT on that task, host your model and pay GPU-hours.
GPU HOURS
Start training specialized open models and pay only for the GPU time you use.
For teams replacing high-volume frontier API calls with owned, hosted models.
For orgs that need private clusters, security review, and committed capacity.
WHY?
Let's say you make 1,000,000 monthly requests with 1,500 input and 500 output tokens each. That is 2.0B monthly tokens. A batched Qwen3-8B endpoint on 1x H100 uses about 122 GPU-hours, or $395/month at our current H100 rate.
WHY IT SAVES
Frontier models are priced for general intelligence across every domain. A fine-tuned Qwen-class model can be optimized for one workflow, deployed behind an OpenAI-compatible endpoint, and scaled with transparent GPU-hour economics.