Simple, transparent pricing.
8.9K GitHub Stars
Free Forever
Everything AI researchers need to build state-of-the-art models in a single library.
100x faster development
$25/month
with rollover to next month
Build high-performing custom models faster. No need to bring your own compute.
Tailored to your needs
Custom Pricing
Platform access and services pricing tailored to your team's needs. Oumi's team works alongside yours to build custom models and agents for your most critical use cases.
Detailed breakdown of tools, storage, training, and inference pricing.
Evaluation | 1,000 judgments / $1 |
Data Synthesis | 1,000 rows / $1 |
Storage | 4 GB/month / $1 |
Priced per 1M training tokens — calculated as the number of tokens in your training dataset multiplied by the number of epochs.
| Model Size | Price |
|---|---|
Up to 16B | $0.49 |
16.1–32B | $2.00 |
32.1–80B | $3.00 |
80.1–300B | $6.00 |
| Model Size | Input / 1M | Output / 1M |
|---|---|---|
Llama 3.1 70B | $1.00 | $1.00 |
Llama 3.1 8B | $0.22 | $0.22 |
Qwen2.5 7B Instruct | $0.22 | $0.22 |
Qwen3 235B A22B Instruct | $0.25 | $0.90 |
Mixtral 8x7B Instruct v0.1 | $0.55 | $0.55 |
Mistral 7B Instruct v0.3 | $0.22 | $0.22 |
Kimi K2 Instruct | $0.70 | $2.80 |
Kimi K2 Thinking | $0.70 | $2.80 |
Kimi K2 Instruct 0905 | $0.70 | $2.80 |
gpt-oss-120b | $0.15 | $0.60 |
DeepSeek V3.1 | $0.60 | $1.70 |
GLM 4.6 | $0.60 | $2.40 |
Inference is only charged when you utilize models hosted by Oumi to power an action on the platform.
Deploy fully fine-tuned or LoRA models for inference and pay: