Pricing

Simple, transparent pricing.

8.9K GitHub Stars

Open Source Stack

Free Forever

Everything AI researchers need to build state-of-the-art models in a single library.

Pre-training, fine-tuning, and evaluation at any scale
Text and multimodal, open and closed models
Data synthesis and curation
Run anywhere, from your laptop to the cloud

Visit GitHub

100x faster development

Pro Platform

$25/month

with rollover to next month

Build high-performing custom models faster. No need to bring your own compute.

Free starter credits: $50 corporate, $25 personal
Automated evaluation, synthesis, and training pipelines
Deploy to the inference provider of your choice
Expert support
25 monthly credits with rollover

Get Started with Free Credits

Tailored to your needs

Enterprise

Custom Pricing

Platform access and services pricing tailored to your team's needs. Oumi's team works alongside yours to build custom models and agents for your most critical use cases.

Dedicated slack channel
Designated expert
Models and agents tuned to your domain and task
Fixed price and fixed bid professional services engagement

Contact Us

Hosted Platform – detailed pricing

Detailed breakdown of tools, storage, training, and inference pricing.

Tools & Storage

Evaluation	1,000 judgments / $1
Data Synthesis	1,000 rows / $1
Storage	4 GB/month / $1

Training

Priced per 1M training tokens — calculated as the number of tokens in your training dataset multiplied by the number of epochs.

Model Size	Price
Up to 16B	$0.49
16.1–32B	$2.00
32.1–80B	$3.00
80.1–300B	$6.00

Inference for evaluation & synthesis

Model Size	Input / 1M	Output / 1M
Llama 3.1 70B	$1.00	$1.00
Llama 3.1 8B	$0.22	$0.22
Qwen2.5 7B Instruct	$0.22	$0.22
Qwen3 235B A22B Instruct	$0.25	$0.90
Mixtral 8x7B Instruct v0.1	$0.55	$0.55
Mistral 7B Instruct v0.3	$0.22	$0.22
Kimi K2 Instruct	$0.70	$2.80
Kimi K2 Thinking	$0.70	$2.80
Kimi K2 Instruct 0905	$0.70	$2.80
gpt-oss-120b	$0.15	$0.60
DeepSeek V3.1	$0.60	$1.70
GLM 4.6	$0.60	$2.40

Inference is only charged when you utilize models hosted by Oumi to power an action on the platform.

Production Inference

Deploy fully fine-tuned or LoRA models for inference and pay:

Per token (auto-scales based on traffic)

Per GPU hour (you control capacity)

Contact us for pricing

Frequently asked questions

Sign up today for $25 in credits with a personal email or $50 with a professional email.