Blog
Insights on custom AI models, open-source ML, and the future of specialized intelligence.
Newsletter: Stop throwing money away on frontier AI (and other news)
Updates for Week of May 18, 2026
Read Post
How much money are you throwing away on Anthropic and OpenAI models?
Specialized AI can be 10x-100x fold or more cheaper than a general-purpose frontier model with the same or better performance. Learn how much you could save by moving from Anthropic to specialized AI
Read Post
Oumi OSS v0.8: Deploy, MCP, and Batch Inference Everywhere
Read Post
Case Study: Kaizen Gaming builds a specialized text-to-SQL model for sports analytics that beats Kimi-K2
With Oumi, Kaizen fine-tuned a 3B model that matched or exceeded frontier-scale alternatives — at a fraction of the cost — by treating rule compliance as a fine-tuning problem, not a prompting problem
Read Post
Case Study: Wired Informatics trains a specialised clinical-NLP model on open weights that keeps sensitive data safe
With Oumi, a healthcare AI team moved from 84.5% to 88.9% concept-validity precision on messy clinical text — without sending a single patient record to a third-party API
Read Post
Case Study: Original Voices builds a voice-authentic AI on proprietary persona data with 31% higher authenticity
With Oumi’s technology, Original Voices’s personal-twin product moved from 52% to 83% authenticity pass rate — driven entirely by better evaluation, not more data.
Read Post
Newsletter: Oumi can now host your Custom AI Models with a single click (and other news)
Updates for Week of May 11, 2026
Read Post
Case Study: Aurasell builds an 8B model for extracting information from websites outperforming Sonnet 4.5 by 8%
With Oumi’s technology, Aurasell's custom AI beat Anthropic's Sonnet 4.5 at extracting information from webpages by 8% in coverage and 12% in groundedness
Read Post
Case Study: DMG achieves 6% higher quality and 100x lower costs for invoice validation
With Oumi’s technology, DMG’s custom AI beat GPT5.2 by 6% accuracy and 6% validity at 100x lower cost
Read Post
Building Customer Support with a sub-1B Small Language Model that Beats GPT-5.4 (Part 1)
Small Language Models occupy a sweet spot in speed and cost between frontier LLMs and BERT models for a wide range of NLP tasks
Read Post
Oumi’s Study Finds 50% of AI Overviews Untrustworthy
A not-so-simple study of SimpleQA and AI search results
Read Post
The Era of General Purpose AI Is Over
General-purpose models were built for the average.
Read PostThe Case for Specialized Intelligence
Our vision for AI at Oumi.ai
Read PostLambda and Oumi partner for end-to-end custom model development
Enterprises can now build and deploy custom models for their specific use cases 100x faster, with 10x better cost efficiency, and superior accuracy
Read Post
Wrapping Up 2025, Looking Ahead to 2026
That’s a wrap on 2025. Hello 2026. 🎉
Read PostOumi v0.5.0: Data Synthesis, OpenEnv, Hyper-param Tuning
Major new features!
Read Post
DCVLR Competition Results: Data Curation for Vision-Language Reasoning
Read Post
Less (Data) is More for Fine-tuning
1000 Samples or Less for Amazing Fine-tuning
Read Post
Why Less is More for Fine-Tuning
What is the evidence for successful fine-tuning with small data?
Watch on Substack
Small Fine-tuned Models are All You Need
But the devil is in the details—how can you get them right?
Read Post
Hours, Not Months – The Custom AI Era is Now
Read Post
OpenAI Just Dropped Two Massive Open-weight Models — But How Do We Separate The Reality From The Hype?
Evaluating GPT-OSS-20B and GPT-OSS-120B with LLM-as-a-Judge — strong on truthfulness, but overly conservative refusals hold them back
Read Post
Training Frontier Reasoning VLMs for the 2025 NeurIPS DCVLR Workshop with Oumi
Baseline data curation strategies for the NeurIPS DCVLR competition — synthesis, filtration, and a 37.6% improvement on reasoning benchmarks
Read Post