Clarifai and Vultr Showcase Record-Breaking AI Inference Performance on GPUs at NVIDIA GTC AI Conference
New benchmark results confirm the Clarifai Reasoning Engine on Vultr's GPU clusters delivers unprecedented speed, efficiency, and scalability for next-gen AI workloads
WASHINGTON, Oct. 28, 2025 /PRNewswire/ -- Clarifai, a global leader in AI and pioneer of the full-stack AI platform, and Vultr, the world's largest privately held cloud computing platform, today announced new benchmark results demonstrating unprecedented inference speed and cost efficiency on GPUs. The announcement comes as the two companies showcase their collaboration at the Vultr booth during NVIDIA GTC Washington, D.C., taking place October 28–30.
The joint results highlight the Clarifai Reasoning Engine, optimized for agentic AI inference, running on Vultr's large-scale dedicated GPU clusters. Independent testing by Artificial Analysis showed the Clarifai Reasoning Engine achieving 544 tokens per second, 0.36 seconds time to first token, and an industry-leading $0.16 per million tokens on the GPT-OSS-120B model—outpacing every GPU-based inference platform tested, and rivaling specialized ASIC accelerators.
"Clarifai's benchmark-topping performance is a testament to what's possible when software innovation meets cloud engineering excellence," said Kevin Cochrane, CMO of Vultr. "Our GPU clusters deliver extraordinary inference speed and efficiency while keeping costs under control. Together with Clarifai, we're enabling organizations to develop competitive, high-performance AI solutions faster than ever before."
"Vultr's GPU infrastructure is a cornerstone of how we help customers unlock the full potential of the Clarifai Reasoning Engine," said Matthew Zeiler, Founder & CEO at Clarifai. "As organizations build AI workloads and agentic systems, they need performance and cost efficiency without compromising reasoning quality. Vultr provides the speed and reliability that make those outcomes possible."
The latest benchmarks are part of the Clarifai 11.9 release, which introduces expanded capabilities for building and deploying advanced AI systems, including:
- Added new Vultr cloud instances on NVIDIA HGX B200 and NVIDIA GH200 Grace Hopper™ Superchips.
- Toolkit compatibility for vLLM, LMStudio, and Hugging Face, enabling seamless model initialization and management.
- New model releases, including Qwen3-Next-80B-A3B-Thinking and Qwen3-30B-A3B-Instruct, further expanding Clarifai's model-agnostic ecosystem.
Built for enterprise-scale workloads, the Clarifai Reasoning Engine continuously optimizes kernels, batching, and memory use based on workload behavior, improving performance over time without sacrificing accuracy. It supports any large reasoning model, allowing customers to bring their own models and achieve similar levels of throughput and efficiency.
Clarifai and Vultr are setting a new standard for AI inference performance, scalability, and cost control, enabling developers and enterprises to accelerate innovation in reasoning, agentic systems, and generative AI.
Visit Clarifai and Vultr at booth #453 to learn more.
About Clarifai
Clarifai is a global leader in AI and the pioneer of the full-stack AI platform that helps organizations, teams, and developers build, deploy, and operationalize AI at the highest scale and availability. Clarifai's cutting-edge AI platform supports today's modern AI technologies like Large Language Models (LLMs), Large Vision Models (LVMs), Retrieval Augmented Generation (RAG), automated data labeling, high-volume production inference, and more. Founded in 2013, Clarifai is available in cloud, on-premises, or hybrid environments and has been used to build more than 1.5 million AI models, serving more than 500,000 users across 170 countries. Learn more at www.clarifai.com.
For more information or media requests, contact: [email protected].
About Vultr
Vultr is on a mission to make high-performance cloud infrastructure easy to use, affordable, and locally accessible for enterprises and AI innovators around the world. Vultr is trusted by hundreds of thousands of active customers across 185 countries for its flexible, scalable, global Cloud Compute, Cloud GPU, Bare Metal, and Cloud Storage solutions. Founded by David Aninowsky and self-funded for over a decade, Vultr has grown to become the world's largest privately held cloud infrastructure company.
SOURCE Clarifai
WANT YOUR COMPANY'S NEWS FEATURED ON PRNEWSWIRE.COM?
Newsrooms &
Influencers
Digital Media
Outlets
Journalists
Opted In
Share this article