
Benchling Inference powered by Baseten gives biotech teams on-demand GPU capacity built for scientific model workloads
SAN FRANCISCO, May 20, 2026 /PRNewswire/ -- Benchling and Baseten today announced Benchling Inference, giving biotech customers scalable, cost-effective GPU capacity to train and run scientific models, without managing infrastructure. It comes preloaded with today's top scientific models and the integrations to make in silico discovery work out-of-the-box for biopharma companies.
Between 2020 and 2025, the number of new scientific AI models released annually grew from 28 to more than 380. These models are now standard in R&D workflows, but the compute layer hasn't kept up. Drug discovery is bursty by nature: teams wait on the physical lab, data comes in waves, then need to run 100,000 predictions in a few hours before going quiet for days. For most computational teams, that plays out as HPC queues with multi-week backlogs, GPU reservations sitting idle between data collection cycles, and predictions rationed during active campaigns.
Benchling Inference is built on the Baseten Inference Stack, a tightly integrated combination of a high-performance runtime (custom kernels, speculative decoding, KV cache optimizations) and inference-optimized infrastructure spanning 15+ cloud providers, with cold starts in 5–10 seconds. Benchling adds a biotech layer on top with pre-configured defaults for scientific models and deployment options for organizations with strict data residency requirements. By aggregating demand across the industry, Benchling also brings better economics to biotech startups.
With Benchling Inference, scientists can deploy third-party models or serve internal models built on their own experimental data from a unified compute environment. For teams with data sovereignty requirements, the Baseten Inference Stack runs identically in Baseten Cloud, inside a customer's virtual private cloud (VPC), or a hybrid of both so predictions never have to leave their environment. Computational scientists working in Jupyter notebooks or via SDK can call inference directly through Benchling.
"Biotech has entered a new era where AI models trained on proprietary experimental data could unlock breakthroughs that weren't possible before. The bottleneck has been infrastructure and biotech research labs should not have to become GPU experts to run frontier models on their data. By partnering with Benchling, we bring six years of inference expertise directly into the environments where the science happens" said Amir Highighat, CTO & Co-Founder of Baseten.
"Access to compute is becoming a strategic advantage. But we hear from computational scientists that getting inference to work in drug discovery is harder than it should be; workloads are bursty, the data is sensitive, compute costs are too high," said Ashu Singhal, co-founder and President of Benchling. "We've been running Baseten internally for Benchling's Model Hub and learned a lot about tailoring inference for drug discovery. Now we want customers to have the same access."
Companies interested in Benchling Inference can sign up here.
About Benchling: Benchling is the AI platform for biotech R&D, unifying scientific data and automating workflows to accelerate discovery and development. Trusted by more than 1,300 companies worldwide, from pioneering startups to global leaders like Merck, Moderna, and Sanofi, Benchling gives scientists a single place to capture, connect, and act on data across the entire R&D lifecycle. With Benchling AI, agents and models work directly inside scientific workflows, grounded in structured data. The result is faster teams, better molecules, and breakthroughs that reach the world sooner.
About Baseten: Baseten is the inference company behind the world's best AI products. The company builds systems software that runs the entire workload for AI applications—from GPUs and autoscaling to observability, billing, and developer tools—so teams can focus on models and user experience instead of infrastructure. Baseten's customers include leading AI companies such as Cursor, Mercor, Clay, OpenEvidence, Lovable, Abridge and others building specialized models for their domains. Founded in 2019 and based in San Francisco, Baseten has raised $585 million to date from investors including IVP, CapitalG, Conviction, Bond, Greylock, and Spark Capital.
SOURCE Benchling
Share this article