Accessibility Statement Skip Navigation
  • Resources
  • Data Privacy
  • Contact Us
  • Send a Release
Return to PR Newswire homepage
  • News
  • Products
    • Overview
    • Distribution by PR Newswire
    • Guaranteed Paid Placement
    • Cision Media Monitoring
    • Multichannel Amplification
    • All Products
  • Contact
    • General Inquiries
    • Request a Demo
    • Partnerships
    • Media Inquiries
When typing in this field, a list of search results will appear and be automatically updated as you type.

Searching for your content...

No results found. Please change your search terms and try again.
  • News in Focus
      • Browse News Releases

      • All News Releases
      • All Public Company
      • English-only
      • All Multimedia

      • All Multimedia
      • All Photos
      • All Videos
  • Business & Money
      • Auto & Transportation

      • Aerospace & Defense
      • Air Freight
      • Airlines & Aviation
      • Automotive
      • Maritime & Shipbuilding
      • Railroads & Intermodal Transportation
      • Supply Chain/Logistics
      • Transportation, Trucking & Railroad
      • Travel
      • Trucking & Road Transportation
      • View All Auto & Transportation

      • Business Technology

      • Blockchain
      • Broadcast Tech
      • Computer & Electronics
      • Computer Accessories
      • Computer Hardware
      • Computer Networks
      • Computer Software
      • Data Analytics
      • Electronic Commerce
      • Electronic Components
      • Electronic Design Automation
      • Financial Technology
      • High-Tech Security
      • Internet Technology
      • Nanotechnology
      • Semiconductors
      • View All Business Technology

      • Entertain­ment & Media

      • Advertising
      • Art, Culture & Design
      • Books
      • Entertainment
      • Film & Motion Picture
      • Magazines
      • Music
      • Publishing & Information Services
      • Radio & Podcast
      • Television
      • View All Entertain­ment & Media

      • Financial Services & Investing

      • Accounting News & Issues
      • Acquisitions, Mergers & Takeovers
      • Banking & Financial Services
      • Bankruptcy
      • Bond & Stock Ratings
      • Conference Call Announcements
      • Contracts
      • Cryptocurrency
      • Dividends
      • Earnings
      • Earnings Projections or Forecasts
      • Financing Agreements
      • Insurance
      • Investment Options
      • Joint Ventures
      • Mutual Funds
      • Offerings
      • Private Placement
      • Real Estate
      • Restructuring & Recapitalization
      • Sales Reports
      • Shareholder Activism
      • Shareholder Meetings
      • Venture Capital
      • View All Financial Services & Investing

      • General Business

      • Awards
      • Commercial Real Estate
      • Corporate Expansion
      • Earnings
      • Environmental, Social and Governance (ESG)
      • Human Resource & Workforce Management
      • Licensing/marketing agreements
      • New Products & Services
      • Obituary
      • Outsourcing Businesses
      • Overseas Real Estate (Non-US)
      • Personnel Announcements
      • Residential Real Estate
      • Small-Business Services
      • Socially Responsible Investing
      • Surveys, Polls & Research
      • Trade Show News
      • View All General Business

  • Science & Tech
      • Consumer Technology

      • Artificial Intelligence
      • Blockchain
      • Cloud Computing/Internet of Things
      • Computer Accessories
      • Computer Electronics
      • Computer Hardware
      • Computer Networks
      • Computer Software
      • Consumer Electronics
      • Cryptocurrency
      • Data Analytics
      • Electronic Commerce
      • Electronic Design Automation
      • Financial Technology
      • Mobile Devices/Apps
      • Social Media
      • STEM (Science, Tech, Engineering, Math)
      • Wireless Communications
      • View All Consumer Technology

      • Energy & Natural Resources

      • Alternative Energies
      • Chemical
      • Electrical Utilities
      • General Manufacturing
      • Mining
      • Mining & Metals
      • Natural Gas Utilities
      • Oil & Energy
      • Oil & Gas Discoveries
      • Utilities
      • Water Utilities
      • View All Energy & Natural Resources

      • Environ­ment

      • Conservation & Recycling
      • Environmental Issues
      • Environmental Policy
      • Environmental Products & Services
      • Green Technology
      • Natural Disasters
      • View All Environ­ment

      • Heavy Industry & Manufacturing

      • Aerospace & Defense
      • Agriculture
      • Chemical
      • Computer Accessories
      • Construction & Building
      • General Manufacturing
      • HVAC (Heating, Ventilation & Air-Conditioning)
      • Machinery
      • Machine Tools, Metalworking & Metallurgy
      • Mining
      • Mining & Metals
      • Paper, Forest Products & Containers
      • Precious Metals
      • Textiles
      • Tobacco
      • View All Heavy Industry & Manufacturing

      • Telecomm­unications

      • Computer Accessories
      • Computer Networks
      • Mobile Devices/Apps
      • Telecommunications
      • Telecommunications Carriers & Services
      • Telecommunications Equipment
      • VoIP (Voice over Internet Protocol)
      • Wireless Communications
      • View All Telecomm­unications

  • Lifestyle & Health
      • Consumer Products & Retail

      • Animals & Pets
      • Beers, Wine & Spirits
      • Beverages
      • Cannabis
      • Cosmetics and Personal Care
      • Fashion
      • Food
      • Furniture & Furnishings
      • Home Improvement
      • Household, Consumer & Cosmetics
      • Household Products
      • Jewelry
      • Non-Alcoholic Beverages
      • Office Products
      • Product Recalls
      • Restaurants
      • Retail
      • Supermarkets
      • Toys
      • View All Consumer Products & Retail

      • Entertain­ment & Media

      • Advertising
      • Art, Culture & Design
      • Books
      • Entertainment
      • Film & Motion Picture
      • Magazines
      • Music
      • Publishing & Information Services
      • Radio & Podcast
      • Television
      • View All Entertain­ment & Media

      • Health

      • Biometrics
      • Biotechnology
      • Clinical Trials & Medical Discoveries
      • Dentistry
      • FDA Approval
      • Fitness/Wellness
      • Health Care & Hospitals
      • Health Insurance
      • Infectious Disease Control
      • International Medical Approval
      • Medical Equipment
      • Medical Pharmaceuticals
      • Mental Health
      • Pharmaceuticals
      • Supplementary Medicine
      • View All Health

      • Sports

      • Outdoors, Camping & Hiking
      • Sporting Events
      • Sports
      • Sports Equipment & Accessories
      • View All Sports

      • Travel

      • Amusement Parks & Tourist Attractions
      • Gambling & Casinos
      • Hotels & Resorts
      • Outdoors, Camping & Hiking
      • Passenger Aviation
      • Travel
      • View All Travel

  • Policy & Public Interest
      • Policy & Public Interest

      • Animal Welfare
      • Corporate Social Responsibility
      • Economic News, Trends & Analysis
      • Education
      • Environmental Products & Services
      • European Government
      • Natural Disasters
      • Not-for-Profit
      • Public Safety
      • View All Policy & Public Interest

  • People & Culture
      • People & Culture

      • Children-related news
      • Disabled Persons
      • Diversity, Equity & Inclusion
      • Hispanic-oriented news
      • LGBTQ+
      • Religion
      • Senior Citizens
      • Veterans
      • Women-Related news
      • View All People & Culture

  • Overview
  • Distribution by PR Newswire
  • Guaranteed Paid Placement
  • Cision Media Monitoring
  • Multichannel Amplification
  • All Products
  • General Inquiries
  • Request a Demo
  • Partnerships
  • Media Inquiries
  • Hamburger menu
  • PR Newswire: news distribution, targeting and monitoring Home
  • Send a Release
    • Chat

    • ALL CONTACT INFO
    • Contact Us


  • News Releases
  • Send a Release
  • Data Privacy
  • News in Focus
    • Browse All News
    • Multimedia Gallery
    • Trending Topics
  • Business & Money
    • Auto & Transportation
    • Business Technology
    • Entertain­ment & Media
    • Financial Services & Investing
    • General Business
  • Science & Tech
    • Consumer Technology
    • Energy & Natural Resources
    • Environ­ment
    • Heavy Industry & Manufacturing
    • Telecomm­unications
  • Lifestyle & Health
    • Consumer Products & Retail
    • Entertain­ment & Media
    • Health
    • Sports
    • Travel
  • Policy & Public Interest
  • People & Culture
    • People & Culture
  • News Releases
  • Send a Release
  • Data Privacy
  • Overview
  • Distribution by PR Newswire
  • Guaranteed Paid Placement
  • Cision Media Monitoring
  • Cision IR
  • SocialBoost
  • All Products
  • News Releases
  • Send a Release
  • Data Privacy
  • General Inquiries
  • Request a Demo
  • Editorial Bureaus
  • Partnerships
  • Media Inquiries
  • News Releases
  • Send a Release
  • Data Privacy

WEKA Maximizes Token Output With Lower Cost Per Token on NVIDIA BlueField-4 STX
  • USA - English
  • Deutschland - English
  • Deutschland - Deutsch
  • Korea - 한국어
  • France - Français
  • India - Hindi

WEKA_v1_Logo_new

News provided by

WEKA

17 Mar, 2026, 06:00 CST

Share this article

Share toX

Share this article

Share toX

NeuralMesh and Augmented Memory Grid Integration with NVIDIA STX Increases Token Production by 6.5x in the Same GPU Footprint, Slashing Cost of Inference for AI-Driven Organizations

SAN JOSE, Calif. and CAMPBELL, Calif., March 17, 2026 /PRNewswire/ -- From GTC 2026: WEKA, the AI storage and memory systems company, today announced the integration of its NeuralMesh™ software with the NVIDIA STX reference architecture. WEKA's breakthrough Augmented Memory Grid™ memory extension technology running on NeuralMesh will support NVIDIA STX to bring high-throughput context memory storage to agentic AI factories, making long-context reasoning seamless across sessions, tools, and tasks. Leveraging NVIDIA Vera Rubin NVL72, NVIDIA BlueField-4, and NVIDIA Spectrum-X Ethernet, the NeuralMesh solution based on NVIDIA STX will deliver an estimated increase of 4-10x more tokens per second for context memory while supporting at least 320 GB read and 150 GB write throughput per second for AI workloads, more than double the throughput of conventional AI storage platforms.

Continue Reading
WEKA and NVIDIA unlock cost-efficient AI inference at scale
WEKA and NVIDIA unlock cost-efficient AI inference at scale

Solving the Inference Cost Problem with Shared KV Cache Infrastructure
Scaling agentic systems, especially for software engineering applications, exposes a hard truth: today's AI economics are decided at the memory infrastructure layer. Every large-scale inference fleet hits the memory wall: limited high-bandwidth memory (HBM) on the GPU is rapidly exhausted, key-value (KV) cache is evicted, context is lost, and the system is forced to repeat work it already completed. This architectural inefficiency sends inference costs soaring. The answer is a shared KV cache infrastructure that keeps context live across agents, users, and sessions. It eliminates redundant computation, sustains token throughput, and maintains predictable performance. Without shared KV cache infrastructure, every increase in concurrent users and agents becomes a liability — costs rise, experiences degrade, and the inference fleet becomes harder to operate the larger it grows. With STX for context memory, NVIDIA is introducing a blueprint to address these core inference bottlenecks.

Context Memory Storage: The Foundation of Agentic AI Factories
With co-designed WEKA solutions based on NVIDIA STX architecture, AI clouds, enterprises, and AI model builders can deploy the infrastructure foundation they need to run GPUs at peak productivity, sustain high-volume token production, and make large-scale inference more energy and cost-efficient.

Leading AI innovators and cloud providers, such as Firmus, are already transforming their inference economics with Augmented Memory Grid on NeuralMesh.

"Real-world AI doesn't run in a lab— it has power constraints, cooling limits, and relentless workload demand. Firmus is built for exactly that. Paired with NVIDIA AI infrastructure, WEKA Augmented Memory Grid delivers up to 6.5x higher tokens per second and 4x faster TTFT at scale, proving we can get more performance from the same GPU footprint. With NeuralMesh and Augmented Memory Grid integrated into our NVIDIA-aligned AI Factory and NVIDIA STX reference architecture, we'll be able to deliver the fastest context memory network for predictable and efficient inference at scale," said Daniel Kearney, Chief Technology Officer at Firmus.

NeuralMesh and NVIDIA STX: Purpose-Built for Agentic AI
NeuralMesh is WEKA's intelligent, adaptive storage system built on over 170 patents. It will run across the full-stack STX reference architecture, providing the next-generation storage organizations need to standardize high-performance AI data services and accelerate agentic AI outcomes. WEKA's Augmented Memory Grid is a purpose-built memory extension layer that pools and persists KV cache outside of GPU memory, keeping long-context sessions stable and concurrency high as inference workloads grow. First unveiled at GTC 2025 and generally available to NeuralMesh customers today, Augmented Memory Grid has been validated with Supermicro on NVIDIA Grace CPUs and BlueField-3 DPUs to deliver numerous benefits that improve AI economics, including:

  • Faster User Experiences: Augmented Memory Grid on NeuralMesh delivers up to 4-20x improvement in time-to-first-token, keeping AI agents and applications responsive under real-world load.
  • More Revenue from the Same Hardware: Serve 6.5x more tokens per GPU — without adding infrastructure.
  • Sustained Performance at Scale: Augmented Memory Grid maintains high KV cache hit rates even as sessions, agents, and context windows grow — preventing the performance cliff that hits DRAM-only architectures.
  • GPU-Native Efficiency: BlueField-4 integration offloads the storage data path from the CPU, keeping GPUs fully productive and eliminating I/O bottlenecks.

"With coding LLMs advancing, we're seeing unprecedented adoption of Agentic AI use cases for software engineering, where productivity increases by 100-1000x. As coding assistants make repeated calls against largely unchanged codebases and prompts, WEKA's Augmented Memory Grid reuses cached context instead of forcing redundant prefill, even as context windows grow to incredible lengths. This provides a major boost in response times and greatly increases the number of concurrent users running on the same infrastructure," said Liran Zvibel, co-founder and CEO at WEKA. "WEKA first identified this need for context memory storage more than a year ago and launched Augmented Memory Grid at GTC 2025. Now, NVIDIA STX opens the door to organizations running their storage and memory extension infrastructure on state-of-the-art NVIDIA Vera Rubin architecture, including NVIDIA BlueField-4 and NVIDIA Spectrum-X Ethernet. Running Augmented Memory Grid on NeuralMesh for NVIDIA STX delivers extreme performance and efficiency that translates directly to game-changing AI economics."

Availability

WEKA's Augmented Memory Grid is commercially available with NeuralMesh today.

Organizations that don't address the memory wall today will find it harder and more expensive to scale tomorrow. As agentic workloads grow and context windows expand, DRAM-only architectures face a compounding cost problem: each additional concurrent user or session increases recomputation overhead, GPU idle time, and operational cost. The organizations that architect for persistent KV cache now will have a structural cost and performance advantage over those that wait.

For more information about NeuralMesh, visit: weka.io/NeuralMesh.
For more information about Augmented Memory Grid, visit: weka.io/augmented-memory-grid.

Organizations can learn more at weka.io/nvidia or visit WEKA at GTC 2026, booth #1034.

About WEKA
WEKA is transforming how organizations build, run, and scale AI workflows with NeuralMesh™ by WEKA®, its intelligent, adaptive mesh storage system. Unlike traditional data infrastructure, which becomes slower and more fragile as workloads expand, NeuralMesh becomes faster, stronger, and more efficient as it scales, dynamically adapting to AI environments to provide a flexible foundation for enterprise AI and agentic AI innovation. Trusted by 30% of the Fortune 50, NeuralMesh helps leading enterprises, AI cloud providers, and AI builders optimize GPUs, scale AI faster, and reduce innovation costs. Learn more at www.weka.io or connect with us on LinkedIn and X.

WEKA and the W logo are registered trademarks of WekaIO, Inc. Other trade names herein may be trademarks of their respective owners.

SOURCE WEKA

Modal title

Also from this source

WEKA Accelerates AI Factory Deployment Times From Months to Minutes with Turnkey NVIDIA AI Data Platform Solution

WEKA Accelerates AI Factory Deployment Times From Months to Minutes with Turnkey NVIDIA AI Data Platform Solution

From GTC 2026: WEKA, the AI storage and memory systems company, today announced general availability of its enterprise-ready NeuralMesh™ AI Data...

WEKA Unveils Next-Gen WEKApod Appliances to Redefine AI Storage Economics

WEKA Unveils Next-Gen WEKApod Appliances to Redefine AI Storage Economics

From SC25: WEKA, the AI storage company, announced the next generation of its WEKApod™ appliances to upend traditional performance-versus-cost...

More Releases From This Source

Explore

Artificial Intelligence

Artificial Intelligence

Computer & Electronics

Computer & Electronics

Computer Software

Computer Software

Computer Software

Computer Software

News Releases in Similar Topics

Contact Cision

  • General Inquiries
  • Request a Demo
  • Partnerships
  • Media Inquiries

Products

  • Cision Communication Cloud®
  • For Marketers
  • For Public Relations
  • For IR & Compliance
  • For Agency
  • For Small Business
  • All Products

About

  • About PR Newswire
  • About Cision
  • Become a Publishing Partner
  • Careers
  • Accessibility Statement
  • APAC – Simplified Chinese
  • APAC
  • APAC - Traditional Chinese
  • Arabic
  • Brazil
  • Canada
  • Czech
  • Denmark
  • Finland
  • France
  • Germany
  • India
  • Indonesia
  • Israel
  • Italy
  • Japan
  • Korea
  • Mexico
  • Middle East
  • Netherlands
  • Norway
  • Poland
  • Portugal
  • Russia
  • Slovakia
  • Spain
  • Sweden
  • United Kingdom
  • United States
  • Vietnam

My Services

  • All New Releases
  • Platform

Do not sell or share my personal information:

  • Submit via [email protected] 
  • Call Privacy toll-free: 877-297-8921

Contact Cision

Products

About

My Services
  • All News Releases
  • Platform
[email protected]
  • Terms of Use
  • Privacy Policy
  • Information Security Policy
  • Site Map
  • RSS
  • Cookie Settings
  • Accessibility
Copyright © 2026 Cision US Inc.