Accessibility Statement Skip Navigation
  • Back to Global Sites
  • +972-77-2005042
  • Blog
  • Journalists
  • GDPR
  • Send a Release
PR Newswire: news distribution, targeting and monitoring
  • News
  • Products
  • Contact
  • Hamburger menu
  • PR Newswire: news distribution, targeting and monitoring
  • Send a Release
    • Telephone

    • +972-77-2005042 from 8 AM - 11 PM IL

    • Contact
    • Contact

      +972-77-2005042
      from 8 AM - 11 PM IL

  • Request More Information
  • Journalists
  • GDPR
  • Request More Information
  • Journalists
  • GDPR
  • Request More Information
  • Journalists
  • GDPR
  • Request More Information
  • Journalists
  • GDPR

aiOla releases breakthrough AI model that's 50% faster than OpenAI's Whisper


News provided by

aiOla

02 Aug, 2024, 15:30 IDT

Share this article

Share toX

Share this article

Share toX

The open-source model improves on Whisper by using multi-head attention to achieve speedup and reduced latency while retaining full speech recognition accuracy

TEL AVIV, Israel, Aug. 2, 2024 /PRNewswire/ -- aiOla, a leader in speech recognition technology, has announced today the release of its new open-source AI model, Whisper-Medusa. The new model, based on a multi-head attention architecture, outperforms OpenAI's Whisper, the most popular and best available AI speech recognition model, by performing 50% faster with no loss in performance.

The automatic speech recognition market size is projected to grow to $7.14 billion this year. As voice becomes an integrated feature in most connected devices and AI chatbots, speech recognition has emerged as a vital technology field. Amid this rapid expansion, OpenAI disrupted the automatic speech recognition landscape by releasing Whisper, an open-source model considered superior to any other commercial or open-source speech recognition model available today. Whisper, with more than 5 million downloads per month, has become the gold standard for automatic speech recognition systems and is powering tens of thousands of applications.

aiOla's new open-source model, Whisper-Medusa, greatly improves the speed compared to Whisper by altering how the model predicts tokens. While Whisper predicts one token at a time, Whisper-Medusa can predict ten at a time, resulting in a 50% increase in speech prediction speed and generation runtime. As a result of this significant advancement, aiOla has decided to release the model's weights and code today on GitHub and Hugging Face for the community to access.

"Creating Whisper-Medusa was not an easy task, but its significance to the community is profound," said Gill Hetz, VP of Research at aiOla."Improving the speed and latency of LLMs is much easier to do than with automatic speech recognition systems. The encoder and decoder architectures present unique challenges due to the complexity of processing continuous audio signals and handling noise or accents. We addressed these challenges by employing our novel multi-head attention approach, which resulted in a model with nearly double the prediction speed while maintaining Whisper's high levels of accuracy. It's a major feat, and we are very proud to be the first in the industry to successfully leverage multi-head attention architecture for automatic speech recognition systems and bring it to the public. "

Whisper-Medusa, based on multi-head attention, is trained using weak supervision. In this process, the main components of Whisper are initially frozen while additional parameters are trained. This training process involves using Whisper to transcribe audio datasets and employing these transcriptions as labels for training Medusa's additional token prediction modules. aiOla currently offers Whisper-Medusa as a 10-head model, with future plans to release a 20-head version with equivalent accuracy.

About aiOla:

aiOla's patented technology comprehends over 100 languages, and discerns jargon, abbreviations, and acronyms, demonstrating a low error rate even in noisy environments. aiOla's technology converts manual processes in critical industries into data-driven, paperless, AI-powered workflows through cutting-edge speech recognition.

Contact:
Ali Goldberg
Concrete Media for aiOla
[email protected] 

SOURCE aiOla

Modal title

Also from this source

aiOla unveils Drax, an open-source speech model with state-of-the-art accuracy and up to 5× faster than models from direct competitors

aiOla unveils Drax, an open-source speech model with state-of-the-art accuracy and up to 5× faster than models from direct competitors

aiOla, a voice-AI lab advancing speech recognition technology, is announcing today Drax, an open-source AI model that brings flow-matching-based...

UST Invests in aiOla to Scale Hands-free, Voice-agentic Automation for Frontline Operations Globally

UST Invests in aiOla to Scale Hands-free, Voice-agentic Automation for Frontline Operations Globally

UST, a leading AI and technology transformation solutions company, has strengthened its presence at the intersection of human-AI interaction with its ...

More Releases From This Source

Explore

Computer & Electronics

Computer & Electronics

Computer Software

Computer Software

Computer Software

Computer Software

Artificial Intelligence

Artificial Intelligence

News Releases in Similar Topics

Contact PR Newswire

  • +972-77-2005042
    from 8 AM - 11 PM IL

Global Sites

  • APAC
  • APAC - Traditional Chinese
  • Asia
  • Brazil
  • Canada
  • Czech
  • Denmark
  • Finland
  • France
  • Germany

 

  • India
  • Indonesia
  • Israel
  • Italy
  • Mexico
  • Middle East
  • Middle East - Arabic
  • Netherlands
  • Norway
  • Poland

 

  • Portugal
  • Russia
  • Slovakia
  • Spain
  • Sweden
  • United Kingdom
  • United States

Do not sell or share my personal information:

  • Submit via [email protected] 
  • Call Privacy toll-free: 877-297-8921
Global Sites
  • Asia
  • Brazil
  • Canada
  • Csezh
  • Denmark
  • Finland
  • France
  • Germany
  • India
  • Israel
  • Italie
  • Mexico
  • Middle East
  • Netherlands
  • Norway
  • Poland
  • Portugal
  • Russia
  • Slovakia
  • Spain
  • Sweden
  • United Kingdom
  • United States
+972-77-2005042
from 8 AM - 11 PM IL
  • Terms of Use
  • Privacy Policy
  • Information Security Policy
  • Site Map
  • Cookie Settings
Copyright © 2025 Cision US Inc.