Erklärung zur Barrierefreiheit Navigation überspringen
  • Zurück zu Global Sites
  • +44 (0)20 7454 5110
  • DSGVO
  • Journalisten
  • Weitere Informationen anfordern
PR Newswire: news distribution, targeting and monitoring
  • Nachrichten
  • Produkte
  • Kontakt
When typing in this field, a list of search results will appear and be automatically updated as you type.

Suche nach Ihrem Inhalt...

Keine Ergebnisse gefunden. Bitte verwenden Sie die erweiterte Suche, um den vollständigen Inhalt zu überprüfen.
  • Hamburger menu
  • PR Newswire: news distribution, targeting and monitoring
  • Weitere Informationen anfordern
    • Telefon

    • +44 (0)20 7454 5110 von 8 AM - 5 PM GMT

    • Kontakt
    • Kontakt

      +44 (0)20 7454 5110
      von 8 AM - 5 PM GMT

  • When typing in this field, a list of search results will appear and be automatically updated as you type.

  • Weitere Informationen anfordern
  • Journalisten
  • DSGVO
  • Weitere Informationen anfordern
  • Journalisten
  • DSGVO
  • Weitere Informationen anfordern
  • Journalisten
  • DSGVO
  • Weitere Informationen anfordern
  • Journalisten
  • DSGVO

Nota AI Reduces Memory Usage of Upstage's Solar LLM by 72%, Demonstrating Proprietary Quantization Technology
  • USA - English
  • Deutschland - Deutsch


Vom Nachrichtendienst

Nota AI

06 März, 2026, 07:00 GMT

Artikel teilen

Share toX

Artikel teilen

Share toX

New "Nota AI MoE Quantization" approach preserves model performance while significantly improving memory efficiency

SEOUL, South Korea, March 6, 2026 /PRNewswire/ -- Nota AI, an AI optimization technology company behind the Nota AI brand, announced that it has developed a next-generation quantization technology that significantly compresses the size of Solar, a high-performance large language model (LLM) developed by Upstage, while maintaining high accuracy. The breakthrough reduces inference costs and improves processing speed without sacrificing performance.

Continue Reading
Nota AI Reduces Memory Usage of Upstage’s Solar LLM by 72%, Demonstrating Proprietary Quantization Technology
Nota AI Reduces Memory Usage of Upstage’s Solar LLM by 72%, Demonstrating Proprietary Quantization Technology

The development was carried out as part of the "Sovereign AI Foundation Model Project" led by South Korea's Ministry of Science and ICT. By applying Nota AI's lightweighting and optimization technologies to Solar Open 100B, the company significantly improved memory efficiency while preserving model performance. The achievement lowers the memory requirements of the 100B-parameter model while maintaining its capabilities, enabling more practical deployment of Korean AI foundation models in physical AI environments such as mobility and robotics.

The newly developed technology focuses on addressing technical challenges associated with the Mixture of Experts (MoE) architecture, which is rapidly gaining adoption in next-generation LLMs. Conventional quantization methods typically compress the entire model uniformly without considering the distinct characteristics of individual expert models. To overcome this limitation, Nota AI developed a proprietary algorithm optimized for MoE architectures, called "Nota AI MoE Quantization."

The approach is designed to minimize quantization distortion during the inference process of MoE models. Unlike conventional methods that uniformly reduce precision across all operations, Nota AI's algorithm selectively preserves precision in critical components while compressing less sensitive parts of the model. This enables effective model compression while minimizing performance loss.

Applying the technology to the Solar 100B model yielded significant improvements compared with conventional quantization methods. Nota AI successfully reduced Solar's memory usage from 191.2GB to 51.9GB, representing a 72.8% reduction. At the same time, the model maintained performance levels comparable to the original version, achieving a Perplexity (PPL) score of 6.81, close to the baseline model's 6.06. In contrast, some generic quantization approaches resulted in performance degradation exceeding fivefold. Nota AI has filed a patent application for the technology to strengthen its intellectual property portfolio.

While conventional quantization techniques often sacrifice model performance to reduce memory usage, Nota AI's technology demonstrates that it is possible to maintain performance while delivering AI services faster and to more users on limited GPU infrastructure. As a result, enterprises can deploy large-scale LLMs more easily on their own devices—models that were previously difficult to implement due to hardware constraints.

The significant reduction in Solar 100B's memory footprint while preserving performance also creates new opportunities for deploying high-performance AI in real-world on-device environments, including robotics and automotive systems. Additionally, the technology enables organizations facing limited access to high-end GPU infrastructure to serve more users on the same hardware, directly contributing to lower operational costs.

"This achievement is meaningful because we were able to apply Nota AI's proprietary quantization technology to Solar 100B, a Korean AI foundation model, significantly reducing memory usage while maintaining performance," said Myungsu Chae, CEO of Nota AI, said, "As demand grows for deploying large-scale models directly on devices, Nota AI's lightweighting and optimization technologies will play a critical role in enabling high-performance AI."

Photo - https://mma.prnewswire.com/media/2926544/PR__Solar_NotaAI_260220.jpg 

Modal title

Mehr von dieser Quelle

Nota AI reduziert den Speicherbedarf des Solar LLM von Upstage um 72 % und demonstriert seine firmeneigene Quantisierungstechnologie

Nota AI reduziert den Speicherbedarf des Solar LLM von Upstage um 72 % und demonstriert seine firmeneigene Quantisierungstechnologie

Nota AI, ein Unternehmen für KI-Optimierungstechnologie, gab bekannt, eine Quantisierungstechnologie der nächsten Generation entwickelt zu haben, mit ...

Nota AI unterzeichnet Vertrag über die Lieferung von Technologien zur Optimierung von KI-Modellen mit FuriosaAI und erweitert damit die Kommerzialisierung auf Rechenzentren

Nota AI unterzeichnet Vertrag über die Lieferung von Technologien zur Optimierung von KI-Modellen mit FuriosaAI und erweitert damit die Kommerzialisierung auf Rechenzentren

Nota AI (Geschäftsführer Myungsu Chae), ein auf die Komprimierung und Optimierung von KI-Modellen spezialisiertes Unternehmen, gab bekannt, dass es...

Weitere Pressemitteilungen von dieser Quelle

Suchen

Artificial Intelligence

Artificial Intelligence

The Latest Artificial Intelligence News

The Latest Artificial Intelligence News

Cloud Computing/Internet of Things

Cloud Computing/Internet of Things

Data Analytics

Data Analytics

Pressemeldungen zu ähnlichen Themen

Kontaktaufnahme zu PR Newswire

  • +44 (0)20 7454 5110
    von 8 AM - 5 PM GMT

Globale Seiten

  • APAC
  • APAC – Traditionelles Chinesisch
  • Asien
  • Brasilien
  • Kanada
  • Tschechische Republik
  • Dänemark
  • Finnland
  • Frankreich
  • Deutschland

 

  • Indien
  • Indonesia
  • Israel
  • Italien
  • Mexiko
  • Naher Osten
  • Naher Osten – Arabisch
  • Niederlande
  • Norwegen
  • Polen

 

  • Portugal
  • Russland
  • Slowakei
  • Spanien
  • Schweden
  • Großbritannien
  • Vereinigte Staaten

Do not sell or share my personal information:

  • Submit via [email protected] 
  • Call Privacy toll-free: 877-297-8921
Globale Seiten
  • Asien
  • Brasilien
  • Kanada
  • Tschechische Republik
  • Dänemark
  • Finnland
  • Frankreich
  • Deutschland
  • Indien
  • Israel
  • Italien
  • Mexiko
  • Naher Osten
  • Niederlande
  • Norwegen
  • Polen
  • Portugal
  • Russland
  • Slowakei
  • Spanien
  • Schweden
  • Großbritannien
  • Vereinigte Staaten
+44 (0)20 7454 5110
von 8 AM - 5 PM GMT
  • Terms of Use
  • Privacy Policy
  • Information Security Policy
  • Site Map
  • Cookie Settings
Copyright © 2026 Cision US Inc.