Accessibility Statement Skip Navigation
  • Resources
  • Investor Relations
  • Journalists
  • Agencies
  • Client Login
  • Send a Release
Return to PR Newswire homepage
  • News
  • Products
  • Contact
When typing in this field, a list of search results will appear and be automatically updated as you type.

Searching for your content...

No results found. Please change your search terms and try again.
  • News in Focus
      • Browse News Releases

      • All News Releases
      • All Public Company
      • English-only
      • News Releases Overview

      • Multimedia Gallery

      • All Multimedia
      • All Photos
      • All Videos
      • Multimedia Gallery Overview

      • Trending Topics

      • All Trending Topics
  • Business & Money
      • Auto & Transportation

      • All Automotive & Transportation
      • Aerospace, Defense
      • Air Freight
      • Airlines & Aviation
      • Automotive
      • Maritime & Shipbuilding
      • Railroads and Intermodal Transportation
      • Supply Chain/Logistics
      • Transportation, Trucking & Railroad
      • Travel
      • Trucking and Road Transportation
      • Auto & Transportation Overview

      • View All Auto & Transportation

      • Business Technology

      • All Business Technology
      • Blockchain
      • Broadcast Tech
      • Computer & Electronics
      • Computer Hardware
      • Computer Software
      • Data Analytics
      • Electronic Commerce
      • Electronic Components
      • Electronic Design Automation
      • Financial Technology
      • High Tech Security
      • Internet Technology
      • Nanotechnology
      • Networks
      • Peripherals
      • Semiconductors
      • Business Technology Overview

      • View All Business Technology

      • Entertain­ment & Media

      • All Entertain­ment & Media
      • Advertising
      • Art
      • Books
      • Entertainment
      • Film and Motion Picture
      • Magazines
      • Music
      • Publishing & Information Services
      • Radio & Podcast
      • Television
      • Entertain­ment & Media Overview

      • View All Entertain­ment & Media

      • Financial Services & Investing

      • All Financial Services & Investing
      • Accounting News & Issues
      • Acquisitions, Mergers and Takeovers
      • Banking & Financial Services
      • Bankruptcy
      • Bond & Stock Ratings
      • Conference Call Announcements
      • Contracts
      • Cryptocurrency
      • Dividends
      • Earnings
      • Earnings Forecasts & Projections
      • Financing Agreements
      • Insurance
      • Investments Opinions
      • Joint Ventures
      • Mutual Funds
      • Private Placement
      • Real Estate
      • Restructuring & Recapitalization
      • Sales Reports
      • Shareholder Activism
      • Shareholder Meetings
      • Stock Offering
      • Stock Split
      • Venture Capital
      • Financial Services & Investing Overview

      • View All Financial Services & Investing

      • General Business

      • All General Business
      • Awards
      • Commercial Real Estate
      • Corporate Expansion
      • Earnings
      • Environmental, Social and Governance (ESG)
      • Human Resource & Workforce Management
      • Licensing
      • New Products & Services
      • Obituaries
      • Outsourcing Businesses
      • Overseas Real Estate (non-US)
      • Personnel Announcements
      • Real Estate Transactions
      • Residential Real Estate
      • Small Business Services
      • Socially Responsible Investing
      • Surveys, Polls and Research
      • Trade Show News
      • General Business Overview

      • View All General Business

  • Science & Tech
      • Consumer Technology

      • All Consumer Technology
      • Artificial Intelligence
      • Blockchain
      • Cloud Computing/Internet of Things
      • Computer Electronics
      • Computer Hardware
      • Computer Software
      • Consumer Electronics
      • Cryptocurrency
      • Data Analytics
      • Electronic Commerce
      • Electronic Gaming
      • Financial Technology
      • Mobile Entertainment
      • Multimedia & Internet
      • Peripherals
      • Social Media
      • STEM (Science, Tech, Engineering, Math)
      • Supply Chain/Logistics
      • Wireless Communications
      • Consumer Technology Overview

      • View All Consumer Technology

      • Energy & Natural Resources

      • All Energy
      • Alternative Energies
      • Chemical
      • Electrical Utilities
      • Gas
      • General Manufacturing
      • Mining
      • Mining & Metals
      • Oil & Energy
      • Oil and Gas Discoveries
      • Utilities
      • Water Utilities
      • Energy & Natural Resources Overview

      • View All Energy & Natural Resources

      • Environ­ment

      • All Environ­ment
      • Conservation & Recycling
      • Environmental Issues
      • Environmental Policy
      • Environmental Products & Services
      • Green Technology
      • Natural Disasters
      • Environ­ment Overview

      • View All Environ­ment

      • Heavy Industry & Manufacturing

      • All Heavy Industry & Manufacturing
      • Aerospace & Defense
      • Agriculture
      • Chemical
      • Construction & Building
      • General Manufacturing
      • HVAC (Heating, Ventilation and Air-Conditioning)
      • Machinery
      • Machine Tools, Metalworking and Metallurgy
      • Mining
      • Mining & Metals
      • Paper, Forest Products & Containers
      • Precious Metals
      • Textiles
      • Tobacco
      • Heavy Industry & Manufacturing Overview

      • View All Heavy Industry & Manufacturing

      • Telecomm­unications

      • All Telecomm­unications
      • Carriers and Services
      • Mobile Entertainment
      • Networks
      • Peripherals
      • Telecommunications Equipment
      • Telecommunications Industry
      • VoIP (Voice over Internet Protocol)
      • Wireless Communications
      • Telecomm­unications Overview

      • View All Telecomm­unications

  • Lifestyle & Health
      • Consumer Products & Retail

      • All Consumer Products & Retail
      • Animals & Pets
      • Beers, Wines and Spirits
      • Beverages
      • Bridal Services
      • Cannabis
      • Cosmetics and Personal Care
      • Fashion
      • Food & Beverages
      • Furniture and Furnishings
      • Home Improvement
      • Household, Consumer & Cosmetics
      • Household Products
      • Jewelry
      • Non-Alcoholic Beverages
      • Office Products
      • Organic Food
      • Product Recalls
      • Restaurants
      • Retail
      • Supermarkets
      • Toys
      • Consumer Products & Retail Overview

      • View All Consumer Products & Retail

      • Entertain­ment & Media

      • All Entertain­ment & Media
      • Advertising
      • Art
      • Books
      • Entertainment
      • Film and Motion Picture
      • Magazines
      • Music
      • Publishing & Information Services
      • Radio & Podcast
      • Television
      • Entertain­ment & Media Overview

      • View All Entertain­ment & Media

      • Health

      • All Health
      • Biometrics
      • Biotechnology
      • Clinical Trials & Medical Discoveries
      • Dentistry
      • FDA Approval
      • Fitness/Wellness
      • Health Care & Hospitals
      • Health Insurance
      • Infection Control
      • International Medical Approval
      • Medical Equipment
      • Medical Pharmaceuticals
      • Mental Health
      • Pharmaceuticals
      • Supplementary Medicine
      • Health Overview

      • View All Health

      • Sports

      • All Sports
      • General Sports
      • Outdoors, Camping & Hiking
      • Sporting Events
      • Sports Equipment & Accessories
      • Sports Overview

      • View All Sports

      • Travel

      • All Travel
      • Amusement Parks and Tourist Attractions
      • Gambling & Casinos
      • Hotels and Resorts
      • Leisure & Tourism
      • Outdoors, Camping & Hiking
      • Passenger Aviation
      • Travel Industry
      • Travel Overview

      • View All Travel

  • Policy & Public Interest
      • Policy & Public Interest

      • All Policy & Public Interest
      • Advocacy Group Opinion
      • Animal Welfare
      • Congressional & Presidential Campaigns
      • Corporate Social Responsibility
      • Domestic Policy
      • Economic News, Trends, Analysis
      • Education
      • Environmental
      • European Government
      • FDA Approval
      • Federal and State Legislation
      • Federal Executive Branch & Agency
      • Foreign Policy & International Affairs
      • Homeland Security
      • Labor & Union
      • Legal Issues
      • Natural Disasters
      • Not For Profit
      • Patent Law
      • Public Safety
      • Trade Policy
      • U.S. State Policy
      • Policy & Public Interest Overview

      • View All Policy & Public Interest

  • People & Culture
      • People & Culture

      • All People & Culture
      • Aboriginal, First Nations & Native American
      • African American
      • Asian American
      • Children
      • Diversity, Equity & Inclusion
      • Hispanic
      • Lesbian, Gay & Bisexual
      • Men's Interest
      • People with Disabilities
      • Religion
      • Senior Citizens
      • Veterans
      • Women
      • People & Culture Overview

      • View All People & Culture

      • In-Language News

      • Arabic
      • español
      • português
      • Česko
      • Danmark
      • Deutschland
      • España
      • France
      • Italia
      • Nederland
      • Norge
      • Polska
      • Portugal
      • Россия
      • Slovensko
      • Suomi
      • Sverige
  • Explore Our Platform
  • Plan Campaigns
  • Create with AI
  • Distribute Press Releases
  • Amplify Content
  • All Products
  • General Inquiries
  • Editorial Bureaus
  • Partnerships
  • Media Inquiries
  • Worldwide Offices
  • Hamburger menu
  • PR Newswire: news distribution, targeting and monitoring
  • Send a Release
    • ALL CONTACT INFO
    • Contact Us

      888-776-0942
      from 8 AM - 10 PM ET

  • Send a Release
  • Client Login
  • Resources
  • Blog
  • Journalists
  • RSS
  • News in Focus
    • Browse All News
    • Multimedia Gallery
    • Trending Topics
  • Business & Money
    • Auto & Transportation
    • Business Technology
    • Entertain­ment & Media
    • Financial Services & Investing
    • General Business
  • Science & Tech
    • Consumer Technology
    • Energy & Natural Resources
    • Environ­ment
    • Heavy Industry & Manufacturing
    • Telecomm­unications
  • Lifestyle & Health
    • Consumer Products & Retail
    • Entertain­ment & Media
    • Health
    • Sports
    • Travel
  • Policy & Public Interest
  • People & Culture
    • People & Culture
  • Send a Release
  • Client Login
  • Resources
  • Blog
  • Journalists
  • RSS
  • Explore Our Platform
  • Plan Campaigns
  • Create with AI
  • Distribute Press Releases
  • Amplify Content
  • All Products
  • Send a Release
  • Client Login
  • Resources
  • Blog
  • Journalists
  • RSS
  • General Inquiries
  • Editorial Bureaus
  • Partnerships
  • Media Inquiries
  • Worldwide Offices
  • Send a Release
  • Client Login
  • Resources
  • Blog
  • Journalists
  • RSS

Stop AI from Guessing: Appier Enables Agents to Assess Confidence Before Acting
  • APAC - English
  • USA - English

Appier Company Logo (PRNewsfoto/Appier)

News provided by

Appier

Mar 24, 2026, 03:35 ET

Share this article

Share toX

Share this article

Share toX

New Framework Boosts Reliability, Cost Efficiency, and Scalability for Enterprise AI

SINGAPORE, March 24, 2026 /PRNewswire/ -- As an AI-native Agentic AI-as-a-Service (AaaS) company, Appier today announced its latest research paper, On Calibration of Large Language Models: From Response to Capability, as part of its ongoing investment in advanced AI innovation. The study introduces Capability Calibration[1]—a new framework designed to address the overconfidence and hallucination challenges of large language models (LLMs) by enabling AI systems to better assess their own ability to solve a given task.

This research equips AI agents with a critical capability: estimating the likelihood of solving a problem before generating an answer. By introducing a quantifiable self-assessment mechanism, AI systems can make more reliable decisions and allocate computational resources more efficiently—improving the reliability, cost efficiency, and scalability of enterprise AI deployments.

From Response Accuracy to Problem-Solving Capability
Traditional LLM calibration focuses on response-level confidence, estimating whether a single generated answer is correct. However, because LLM outputs are inherently stochastic, the same query may produce different responses across multiple attempts. Therefore, a single response often fails to reflect the model's true capability.

In practice, organizations are less concerned with whether one answer is correct and more interested in whether a model can consistently solve the task. Appier's capability calibration framework addresses this by shifting evaluation from single-response confidence to the model's expected success rate for a given query. This moves the evaluation target from a single answer to the model's broader problem-solving capability, providing a more practical measure of real-world performance.

Teaching AI Agents to "Know Their Limits"
"AI agents should not only generate answers but also understand the limits of their own capabilities," said Chih-Han Yu, CEO and Co-Founder of Appier. "With capability calibration, an agent can estimate its probability of success before responding and allocate resources intelligently. Simple queries can be handled quickly, while complex tasks can automatically leverage stronger models or additional compute. This transforms AI from a passive tool into a system that actively manages resources, optimizes costs, and improves decision quality—an essential foundation for scaling enterprise-grade AI agents."

Experimental Results: High-Quality Calibration at Low Cost
The research clarifies the theoretical relationship between capability calibration and traditional response calibration[2], and evaluates multiple confidence estimation approaches across three large language models and seven datasets covering knowledge-intensive and reasoning-intensive tasks. Methods tested include:

  • Verbalized confidence[3]: The model explicitly states its confidence, in text or as a percentage.
  • P(True)[4]: Estimates the probability that the answer is correct based on generation signals.
  • Linear probes[5]: Use internal model signals to assess whether it truly understands.

Results show that the linear probe method provides the best balance between cost and performance, with computational cost even lower than generating a single token while maintaining reliable confidence estimation.

Two Key Applications: Improving Inference Efficiency and Resource Allocation
The framework enables two practical use cases. First, pass@k[6] prediction, a widely used metric for evaluating LLMs in complex tasks. Capability-calibrated confidence estimates the probability that a model will produce at least one correct answer after k attempts, without actually generating multiple responses. Second, inference resource allocation, where computational resources are dynamically distributed based on predicted task difficulty. Harder problems receive more attempts, allowing more tasks to be solved within the same compute budget.

Building a Decision Foundation for Trustworthy AI Agents
Capability calibration enables AI agents to establish a stable and quantifiable confidence signal before taking action. This allows agents to determine whether they can solve a task independently, when to call external tools, and when to seek human assistance—helping AI systems operate more reliably in uncertain environments.

Advancing Capability Calibration to Power Agentic AI Applications
Looking ahead, Appier's AI research team will continue advancing capability calibration by improving model evaluation methods and expanding the framework to applications such as model routing, human–AI collaboration, and trustworthy AI systems. Leveraging Appier's deep expertise in AI and marketing technology, these research advances will be translated into product capabilities, accelerating the deployment of Agentic AI in advertising and marketing decision-making and helping enterprises operate more efficiently in an increasingly complex digital landscape.

About Appier
Appier (TSE: 4180) is an AI-native Agentic AI as a Service (AaaS) company that empowers business decision-making with cutting-edge AdTech and MarTech solutions. Founded in 2012 with the vision of "Making AI Easy by making software intelligent," Appier endeavors to help businesses turn AI into ROI with its Ad Cloud, Personalization Cloud, and Data Cloud solutions. Now Appier has 17 offices across APAC, the US and EMEA, and is listed on the Tokyo Stock Exchange. Visit www.appier.com for more company information, and visit ir.appier.com/en/ for more IR information.

[1] Capability Calibration – A method for evaluating an AI model's overall problem-solving ability by estimating the probability that it will successfully answer a given query, rather than judging a single response.

[2] Response Calibration – A traditional AI evaluation approach that measures a model's confidence in the correctness of a single generated response.

[3] Verbalized Confidence – A method where the model explicitly states its confidence in the correctness of an answer in natural language, such as a percentage or confidence level.

[4] P(True) – A technique that estimates the probability that an answer is correct by analyzing the token probability distribution generated by the model.

[5] Linear Probe – A lightweight linear classifier trained on a model's internal representations to analyze whether the model has learned specific knowledge or capabilities, and to estimate confidence.

[6] pass@k – A common AI evaluation metric estimating the probability that a model produces at least one correct answer within k attempts, reflecting the need to explore multiple reasoning paths in complex tasks.

For media queries, please email [email protected]

SOURCE Appier

21%

more press release views with 
Request a Demo

Modal title

Also from this source

Stop AI from Guessing: Appier Enables Agents to Assess Confidence Before Acting

Stop AI from Guessing: Appier Enables Agents to Assess Confidence Before Acting

As an AI-native Agentic AI-as-a-Service (AaaS) company, Appier today announced its latest research paper, On Calibration of Large Language Models:...

Appier Releases Whitepaper on the Future of Autonomous Marketing with Agentic AI

Appier Releases Whitepaper on the Future of Autonomous Marketing with Agentic AI

Appier, an AI-native AaaS (Agentic AI as a Service) company, today announced the release of its latest whitepaper, "The Future of Autonomous...

More Releases From This Source

Explore

STEM (Science, Tech, Engineering, Math)

STEM (Science, Tech, Engineering, Math)

Computer & Electronics

Computer & Electronics

Artificial Intelligence

Artificial Intelligence

The Latest Artificial Intelligence News

The Latest Artificial Intelligence News

News Releases in Similar Topics

Contact PR Newswire

  • Call PR Newswire at 888-776-0942
    from 8 AM - 9 PM ET
  • Chat with an Expert
  • General Inquiries
  • Editorial Bureaus
  • Partnerships
  • Media Inquiries
  • Worldwide Offices

Products

  • For Marketers
  • For Public Relations
  • For IR & Compliance
  • For Agency
  • All Products

About

  • About PR Newswire
  • About Cision
  • Become a Publishing Partner
  • Become a Channel Partner
  • Careers
  • Accessibility Statement
  • APAC
  • APAC - Simplified Chinese
  • APAC - Traditional Chinese
  • Brazil
  • Canada
  • Czech
  • Denmark
  • Finland
  • France
  • Germany
  • India
  • Indonesia
  • Israel
  • Italy
  • Japan
  • Korea
  • Mexico
  • Middle East
  • Middle East - Arabic
  • Netherlands
  • Norway
  • Poland
  • Portugal
  • Russia
  • Slovakia
  • Spain
  • Sweden
  • United Kingdom
  • Vietnam

My Services

  • All New Releases
  • Platform Login
  • ProfNet
  • Data Privacy

Do not sell or share my personal information:

  • Submit via [email protected] 
  • Call Privacy toll-free: 877-297-8921

Contact PR Newswire

Products

About

My Services
  • All News Releases
  • Platform Login
  • ProfNet
Call PR Newswire at
888-776-0942
  • Terms of Use
  • Privacy Policy
  • Information Security Policy
  • Site Map
  • RSS
  • Cookies
Copyright © 2026 Cision US Inc.