Lecture image placeholder

Premium content

Access to this content requires a subscription. You must be a premium user to view this content.

Monthly subscription - $9.99Pay per view - $4.99Access through your institutionLogin with Underline account
Need help?
Contact us
Lecture placeholder background
VIDEO DOI: https://doi.org/10.48448/esmy-ta22

poster

ACL 2024

August 22, 2024

Bangkok, Thailand

MARS: Meaning-Aware Response Scoring for Uncertainty Estimation in Generative LLMs

keywords:

token weighting

semantic contribution

semantic importance

probability-based ue methods

generative llms

uncertainty estimation

Generative Large Language Models (LLMs) are widely utilized for their excellence in various tasks. However, their tendency to produce inaccurate or misleading outputs poses a potential risk, particularly in high-stakes environments. Therefore, estimating the correctness of generative LLM outputs is an important task for enhanced reliability. Uncertainty Estimation (UE) in generative LLMs is an evolving domain, where SOTA probability-based methods commonly employ length-normalized scoring. In this work, we propose Meaning-Aware Response Scoring (MARS) as an alternative to length-normalized scoring for UE methods. MARS is a novel scoring function that considers the semantic contribution of each token in the generated sequence in the context of the question. We demonstrate that integrating MARS into UE methods results in a universal and significant improvement in UE performance. We conduct experiments using three distinct closed-book question-answering datasets across five popular pre-trained LLMs. Lastly, we validate the efficacy of MARS on a Medical QA dataset. Code can be found \href{https://github.com/Ybakman/LLM_Uncertainty} {here}.

Downloads

SlidesTranscript English (automatic)

Next from ACL 2024

Inducing Systematicity in Transformers by Attending to Structurally Quantized Embeddings
poster

Inducing Systematicity in Transformers by Attending to Structurally Quantized Embeddings

ACL 2024

Mohit BansalYichen Jiang
Yichen Jiang and 2 other authors

22 August 2024

Stay up to date with the latest Underline news!

Select topic of interest (you can select more than one)

PRESENTATIONS

  • All Lectures
  • For Librarians
  • Resource Center
  • Free Trial
Underline Science, Inc.
1216 Broadway, 2nd Floor, New York, NY 10001, USA

© 2023 Underline - All rights reserved