Lecture image placeholder

Premium content

Access to this content requires a subscription. You must be a premium user to view this content.

Monthly subscription - $9.99Pay per view - $4.99Access through your institutionLogin with Underline account
Need help?
Contact us
Lecture placeholder background
VIDEO DOI: https://doi.org/10.48448/wk8w-j979

poster

ACL 2024

August 13, 2024

Bangkok, Thailand

AttributionBench: How Hard is Automatic Attribution Evaluation?

keywords:

attribution evaluation

llm evaluation

benchmark

Modern generative search engines enhance the reliability of large language model (LLM) responses by providing cited evidence. However, evaluating the answer's attribution, i.e., whether every claim within the generated responses is fully supported by its cited evidence, remains an open problem. This verification, traditionally dependent on costly human evaluation, underscores the urgent need for automatic attribution evaluation methods. To bridge the gap in the absence of standardized benchmarks for these methods, we present \texttt{AttributionBench}, a comprehensive benchmark compiled from various existing attribution datasets. Our extensive experiments on \texttt{AttributionBench} reveal the challenges of automatic attribution evaluation, even for state-of-the-art LLMs. Specifically, our findings show that even a fine-tuned GPT-3.5 only achieves around 80\% macro-F1 under a binary classification formulation. A detailed analysis of more than 300 error cases indicates that a majority of failures stem from the model's inability to process nuanced information, and the discrepancy between the information the model has access to and that human annotators do.

Downloads

SlidesTranscript English (automatic)

Next from ACL 2024

SAC-KG: Exploiting Large Language Models as Skilled Automatic Constructors for Domain Knowledge Graph
poster

SAC-KG: Exploiting Large Language Models as Skilled Automatic Constructors for Domain Knowledge Graph

ACL 2024

+3
Hanzhu Chen and 5 other authors

13 August 2024

Stay up to date with the latest Underline news!

Select topic of interest (you can select more than one)

PRESENTATIONS

  • All Lectures
  • For Librarians
  • Resource Center
  • Free Trial
Underline Science, Inc.
1216 Broadway, 2nd Floor, New York, NY 10001, USA

© 2023 Underline - All rights reserved