Lecture image placeholder

Premium content

Access to this content requires a subscription. You must be a premium user to view this content.

Monthly subscription - $9.99Pay per view - $4.99Access through your institutionLogin with Underline account
Need help?
Contact us
Lecture placeholder background
VIDEO DOI: https://doi.org/10.48448/nqb6-ng84

poster

ACL 2024

August 12, 2024

Bangkok, Thailand

Monotonic Representation of Numeric Attributes in Language Models

keywords:

numeric properties

monotonic representations

world knowledge

interpretability

Language models (LMs) can express factual knowledge involving numeric properties such as Karl Popper was born in 1902. However, how this information is encoded in the model’s internal representations is not understood well. Here, we introduce a method for finding and editing representations of numeric properties such as an entity’s birth year. We find directions that encode numeric properties monotonically, in an interpretable fashion. When editing representations along these directions, LM output changes accordingly. For example, by patching activations along a "birthyear" direction we can make the LM express an increasingly late birthyear. Property-encoding directions exist across several numeric properties in all models under consideration, suggesting the possibility that monotonic representation of numeric properties consistently emerges during LM pretraining. Code: https://github.com/bheinzerling/numeric-property-repr A long version of this short paper is available at: https://arxiv.org/abs/2403.10381

Downloads

SlidesTranscript English (automatic)

Next from ACL 2024

Intuitive or Dependent? Investigating LLMs' Behavior Style to Conflicting Prompts
poster

Intuitive or Dependent? Investigating LLMs' Behavior Style to Conflicting Prompts

ACL 2024

+3Yixin Cao
Jiahao Ying and 5 other authors

12 August 2024

Stay up to date with the latest Underline news!

Select topic of interest (you can select more than one)

PRESENTATIONS

  • All Lectures
  • For Librarians
  • Resource Center
  • Free Trial
Underline Science, Inc.
1216 Broadway, 2nd Floor, New York, NY 10001, USA

© 2023 Underline - All rights reserved