VIDEO DOI: https://doi.org/10.48448/7j76-dw24

poster

ACL 2022

•

May 24, 2022

•

Dublin, Ireland

Softmax Bottleneck Makes Language Models Unable to Represent Multi-mode Word Distributions

Please log in to leave a comment

Downloads

SlidesPaperTranscript English (automatic)

Next from ACL 2022

Sparse Progressive Distillation: Resolving Overfitting under Pretrain-and-Finetune Paradigm
poster

Sparse Progressive Distillation: Resolving Overfitting under Pretrain-and-Finetune Paradigm

ACL 2022

+8Shaoyi Huang
Shaoyi Huang and 10 other authors

24 May 2022

Similar lecture

Low-Rank Softmax Can Have Unargmaxable Classes in Theory but Rarely in Practice
findings / work in progress

Low-Rank Softmax Can Have Unargmaxable Classes in Theory but Rarely in Practice

ACL 2022

Adam LopezAndreas Grivas
Andreas Grivas and 2 other authors

23 May 2022

Stay up to date with the latest Underline news!

Select topic of interest (you can select more than one)

PRESENTATIONS

  • All Lectures
  • For Librarians
  • Resource Center
  • Free Trial
Underline Science, Inc.
1216 Broadway, 2nd Floor, New York, NY 10001, USA

© 2023 Underline - All rights reserved