poster

EMNLP 2021

November 08, 2021

Live on Underline

Effects of Parameter Norm Growth During Transformer Training: Inductive Bias from Gradient Descent

Please log in to leave a comment

Downloads

SlidesPaper

Next from EMNLP 2021

GradTS: A Gradient-Based Automatic Auxiliary Task Selection Method Based on Transformer Networks
poster

GradTS: A Gradient-Based Automatic Auxiliary Task Selection Method Based on Transformer Networks

EMNLP 2021

+2Soroush VosoughiWeicheng Ma
Weicheng Ma and 4 other authors

08 November 2021

Similar lecture

It Is Not As Good As You Think! Evaluating Simultaneous Machine Translation on Interpretation Data
poster

It Is Not As Good As You Think! Evaluating Simultaneous Machine Translation on Interpretation Data

EMNLP 2021

+2
Philip Arthur and 4 other authors

08 November 2021

Stay up to date with the latest Underline news!

Select topic of interest (you can select more than one)

PRESENTATIONS

  • All Lectures
  • For Librarians
  • Resource Center
  • Free Trial
Underline Science, Inc.
1216 Broadway, 2nd Floor, New York, NY 10001, USA

© 2023 Underline - All rights reserved