
Sergey Edunov
Facebook, AI Research
multilinguality
nmt
bitext mining
positional encoding
large-scale
wmt evaluation
continual pre-training
long context
data mix recipe
1
presentations
Presentations

Effective Long-Context Scaling of Foundation Models
Wenhan Xiong and 20 other authors