Junyuan Shang
Senior algorithm engineer @ Baidu Online Network Technology (Beijing) Co., Ltd
large language models
ensemble
efficient inference
pre-trained model
long-document, retrospective, recurrence mechanism
large language model
semeval2022-task7
winning system
kv cache eviction
efficient model architecture
latent thinking
3
presentations
2
number of views
SHORT BIO
Senior Algorithm Engineer@Baidu NLP working on large-scale pre-trained models and its application.