
Junyuan Shang
Senior algorithm engineer @ Baidu Online Network Technology (Beijing) Co., Ltd
large language models
ensemble
efficient inference
pre-trained model
long-document, retrospective, recurrence mechanism
semeval2022-task7
winning system
kv cache eviction
3
presentations
1
number of views
SHORT BIO
Senior Algorithm Engineer@Baidu NLP working on large-scale pre-trained models and its application.
Presentations

NACL: A General and Effective KV Cache Eviction Framework for LLM at Inference Time
Yilong Chen and 9 other authors

X-PuDu at SemEval-2022 Task 7: A Replaced Token Detection Task Pre-trained Model with Pattern-aware Ensembling for Identifying Plausible Clarifications
Junyuan Shang

ERNIE-Doc: A Retrospective Long-Document Modeling Transformer
SIyu Ding and 3 other authors