
Yihong Liu
low-resource languages
benchmark
reasoning
normalizing flows
unsupervised machine translation
representation learning
transliteration
multilingual embeddings
turkish
llm
evaluation dataset
colexification
continued pretraining
multilinguaility
embedding initialization
6
presentations
1
number of views
Presentations

SynthEval: Hybrid Behavioral Testing of NLP Models with Synthetic Evaluation
Raoyuan Zhao and 5 other authors

Breaking the Script Barrier in Multilingual Pre-Trained Language Models with Transliteration-Based Post-Training Alignment
Orgest Xhelili and 2 other authors

TransliCo: A Contrastive Learning Framework to Address the Script Barrier in Multilingual Pretrained Language Models
Yihong Liu and 3 other authors

OFA: A Framework of Initializing Unseen Subword Embeddings for Efficient Large-scale Multilingual Continued Pretraining
Yihong Liu and 3 other authors

Crosslingual Transfer Learning for Low-Resource Languages Based on Multilingual Colexification Graphs
Yihong Liu and 4 other authors

Flow-Adapter Architecture for Unsupervised Machine Translation
Yihong Liu and 2 other authors