
Yifan Qiao
University of California, Santa Barbara
machine translation
low resource languages
quantization
efficient
document ranking
multi-lingual large language models
2
presentations
Presentations

Threshold-driven Pruning with Segmented Maximum Term Weights for Approximate Cluster-based Sparse Retrieval
Yifan Qiao and 4 other authors

Compact Token Representations with Contextual Quantization for Efficient Document Re-ranking
Yingrui Yang and 2 other authors