
Minghan Li
University of Waterloo
dense retrieval
information retrieval
multilinguality
open-domain question answering
efficiency
out-of-domain
set prediction
reranking
attribution analysis
candidate set pruning
error control
zero-shot robustness
hybrid representations
lexical and semantic matching
multi-vector retrieval
6
presentations
3
number of views
SHORT BIO
I’m a CS PhD student (3rd year) at Universtiy of Waterloo, supervised by Professor Jimmy Lin. I am interested in building efficient, scalable, and robust neural models for information retrieval and its downstream applications. My research goal is to design task-aware retrievers to help large language models efficiently aggregate knowledge from large, unstructured databases.
Presentations

Unifying Multimodal Retrieval via Document Screenshot Embedding
Xueguang Ma and 4 other authors

CELI: Simple yet Effective Approach to Enhance Out-of-Domain Generalization of Cross-Encoders.
Xinyu Zhang and 2 other authors

CITADEL: Conditional Token Interaction via Dynamic Lexical Routing for Efficient and Effective Multi-Vector Retrieval
Minghan Li and 1 other author

Aggretriever: A Simple Approach to Aggregate Textual Representations for Robust Dense Passage Retrieval
Sheng-Chieh Lin and 2 other authors

Certified Error Control of Candidate Set Pruning for Two-Stage Relevance Ranking
Minghan Li and 1 other author

An Encoder Attribution Analysis for Dense Passage Retriever in Open Domain Question Answering
Minghan Li