EMNLP 2025

November 05, 2025

Suzhou, China

Would you like to see your presentation here, made available to a global audience of researchers?
Add your own presentation or have us affordably record your next conference.

Recent advances in large language models (LLMs) have boosted performance across a broad spectrum of natural‑language tasks, yet no single model excels uniformly across domains. Sending each query to the most suitable model mitigates this limitation, but deciding among all available LLMs for each query is prohibitively expensive. Both the accuracy and the latency can improve if the decision space for the model choice is first narrowed, followed by selecting the suitable model for the given query. We introduce Select-then-Route (StR), a two‑stage framework that first selects a small, task‑appropriate pool of LLMs and then routes each query within that pool through an adaptive cascade. StR first employs a lightweight, taxonomy‑guided selector that maps each query to models proven proficient for its semantic class (e.g., reasoning, code, summarisation). Within the selected pool, a confidence‑based cascade begins with the cheapest model and escalates only when a multi‑judge agreement test signals low reliability. Across six public benchmarks of various domains, StR improves the end‑to‑end accuracy from 91.7% (best single model) to 94.3% while reducing inference cost by 4X. Because both the taxonomy and multi-judge evaluation thresholds are tunable, StR exposes a smooth cost–accuracy frontier, enabling users to dial in the trade‑off that best fits their latency and budget constraints.

Downloads

Paper

Next from EMNLP 2025

QuackIR: Retrieval in DuckDB and Other Relational Database Management Systems
poster

QuackIR: Retrieval in DuckDB and Other Relational Database Management Systems

EMNLP 2025

Jimmy Lin
Zijian Chen and 2 other authors

05 November 2025

Stay up to date with the latest Underline news!

Select topic of interest (you can select more than one)

PRESENTATIONS

  • All Presentations
  • For Librarians
  • Resource Center
  • Free Trial
Underline Science, Inc.
1216 Broadway, 2nd Floor, New York, NY 10001, USA

© 2025 Underline - All rights reserved