EMNLP 2025

November 07, 2025

Suzhou, China

Would you like to see your presentation here, made available to a global audience of researchers?
Add your own presentation or have us affordably record your next conference.

Neural models for Japanese pronunciation estimation often suffer from errors such as hallucinations (generating pronunciations that are not grounded in the input) and omissions (skipping parts of the input). Although attention-based alignment has been used to detect such errors, selecting reliable attention heads is difficult, and developing methods that can both detect and correct these errors remains challenging.

In this paper, we propose a simple method called existence-based alignment check. In this approach, we consider alignment candidates independently extracted from all attention heads, and check whether at least one of these candidates satisfies two conditions derived from the linguistic properties of Japanese pronunciation: monotonicity and pronunciation length per character. We generate multiple hypotheses using beam search and use the alignment check as a filtering mechanism to correct hallucinations and omissions.

We apply this method to a dataset of Japanese facility names and demonstrate that it improves pronunciation estimation accuracy by over 2.5\%.

Downloads

Paper

Next from EMNLP 2025

More Data or Better Data? A Critical Analysis of Data Selection and Synthesis for Mathematical Reasoning
poster

More Data or Better Data? A Critical Analysis of Data Selection and Synthesis for Mathematical Reasoning

EMNLP 2025

+3Fei Tan
Simin Guo and 5 other authors

07 November 2025

Stay up to date with the latest Underline news!

Select topic of interest (you can select more than one)

PRESENTATIONS

  • All Presentations
  • For Librarians
  • Resource Center
  • Free Trial
Underline Science, Inc.
1216 Broadway, 2nd Floor, New York, NY 10001, USA

© 2026 Underline - All rights reserved