EMNLP 2025

November 07, 2025

Suzhou, China

Would you like to see your presentation here, made available to a global audience of researchers?
Add your own presentation or have us affordably record your next conference.

Recent advancements in large language models (LLMs) have revolutionized natural language processing through their remarkable capabilities in understanding and executing diverse tasks. While supervised fine-tuning, particularly in Retrieval-Augmented Generation (RAG) scenarios, has proven effective for enhancing task-specific performance, it often leads to catastrophic forgetting, where models lose their previously acquired knowledge and general capabilities. Existing solutions either require access to general instruction data or face limitations in preserving the model's original distribution. To overcome these limitations, we propose SelfAug, a novel self-distribution alignment method. By aligning distributions through the logits of input sequences, SelfAug preserves the model’s semantic distribution, thereby simultaneously mitigating catastrophic forgetting and improving downstream task performance. Through extensive experiments, we show that SelfAug achieves a better balance between downstream task learning and the retention of general capabilities compared to existing methods. Our comprehensive empirical analysis reveals a direct correlation between distribution shifts and the severity of catastrophic forgetting in RAG scenarios, particularly highlighting how the absence of RAG capabilities in general instruction tuning leads to significant distribution shifts during fine-tuning.

Downloads

SlidesPaperTranscript English (automatic)

Next from EMNLP 2025

SEKE: Specialised Experts for Keyword Extraction
poster

SEKE: Specialised Experts for Keyword Extraction

EMNLP 2025

+1Senja PollakMatej MartincBoshko Koloski
Boshko Koloski and 3 other authors

07 November 2025

Stay up to date with the latest Underline news!

Select topic of interest (you can select more than one)

PRESENTATIONS

  • All Presentations
  • For Librarians
  • Resource Center
  • Free Trial
Underline Science, Inc.
1216 Broadway, 2nd Floor, New York, NY 10001, USA

© 2025 Underline - All rights reserved