profile picture

Xiaoyu Tan

large language models

language models

weak supervision

graph theory

entity matching

in-context learning

transportation

prompt learning

temporal point process

semi-markov decision process

geocoding

instruction fine-tuning

reinforcement learning

supervised fine-tuning

4

presentations

2

number of views

SHORT BIO

Tan Xiaoyu obtained his PhD from the National University of Singapore and is currently working at INF Technology (Shanghai) Co., Ltd. in the field of Large Language Model algorithms. His academic interests include instruction fine-tuning of Large Language Models, generalization, and causality.

Presentations

ULMR: Unlearning Large Language Models via Negative Response and Model Parameter Average

Shaojie Shi and 8 other authors

PILLOW: Enhancing Efficient Instruction Fine-tuning via Prompt Matching | VIDEO

Xiaoyu Tan

Self-Criticism: Aligning Large Language Models with their Understanding of Helpfulness, Honesty, and Harmlessness

Xiaoyu Tan

Bellman Meets Hawkes: Model-Based Reinforcement Learning via Temporal Point Processes

Chao Qu and 5 other authors

Stay up to date with the latest Underline news!

Select topic of interest (you can select more than one)

PRESENTATIONS

  • All Presentations
  • For Librarians
  • Resource Center
  • Free Trial
Underline Science, Inc.
1216 Broadway, 2nd Floor, New York, NY 10001, USA

© 2025 Underline - All rights reserved