
Xiaoyu Tan
large language models
language models
weak supervision
graph theory
entity matching
in-context learning
transportation
prompt learning
temporal point process
semi-markov decision process
geocoding
instruction fine-tuning
reinforcement learning
supervised fine-tuning
4
presentations
2
number of views
SHORT BIO
Tan Xiaoyu obtained his PhD from the National University of Singapore and is currently working at INF Technology (Shanghai) Co., Ltd. in the field of Large Language Model algorithms. His academic interests include instruction fine-tuning of Large Language Models, generalization, and causality.
Presentations

ULMR: Unlearning Large Language Models via Negative Response and Model Parameter Average
Shaojie Shi and 8 other authors

PILLOW: Enhancing Efficient Instruction Fine-tuning via Prompt Matching | VIDEO
Xiaoyu Tan

Self-Criticism: Aligning Large Language Models with their Understanding of Helpfulness, Honesty, and Harmlessness
Xiaoyu Tan

Bellman Meets Hawkes: Model-Based Reinforcement Learning via Temporal Point Processes
Chao Qu and 5 other authors