
Samuel Cahyawijaya
Student @ HKUST
benchmark
indonesian
reinforcement learning
conversational question answering
question rewriting
multilingual
low-resource
code-switching
code-mixing
machine translation
sentiment analysis
curriculum learning
style
low-resource languages
evaluation
18
presentations
29
number of views
SHORT BIO
Samuel Cahyawijaya is a PhD student at HKUST. He is a passionate machine learning researcher and enjoys working on data technology, machine learning, and autonomous systems.
Presentations

Cendol: Open Instruction-tuned Generative Large Language Models for Indonesian Languages
Samuel Cahyawijaya and 1 other author

LLMs Are Few-Shot In-Context Low-Resource Language Learners
Samuel Cahyawijaya and 2 other authors

Multilingual Large Language Models Are Not (Yet) Code-Switchers
Ruochen Zhang and 4 other authors

GlobalBench: A Benchmark for Global Progress in Natural Language Processing
Yueqi Song and 10 other authors

A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and Interactivity
Yejin Bang and 12 other authors

PICK: Polished & Informed Candidate Scoring for Knowledge-Grounded Dialogue Systems
Bryan Wilie and 5 other authors

NusaWrites: Constructing High-Quality Corpora for Underrepresented and Extremely Low-Resource Languages
Samuel Cahyawijaya

InstructAlign: High-and-Low Resource Language Alignment via Continual CrosslingualInstruction Tuning
Samuel Cahyawijaya

NusaCrowd: Open Source Initiative for Indonesian NLP Resources
Samuel Cahyawijaya

NusaX: Multilingual Parallel Sentiment Dataset for 10 Indonesian Local Languages
Genta Indra Winata and 9 other authors

How Long Is Enough? Exploring the Optimal Intervals of Long-Range Clinical Note Language Modeling
Samuel Cahyawijaya and 6 other authors

Every picture tells a story: Image-grounded controllable stylistic story generation
Romain Barraud and 5 other authors

SNP2Vec: Scalable Self-Supervised Pre-Training for Genome-Wide Association Study
Samuel Cahyawijaya and 6 other authors

Can Question Rewriting Help Conversational Question Answering?
Etsuko Ishii and 3 other authors

One Country, 700+ Languages: NLP Challenges for Underrepresented Languages and Dialects in Indonesia
Alham Fikri Aji and 10 other authors

Integrating Question Rewrites in Conversational Question Answering: A Reinforcement Learning Approach
Etsuko Ishii and 4 other authors