profile picture

Samuel Cahyawijaya

Student @ HKUST

benchmark

indonesian

reinforcement learning

conversational question answering

question rewriting

multilingual

low-resource

code-switching

code-mixing

machine translation

sentiment analysis

curriculum learning

style

low-resource languages

evaluation

18

presentations

29

number of views

SHORT BIO

Samuel Cahyawijaya is a PhD student at HKUST. He is a passionate machine learning researcher and enjoys working on data technology, machine learning, and autonomous systems.

Presentations

Cendol: Open Instruction-tuned Generative Large Language Models for Indonesian Languages

Samuel Cahyawijaya and 1 other author

LLMs Are Few-Shot In-Context Low-Resource Language Learners

Samuel Cahyawijaya and 2 other authors

Multilingual Large Language Models Are Not (Yet) Code-Switchers

Ruochen Zhang and 4 other authors

GlobalBench: A Benchmark for Global Progress in Natural Language Processing

Yueqi Song and 10 other authors

A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and Interactivity

Yejin Bang and 12 other authors

PICK: Polished & Informed Candidate Scoring for Knowledge-Grounded Dialogue Systems

Bryan Wilie and 5 other authors

NusaWrites: Constructing High-Quality Corpora for Underrepresented and Extremely Low-Resource Languages

Samuel Cahyawijaya

InstructAlign: High-and-Low Resource Language Alignment via Continual CrosslingualInstruction Tuning

Samuel Cahyawijaya

NusaCrowd: Open Source Initiative for Indonesian NLP Resources

Samuel Cahyawijaya

NusaX: Multilingual Parallel Sentiment Dataset for 10 Indonesian Local Languages

Genta Indra Winata and 9 other authors

How Long Is Enough? Exploring the Optimal Intervals of Long-Range Clinical Note Language Modeling

Samuel Cahyawijaya and 6 other authors

Every picture tells a story: Image-grounded controllable stylistic story generation

Romain Barraud and 5 other authors

SNP2Vec: Scalable Self-Supervised Pre-Training for Genome-Wide Association Study

Samuel Cahyawijaya and 6 other authors

Can Question Rewriting Help Conversational Question Answering?

Etsuko Ishii and 3 other authors

One Country, 700+ Languages: NLP Challenges for Underrepresented Languages and Dialects in Indonesia

Alham Fikri Aji and 10 other authors

Integrating Question Rewrites in Conversational Question Answering: A Reinforcement Learning Approach

Etsuko Ishii and 4 other authors

Stay up to date with the latest Underline news!

Select topic of interest (you can select more than one)

PRESENTATIONS

  • All Lectures
  • For Librarians
  • Resource Center
  • Free Trial
Underline Science, Inc.
1216 Broadway, 2nd Floor, New York, NY 10001, USA

© 2025 Underline - All rights reserved