profile picture

Siwen Luo

Post-graduate student @ The University of Sydney

scene graph

document layout analysis

graph convolutional networks

multi-modal learning

text to image generation

document component detection

2

presentations

29

number of views

SHORT BIO

Siwen Luo is currently a final-year PhD student at the School of Computer Science, The University of Sydney. Her research focuses on the cross-area of computer vision and natural language processing, aiming for the exploration and development of interpretable models for multimodalities. Her research works span the ranges of multimodal tasks, including Visual Question Answering, Text to Image generation and Document Layout Analysis. She has published several papers on top-tier NLP conferences, and received the Best Paper Award from ICONIP 2020.

Presentations

Doc-GCN: Heterogeneous Graph Convolutional Networks for Document Layout Analysis

Siwen Luo and 1 other author

VICTR: Visual Information Captured Text Representation for Text-to-Image Multimodal Tasks

Caren Han and 4 other authors

Stay up to date with the latest Underline news!

Select topic of interest (you can select more than one)

PRESENTATIONS

  • All Presentations
  • For Librarians
  • Resource Center
  • Free Trial
Underline Science, Inc.
1216 Broadway, 2nd Floor, New York, NY 10001, USA

© 2025 Underline - All rights reserved