profile picture

Avi Caciularu

Research Scientist @ Google

language models

evaluation

question answering

nlp

probing

large language models

benchmark

retrieval

word embeddings

interpretability

search

language modeling

transformers

tokenization

transformer

16

presentations

21

number of views

SHORT BIO

Avi Caciularu received the B.Sc. degree (cum laude) in computer science and electrical engineering and the M.Sc. degree in electrical engineering from Tel Aviv University in 2018 and 2019, respectively. He is currently pursuing his Ph.D. degree in computer science with Bar-Ilan University, and working as a research intern with Google Research. His research interests include topics in semantics and representation learning for natural language processing, mostly for multi-document tasks.

Presentations

Unpacking Tokenization: Evaluating Text Compression and its Correlation with Model Performance

omer goldman and 5 other authors

Stop Uploading Test Data in Plain Text: Practical Strategies for Mitigating Data Contamination by Evaluation Benchmarks

Alon Jacovi and 3 other authors

Optimizing Retrieval-augmented Reader Models via Token Elimination

Moshe Berchansky and 4 other authors

The Curious Case of Hallucinatory (Un)answerability: Finding Truths in the Hidden States of Over-Confident Large Language Models

Aviv Slobodkin and 4 other authors

A Comprehensive Evaluation of Tool-Assisted Generation Strategies

Alon Jacovi and 5 other authors

Peek Across: Improving Multi-Document Modeling via Cross-Document Question-Answering

Avi Caciularu and 4 other authors

Revisiting Sentence Union Generation as a Testbed for Text Consolidation

Eran Hirsch and 5 other authors

QASem Parsing: Text-to-text Modeling of QA-based Semantics

Ayal Klein and 5 other authors

Transformer Feed-Forward Layers Build Predictions by Promoting Concepts in the Vocabulary Space

Avi Caciularu and 3 other authors

Cross-document Event Coreference Search: Task, Dataset and Modeling

Alon Eirew and 2 other authors

LM-Debugger: An Interactive Tool for Inspection and Intervention in Transformer-Based Language Models

Mor Geva and 7 other authors

Long Context Question Answering via Supervised Contrastive Learning

Avi Caciularu and 3 other authors

Proposition-Level Clustering for Multi-Document Summarization

Ori Ernst and 6 other authors

CDLM: Cross-Document Language Modeling

Avi Caciularu and 5 other authors

CDLM: Cross-Document Language Modeling

Avi Caciularu and 5 other authors

Denoising Word Embeddings by Averaging in a Shared Space

Avi Caciularu and 2 other authors

Stay up to date with the latest Underline news!

Select topic of interest (you can select more than one)

PRESENTATIONS

  • All Presentations
  • For Librarians
  • Resource Center
  • Free Trial
Underline Science, Inc.
1216 Broadway, 2nd Floor, New York, NY 10001, USA

© 2025 Underline - All rights reserved