
Avi Caciularu
Research Scientist @ Google
language models
evaluation
question answering
nlp
probing
large language models
benchmark
retrieval
word embeddings
interpretability
search
language modeling
transformers
tokenization
transformer
16
presentations
21
number of views
SHORT BIO
Avi Caciularu received the B.Sc. degree (cum laude) in computer science and electrical engineering and the M.Sc. degree in electrical engineering from Tel Aviv University in 2018 and 2019, respectively. He is currently pursuing his Ph.D. degree in computer science with Bar-Ilan University, and working as a research intern with Google Research. His research interests include topics in semantics and representation learning for natural language processing, mostly for multi-document tasks.
Presentations

Unpacking Tokenization: Evaluating Text Compression and its Correlation with Model Performance
omer goldman and 5 other authors

Stop Uploading Test Data in Plain Text: Practical Strategies for Mitigating Data Contamination by Evaluation Benchmarks
Alon Jacovi and 3 other authors

Optimizing Retrieval-augmented Reader Models via Token Elimination
Moshe Berchansky and 4 other authors

The Curious Case of Hallucinatory (Un)answerability: Finding Truths in the Hidden States of Over-Confident Large Language Models
Aviv Slobodkin and 4 other authors

A Comprehensive Evaluation of Tool-Assisted Generation Strategies
Alon Jacovi and 5 other authors

Peek Across: Improving Multi-Document Modeling via Cross-Document Question-Answering
Avi Caciularu and 4 other authors

Revisiting Sentence Union Generation as a Testbed for Text Consolidation
Eran Hirsch and 5 other authors

QASem Parsing: Text-to-text Modeling of QA-based Semantics
Ayal Klein and 5 other authors

Transformer Feed-Forward Layers Build Predictions by Promoting Concepts in the Vocabulary Space
Avi Caciularu and 3 other authors

Cross-document Event Coreference Search: Task, Dataset and Modeling
Alon Eirew and 2 other authors

LM-Debugger: An Interactive Tool for Inspection and Intervention in Transformer-Based Language Models
Mor Geva and 7 other authors

Long Context Question Answering via Supervised Contrastive Learning
Avi Caciularu and 3 other authors

Proposition-Level Clustering for Multi-Document Summarization
Ori Ernst and 6 other authors

CDLM: Cross-Document Language Modeling
Avi Caciularu and 5 other authors

CDLM: Cross-Document Language Modeling
Avi Caciularu and 5 other authors

Denoising Word Embeddings by Averaging in a Shared Space
Avi Caciularu and 2 other authors