
Andrew McCallum
softmax bottleneck
mixture of softmax
hallucination
reranker
copy mechanism
repetition
bart
factuality
summarization
gpt-2
pointer network
t5
box embeddings
representation learning
next word distribution
24
presentations
43
number of views
1
citations
Presentations

Analysis of Plan-based Retrieval for Grounded Text Generation
Ameya Godbole and 5 other authors

Comparing Neighbors Together Makes it Easy: Jointly Comparing Multiple Candidates for Efficient and Effective Retrieval
Jonghyun Song and 4 other authors

Every Answer Matters: Evaluating Commonsense with Probabilistic Measures
Qi Cheng and 6 other authors

Revisiting the Architectures like Pointer Networks to Efficiently Improve the Next Word Distribution, Summarization Factuality, and Beyond
Haw-Shiuan Chang and 4 other authors

Revisiting the Architectures like Pointer Networks to Efficiently Improve the Next Word Distribution, Summarization Factuality, and Beyond
Haw-Shiuan Chang and 4 other authors

Multi-CLS BERT: An Efficient Alternative to Traditional Ensembling
Haw-Shiuan Chang and 3 other authors

Low-Resource Compositional Semantic Parsing with Concept Pretraining
Subendhu Rongali and 5 other authors

Unsupervised Partial Sentence Matching for Cited Text Identification
Kathryn Ricci and 3 other authors

Softmax Bottleneck Makes Language Models Unable to Represent Multi-mode Word Distributions
Haw-Shiuan Chang and 1 other author

Event-Event Relation Extraction using Probabilistic Box Embedding
EunJeong Hwang and 5 other authors

Word2Box: Capturing Set-Theoretic Semantics of Words using Box Embeddings
Shib Dasgupta and 6 other authors

An Evaluative Measure of Clustering Methods Incorporating Hyperparameter Sensitivity
Siddhartha Mishra and 4 other authors

Sublinear Time Approximation of Text Similarity Matrices
Archan Ray and 3 other authors

Case-based Reasoning for Natural Language Queries over Knowledge Bases
Rajarshi Das and 8 other authors

Event and Entity Coreference using Trees to Encode Uncertainty in Joint Decisions
Nishant Yadav and 3 other authors

Improved Latent Tree Induction with Distant Supervision via Span Constraints
Zhiyang Xu and 8 other authors