
Alessandro Stolfo
Graduate student @ ETH Zürich
reasoning
math
llms
mechanistic interpretability
mechanistic interpretation
robustness
language model understanding
evaluation
interpretability
3
presentations
10
number of views
SHORT BIO
Alessandro is currently a doctoral candidate at the Institute for Machine Learning at ETH Zürich, under the guidance of Prof. Mrinmaya Sachan. He is interested in studying and improving machine learning models for natural language processing, and his research explores language models' capabilities on complex tasks such as solving arithmetic problems and reasoning over factual and commonsense knowledge. Prior to embarking on their doctoral journey, Alessandro acquired a Master's in Data Science from ETH Zürich and worked as a software developer at a start-up. Alessandro obtained his undergraduate degree in Computer Engineering from Politecnico di Milano. Alessandro is a grateful recipient of the CYD Doctoral Fellowship.
Presentations

A Mechanistic Interpretation of Arithmetic Reasoning in Language Models using Causal Mediation Analysis | VIDEO
Alessandro Stolfo and 2 other authors

Towards a Mechanistic Interpretation of Multi-Step Reasoning Capabilities of Language Models
Yifan Hou and 7 other authors

A Causal Framework to Quantify the Robustness of Mathematical Reasoning with Language Models
Alessandro Stolfo and 4 other authors