Alessandro Stolfo

OAT Y 24

Andreasstrasse 5

8050 Zürich, Switzerland

Hi! I am a doctoral student in the Institute for Machine Learning at ETH Zürich, where I am advised by Prof. Mrinmaya Sachan, and co-advised by Prof. Yonatan Belinkov (Technion).

My research focuses on evaluating and interpreting machine learning models for natural language processing. I am particularly interested in exploring the capabilities of (large) language models in areas such as solving arithmetic problems and reasoning over factual and commonsense knowledge.

In summer 2024, I interned with the AI Frontiers group at Microsoft Research in Redmond, WA, where I had the opportunity to collaborate with Besmira Nushi and Eric Horvitz. Previously, in summer 2023, I interned with the Machine Learning Research Group at Oracle Labs in Burlington, MA, working with Ari Kobren.

Before starting my doctoral studies, I obtained a Master’s degree in Data Science at ETH Zürich, and I worked at Rethink-Resource on the development of Circado. I completed my undergraduate studies in Computer Engineering at Politecnico di Milano.

I am grateful to be a recipient of the CYD Doctoral Fellowship.

news

May 28, 2024	I am interning in the AI Frontiers group at Microsoft Research in Redmond, WA.
Nov 22, 2023	I am attending the ML Alignment & Theory Scholars (MATS) Program, mentored by Neel Nanda.
Jul 17, 2023	I started my internship in the ML Research Group at Oracle Labs in Burlington, MA.
Apr 22, 2022	I answered a couple of questions for this EPFL News article. Check it out!

selected publications

NeurIPS 2024

Confidence Regulation Neurons in Language Models

A. Stolfo*, B. Wu*, W. Gurnee, Y. Belinkov, X. Song, M. Sachan, and N. Nanda

OpenReview arXiv
ICML 2024

Do Language Models Exhibit the Same Cognitive Biases in Problem Solving as Human Learners?

A. Opedal*, A. Stolfo*, H. Shirakami, Y. Jiao, R. Cotterell, B. Schölkopf, A. Saparov, and M. Sachan

OpenReview arXiv
NAACL 2024 (F)

Groundedness in Retrieval-augmented Long-form Generation: An Empirical Study

A. Stolfo

ACL arXiv
EMNLP 2023

A Mechanistic Interpretation of Arithmetic Reasoning in Language Models using Causal Mediation Analysis

A. Stolfo, Y. Belinkov, and M. Sachan

ACL arXiv
ACL 2023

A Causal Framework to Quantify the Robustness of Mathematical Reasoning with Language Models

A. Stolfo*, Z. Jin*, K. Shridhar, B. Schölkopf, and M. Sachan

ACL arXiv