A">
[go: up one dir, main page]

Alessandro Stolfo

prof_pic.jpeg

OAT Y 24

Andreasstrasse 5

8050 Zürich, Switzerland

Hi! I am a doctoral student in the Institute for Machine Learning at ETH Zürich, where I am advised by Prof. Mrinmaya Sachan, and co-advised by Prof. Yonatan Belinkov (Technion).

My research focuses on evaluating and interpreting machine learning models for natural language processing. I am particularly interested in exploring the capabilities of (large) language models in areas such as solving arithmetic problems and reasoning over factual and commonsense knowledge.

In summer 2024, I interned with the AI Frontiers group at Microsoft Research in Redmond, WA, where I had the opportunity to collaborate with Besmira Nushi and Eric Horvitz. Previously, in summer 2023, I interned with the Machine Learning Research Group at Oracle Labs in Burlington, MA, working with Ari Kobren.

Before starting my doctoral studies, I obtained a Master’s degree in Data Science at ETH Zürich, and I worked at Rethink-Resource on the development of Circado. I completed my undergraduate studies in Computer Engineering at Politecnico di Milano.

I am grateful to be a recipient of the CYD Doctoral Fellowship.

news

May 28, 2024 I am interning in the AI Frontiers group at Microsoft Research in Redmond, WA.
Nov 22, 2023 I am attending the ML Alignment & Theory Scholars (MATS) Program, mentored by Neel Nanda.
Jul 17, 2023 I started my internship in the ML Research Group at Oracle Labs in Burlington, MA.
Apr 22, 2022 I answered a couple of questions for this EPFL News article. Check it out!

selected publications

  1. NeurIPS 2024
    Confidence Regulation Neurons in Language Models
    A. Stolfo*, B. Wu*, W. Gurnee, Y. Belinkov, X. Song, M. Sachan, and N. Nanda
  2. ICML 2024
    Do Language Models Exhibit the Same Cognitive Biases in Problem Solving as Human Learners?
    A. Opedal*, A. Stolfo*, H. Shirakami, Y. Jiao, R. Cotterell, B. Schölkopf, A. Saparov, and M. Sachan
  3. NAACL 2024 (F)
    Groundedness in Retrieval-augmented Long-form Generation: An Empirical Study
    A. Stolfo
  4. EMNLP 2023
    A Mechanistic Interpretation of Arithmetic Reasoning in Language Models using Causal Mediation Analysis
    A. Stolfo, Y. Belinkov, and M. Sachan
  5. ACL 2023
    A Causal Framework to Quantify the Robustness of Mathematical Reasoning with Language Models
    A. Stolfo*, Z. Jin*, K. Shridhar, B. Schölkopf, and M. Sachan