I am a Research Fellow at Goodfire working on model diffing for surfacing unexpected behavior in LLMs. Before that I did MATS training phase with Neel Nanda where I worked on new techniques for model diffing (SAEs on activation differences). I also worked on a
variation of crosscoders for understanding how chat behavior arises from the base model.
I am a fourth-year PhD student at New York University. My previous research in the PhD was on scaling laws and phase transitions of diffusion models and neural networks with
Eric Vanden-Eijnden and
Arthur Jacot. I obtained my B.S. in Mathematics at Stanford University, where I worked on interacting particle systems with
Amir Dembo