I am a PhD student at New York University working on Mechanistic Interpretability. My previous research in the PhD was on scaling laws and phase transitions of diffusion models and neural networks with Eric Vanden-Eijnden and Arthur Jacot. I obtained my B.S. in Mathematics at Stanford University, where I worked on interacting particle systems with Amir Dembo
Current research
Tied Crosscoders: new architecture for model-diffing to understand how chat model behavior arises from the base model and to measure how novel chat features are when compared to base features.