All Publications


  • Causal Abstraction: A Theoretical Foundation for Mechanistic Interpretability JOURNAL OF MACHINE LEARNING RESEARCH Geiger, A., Ibeling, D., Zur, A., Chaudhary, M., Chauhan, S., Huang, J., Arora, A., Wu, Z., Goodman, N., Potts, C., Icard, T. 2025; 26