Max Lamparth
Postdoctoral Scholar, Infectious Diseases
Bio
Max is a postdoctoral fellow at the Center for International Security and Cooperation, the Stanford Center for AI Safety, the Stanford Existential Risks Initiative, and the Brainstorm Lab at the Department of Psychiatry and Behavioral Sciences at Stanford University. He is advised by Prof. Clark Barrett, Prof. Steve Luby, and Prof. Paul Edwards.
With his research, he wants to make AI systems more secure and safe to use. Specifically, he is focussing on improving the ethical behavior of language models, making their inner workings more interpretable, and increasing their robustness against misuse.
Max received his Ph.D. in August 2023 from the School of Natural Sciences at the Technical University of Munich and previously a B.Sc. and M.Sc. in Physics from the Ruprecht Karl University of Heidelberg.
2024-25 Courses
- Introduction to AI Governance
CS 134, STS 14 (Win) - Introduction to AI Safety
CS 120 (Aut) -
Prior Year Courses
2023-24 Courses
- Introduction to AI Safety
CS 120, STS 10 (Spr)
- Introduction to AI Safety
All Publications
-
Escalation Risks from Language Models in Military and Diplomatic Decision-Making
ASSOC COMPUTING MACHINERY. 2024: 836-898
View details for DOI 10.1145/3630106.3658942
View details for Web of Science ID 001253359300057
-
Analyzing And Editing Inner Mechanisms of Backdoored Language Models
ASSOC COMPUTING MACHINERY. 2024: 2362-2373
View details for DOI 10.1145/3630106.3659042
View details for Web of Science ID 001253359300156