
Stefano Ermon
Assistant Professor of Computer Science and Center Fellow, by courtesy, at the Woods Institute for the Environment
Bio
I am an Assistant Professor in the Department of Computer Science at Stanford University, where I am affiliated with the Artificial Intelligence Laboratory and a fellow of the Woods Institute for the Environment.
My research is centered on techniques for scalable and accurate inference in graphical models, statistical modeling of data, large-scale combinatorial optimization, and robust decision making under uncertainty, and is motivated by a range of applications, in particular ones in the emerging field of computational sustainability.
Academic Appointments
-
Assistant Professor, Computer Science
-
Center Fellow (By courtesy), Stanford Woods Institute for the Environment
-
Faculty Affiliate, Institute for Human-Centered Artificial Intelligence (HAI)
Honors & Awards
-
Sloan Research Fellowship, Alfred P. Sloan Foundation
-
IJCAI Computers and Thought Award, IJCAI
-
Microsoft Research Faculty Fellowship, Microsoft Research
-
NSF CAREER Award, National Science Foundation
-
ONR Young Investigator Award, Office of Naval Research
-
AFOSR Young Investigator Award, Air Force Office of Scientific Research
-
AWS Machine Learning Research Award, Amazon Web Services (AWS)
-
Sony Faculty Innovation Award, Sony
-
Hellman Fellowship, Hellman Foundation
-
AAAI 2017 Outstanding Paper Award, AAAI
-
Bloomberg Data Science Research Grant, Bloomberg
-
10 World Changing Ideas of 2016, Scientific American
-
First Place, World Bank Big Data Innovation Challenge, World Bank
-
Finalist, NVIDIA Global Impact Award, NVIDIA
Professional Education
-
Ph.D., Cornell University, Computer Science (2015)
2020-21 Courses
- Probabilistic Graphical Models: Principles and Techniques
CS 228 (Win) -
Independent Studies (14)
- Advanced Reading and Research
CS 499 (Aut, Win, Spr, Sum) - Advanced Reading and Research
CS 499P (Aut, Win, Spr) - Computer Laboratory
CS 393 (Win) - Curricular Practical Training
CS 390A (Aut, Win, Spr, Sum) - Curricular Practical Training
CS 390B (Aut, Win, Spr, Sum) - Curricular Practical Training
CS 390C (Spr, Sum) - Independent Project
CS 399 (Aut, Win, Spr) - Independent Project
CS 399P (Win, Spr) - Independent Work
CS 199 (Aut, Win, Spr) - Part-time Curricular Practical Training
CS 390D (Aut, Win, Sum) - Research
PHYSICS 490 (Spr) - Senior Project
CS 191 (Aut, Win, Spr) - Supervised Undergraduate Research
CS 195 (Win) - Writing Intensive Senior Project (WIM)
CS 191W (Aut, Win, Spr)
- Advanced Reading and Research
-
Prior Year Courses
2019-20 Courses
- Data for Sustainable Development
CS 325B, EARTHSYS 162, EARTHSYS 262 (Aut) - Deep Generative Models
CS 236 (Aut) - Probabilistic Graphical Models: Principles and Techniques
CS 228 (Win)
2018-19 Courses
- Data for Sustainable Development
CS 325B, EARTHSYS 162, EARTHSYS 262 (Aut) - Deep Generative Models
CS 236 (Aut) - Probabilistic Graphical Models: Principles and Techniques
CS 228 (Win)
2017-18 Courses
- Artificial Intelligence: Principles and Techniques
CS 221 (Aut) - Data for Sustainable Development
CS 325B, EARTHSYS 162, EARTHSYS 262 (Aut, Win) - Probabilistic Graphical Models: Principles and Techniques
CS 228 (Win)
- Data for Sustainable Development
Stanford Advisees
-
Doctoral Dissertation Reader (AC)
Panos Achlioptas, Sherrie Wang, Jiaxuan You -
Postdoctoral Faculty Sponsor
Willie Neiswanger, Andreas Schlueter -
Doctoral Dissertation Advisor (AC)
Rishi Sharma -
Master's Program Advisor
Constantine Athanitis, Cheng-Min Chiang, Kyle Clark, Ben Gaiarin, Jonathan Gomes Selman, JK Hunt, Lingjie Kong, Nazli Koyluoglu, Thomas Maheras, Henrik Marklund, Shahab Mousavi, Eun Jee Sung, Eric Tang, Andre Turati, Avery Wang -
Doctoral Dissertation Co-Advisor (AC)
Bryan He, Pratyusha Kalluri, Andy Shih, Sharon Zhou -
Doctoral (Program)
Kristy Choi, Chris Cundy, Kun Ho Kim, Rui Shu, Jiaming Song, Yang Song, Shengjia Zhao
All Publications
-
Computational Sustainability: Computing for a Better World and a Sustainable Future
COMMUNICATIONS OF THE ACM
2019; 62 (9): 56–65
View details for DOI 10.1145/3339399
View details for Web of Science ID 000483033900019
-
High-Voltage Charging-Induced Strain, Heterogeneity, and Micro-Cracks in Secondary Particles of a Nickel-Rich Layered Cathode Material
ADVANCED FUNCTIONAL MATERIALS
2019; 29 (18)
View details for DOI 10.1002/adfm.201900247
View details for Web of Science ID 000471330500021
-
InfoVAE: Balancing Learning and Inference in Variational Autoencoders
ASSOC ADVANCEMENT ARTIFICIAL INTELLIGENCE. 2019: 5885–92
View details for Web of Science ID 000486572500051
-
Tile2Vec: Unsupervised Representation Learning for Spatially Distributed Data
ASSOC ADVANCEMENT ARTIFICIAL INTELLIGENCE. 2019: 3967–74
View details for Web of Science ID 000485292603121
-
Rapid identification of pathogenic bacteria using Raman spectroscopy and deep learning.
Nature communications
2019; 10 (1): 4927
Abstract
Raman optical spectroscopy promises label-free bacterial detection, identification, and antibiotic susceptibility testing in a single step. However, achieving clinically relevant speeds and accuracies remains challenging due to weak Raman signal from bacterial cells and numerous bacterial species and phenotypes. Here we generate an extensive dataset of bacterial Raman spectra and apply deep learning approaches to accurately identify 30 common bacterial pathogens. Even on low signal-to-noise spectra, we achieve average isolate-level accuracies exceeding 82% and antibiotic treatment identification accuracies of 97.0±0.3%. We also show that this approach distinguishes between methicillin-resistant and -susceptible isolates of Staphylococcus aureus (MRSA and MSSA) with 89±0.1% accuracy. We validate our results on clinical isolates from 50 patients. Using just 10 bacterial spectra from each patient isolate, we achieve treatment identification accuracies of 99.7%. Our approach has potential for culture-free pathogen identification and antibiotic susceptibility testing, and could be readily extended for diagnostics on blood, urine, and sputum.
View details for DOI 10.1038/s41467-019-12898-9
View details for PubMedID 31666527
-
Predicting Economic Development using Geolocated Wikipedia Articles
ASSOC COMPUTING MACHINERY. 2019: 2698–2706
View details for DOI 10.1145/3292500.3330784
View details for Web of Science ID 000485562502077
-
Using machine learning to discover shape descriptors for predicting emulsion stability in a microfluidic channel.
Soft matter
2018
Abstract
In soft matter consisting of many deformable objects, object shapes often carry important information about local forces and their interactions with the local environment, and can be tightly coupled to the bulk properties and functions. In a concentrated emulsion, for example, the shapes of individual droplets are directly related to the local stress arising from interactions with neighboring drops, which in turn determine their stability and the resulting rheological properties. Shape descriptors used in prior work on single drops and dilute emulsions, where droplet-droplet interactions are largely negligible and the drop shapes are simple, are insufficient to fully capture the broad range of droplet shapes in a concentrated system. This paper describes the application of a machine learning method, specifically a convolutional autoencoder model, that learns to: (1) discover a low-dimensional code (8-dimensional) to describe droplet shapes within a concentrated emulsion, and (2) predict whether the drop will become unstable and undergo break-up. The input consists of images (N = 500002) of two-dimensional droplet boundaries extracted from movies of a concentrated emulsion flowing through a confined microfluidic channel as a monolayer. The model is able to faithfully reconstruct droplet shapes, as well as to achieve a classification accuracy of 91.7% in the prediction of droplet break-up, compared with 60% using conventional scalar descriptors based on droplet elongation. It is observed that 4 out of the 8 dimensions of the code are interpretable, corresponding to drop skewness, elongation, throat size, and surface curvature, respectively. Furthermore, the results show that drop elongation, throat size, and surface curvature are dominant factors in predicting droplet break-up for the flow conditions tested. The method presented is expected to facilitate follow-on work to identify the relationship between drop shapes and the interactions with other drops, and to identify potentially new modes of break-up mechanisms in a concentrated system. Finally, the method developed here should also apply to other soft materials such as foams, gels, and cells and tissues.
View details for PubMedID 30570628
-
Learning with Weak Supervision from Physics and Data-Driven Constraints
AI MAGAZINE
2018; 39 (1): 27–38
View details for Web of Science ID 000435949900005
-
Bias and Generalization in Deep Generative Models: An Empirical Study
NEURAL INFORMATION PROCESSING SYSTEMS (NIPS). 2018
View details for Web of Science ID 000461852005038
-
The Information Autoencoding Family: A Lagrangian Perspective on Latent Variable Generative Models
AUAI PRESS. 2018: 1031–41
View details for Web of Science ID 000493119200101
-
Bayesian optimization and attribute adjustment
AUAI PRESS. 2018: 1042–52
View details for Web of Science ID 000493119200102
-
End-to-End Learning of Motion Representation for Video Understanding
IEEE. 2018: 6016–25
View details for DOI 10.1109/CVPR.2018.00630
View details for Web of Science ID 000457843606018
-
Semi-supervised Deep Kernel Learning: Regression with Unlabeled Data by Minimizing Predictive Variance
NEURAL INFORMATION PROCESSING SYSTEMS (NIPS). 2018
View details for Web of Science ID 000461823305035
-
Infrastructure Quality Assessment in Africa using Satellite Imagery and Deep Learning
ASSOC COMPUTING MACHINERY. 2018: 616–25
View details for DOI 10.1145/3219819.3219924
View details for Web of Science ID 000455346400065
-
Deep Transfer Learning for Crop Yield Prediction with Remote Sensing Data
ASSOC COMPUTING MACHINERY. 2018
View details for DOI 10.1145/3209811.3212707
View details for Web of Science ID 000455345900050
-
Amortized Inference Regularization
NEURAL INFORMATION PROCESSING SYSTEMS (NIPS). 2018
View details for Web of Science ID 000461823304041
-
Flow-GAN: Combining Maximum Likelihood and Adversarial Learning in Generative Models
ASSOC ADVANCEMENT ARTIFICIAL INTELLIGENCE. 2018: 3069–76
View details for Web of Science ID 000485488903019
-
Boosted Generative Models
ASSOC ADVANCEMENT ARTIFICIAL INTELLIGENCE. 2018: 3077–84
View details for Web of Science ID 000485488903020
-
Approximate Inference via Weighted Rademacher Complexity
ASSOC ADVANCEMENT ARTIFICIAL INTELLIGENCE. 2018: 6376–83
View details for Web of Science ID 000485488906057
-
Deterministic Policy Optimization by Combining Pathwise and Score Function Estimators for Discrete Action Spaces
ASSOC ADVANCEMENT ARTIFICIAL INTELLIGENCE. 2018: 3474–81
View details for Web of Science ID 000485488903069
-
Constructing Unrestricted Adversarial Examples with Generative Models
NEURAL INFORMATION PROCESSING SYSTEMS (NIPS). 2018
View details for Web of Science ID 000461852002082
-
Multi-Agent Generative Adversarial Imitation Learning
NEURAL INFORMATION PROCESSING SYSTEMS (NIPS). 2018
View details for Web of Science ID 000461852002005
-
Streamlining Variational Inference for Constraint Satisfaction Problems
NEURAL INFORMATION PROCESSING SYSTEMS (NIPS). 2018
View details for Web of Science ID 000461852005016
-
Coupling between oxygen redox and cation migration explains unusual electrochemistry in lithium-rich layered oxides
NATURE COMMUNICATIONS
2017; 8
View details for DOI 10.1038/s41467-017-02041-x
View details for Web of Science ID 000417702300042
-
Autotuning Stencil Computations with Structural Ordinal Regression Learning
IEEE. 2017: 287–96
View details for DOI 10.1109/IPDPS.2017.102
View details for Web of Science ID 000427044800030
-
A-NICE-MC: Adversarial Training for MCMC
NEURAL INFORMATION PROCESSING SYSTEMS (NIPS). 2017
View details for Web of Science ID 000452649405022
-
InfoGAIL: Interpretable Imitation Learning from Visual Demonstrations
NEURAL INFORMATION PROCESSING SYSTEMS (NIPS). 2017
View details for Web of Science ID 000452649403085
-
Coupling between oxygen redox and cation migration explains unusual electrochemistry in lithium-rich layered oxides.
Nature communications
2017; 8 (1): 2091
Abstract
Lithium-rich layered transition metal oxide positive electrodes offer access to anion redox at high potentials, thereby promising high energy densities for lithium-ion batteries. However, anion redox is also associated with several unfavorable electrochemical properties, such as open-circuit voltage hysteresis. Here we reveal that in Li1.17-x Ni0.21Co0.08Mn0.54O2, these properties arise from a strong coupling between anion redox and cation migration. We combine various X-ray spectroscopic, microscopic, and structural probes to show that partially reversible transition metal migration decreases the potential of the bulk oxygen redox couple by > 1 V, leading to a reordering in the anionic and cationic redox potentials during cycling. First principles calculations show that this is due to the drastic change in the local oxygen coordination environments associated with the transition metal migration. We propose that this mechanism is involved in stabilizing the oxygen redox couple, which we observe spectroscopically to persist for 500 charge/discharge cycles.
View details for PubMedID 29233965
View details for PubMedCentralID PMC5727078
-
Monitoring Ethiopian Wheat Fungus with Satellite Imagery and Deep Feature Learning
IEEE. 2017: 1524–32
View details for DOI 10.1109/CVPRW.2017.196
View details for Web of Science ID 000426448300189
-
Unsupervised Data Mining in nanoscale X-ray Spectro-Microscopic Study of NdFeB Magnet
SCIENTIFIC REPORTS
2016; 6
Abstract
Novel developments in X-ray based spectro-microscopic characterization techniques have increased the rate of acquisition of spatially resolved spectroscopic data by several orders of magnitude over what was possible a few years ago. This accelerated data acquisition, with high spatial resolution at nanoscale and sensitivity to subtle differences in chemistry and atomic structure, provides a unique opportunity to investigate hierarchically complex and structurally heterogeneous systems found in functional devices and materials systems. However, handling and analyzing the large volume data generated poses significant challenges. Here we apply an unsupervised data-mining algorithm known as DBSCAN to study a rare-earth element based permanent magnet material, Nd2Fe14B. We are able to reduce a large spectro-microscopic dataset of over 300,000 spectra to 3, preserving much of the underlying information. Scientists can easily and quickly analyze in detail three characteristic spectra. Our approach can rapidly provide a concise representation of a large and complex dataset to materials scientists and chemists. For example, it shows that the surface of common Nd2Fe14B magnet is chemically and structurally very different from the bulk, suggesting a possible surface alteration effect possibly due to the corrosion, which could affect the material's overall properties.
View details for DOI 10.1038/srep34406
View details for Web of Science ID 000384188300001
View details for PubMedID 27680388
View details for PubMedCentralID PMC5041149
-
Combining satellite imagery and machine learning to predict poverty.
Science
2016; 353 (6301): 790-794
Abstract
Reliable data on economic livelihoods remain scarce in the developing world, hampering efforts to study these outcomes and to design policies that improve them. Here we demonstrate an accurate, inexpensive, and scalable method for estimating consumption expenditure and asset wealth from high-resolution satellite imagery. Using survey and satellite data from five African countries--Nigeria, Tanzania, Uganda, Malawi, and Rwanda--we show how a convolutional neural network can be trained to identify image features that can explain up to 75% of the variation in local-level economic outcomes. Our method, which requires only publicly available data, could transform efforts to track and target poverty in developing countries. It also demonstrates how powerful machine learning techniques can be applied in a setting with limited training data, suggesting broad potential application across many scientific domains.
View details for DOI 10.1126/science.aaf7894
View details for PubMedID 27540167