
Bio
Juan Carlos Niebles received an Engineering degree in Electronics from Universidad del Norte (Colombia) in 2002, an M.Sc. degree in Electrical and Computer Engineering from University of Illinois at Urbana-Champaign in 2007, and a Ph.D. degree in Electrical Engineering from Princeton University in 2011. He is a Senior Research Scientist at the Stanford AI Lab and Associate Director of Research at the Stanford-Toyota Center for AI Research since 2015. He is also an Associate Professor of Electrical and Electronic Engineering in Universidad del Norte (Colombia) since 2011. His research interests are in computer vision and machine learning, with a focus on visual recognition and understanding of human actions and activities, objects, scenes, and events. He is a recipient of a Google Faculty Research award (2015), the Microsoft Research Faculty Fellowship (2012), a Google Research award (2011) and a Fulbright Fellowship (2005).
Honors & Awards
-
Faculty Research Award, Google (2015)
-
Senior Member, IEEE (2015)
-
Faculty Fellow, Microsoft Research (2012)
-
Research Award, Google (2011)
-
Fulbright PhD Fellowship, Fulbright-Colciencias-DNP (2005)
Boards, Advisory Committees, Professional Organizations
-
Steering Committee, AI Index (2018 - Present)
-
Associate Director of Research, Stanford AI Lab-Toyota Center for AI Research (2015 - Present)
-
Senior Member, IEEE (2015 - Present)
-
Member, IEEE Computer Society (2014 - Present)
-
Member, IEEE (2007 - Present)
Professional Education
-
Ph.D., Princeton University, Electrical Engineering (2011)
-
M.A., Princeton University, Electrical Engineering (2009)
-
M.Sc., University of Illinois at Urbana-Champaign, Electrical and Computer Engineering (2007)
-
Engineer, Universidad del Norte, Electronics Engineering (2002)
Current Research and Scholarly Interests
My research work is in computer vision. The goal of my research is to enable computers and robots to perceive the visual world by developing novel computer vision algorithms for automatic analysis of images and videos. From the scientific point of view, we tackle fundamental open problems in computer vision research related to the visual recognition and understanding of human actions and activities, objects, scenes, and events. From the application perspective, we develop systems that solve practical world problems by introducing cutting-edge computer vision technologies into new application domains.
2020-21 Courses
- Computer Vision: Foundations and Applications
CS 131 (Aut) -
Independent Studies (6)
- Curricular Practical Training
CS 390A (Sum) - Independent Project
CS 399 (Aut, Win, Spr) - Independent Work
CS 199 (Aut) - Senior Project
CS 191 (Spr) - Supervised Undergraduate Research
CS 195 (Win) - Writing Intensive Senior Project (WIM)
CS 191W (Win)
- Curricular Practical Training
-
Prior Year Courses
2019-20 Courses
- Computer Vision: Foundations and Applications
CS 131 (Aut)
2018-19 Courses
- AI-Assisted Care
MED 277 (Aut) - Computer Vision: Foundations and Applications
CS 131 (Aut)
2017-18 Courses
- Computer Vision: Foundations and Applications
CS 131 (Aut)
- Computer Vision: Foundations and Applications
All Publications
-
Vision-based Estimation of MDS-UPDRS Gait Scores for Assessing Parkinson's Disease Motor Severity.
Medical image computing and computer-assisted intervention : MICCAI ... International Conference on Medical Image Computing and Computer-Assisted Intervention
2020; 12263: 637–47
Abstract
Parkinson's disease (PD) is a progressive neurological disorder primarily affecting motor function resulting in tremor at rest, rigidity, bradykinesia, and postural instability. The physical severity of PD impairments can be quantified through the Movement Disorder Society Unified Parkinson's Disease Rating Scale (MDS-UPDRS), a widely used clinical rating scale. Accurate and quantitative assessment of disease progression is critical to developing a treatment that slows or stops further advancement of the disease. Prior work has mainly focused on dopamine transport neuroimaging for diagnosis or costly and intrusive wearables evaluating motor impairments. For the first time, we propose a computer vision-based model that observes non-intrusive video recordings of individuals, extracts their 3D body skeletons, tracks them through time, and classifies the movements according to the MDS-UPDRS gait scores. Experimental results show that our proposed method performs significantly better than chance and competing methods with an F 1-score of 0.83 and a balanced accuracy of 81%. This is the first benchmark for classifying PD patients based on MDS-UPDRS gait severity and could be an objective biomarker for disease severity. Our work demonstrates how computer-assisted technologies can be used to non-intrusively monitor patients and their motor impairments. The code is available at https://github.com/mlu355/PD-Motor-Severity-Estimation.
View details for DOI 10.1007/978-3-030-59716-0_61
View details for PubMedID 33103164
-
Socially and Contextually Aware Human Motion and Pose Forecasting
IEEE ROBOTICS AND AUTOMATION LETTERS
2020; 5 (4): 6033–40
View details for DOI 10.1109/LRA.2020.3010742
View details for Web of Science ID 000554894900027
-
Explaining VQA predictions using visual grounding and a knowledge base
IMAGE AND VISION COMPUTING
2020; 101
View details for DOI 10.1016/j.imavis.2020.103968
View details for Web of Science ID 000570137900006
-
Segmenting the Future
IEEE ROBOTICS AND AUTOMATION LETTERS
2020; 5 (3): 4202–9
View details for DOI 10.1109/LRA.2020.2992184
View details for Web of Science ID 000541731600001
-
Spatiotemporal Relationship Reasoning for Pedestrian Intent Prediction
IEEE ROBOTICS AND AUTOMATION LETTERS
2020; 5 (2): 3485–92
View details for DOI 10.1109/LRA.2020.2976305
View details for Web of Science ID 000520954200034
-
Disentangling Human Dynamics for Pedestrian Locomotion Forecasting with Noisy Supervision
IEEE COMPUTER SOC. 2020: 2773–82
View details for Web of Science ID 000578444802088
-
Action-Agnostic Human Pose Forecasting
IEEE. 2019: 1423–32
View details for DOI 10.1109/WACV.2019.00156
View details for Web of Science ID 000469423400149
-
Continuous Relaxation of Symbolic Planner for One-Shot Imitation Learning
IEEE. 2019: 2635–42
View details for Web of Science ID 000544658402034
-
Peeking into the Future: Predicting Future Person Activities and Locations in Videos
IEEE COMPUTER SOC. 2019: 5718–27
View details for DOI 10.1109/CVPR.2019.00587
View details for Web of Science ID 000529484005092
-
(DTW)-T-3: Discriminative Differentiable Dynamic Time Warping for Weakly Supervised Action Alignment and Segmentation
IEEE COMPUTER SOC. 2019: 3541–50
View details for DOI 10.1109/CVPR.2019.00366
View details for Web of Science ID 000529484003071
-
Peeking into the Future: Predicting Future Person Activities and Locations in Videos
IEEE. 2019: 2960–63
View details for DOI 10.1109/CVPRW.2019.00358
View details for Web of Science ID 000569983600352
-
Neural Task Graphs: Generalizing to Unseen Tasks from a Single Video Demonstration
IEEE. 2019: 8557–66
View details for DOI 10.1109/CVPR.2019.00876
View details for Web of Science ID 000542649302018
-
Learning Temporal Action ProposalsWith Fewer Labels
IEEE. 2019: 7072–81
View details for DOI 10.1109/ICCV.2019.00717
View details for Web of Science ID 000548549202018
-
Imitation Learning for Human Pose Prediction
IEEE. 2019: 7123–32
View details for DOI 10.1109/ICCV.2019.00722
View details for Web of Science ID 000548549202023
-
Interpretable Visual Question Answering by Visual Grounding from Attention Supervision Mining
IEEE. 2019: 349–57
View details for DOI 10.1109/WACV.2019.00043
View details for Web of Science ID 000469423400036
-
Learning to Decompose and Disentangle Representations for Video Prediction
NEURAL INFORMATION PROCESSING SYSTEMS (NIPS). 2018
View details for Web of Science ID 000461823300048
-
Finding "It": Weakly-Supervised Reference-Aware Visual Grounding in Instructional Videos
IEEE. 2018: 5948–57
View details for DOI 10.1109/CVPR.2018.00623
View details for Web of Science ID 000457843606011
-
What Makes a Video a Video: Analyzing Temporal Information in Video Understanding Models and Datasets
IEEE. 2018: 7366–75
View details for DOI 10.1109/CVPR.2018.00769
View details for Web of Science ID 000457843607054
-
Sparse composition of body poses and atomic actions for human activity recognition in RGB-D videos
IMAGE AND VISION COMPUTING
2017; 59: 63–75
View details for DOI 10.1016/j.imavis.2016.11.004
View details for Web of Science ID 000397687900005
-
Dense-Captioning Events in Videos
IEEE. 2017: 706–15
View details for DOI 10.1109/ICCV.2017.83
View details for Web of Science ID 000425498400074
-
Visual Forecasting by Imitating Dynamics in Natural Sequences
IEEE. 2017: 3018–27
View details for DOI 10.1109/ICCV.2017.326
View details for Web of Science ID 000425498403009
-
Unsupervised Visual-Linguistic Reference Resolution in Instructional Videos
IEEE. 2017: 1032–41
View details for DOI 10.1109/CVPR.2017.116
View details for Web of Science ID 000418371401011
-
Agent-Centric Risk Assessment: Accident Anticipation and Risky Region Localization
IEEE. 2017: 1330–38
View details for DOI 10.1109/CVPR.2017.146
View details for Web of Science ID 000418371401041
-
Risky Region Localization with Point Supervision
IEEE. 2017: 246–53
View details for DOI 10.1109/ICCVW.2017.38
View details for Web of Science ID 000425239600031
-
A Hierarchical Pose-Based Approach to Complex Action Understanding Using Dictionaries of Actionlets and Motion Poselets
IEEE. 2016: 1981–90
View details for DOI 10.1109/CVPR.2016.218
View details for Web of Science ID 000400012302004
-
Title Generation for User Generated Videos
SPRINGER INTERNATIONAL PUBLISHING AG. 2016: 609–25
View details for DOI 10.1007/978-3-319-46475-6_38
View details for Web of Science ID 000389383900038
-
DAPs: Deep Action Proposals for Action Understanding
SPRINGER INTERNATIONAL PUBLISHING AG. 2016: 768–84
View details for DOI 10.1007/978-3-319-46487-9_47
View details for Web of Science ID 000389384800047
-
Connectionist Temporal Modeling for Weakly Supervised Action Labeling
SPRINGER INTERNATIONAL PUBLISHING AG. 2016: 137–53
View details for DOI 10.1007/978-3-319-46493-0_9
View details for Web of Science ID 000389385100009
-
Fast Temporal Activity Proposals for Efficient Detection of Human Actions in Untrimmed Videos
IEEE. 2016: 1914–23
View details for DOI 10.1109/CVPR.2016.211
View details for Web of Science ID 000400012301103
-
Modeling Temporal Structure of Decomposable Motion Segments for Activity Classification
11th European Conference on Computer Vision
SPRINGER-VERLAG BERLIN. 2010: 392–405
View details for Web of Science ID 000286164000029