Peter Broadwell
Digital Scholarship Research Developer, Center for Interdisciplinary Digital Research
Bio
Peter Broadwell is a Digital Scholarship Research Developer at the Center for Interdisciplinary Digital Research. His work applies machine learning, web-based visualization, and other methods of digital analysis to complex cultural data. Recent studies in which he has participated have involved automatic translation and indexing of folklore collections in multiple languages, deep learning-based analysis of dance choreography from video sources, and multimedia annotation of Japanese Noh theater performances.
Current Role at Stanford
Digital Scholarship Research Developer, Center for Interdisciplinary Digital Research, Stanford University Libraries
Honors & Awards
-
Jöran Sahlgren Prize, Kungl. Gustav Adolfs Akademien för Svensk Folkkultur (2022)
-
Best Conference Paper, Joint Council on Digital Libraries (2016)
-
Mellon Dissertation Fellowship in the Humanities in Original Sources, Council on Library and Information Resources (2007-2008)
Education & Certifications
-
Ph.D., University of California, Los Angeles, Musicology (2010)
-
M.A., University of California, Los Angeles, Musicology (2006)
-
M.S., University of California, Berkeley, Computer Science (2004)
-
B.A., Grinnell College, Computer Science (2001)
All Publications
-
Disentangling the Folklore Hairball A Network Approach to the Characterization of a Large Folktale Corpus
FABULA
2023; 64 (1-2): 64-91
View details for DOI 10.1515/fabula-2023-0004
View details for Web of Science ID 001031308600005
-
Comparing Themes Extracted via Topic Modeling and Manual Content Analysis: Korean-Language Discussions of Dementia on Twitter.
Studies in health technology and informatics
2022; 295: 230-233
Abstract
We randomly examined Korean-language Tweets mentioning dementia/Alzheimer's disease (n= 12,413) posted from November 28 to December 9, 2020, without limiting geographical locations. We independently applied Latent Dirichlet Allocation (LDA) topic modeling and qualitative content analysis to the texts of the Tweets. We compared the themes extracted by LDA topic modeling to those identified via manual coding methods. A total of 16 themes were detected from manual coding, with inter-rater reliability (Cohen's kappa) of 0.842. The proportions of the most prominent themes were: burdens of family caregiving (48.50%), reports of wandering/missing family members with dementia (18.12%), stigma (13.64%), prevention strategies (5.07%), risk factors (4.91%), healthcare policy (3.26%), and elder abuse/safety issues (1.75%). Seven themes whose contents were similar to themes derived from manual coding were extracted from the LDA topic modeling results (perplexity: -6.39, coherence score: 0.45). Our findings suggest that applying LDA topic modeling can be fairly effective at extracting themes from Korean Twitter discussions, in a manner analogous to qualitative coding, to gain insights regarding caregiving for family members with dementia, and our approach can be applied to other languages.
View details for DOI 10.3233/SHTI220704
View details for PubMedID 35773850
-
Comparative K-Pop Choreography Analysis through Deep-Learning Pose Estimation across a Large Video Corpus
DIGITAL HUMANITIES QUARTERLY
2021; 15 (1)
View details for Web of Science ID 000696408800016
-
Bridges, Sex Slaves, Tweets, and Guns A Multi-Domain Model of Conspiracy Theory
FOLKLORE AND SOCIAL MEDIA
2020: 39-66
View details for DOI 10.7330/9781646420599.c002
View details for Web of Science ID 000635283000003
-
Comparing published scientific journal articles to their pre-print versions
INTERNATIONAL JOURNAL ON DIGITAL LIBRARIES
2019; 20 (4): 335-350
View details for DOI 10.1007/s00799-018-0234-1
View details for Web of Science ID 000497276800004
-
Reading the Quan Tang shi: Literary History, Topic Modeling, Divergence Measures
DIGITAL HUMANITIES QUARTERLY
2019; 13 (4)
View details for Web of Science ID 000522557300008
-
SoLoGlo - A Service to Archive, Analyze, and Link Social, Local, and Global News
SOC IMAGING SCIENCE & TECHNOLOGY. 2015: 27-29
View details for Web of Science ID 000409399100007
-
Semi-Supervised Morphosyntactic Classification of Old Icelandic
PLOS ONE
2014; 9 (7): e102366
Abstract
We present IceMorph, a semi-supervised morphosyntactic analyzer of Old Icelandic. In addition to machine-read corpora and dictionaries, it applies a small set of declension prototypes to map corpus words to dictionary entries. A web-based GUI allows expert users to modify and augment data through an online process. A machine learning module incorporates prototype data, edit-distance metrics, and expert feedback to continuously update part-of-speech and morphosyntactic classification. An advantage of the analyzer is its ability to achieve competitive classification accuracy with minimum training data.
View details for DOI 10.1371/journal.pone.0102366
View details for Web of Science ID 000341306600070
View details for PubMedID 25029462
View details for PubMedCentralID PMC4100772
-
A searchable meta-graph can connect even troublesome house elves and other supernatural beings to scholarly folk categories.
COMMUNICATIONS OF THE ACM
2012; 55 (7): 60-70
View details for DOI 10.1145/2209249.2209267
View details for Web of Science ID 000306310900026