Haley Lepp
Ph.D. Student in Education, admitted Autumn 2022
Ph.D. Student in Education, admitted Autumn 2022
Master of Arts Student in Sociology, admitted Autumn 2024
Research Assistant, Artiles Program
Education & Certifications
-
B.S., Georgetown University, Science, Technology, and International Affairs (2015)
-
M.S., University of Washington, Computational Linguistics (2020)
All Publications
-
Linguistic Affordances Framework: A Linguistic-Sociological Approach for the Social Study of Language Technology
SOCIAL SCIENCE COMPUTER REVIEW
2025
View details for DOI 10.1177/08944393251366242
View details for Web of Science ID 001552255600001
-
Quantifying large language model usage in scientific papers.
Nature human behaviour
2025
Abstract
Scientific publishing is the primary means of disseminating research findings. There has been speculation about how extensively large language models (LLMs) are being used in academic writing. Here we conduct a systematic analysis across 1,121,912 preprints and published papers from January 2020 to September 2024 on arXiv, bioRxiv and Nature portfolio journals, using a population-level framework based on word frequency shifts to estimate the prevalence of LLM-modified content over time. Our findings suggest a steady increase in LLM usage, with the largest and fastest growth estimated for computer science papers (up to 22%). By comparison, mathematics papers and the Nature portfolio showed lower evidence of LLM modification (up to 9%). LLM modification estimates were higher among papers from first authors who post preprints more frequently, papers in more crowded research areas and papers of shorter lengths. Our findings suggest that LLMs are being broadly used in scientific writing.
View details for DOI 10.1038/s41562-025-02273-8
View details for PubMedID 40760036
View details for PubMedCentralID 5199034
-
You Cannot Sound Like GPT": Signs of language discrimination and resistance in computer science publishing
ASSOC COMPUTING MACHINERY. 2025: 3161-3180
View details for DOI 10.1145/3715275.3732202
View details for Web of Science ID 001543679300190
-
Conversational Agents in Language Education: Where They Fit and Their Research Challenges
edited by Stephanidis, C., Antona, M., Ntoa, S.
SPRINGER INTERNATIONAL PUBLISHING AG. 2021: 272-279
View details for DOI 10.1007/978-3-030-90179-0_35
View details for Web of Science ID 000793810500035
-
Pardon the Interruption: An Analysis of Gender and Turn-Taking in US Supreme Court Oral Arguments
ISCA-INT SPEECH COMMUNICATION ASSOC. 2020: 1838-1842
View details for DOI 10.21437/Interspeech.2020-2964
View details for Web of Science ID 000833594101199
-
Visualizing Inferred Morphotactic Systems
ASSOC COMPUTATIONAL LINGUISTICS-ACL. 2019: 127-131
View details for Web of Science ID 000860933000022
https://orcid.org/0009-0003-9789-7415