All Publications


  • Thermodynamic principles link in vitro transcription factor affinities to single-molecule chromatin states in cells. bioRxiv : the preprint server for biology Schaepe, J. M., Fries, T., Doughty, B. R., Crocker, O. J., Hinks, M. M., Marklund, E., Greenleaf, W. J. 2025

    Abstract

    The molecular details governing transcription factor (TF) binding and the formation of accessible chromatin are not yet quantitatively understood - including how sequence context modulates affinity, how TFs search DNA, the kinetics of TF occupancy, and how motif grammars coordinate binding. To resolve these questions for a human TF, erythroid Krüppel-like factor (eKLF/KLF1), we quantitatively compare, in high throughput, in vitro TF binding rates and affinities with in vivo single molecule TF and nucleosome occupancies across engineered DNA sequences. We find that 40-fold flanking sequence effects on affinity are consistent with distal flanks tuning TF search parameters and captured by a linear energy model. Motif recognition probability, rather than time in the bound state, drives affinity changes, and in vitro and in nuclei measurements exhibit consistent, minutes-long TF residence times. Finally, pairing in vitro biophysical parameters with thermodynamic models accurately predicts in vivo single-molecule chromatin states for unseen motif grammars.

    View details for DOI 10.1101/2025.01.27.635162

    View details for PubMedID 39975040

    View details for PubMedCentralID PMC11838358

  • Single-molecule states link transcription factor binding to gene expression. Nature Doughty, B. R., Hinks, M. M., Schaepe, J. M., Marinov, G. K., Thurm, A. R., Rios-Martinez, C., Parks, B. E., Tan, Y., Marklund, E., Dubocanin, D., Bintu, L., Greenleaf, W. J. 2024

    Abstract

    The binding of multiple transcription factors (TFs) to genomic enhancers drives gene expression in mammalian cells1. However, the molecular details that link enhancer sequence to TF binding, promoter state and transcription levels remain unclear. Here we applied single-molecule footprinting2,3 to measure the simultaneous occupancy of TFs, nucleosomes and other regulatory proteins on engineered enhancer-promoter constructs with variable numbers of TF binding sites for both a synthetic TF and an endogenous TF involved in the type I interferon response. Although TF binding events on nucleosome-free DNA are independent, activation domains recruit cofactors that destabilize nucleosomes, driving observed TF binding cooperativity. Average TF occupancy linearly determines promoter activity, and we decompose TF strength into separable binding and activation terms. Finally, we develop thermodynamic and kinetic models that quantitatively predict both the enhancer binding microstates and gene expression dynamics. This work provides a template for the quantitative dissection of distinct contributors to gene expression, including TF activation domains, concentration, binding affinity, binding site configuration and recruitment of chromatin regulators.

    View details for DOI 10.1038/s41586-024-08219-w

    View details for PubMedID 39567683

    View details for PubMedCentralID 3514679

  • Single-molecule chromatin configurations link transcription factor binding to expression in human cells. bioRxiv : the preprint server for biology Doughty, B. R., Hinks, M. M., Schaepe, J. M., Marinov, G. K., Thurm, A. R., Rios-Martinez, C., Parks, B. E., Tan, Y., Marklund, E., Dubocanin, D., Bintu, L., Greenleaf, W. J. 2024

    Abstract

    The binding of multiple transcription factors (TFs) to genomic enhancers activates gene expression in mammalian cells. However, the molecular details that link enhancer sequence to TF binding, promoter state, and gene expression levels remain opaque. We applied single-molecule footprinting (SMF) to measure the simultaneous occupancy of TFs, nucleosomes, and components of the transcription machinery on engineered enhancer/promoter constructs with variable numbers of TF binding sites for both a synthetic and an endogenous TF. We find that activation domains enhance a TF's capacity to compete with nucleosomes for binding to DNA in a BAF-dependent manner, TF binding on nucleosome-free DNA is consistent with independent binding between TFs, and average TF occupancy linearly contributes to promoter activation rates. We also decompose TF strength into separable binding and activation terms, which can be tuned and perturbed independently. Finally, we develop thermodynamic and kinetic models that quantitatively predict both the binding microstates observed at the enhancer and subsequent time-dependent gene expression. This work provides a template for quantitative dissection of distinct contributors to gene activation, including the activity of chromatin remodelers, TF activation domains, chromatin acetylation, TF concentration, TF binding affinity, and TF binding site configuration.

    View details for DOI 10.1101/2024.02.02.578660

    View details for PubMedID 38352517

  • Short tandem repeats bind transcription factors to tune eukaryotic gene expression. Science (New York, N.Y.) Horton, C. A., Alexandari, A. M., Hayes, M. G., Marklund, E., Schaepe, J. M., Aditham, A. K., Shah, N., Suzuki, P. H., Shrikumar, A., Afek, A., Greenleaf, W. J., Gordân, R., Zeitlinger, J., Kundaje, A., Fordyce, P. M. 2023; 381 (6664): eadd1250

    Abstract

    Short tandem repeats (STRs) are enriched in eukaryotic cis-regulatory elements and alter gene expression, yet how they regulate transcription remains unknown. We found that STRs modulate transcription factor (TF)-DNA affinities and apparent on-rates by about 70-fold by directly binding TF DNA-binding domains, with energetic impacts exceeding many consensus motif mutations. STRs maximize the number of weakly preferred microstates near target sites, thereby increasing TF density, with impacts well predicted by statistical mechanics. Confirming that STRs also affect TF binding in cells, neural networks trained only on in vivo occupancies predicted effects identical to those observed in vitro. Approximately 90% of TFs preferentially bound STRs that need not resemble known motifs, providing a cis-regulatory mechanism to target TFs to genomic sites.

    View details for DOI 10.1126/science.add1250

    View details for PubMedID 37733848

  • A microwell platform for high-throughput longitudinal phenotyping and selective retrieval of organoids. Cell systems Sockell, A., Wong, W., Longwell, S., Vu, T., Karlsson, K., Mokhtari, D., Schaepe, J., Lo, Y., Cornelius, V., Kuo, C., Van Valen, D., Curtis, C., Fordyce, P. M. 2023; 14 (9): 764

    Abstract

    Organoids are powerful experimental models for studying the ontogeny and progression of various diseases including cancer. Organoids are conventionally cultured in bulk using an extracellular matrix mimic. However, bulk-cultured organoids physically overlap, making it impossible to track the growth of individual organoids over time in high throughput. Moreover, local spatial variations in bulk matrix properties make it difficult to assess whether observed phenotypic heterogeneity between organoids results from intrinsic cell differences or differences in the microenvironment. Here, we developed a microwell-based method that enables high-throughput quantification of image-based parameters for organoids grown from single cells, which can further be retrieved from their microwells for molecular profiling. Coupled with a deep learning image-processing pipeline, we characterized phenotypic traits including growth rates, cellular movement, and apical-basal polarity in two CRISPR-engineered human gastric organoid models, identifying genomic changes associated with increased growth rate and changes in accessibility and expression correlated with apical-basal polarity. A record of this paper's transparent peer review process is included in the supplemental information.

    View details for DOI 10.1016/j.cels.2023.08.002

    View details for PubMedID 37734323

  • Systematic discovery of recombinases for efficient integration of large DNA sequences into the human genome. Nature biotechnology Durrant, M. G., Fanton, A., Tycko, J., Hinks, M., Chandrasekaran, S. S., Perry, N. T., Schaepe, J., Du, P. P., Lotfy, P., Bassik, M. C., Bintu, L., Bhatt, A. S., Hsu, P. D. 2022

    Abstract

    Large serine recombinases (LSRs) are DNA integrases that facilitate the site-specific integration of mobile genetic elements into bacterial genomes. Only a few LSRs, such as Bxb1 and PhiC31, have been characterized to date, with limited efficiency as tools for DNA integration in human cells. In this study, we developed a computational approach to identify thousands of LSRs and their DNA attachment sites, expanding known LSR diversity by >100-fold and enabling the prediction of their insertion site specificities. We tested their recombination activity in human cells, classifying them as landing pad, genome-targeting or multi-targeting LSRs. Overall, we achieved up to seven-fold higher recombination than Bxb1 and genome integration efficiencies of 40-75% with cargo sizes over 7kb. We also demonstrate virus-free, direct integration of plasmid or amplicon libraries for improved functional genomics applications. This systematic discovery of recombinases directly from microbial sequencing data provides a resource of over 60 LSRs experimentally characterized in human cells for large-payload genome insertion without exposed DNA double-stranded breaks.

    View details for DOI 10.1038/s41587-022-01494-w

    View details for PubMedID 36217031

  • Short tandem repeats recruit transcription factors to tune eukaryotic gene expression Horton, C. A., Alexandari, A. M., Hayes, M. G., Schaepe, J. M., Marklund, E., Shah, N., Aditham, A. K., Shrikumar, A., Afek, A., Greenleaf, W. J., Gordan, R., Zeitlinger, J., Kundaje, A., Fordyce, P. M. CELL PRESS. 2022: 287A-288A
  • Neuronal defects in a human cellular model of 22q11.2 deletion syndrome. Nature medicine Khan, T. A., Revah, O. n., Gordon, A. n., Yoon, S. J., Krawisz, A. K., Goold, C. n., Sun, Y. n., Kim, C. H., Tian, Y. n., Li, M. Y., Schaepe, J. M., Ikeda, K. n., Amin, N. D., Sakai, N. n., Yazawa, M. n., Kushan, L. n., Nishino, S. n., Porteus, M. H., Rapoport, J. L., Bernstein, J. A., O'Hara, R. n., Bearden, C. E., Hallmayer, J. F., Huguenard, J. R., Geschwind, D. H., Dolmetsch, R. E., Paşca, S. P. 2020

    Abstract

    22q11.2 deletion syndrome (22q11DS) is a highly penetrant and common genetic cause of neuropsychiatric disease. Here we generated induced pluripotent stem cells from 15 individuals with 22q11DS and 15 control individuals and differentiated them into three-dimensional (3D) cerebral cortical organoids. Transcriptional profiling across 100 days showed high reliability of differentiation and revealed changes in neuronal excitability-related genes. Using electrophysiology and live imaging, we identified defects in spontaneous neuronal activity and calcium signaling in both organoid- and 2D-derived cortical neurons. The calcium deficit was related to resting membrane potential changes that led to abnormal inactivation of voltage-gated calcium channels. Heterozygous loss of DGCR8 recapitulated the excitability and calcium phenotypes and its overexpression rescued these defects. Moreover, the 22q11DS calcium abnormality could also be restored by application of antipsychotics. Taken together, our study illustrates how stem cell derived models can be used to uncover and rescue cellular phenotypes associated with genetic forms of neuropsychiatric disease.

    View details for DOI 10.1038/s41591-020-1043-9

    View details for PubMedID 32989314