Dr. Tan is a computational biologist who develops computational tools to quantitatively assess cell identity, improve stem cell engineering, and understand cancer heterogeneity. As a Ph.D. student, Dr. Tan routinely performs computational and quantitative analysis on scRNA-seq data, which has resulted in several publications. Currently, at her postdoctoral position, Dr. Tan integrated single-cell omics with multiplexed image data to understand high dimensional tissue architecture in cancer. Dr.Tan's long-term aims are to integrate multi-omics to understand how different cell types and their interactions contribute to development and disease.

Professional Education

  • Bachelor of Science, Chinese University of Hong Kong (2014)
  • Doctor of Philosophy, Johns Hopkins University (2021)
  • BSc(Hon), The Chinese University of Hong Kong, Cell and Molecular Biology (2014)
  • PhD, Johns Hopkins University, Computational Biology (2021)

Stanford Advisors

Lab Affiliations

All Publications

  • Gene Regulatory Network Analysis and Engineering Directs Development and Vascularization of Multilineage Human Liver Organoids CELL SYSTEMS Velazquez, J. J., LeGraw, R., Moghadam, F., Tan, Y., Kilbourne, J., Maggiore, J. C., Hislop, J., Liu, S., Cats, D., Lopes, S., Plaisier, C., Cahan, P., Kiani, S., Ebrahimkhani, M. R. 2021; 12 (1): 41-+


    Pluripotent stem cell (PSC)-derived organoids have emerged as novel multicellular models of human tissue development but display immature phenotypes, aberrant tissue fates, and a limited subset of cells. Here, we demonstrate that integrated analysis and engineering of gene regulatory networks (GRNs) in PSC-derived multilineage human liver organoids direct maturation and vascular morphogenesis in vitro. Overexpression of PROX1 and ATF5, combined with targeted CRISPR-based transcriptional activation of endogenous CYP3A4, reprograms tissue GRNs and improves native liver functions, such as FXR signaling, CYP3A4 enzymatic activity, and stromal cell reactivity. The engineered tissues possess superior liver identity when compared with other PSC-derived liver organoids and show the presence of hepatocyte, biliary, endothelial, and stellate-like cell populations in single-cell RNA-seq analysis. Finally, they show hepatic functions when studied in vivo. Collectively, our approach provides an experimental framework to direct organogenesis in vitro by systematically probing molecular pathways and transcriptional networks that promote tissue development.

    View details for DOI 10.1016/j.cels.2020.11.002

    View details for Web of Science ID 000610099400004

    View details for PubMedID 33290741

    View details for PubMedCentralID PMC8164844

  • Strategies for Accurate Cell Type Identification in CODEX Multiplexed Imaging Data. Frontiers in immunology Hickey, J. W., Tan, Y., Nolan, G. P., Goltsev, Y. 2021; 12: 727626


    Multiplexed imaging is a recently developed and powerful single-cell biology research tool. However, it presents new sources of technical noise that are distinct from other types of single-cell data, necessitating new practices for single-cell multiplexed imaging processing and analysis, particularly regarding cell-type identification. Here we created single-cell multiplexed imaging datasets by performing CODEX on four sections of the human colon (ascending, transverse, descending, and sigmoid) using a panel of 47 oligonucleotide-barcoded antibodies. After cell segmentation, we implemented five different normalization techniques crossed with four unsupervised clustering algorithms, resulting in 20 unique cell-type annotations for the same dataset. We generated two standard annotations: hand-gated cell types and cell types produced by over-clustering with spatial verification. We then compared these annotations at four levels of cell-type granularity. First, increasing cell-type granularity led to decreased labeling accuracy; therefore, subtle phenotype annotations should be avoided at the clustering step. Second, accuracy in cell-type identification varied more with normalization choice than with clustering algorithm. Third, unsupervised clustering better accounted for segmentation noise during cell-type annotation than hand-gating. Fourth, Z-score normalization was generally effective in mitigating the effects of noise from single-cell multiplexed imaging. Variation in cell-type identification will lead to significant differential spatial results such as cellular neighborhood analysis; consequently, we also make recommendations for accurately assigning cell-type labels to CODEX multiplexed imaging.

    View details for DOI 10.3389/fimmu.2021.727626

    View details for PubMedID 34484237

  • Transcriptome Dynamics of Hematopoietic Stem Cell Formation Revealed Using a Combinatorial Runx1 and Ly6a Reporter System STEM CELL REPORTS Chen, M. J., da Rocha, E., Cahan, P., Kubaczka, C., Hunter, P., Sousa, P., Mullin, N. K., Fujiwara, Y., Minh Nguyen, Tan, Y., Zhou, Y., North, T. E., Zon, L., Daley, G. Q., Schlaeger, T. M. 2020; 14 (5): 956-971


    Studies of hematopoietic stem cell (HSC) development from pre-HSC-producing hemogenic endothelial cells (HECs) are hampered by the rarity of these cells and the presence of other cell types with overlapping marker expression profiles. We generated a Tg(Runx1-mKO2; Ly6a-GFP) dual reporter mouse to visualize hematopoietic commitment and study pre-HSC emergence and maturation. Runx1-mKO2 marked all intra-arterial HECs and hematopoietic cluster cells (HCCs), including pre-HSCs, myeloid- and lymphoid progenitors, and HSCs themselves. However, HSC and lymphoid potential were almost exclusively found in reporter double-positive (DP) cells. Robust HSC activity was first detected in DP cells of the placenta, reflecting the importance of this niche for (pre-)HSC maturation and expansion before the fetal liver stage. A time course analysis by single-cell RNA sequencing revealed that as pre-HSCs mature into fetal liver stage HSCs, they show signs of interferon exposure, exhibit signatures of multi-lineage differentiation gene expression, and develop a prolonged cell cycle reminiscent of quiescent adult HSCs.

    View details for DOI 10.1016/j.stemcr.2020.03.020

    View details for Web of Science ID 000533148900015

    View details for PubMedID 32302558

    View details for PubMedCentralID PMC7220988

  • SingleCellNet: A Computational Tool to Classify Single Cell RNA-Seq Data Across Platforms and Across Species CELL SYSTEMS Tan, Y., Cahan, P. 2019; 9 (2): 207-+


    Single-cell RNA-seq has emerged as a powerful tool in diverse applications, from determining the cell-type composition of tissues to uncovering regulators of developmental programs. A near-universal step in the analysis of single-cell RNA-seq data is to hypothesize the identity of each cell. Often, this is achieved by searching for combinations of genes that have previously been implicated as being cell-type specific, an approach that is not quantitative and does not explicitly take advantage of other single-cell RNA-seq studies. Here, we describe our tool, SingleCellNet, which addresses these issues and enables the classification of query single-cell RNA-seq data in comparison to reference single-cell RNA-seq data. SingleCellNet compares favorably to other methods in sensitivity and specificity, and it is able to classify across platforms and species. We highlight SingleCellNet's utility by classifying previously undetermined cells, and by assessing the outcome of a cell fate engineering experiment.

    View details for DOI 10.1016/j.cels.2019.06.004

    View details for Web of Science ID 000483697600008

    View details for PubMedID 31377170

    View details for PubMedCentralID PMC6715530

  • SCD1 and SCD2 Form a Complex That Functions with the Exocyst and RabE1 in Exocytosis and Cytokinesis PLANT CELL Mayers, J., Hu, T., Wang, C., Cardenas, J. J., Tan, Y., Pan, J., Bednarek, S. Y. 2017; 29 (10): 2610-2625


    Although exocytosis is critical for the proper trafficking of materials to the plasma membrane, relatively little is known about the mechanistic details of post-Golgi trafficking in plants. Here, we demonstrate that the DENN (Differentially Expressed in Normal and Neoplastic cells) domain protein STOMATAL CYTOKINESIS DEFECTIVE1 (SCD1) and SCD2 form a previously unknown protein complex, the SCD complex, that functionally interacts with subunits of the exocyst complex and the RabE1 family of GTPases in Arabidopsis thaliana Consistent with a role in post-Golgi trafficking, scd1 and scd2 mutants display defects in exocytosis and recycling of PIN2-GFP. Perturbation of exocytosis using the small molecule Endosidin2 results in growth inhibition and PIN2-GFP trafficking defects in scd1 and scd2 mutants. In addition to the exocyst, the SCD complex binds in a nucleotide state-specific manner with Sec4p/Rab8-related RabE1 GTPases and overexpression of wild-type RabE1 rescues scd1 temperature-sensitive mutants. Furthermore, SCD1 colocalizes with the exocyst subunit, SEC15B, and RabE1 at the cell plate and in distinct punctae at or near the plasma membrane. Our findings reveal a mechanism for plant exocytosis, through the identification and characterization of a protein interaction network that includes the SCD complex, RabE1, and the exocyst.

    View details for DOI 10.1105/tpc.17.00409

    View details for Web of Science ID 000414861100025

    View details for PubMedID 28970336

    View details for PubMedCentralID PMC5774579

  • Assessment of engineered cells using CellNet and RNA-seq NATURE PROTOCOLS Radley, A. H., Schwab, R. M., Tan, Y., Kim, J., Lo, E. W., Cahan, P. 2017; 12 (5): 1089-1102


    CellNet is a computational platform designed to assess cell populations engineered by either directed differentiation of pluripotent stem cells (PSCs) or direct conversion, and to suggest specific hypotheses to improve cell fate engineering protocols. CellNet takes as input gene expression data and compares them with large data sets of normal expression profiles compiled from public sources, in regard to the extent to which cell- and tissue-specific gene regulatory networks are established. CellNet was originally designed to work with human or mouse microarray expression data for 21 cell or tissue (C/T) types. Here we describe how to apply CellNet to RNA-seq data and how to build a completely new CellNet platform applicable to, for example, other species or additional cell and tissue types. Once the raw data have been preprocessed, running CellNet takes only several minutes, whereas the time required to create a completely new CellNet is several hours.

    View details for DOI 10.1038/nprot.2017.022

    View details for Web of Science ID 000400371100009

    View details for PubMedID 28448485

    View details for PubMedCentralID PMC5765439

  • Understanding development and stem cells using single cell-based analyses of gene expression DEVELOPMENT Kumar, P., Tan, Y., Cahan, P. 2017; 144 (1): 17-32


    In recent years, genome-wide profiling approaches have begun to uncover the molecular programs that drive developmental processes. In particular, technical advances that enable genome-wide profiling of thousands of individual cells have provided the tantalizing prospect of cataloging cell type diversity and developmental dynamics in a quantitative and comprehensive manner. Here, we review how single-cell RNA sequencing has provided key insights into mammalian developmental and stem cell biology, emphasizing the analytical approaches that are specific to studying gene expression in single cells.

    View details for DOI 10.1242/dev.133058

    View details for Web of Science ID 000393454900005

    View details for PubMedID 28049689

    View details for PubMedCentralID PMC5278625

  • MON1/CCZ1-mediated Rab7 activation regulates tapetal PCD and pollen development in Arabidopsis Plant Physiology Cui, Y. 2017
  • Valproate-Induced Neurodevelopmental Deficits in Xenopus laevis Tadpoles JOURNAL OF NEUROSCIENCE James, E. J., Gu, J., Ramirez-Vizcarrondo, C. M., Hasan, M., Truszkowski, T. S., Tan, Y., Oupravanh, P. M., Khakhalin, A. S., Aizenman, C. D. 2015; 35 (7): 3218-3229


    Autism spectrum disorder (ASD) is increasingly thought to result from low-level deficits in synaptic development and neural circuit formation that cascade into more complex cognitive symptoms. However, the link between synaptic dysfunction and behavior is not well understood. By comparing the effects of abnormal circuit formation and behavioral outcomes across different species, it should be possible to pinpoint the conserved fundamental processes that result in disease. Here we use a novel model for neurodevelopmental disorders in which we expose Xenopus laevis tadpoles to valproic acid (VPA) during a critical time point in brain development at which neurogenesis and neural circuit formation required for sensory processing are occurring. VPA is a commonly prescribed antiepileptic drug with known teratogenic effects. In utero exposure to VPA in humans or rodents results in a higher incidence of ASD or ASD-like behavior later in life. We find that tadpoles exposed to VPA have abnormal sensorimotor and schooling behavior that is accompanied by hyperconnected neural networks in the optic tectum, increased excitatory and inhibitory synaptic drive, elevated levels of spontaneous synaptic activity, and decreased neuronal intrinsic excitability. Consistent with these findings, VPA-treated tadpoles also have increased seizure susceptibility and decreased acoustic startle habituation. These findings indicate that the effects of VPA are remarkably conserved across vertebrate species and that changes in neural circuitry resulting from abnormal developmental pruning can cascade into higher-level behavioral deficits.

    View details for DOI 10.1523/JNEUROSCI.4050-14.2015

    View details for Web of Science ID 000349992800034

    View details for PubMedID 25698756

    View details for PubMedCentralID PMC4331635