Yue Wu's Profile | Stanford Profiles

Bio

2016, BS, Nanjing University, China
2017-2022, Ph.D. University of Georgia, Advisor: Arthur Edison and Jonathan Arnold
2022-present, postdocs, Stanford University, Advisor: Michael Snyder

Honors & Awards

Graduate school travel grant, UGA (2019)
Outstanding graduate with honor, Nanjing University (2016)
First-class People’s Scholarship, Nanjing University (2014 – 2015)
Silver Medal, iGEM (International Genetically Engineered Machine) competition (2014)

Stanford Advisors

Michael Snyder, Postdoctoral Faculty Sponsor

Contact

Academic
yuewu135@stanford.edu

University - Scholar Department: Genetics Position: Postdoctoral Scholar

Additional Info

Mail Code: 5120
ORCID:
https://orcid.org/0000-0001-7170-0053

Current Research and Scholarly Interests

I built computational methods to integrate and model biological time series, including metabolic dynamics, longitudinal multi-omics data, and micro-sampling. I reduce dimensions, built clusters, and search for causal links.

1. Knowledge extraction from time-series metabolic systems. Recent developments in omics approaches provide a comprehensive view of the biological system at one time point. However, the understanding of the dynamic response to environmental perturbation is still limited in both data collection and computational analysis. I contributed to an NMR approach to collecting time-series metabolic data. I then designed the computational method to efficiently extract chemical information from the high-dimensional heavy dataset. This provides rich information regarding dynamic metabolic processes under different environments. I uncovered biological regulation in carbon metabolism and glycogen utilization from this high-dimensional time series, through modeling and time-series analysis. I built a new efficient workflow to understand metabolic dynamics and regulation, which can be expanded to other fermentation systems and the study of metabolic disease in humans.

2.Automation in phenotyping biological systems. New experimental approaches (e.g., microscopic devices) enable the recording of thousands of samples in a short time, which greatly promotes the phenotyping of plants, fungi, and human tissues. However, image annotation and information extraction are still manual intensive. I built multiple frameworks to classify phenotypes through ResNet in PyTorch, associate with genomic information, and uncover important structures through feature importance evaluation. I also built image segmentation programs through Detectron2 to annotate different symbiosis structures of Arbuscular mycorrhiza and worm population. Automation in phenotyping greatly

All Publications

Modifiable Factors Affecting the Postprandial Glycemic Response. Journal of diabetes science and technology Wu, Y., McLaughlin, T., Gorgani, S., Scheideman, A. F., Shao, M. M., Hislop, B. D., Hoang, K., Perelman, D., McGinity, C., Rodgar, M., Park, H., Wang, T., Mayer, C., DuNova, A., Ayers, A., Ho, C., Ræder, H., Klonoff, D. C., Snyder, M. P. 2026: 19322968261418614

Abstract

The postprandial glycemic response (PPGR) is associated with diabetes and cardiovascular disease and is highly individualized. The PPGR is affected by both physiological and behavioral factors. Attention to the PPGR has dramatically increased recently with the widespread use of continuous glucose monitors. It is expected that individualized control of PPGRs will be important in the prevention of diabetes and its associated complications. In this article, we discuss six modifiable factors associated with the PPGRs, including (1) the glucoregulatory hormones, (2) gastric emptying, (3) salivary or pancreatic amylase, (4) diet, (5) physical exercise, and (6) sleep and circadian rhythm. Modifying these factors may allow for personalized intervention strategies to control the PPGR-to reduce the risk for cardiovascular disease in individuals with varying degrees of glycemia.

View details for DOI 10.1177/19322968261418614

View details for PubMedID 41660725
Glucose360: An Open-Source Python Platform with Event-Based Integration for Continuous Glucose Monitoring Data Analysis. Diabetes technology & therapeutics Ehlert, B., Aron, D., Perelman, D., Wu, Y., Snyder, M. P. 2025

Abstract

Background and Aims: Continuous glucose monitoring (CGM) devices provide real-time actionable data on blood glucose levels, making them essential tools for effective glucose management. Integrating blood glucose data with food log data is crucial for understanding how dietary choices impact glucose levels. Despite their utility, many CGM applications lack integration with other external services, such as food trackers, and do not generate useful glycemic variability (GV) metrics or advanced visualizations. Existing solutions vary in functionality: some are proprietary, many require additional user programming or custom preprocessing to meet diverse research needs, and few have created solutions to connect CGM data with external services. Recent reviews highlight gaps such as insufficient postprandial analytics, absence of composite indices, and inadequate tools for nontechnical users. Methods: Glucose360 and commonly used alternative CGM applications and tools were compared by calculating GV metrics on 60 participant datasets and by contrasting their general applications for research workflows. Results: To address limitations, we developed Glucose360, featuring (1) an open-source python framework for event-based CGM data integration and analysis; (2) automated calculation of glucose metrics specific for meals and exercise events and other short-interval events; and (3) a user-friendly web application, designed for users with minimal programming experience and accessible at vurhd2.shinyapps.io/glucose360/. Discussion: Overall, Glucose360 provides a holistic analysis pipeline that is useful for both individuals and researchers to track and analyze CGM data. The source code for Glucose360 can be found at github.com/vurhd2/Glucose360.

View details for DOI 10.1177/15209156251374711

View details for PubMedID 40900178
High-resolution lifestyle profiling and metabolic subphenotypes of type 2 diabetes. NPJ digital medicine Park, H., Metwally, A. A., Delfarah, A., Wu, Y., Perelman, D., Mayer, C., McGinity, C., Rodgar, M., Celli, A., McLaughlin, T., Mignot, E., Snyder, M. 2025; 8 (1): 352

Abstract

Distinct metabolic susceptibilities (beta-cell dysfunction, insulin resistance (IR), and impaired incretin response) underlie type 2 diabetes (T2D). However, their relationships with habitual lifestyle behaviors are underexplored. This study integrated high-resolution lifestyle data from wearable devices, continuous glucose monitoring, and smartphone-based food logs with gold-standard physiological tests in 36 individuals at risk for T2D (ClinicalTrials.Gov; NCT03919877; 2019-04-18). Over 6400 timestamped records of diet, sleep, and physical activity were analyzed with in participants with measures of beta-cell function, tissue-specific IR (muscle, hepatic, adipose), and incretin response. We found that lifestyle timing and variability were strongly associated with metabolic subphenotypes: (1) eating timing was associated with muscle IR and incretin function; (2) irregular sleep correlated to IR and incretin function; and (3) Time-of-day effects of physical activity varied by subphenotype. These findings were validated in an independent cohort. Our results highlight novel physiological links between daily behaviors and metabolic risk, informing potential lifestyle modifications for T2D prevention.

View details for DOI 10.1038/s41746-025-01728-6

View details for PubMedID 40500312

View details for PubMedCentralID PMC12159136
Individual variations in glycemic responses to carbohydrates and underlying metabolic physiology. Nature medicine Wu, Y., Ehlert, B., Metwally, A. A., Perelman, D., Park, H., Brooks, A. W., Abbasi, F., Michael, B., Celli, A., Bejikian, C., Ayhan, E., Lu, Y., Lancaster, S. M., Hornburg, D., Ramirez, L., Bogumil, D., Pollock, S., Wong, F., Bradley, D., Gutjahr, G., Rangan, E. S., Wang, T., McGuire, L., Venkat Rangan, P., Ræder, H., Shipony, Z., Lipson, D., McLaughlin, T., Snyder, M. P. 2025

Abstract

Elevated postprandial glycemic responses (PPGRs) are associated with type 2 diabetes and cardiovascular disease. PPGRs to the same foods have been shown to vary between individuals, but systematic characterization of the underlying physiologic and molecular basis is lacking. We measured PPGRs using continuous glucose monitoring in 55 well-phenotyped participants challenged with seven different standard carbohydrate meals administered in replicate. We also examined whether preloading a rice meal with fiber, protein or fat ('mitigators') altered PPGRs. We performed gold-standard metabolic tests and multi-omics profiling to examine the physiologic and molecular basis for interindividual PPGR differences. Overall, rice was the most glucose-elevating carbohydrate meal, but there was considerable interindividual variability. Individuals with the highest PPGR to potatoes (potato-spikers) were more insulin resistant and had lower beta cell function, whereas grape-spikers were more insulin sensitive. Rice-spikers were more likely to be Asian individuals, and bread-spikers had higher blood pressure. Mitigators were less effective in reducing PPGRs in insulin-resistant as compared to insulin-sensitive participants. Multi-omics signatures of PPGR and metabolic phenotypes were discovered, including insulin-resistance-associated triglycerides, hypertension-associated metabolites and PPGR-associated microbiome pathways. These results demonstrate interindividual variability in PPGRs to carbohydrate meals and mitigators and their association with metabolic and molecular profiles.

View details for DOI 10.1038/s41591-025-03719-2

View details for PubMedID 40467897

View details for PubMedCentralID 4266395
MINE: a new way to design genetics experiments for discovery. Briefings in bioinformatics Torres, I., Zhang, S., Bouffier, A., Skaro, M., Wu, Y., Stupp, L., Arnold, J., Chung, Y. A., Schuttler, H. B. 2025; 26 (2)

Abstract

The Maximally Informative Next Experiment or MINE is a new experimental design approach for experiments, such as those in omics, in which the number of effects or parameters p greatly exceeds the number of samples n (p > n). Classical experimental design presumes n > p for inference about parameters and its application to p > n can lead to over-fitting. To overcome p > n, MINE is an ensemble method, which makes predictions about future experiments from an existing ensemble of models consistent with available data in order to select the most informative next experiment. Its advantages are in exploration of the data for new relationships with n < p and being able to integrate smaller and more tractable experiments to replace adaptively one large classic experiment as discoveries are made. Thus, using MINE is model-guided and adaptive over time in a large omics study. Here, MINE is illustrated in two distinct multiyear experiments, one involving genetic networks in Neurospora crassa and a second one involving a genome-wide association study in Sorghum bicolor as a comparison to classic experimental design in an agricultural setting.

View details for DOI 10.1093/bib/bbaf167

View details for PubMedID 40237762
Origin of the clock in Neurospora crassa. Frontiers in molecular biosciences Al-Omari, A., Altimus, C., Arnold, J., Arsenault, S., Bhandarkar, S., Bhusal, S., Caranica, C., Cheong, J. H., Deng, Z., Edison, A. S., Floyd, G., Griffith, J., Hull, B., Judge, M. T., Liu, Y., Mao, L., Mohanty, B., Qiu, X., Schüttler, H. B., Scruse, A., Taha, T., Wu, L., Wu, Y. 2025; 12: 1697003

Abstract

We examine the collective behavior of single cells in microbial systems to provide insights into the origin of the biological clock. Microfluidics has opened a window onto how single cells can synchronize their behavior. Four hypotheses are proposed to explain the origin of the clock from the synchronized behavior of single cells. These hypotheses depend on the presence or absence of a communication mechanism between the clocks in single cells and the presence or absence of a stochastic component in the clock mechanism. To test these models, we integrate physical models for the behavior of the clocks in single cells or filaments with new approaches to measuring clocks in single cells. As an example, we provide evidence for a quorum-sensing signal both with microfluidics experiments on single cells and with continuous in vivo metabolism NMR (CIVM-NMR). We also provide evidence for the stochastic component in clocks of single cells. Throughout this study, ensemble methods from statistical physics are used to characterize the clock at both the single-cell level and the macroscopic scale of 106 cells.

View details for DOI 10.3389/fmolb.2025.1697003

View details for PubMedID 41613003

View details for PubMedCentralID PMC12848316
Prediction of metabolic subphenotypes of type 2 diabetes via continuous glucose monitoring and machine learning. Nature biomedical engineering Metwally, A. A., Perelman, D., Park, H., Wu, Y., Jha, A., Sharp, S., Celli, A., Ayhan, E., Abbasi, F., Gloyn, A. L., McLaughlin, T., Snyder, M. P. 2024

Abstract

The classification of type 2 diabetes and prediabetes does not consider heterogeneity in the pathophysiology of glucose dysregulation. Here we show that prediabetes is characterized by metabolic heterogeneity, and that metabolic subphenotypes can be predicted by the shape of the glucose curve measured via a continuous glucose monitor (CGM) during standardized oral glucose-tolerance tests (OGTTs) performed in at-home settings. Gold-standard metabolic tests in 32 individuals with early glucose dysregulation revealed dominant or co-dominant subphenotypes (muscle or hepatic insulin-resistance phenotypes in 34% of the individuals, and β-cell-dysfunction or impaired-incretin-action phenotypes in 40% of them). Machine-learning models trained with glucose time series from OGTTs from the 32 individuals predicted the subphenotypes with areas under the curve (AUCs) of 95% for muscle insulin resistance, 89% for β-cell deficiency and 88% for impaired incretin action. With CGM-generated glucose curves obtained during at-home OGTTs, the models predicted the muscle-insulin-resistance and β-cell-deficiency subphenotypes of 29 individuals with AUCs of 88% and 84%, respectively. At-home identification of metabolic subphenotypes via a CGM may aid the risk stratification of individuals with early glucose dysregulation.

View details for DOI 10.1038/s41551-024-01311-6

View details for PubMedID 39715896

View details for PubMedCentralID 11057359
Lifestyle Profiling Using Wearables and Prediction of Glucose Metabolism in Individuals with Normoglycemia or Prediabetes. medRxiv : the preprint server for health sciences Park, H., Metwally, A. A., Delfarah, A., Wu, Y., Perelman, D., Rodgar, M., Mayer, C., Celli, A., McLaughlin, T., Mignot, E., Snyder, M. 2024

Abstract

This study examined the relationship between lifestyles (diet, sleep, and physical activity) and glucose responses at a personal level. 36 healthy adults in the Bay Area were monitored for their lifestyles and glucose levels using wearables and continuous glucose monitoring (NCT03919877). Gold-standard metabolic tests were conducted to phenotype metabolic characteristics. Through the lifestyle data (2,307 meals, 1,809 nights, and 2,447 days) and 231,206 CGM readings from metabolically-phenotyped individuals with normoglycemia or prediabetes, we found: 1) eating timing was associated with hyperglycemia, muscle insulin resistance (IR), and incretin dysfunction, whereas nutrient intakes were not; 2) timing of increased activity in muscle IS and IR participants was associated with differential benefits of glucose control; 3) Integrated ML models using lifestyle factors predicted distinct metabolic characteristics (muscle, adipose IR or incretin dysfunction). Our data indicate the differential impact of lifestyles on glucose regulation among individuals with different metabolic phenotypes, highlighting the value of personalized lifestyle modifications.

View details for DOI 10.1101/2024.09.05.24312545

View details for PubMedID 39281757

View details for PubMedCentralID PMC11398605
Predicting Type 2 Diabetes Metabolic Phenotypes Using Continuous Glucose Monitoring and a Machine Learning Framework. medRxiv : the preprint server for health sciences Metwally, A. A., Perelman, D., Park, H., Wu, Y., Jha, A., Sharp, S., Celli, A., Ayhan, E., Abbasi, F., Gloyn, A. L., McLaughlin, T., Snyder, M. 2024

Abstract

Type 2 diabetes (T2D) and prediabetes are classically defined by the level of fasting glucose or surrogates such as hemoglobin HbA1c. This classification does not take into account the heterogeneity in the pathophysiology of glucose dysregulation, the identification of which could inform targeted approaches to diabetes treatment and prevention and/or predict clinical outcomes. We performed gold-standard metabolic tests in a cohort of individuals with early glucose dysregulation and quantified four distinct metabolic subphenotypes known to contribute to glucose dysregulation and T2D: muscle insulin resistance, β-cell dysfunction, impaired incretin action, and hepatic insulin resistance. We revealed substantial inter-individual heterogeneity, with 34% of individuals exhibiting dominance or co-dominance in muscle and/or liver IR, and 40% exhibiting dominance or co-dominance in β-cell and/or incretin deficiency. Further, with a frequently-sampled oral glucose tolerance test (OGTT), we developed a novel machine learning framework to predict metabolic subphenotypes using features from the dynamic patterns of the glucose time-series ("shape of the glucose curve"). The glucose time-series features identified insulin resistance, β-cell deficiency, and incretin defect with auROCs of 95%, 89%, and 88%, respectively. These figures are superior to currently-used estimates. The prediction of muscle insulin resistance and β-cell deficiency were validated using an independent cohort. We then tested the ability of glucose curves generated by a continuous glucose monitor (CGM) worn during at-home OGTTs to predict insulin resistance and β-cell deficiency, yielding auROC of 88% and 84%, respectively. We thus demonstrate that the prediabetic state is characterized by metabolic heterogeneity, which can be defined by the shape of the glucose curve during standardized OGTT, performed in a clinical research unit or at-home setting using CGM. The use of at-home CGM to identify muscle insulin resistance and β-cell deficiency constitutes a practical and scalable method by which to risk stratify individuals with early glucose dysregulation and inform targeted treatment to prevent T2D.

View details for DOI 10.1101/2024.07.20.24310737

View details for PubMedID 39108516

View details for PubMedCentralID PMC11302614
Computer vision models enable mixed linear modeling to predict arbuscular mycorrhizal fungal colonization using fungal morphology. Scientific reports Zhang, S., Wu, Y., Skaro, M., Cheong, J. H., Bouffier-Landrum, A., Torrres, I., Guo, Y., Stupp, L., Lincoln, B., Prestel, A., Felt, C., Spann, S., Mandal, A., Johnson, N., Arnold, J. 2024; 14 (1): 10866

Abstract

The presence of Arbuscular Mycorrhizal Fungi (AMF) in vascular land plant roots is one of the most ancient of symbioses supporting nitrogen and phosphorus exchange for photosynthetically derived carbon. Here we provide a multi-scale modeling approach to predict AMF colonization of a worldwide crop from a Recombinant Inbred Line (RIL) population derived from Sorghum bicolor and S. propinquum. The high-throughput phenotyping methods of fungal structures here rely on a Mask Region-based Convolutional Neural Network (Mask R-CNN) in computer vision for pixel-wise fungal structure segmentations and mixed linear models to explore the relations of AMF colonization, root niche, and fungal structure allocation. Models proposed capture over 95% of the variation in AMF colonization as a function of root niche and relative abundance of fungal structures in each plant. Arbuscule allocation is a significant predictor of AMF colonization among sibling plants. Arbuscules and extraradical hyphae implicated in nutrient exchange predict highest AMF colonization in the top root section. Our work demonstrates that deep learning can be used by the community for the high-throughput phenotyping of AMF in plant roots. Mixed linear modeling provides a framework for testing hypotheses about AMF colonization phenotypes as a function of root niche and fungal structure allocations.

View details for DOI 10.1038/s41598-024-61181-5

View details for PubMedID 38740920

View details for PubMedCentralID 9256619
SAND: Automated Time-Domain Modeling of NMR Spectra Applied to Metabolite Quantification. Analytical chemistry Wu, Y., Sanati, O., Uchimiya, M., Krishnamurthy, K., Wedell, J., Hoch, J. C., Edison, A. S., Delaglio, F. 2024

Abstract

Developments in untargeted nuclear magnetic resonance (NMR) metabolomics enable the profiling of thousands of biological samples. The exploitation of this rich source of information requires a detailed quantification of spectral features. However, the development of a consistent and automatic workflow has been challenging because of extensive signal overlap. To address this challenge, we introduce the software Spectral Automated NMR Decomposition (SAND). SAND follows on from the previous success of time-domain modeling and automatically quantifies entire spectra without manual interaction. The SAND approach uses hybrid optimization with Markov chain Monte Carlo methods, employing subsampling in both time and frequency domains. In particular, SAND randomly divides the time-domain data into training and validation sets to help avoid overfitting. We demonstrate the accuracy of SAND, which provides a correlation of 0.9 with ground truth on cases including highly overlapped simulated data sets, a two-compound mixture, and a urine sample spiked with different amounts of a four-compound mixture. We further demonstrate an automated annotation using correlation networks derived from SAND decomposed peaks, and on average, 74% of peaks for each compound can be recovered in single clusters. SAND is available in NMRbox, the cloud computing environment for NMR software hosted by the Network for Advanced NMR (NAN). Since the SAND method uses time-domain subsampling (i.e., random subset of time-domain points), it has the potential to be extended to a higher dimensionality and nonuniformly sampled data.

View details for DOI 10.1021/acs.analchem.3c03078

View details for PubMedID 38273718
Characterizing the gene-environment interaction underlying natural morphological variation in Neurospora crassa conidiophores using high-throughput phenomics and transcriptomics G3-GENES GENOMES GENETICS Krach, E. K., Skaro, M., Wu, Y., Arnold, J. 2022; 12 (4)

Abstract

Neurospora crassa propagates through dissemination of conidia, which develop through specialized structures called conidiophores. Recent work has identified striking variation in conidiophore morphology, using a wild population collection from Louisiana, United States of America to classify 3 distinct phenotypes: Wild-Type, Wrap, and Bulky. Little is known about the impact of these phenotypes on sporulation or germination later in the N. crassa life cycle, or about the genetic variation that underlies them. In this study, we show that conidiophore morphology likely affects colonization capacity of wild N. crassa isolates through both sporulation distance and germination on different carbon sources. We generated and crossed homokaryotic strains belonging to each phenotypic group to more robustly fit a model for and estimate heritability of the complex trait, conidiophore architecture. Our fitted model suggests at least 3 genes and 2 epistatic interactions contribute to conidiophore phenotype, which has an estimated heritability of 0.47. To uncover genes contributing to these phenotypes, we performed RNA-sequencing on mycelia and conidiophores of strains representing each of the 3 phenotypes. Our results show that the Bulky strain had a distinct transcriptional profile from that of Wild-Type and Wrap, exhibiting differential expression patterns in clock-controlled genes (ccgs), the conidiation-specific gene con-6, and genes implicated in metabolism and communication. Combined, these results present novel ecological impacts of and differential gene expression underlying natural conidiophore morphological variation, a complex trait that has not yet been thoroughly explored.

View details for DOI 10.1093/g3journal/jkac050

View details for Web of Science ID 000769668200001

View details for PubMedID 35293585

View details for PubMedCentralID PMC8982394
Uncovering in vivo biochemical patterns from time-series metabolic dynamics. PloS one Wu, Y., Judge, M. T., Edison, A. S., Arnold, J. 2022; 17 (5): e0268394

Abstract

System biology relies on holistic biomolecule measurements, and untangling biochemical networks requires time-series metabolomics profiling. With current metabolomic approaches, time-series measurements can be taken for hundreds of metabolic features, which decode underlying metabolic regulation. Such a metabolomic dataset is untargeted with most features unannotated and inaccessible to statistical analysis and computational modeling. The high dimensionality of the metabolic space also causes mechanistic modeling to be rather cumbersome computationally. We implemented a faster exploratory workflow to visualize and extract chemical and biochemical dependencies. Time-series metabolic features (about 300 for each dataset) were extracted by Ridge Tracking-based Extract (RTExtract) on measurements from continuous in vivo monitoring of metabolism by NMR (CIVM-NMR) in Neurospora crassa under different conditions. The metabolic profiles were then smoothed and projected into lower dimensions, enabling a comparison of metabolic trends in the cultures. Next, we expanded incomplete metabolite annotation using a correlation network. Lastly, we uncovered meaningful metabolic clusters by estimating dependencies between smoothed metabolic profiles. We thus sidestepped the processes of time-consuming mechanistic modeling, difficult global optimization, and labor-intensive annotation. Multiple clusters guided insights into central energy metabolism and membrane synthesis. Dense connections with glucose 1-phosphate indicated its central position in metabolism in N. crassa. Our approach was benchmarked on simulated random network dynamics and provides a novel exploratory approach to analyzing high-dimensional metabolic dynamics.

View details for DOI 10.1371/journal.pone.0268394

View details for PubMedID 35550643
Wild Isolates of Neurospora crassa Reveal Three Conidiophore Architectural Phenotypes MICROORGANISMS Krach, E. K., Wu, Y., Skaro, M., Mao, L., Arnold, J. 2020; 8 (11)

Abstract

The vegetative life cycle in the model filamentous fungus, Neurospora crassa, relies on the development of conidiophores to produce new spores. Environmental, temporal, and genetic components of conidiophore development have been well characterized; however, little is known about their morphological variation. We explored conidiophore architectural variation in a natural population using a wild population collection of 21 strains from Louisiana, United States of America (USA). Our work reveals three novel architectural phenotypes, Wild Type, Bulky, and Wrap, and shows their maintenance throughout the duration of conidiophore development. Furthermore, we present a novel image-classifier using a convolutional neural network specifically developed to assign conidiophore architectural phenotypes in a high-throughput manner. To estimate an inheritance model for this discrete complex trait, crosses between strains of each phenotype were conducted, and conidiophores of subsequent progeny were characterized using the trained classifier. Our model suggests that conidiophore architecture is controlled by at least two genes and has a heritability of 0.23. Additionally, we quantified the number of conidia produced by each conidiophore type and their dispersion distance, suggesting that conidiophore architectural phenotype may impact N. crassa colonization capacity.

View details for DOI 10.3390/microorganisms8111760

View details for Web of Science ID 000593214700001

View details for PubMedID 33182369

View details for PubMedCentralID PMC7695285
RTExtract: time-series NMR spectra quantification based on 3D surface ridge tracking BIOINFORMATICS Wu, Y., Judge, M. T., Arnold, J., Bhandarkar, S. M., Edison, A. S. 2020; 36 (20): 5068-5075

Abstract

Time-series nuclear magnetic resonance (NMR) has advanced our knowledge about metabolic dynamics. Before analyzing compounds through modeling or statistical methods, chemical features need to be tracked and quantified. However, because of peak overlap and peak shifting, the available protocols are time consuming at best or even impossible for some regions in NMR spectra.We introduce Ridge Tracking-based Extract (RTExtract), a computer vision-based algorithm, to quantify time-series NMR spectra. The NMR spectra of multiple time points were formulated as a 3D surface. Candidate points were first filtered using local curvature and optima, then connected into ridges by a greedy algorithm. Interactive steps were implemented to refine results. Among 173 simulated ridges, 115 can be tracked (RMSD < 0.001). For reproducing previous results, RTExtract took less than 2 h instead of ∼48 h, and two instead of seven parameters need tuning. Multiple regions with overlapping and changing chemical shifts are accurately tracked.Source code is freely available within Metabolomics toolbox GitHub repository (https://github.com/artedison/Edison_Lab_Shared_Metabolomics_UGA/tree/master/metabolomics_toolbox/code/ridge_tracking) and is implemented in MATLAB and R.Supplementary data are available at Bioinformatics online.

View details for DOI 10.1093/bioinformatics/btaa631

View details for Web of Science ID 000605690100013

View details for PubMedID 32653900

View details for PubMedCentralID PMC7755419
Continuous in vivo Metabolism by NMR FRONTIERS IN MOLECULAR BIOSCIENCES Judge, M. T., Wu, Y., Tayyari, F., Haffori, A., Glushka, J., Ito, T., Arnold, J., Edison, A. S. 2019; 6: 26

Abstract

Dense time-series metabolomics data are essential for unraveling the underlying dynamic properties of metabolism. Here we extend high-resolution-magic angle spinning (HR-MAS) to enable continuous in vivo monitoring of metabolism by NMR (CIVM-NMR) and provide analysis tools for these data. First, we reproduced a result in human chronic lymphoid leukemia cells by using isotope-edited CIVM-NMR to rapidly and unambiguously demonstrate unidirectional flux in branched-chain amino acid metabolism. We then collected untargeted CIVM-NMR datasets for Neurospora crassa, a classic multicellular model organism, and uncovered dynamics between central carbon metabolism, amino acid metabolism, energy storage molecules, and lipid and cell wall precursors. Virtually no sample preparation was required to yield a dynamic metabolic fingerprint over hours to days at ~4-min temporal resolution with little noise. CIVM-NMR is simple and readily adapted to different types of cells and microorganisms, offering an experimental complement to kinetic models of metabolism for diverse biological systems.

View details for DOI 10.3389/fmolb.2019.00026

View details for Web of Science ID 000466811700001

View details for PubMedID 31114791

View details for PubMedCentralID PMC6502900
Genome-Wide Analysis Reveals Ancestral Lack of Seventeen Different tRNAs and Clade-Specific Loss of tRNA-CNNs in Archaea FRONTIERS IN MICROBIOLOGY Wu, Y., Wu, P., Wang, B., Shao, Z. 2018; 9: 1245

Abstract

Transfer RNA (tRNA) is a category of RNAs that specifically decode messenger RNAs (mRNAs) into proteins by recognizing a set of 61 codons commonly adopted by different life domains. The composition and abundance of tRNAs play critical roles in shaping codon usage and pairing bias, which subsequently modulate mRNA translation efficiency and accuracy. Over the past few decades, effort has been concentrated on evaluating the specificity and redundancy of different tRNA families. However, the mechanism and processes underlying tRNA evolution have only rarely been investigated. In this study, by surveying tRNA genes in 167 completely sequenced genomes, we systematically investigated the composition and evolution of tRNAs in Archaea from a phylogenetic perspective. Our data revealed that archaeal genomes are compact in both tRNA types and copy number. Generally, no more than 44 different types of tRNA are present in archaeal genomes to decode the 61 canonical codons, and most of them have only one gene copy per genome. Among them, tRNA-Met was significantly overrepresented, with an average of three copies per genome. In contrast, the tRNA-UAU and 16 tRNAs with A-starting anticodons (tRNA-ANNs) were rarely detected in all archaeal genomes. The conspicuous absence of these tRNAs across the archaeal phylogeny suggests they might have not been evolved in the common ancestor of Archaea, rather than have lost independently from different clades. Furthermore, widespread absence of tRNA-CNNs in the Methanococcales and Methanobacteriales genomes indicates convergent loss of these tRNAs in the two clades. This clade-specific tRNA loss may be attributing to the reductive evolution of their genomes. Our data suggest that the current tRNA profiles in Archaea are contributed not only by the ancestral tRNA composition, but also by differential maintenance and loss of redundant tRNAs.

View details for DOI 10.3389/fmicb.2018.01245

View details for Web of Science ID 000434397000001

View details for PubMedID 29930548

View details for PubMedCentralID PMC6000648
Systematic analyses of glutamine and glutamate metabolisms across different cancer types CHINESE JOURNAL OF CANCER Tian, Y., Du, W., Cao, S., Wu, Y., Dong, N., Wang, Y., Xu, Y. 2017; 36: 88

Abstract

Glutamine and glutamate are known to play important roles in cancer biology. However, no detailed information is available in terms of their levels of involvement in various biological processes across different cancer types, whereas such knowledge could be critical for understanding the distinct characteristics of different cancer types. Our computational study aimed to examine the functional roles of glutamine and glutamate across different cancer types.We conducted a comparative analysis of gene expression data of cancer tissues versus normal control tissues of 11 cancer types to understand glutamine and glutamate metabolisms in cancer. Specifically, we developed a linear regression model to assess differential contributions by glutamine and/or glutamate to each of seven biological processes in cancer versus control tissues.While our computational predictions were consistent with some of the previous observations, multiple novel predictions were made: (1) glutamine is generally not involved in purine synthesis in cancer except for breast cancer, and is similarly not involved in pyridine synthesis except for kidney cancer; (2) glutamine is generally not involved in ATP production in cancer; (3) glutamine's contribution to nucleotide synthesis is minimal if any in cancer; (4) glutamine is not involved in asparagine synthesis in cancer except for bladder and lung cancers; and (5) glutamate does not contribute to serine synthesis except for bladder cancer.We comprehensively predicted the roles of glutamine and glutamate metabolisms in selected metabolic pathways in cancer tissues versus control tissues, which may lead to novel approaches to therapeutic development targeted at glutamine and/or glutamate metabolism. However, our predictions need further functional validation.

View details for DOI 10.1186/s40880-017-0255-y

View details for Web of Science ID 000414851000002

View details for PubMedID 29116024

View details for PubMedCentralID PMC5678792
Large-Scale Analyses of Angiosperm Nucleotide-Binding Site-Leucine-Rich Repeat Genes Reveal Three Anciently Diverged Classes with Distinct Evolutionary Patterns PLANT PHYSIOLOGY Shao, Z., Xue, J., Wu, P., Zhang, Y., Wu, Y., Hang, Y., Wang, B., Chen, J. 2016; 170 (4): 2095-2109

Abstract

Nucleotide-binding site-leucine-rich repeat (NBS-LRR) genes make up the largest plant disease resistance gene family (R genes), with hundreds of copies occurring in individual angiosperm genomes. However, the expansion history of NBS-LRR genes during angiosperm evolution is largely unknown. By identifying more than 6,000 NBS-LRR genes in 22 representative angiosperms and reconstructing their phylogenies, we present a potential framework of NBS-LRR gene evolution in the angiosperm. Three anciently diverged NBS-LRR classes (TNLs, CNLs, and RNLs) were distinguished with unique exon-intron structures and DNA motif sequences. A total of seven ancient TNL, 14 CNL, and two RNL lineages were discovered in the ancestral angiosperm, from which all current NBS-LRR gene repertoires were evolved. A pattern of gradual expansion during the first 100 million years of evolution of the angiosperm clade was observed for CNLs. TNL numbers remained stable during this period but were eventually deleted in three divergent angiosperm lineages. We inferred that an intense expansion of both TNL and CNL genes started from the Cretaceous-Paleogene boundary. Because dramatic environmental changes and an explosion in fungal diversity occurred during this period, the observed expansions of R genes probably reflect convergent adaptive responses of various angiosperm families. An ancient whole-genome duplication event that occurred in an angiosperm ancestor resulted in two RNL lineages, which were conservatively evolved and acted as scaffold proteins for defense signal transduction. Overall, the reconstructed framework of angiosperm NBS-LRR gene evolution in this study may serve as a fundamental reference for better understanding angiosperm NBS-LRR genes.

View details for DOI 10.1104/pp.15.01487

View details for Web of Science ID 000375424200016

View details for PubMedID 26839128

View details for PubMedCentralID PMC4825152
Identification of Arbuscular Mycorrhiza (AM)-Responsive microRNAs in Tomato FRONTIERS IN PLANT SCIENCE Wu, P., Wu, Y., Liu, C., Liu, L., Ma, F., Wu, X., Wu, M., Hang, Y., Chen, J., Shao, Z., Wang, B. 2016; 7: 429

Abstract

A majority of land plants can form symbiosis with arbuscular mycorrhizal (AM) fungi. MicroRNAs (miRNAs) have been implicated to regulate this process in legumes, but their involvement in non-legume species is largely unknown. In this study, by performing deep sequencing of sRNA libraries in tomato roots and comparing with tomato genome, a total of 700 potential miRNAs were predicted, among them, 187 are known plant miRNAs that have been previously deposited in miRBase. Unlike the profiles in other plants such as rice and Arabidopsis, a large proportion of predicted tomato miRNAs was 24 nt in length. A similar pattern was observed in the potato genome but not in tobacco, indicating a Solanum genus-specific expansion of 24-nt miRNAs. About 40% identified tomato miRNAs showed significantly altered expressions upon Rhizophagus irregularis inoculation, suggesting the potential roles of these novel miRNAs in AM symbiosis. The differential expression of five known and six novel miRNAs were further validated using qPCR analysis. Interestingly, three up-regulated known tomato miRNAs belong to a known miR171 family, a member of which has been reported in Medicago truncatula to regulate AM symbiosis. Thus, the miR171 family likely regulates AM symbiosis conservatively across different plant lineages. More than 1000 genes targeted by potential AM-responsive miRNAs were provided and their roles in AM symbiosis are worth further exploring.

View details for DOI 10.3389/fpls.2016.00429

View details for Web of Science ID 000373264200004

View details for PubMedID 27066061

View details for PubMedCentralID PMC4814767

Yue Wu

Postdoctoral Scholar, Genetics

Bio

Honors & Awards

Stanford Advisors

Contact

Additional Info

Links

Current Research and Scholarly Interests

All Publications

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract