Billy Tsz Cheong Lau's Profile

Academic Appointments

Instructor, Medicine - Oncology

Professional Education

B.A.Sc., University of British Columbia, Engineering Physics, Electrical Engineering Option (2006)
Ph.D., Harvard University, Engineering Sciences (2012)

Contact

Academic
billylau@stanford.edu

University - Other Teaching/Research Department: Medicine - Med/Oncology Position: Instructor

Additional Info

Mail Code: 5151

All Publications

Cancer subclone detection based on DNA copy number in single-cell and spatial omic sequencing data. Nature methods Wu, C. Y., Rong, J., Sathe, A., Hess, P. R., Lau, B. T., Grimes, S. M., Huang, S., Ji, H. P., Zhang, N. R. 2025

Abstract

Somatic mutations such as copy number alterations accumulate during cancer progression, driving intratumor heterogeneity that impacts therapy effectiveness. Understanding the characteristics and spatial distribution of genetically distinct subclones is essential for unraveling tumor evolution and improving cancer treatment. Here we present Clonalscope, a subclone detection method using copy number profiles, applicable to spatial transcriptomics and single-cell sequencing data. Clonalscope implements a nested Chinese Restaurant Process to identify de novo tumor subclones, which can incorporate prior information from matched bulk DNA sequencing data for improved subclone detection and malignant cell labeling. On single-cell RNA sequencing and single-cell assay for transposase-accessible chromatin using sequencing data from gastrointestinal tumors, Clonalscope successfully labeled malignant cells and identified genetically different subclones with thorough validations. On spatial transcriptomics data from various primary and metastasized tumors, Clonalscope labeled malignant spots, traced subclones and identified spatially segregated subclones with distinct differentiation levels and expression of genes associated with drug resistance and survival.

View details for DOI 10.1038/s41592-025-02773-5

View details for PubMedID 40954304

View details for PubMedCentralID 3267864
Nanopore-based cell-free DNA fragmentation and methylation profiles from the cerebral spinal fluid of patients with lung cancer brain metastases. bioRxiv : the preprint server for biology Chen, T., Bai, X., Burnside, G., Trinh, T. T., Gephart, M. H., Lau, B. T., Ji, H. P. 2025

Abstract

Non-small cell lung cancer (NSCLC) patients with brain metastases (BMET) have a poor prognosis. Cerebrospinal fluid (CSF) is a source of cell free DNA (cfDNA) from the brain and its methylation and fragmentation properties may be an indicator of NSCLC-BMET.We applied a nanopore single-molecule sequencing approach to characterize the fragmentation, methylation and hydroxymethylation patterns present in CSF-derived cfDNA from NSCLC-BMET patients (N=15). We compared the cancer cfDNA finding to non-cancer healthy controls (N=11) and their CSF cfDNA. We also compared the fragmentation patterns between CSF-derived cfDNA and plasma-derived cfDNA.We observed enriched mono-nucleosome levels and significantly higher mono-/trinucleosome ratios in cancer patients. Comparison with plasma-derived cfDNA further confirmed the unique fragmentation features of CSF-derived cfDNA. Distinct methylation and hydroxymethylation patterns were observed between cancer and control CSF samples. We observed significantly lower degree of hydroxymethylation in cancer patients compared to healthy controls and the affected genes had different pathway profiles.CSF cfDNA in patients with NSCLC-BMET had a distinct profiles of DNA fragmentation, methylation and hydroxymethylation.

View details for DOI 10.1101/2025.07.28.667300

View details for PubMedID 40766439

View details for PubMedCentralID PMC12324296
Single cell and spatial alternative splicing analysis with Nanopore long read sequencing. Nature communications Fu, Y., Kim, H., Roy, S., Huang, S., Adams, J. I., Grimes, S. M., Lau, B. T., Sathe, A., Ji, H. P., Zhang, N. R. 2025; 16 (1): 6654

Abstract

Long-read sequencing boosts alternative splicing analysis but faces technical and computational barriers in single-cell and spatial settings. High Nanopore error rates compromise cell barcode and UMI recovery, while read truncation and misalignment undermine isoform quantification. Downstream, a statistical framework to assess splicing variation within and between cells or spatial spots is lacking. We introduce Longcell, a statistical and computational pipeline for isoform quantification from single-cell and spatially barcoded Nanopore long reads. Longcell efficiently recovers cell barcodes and UMIs, corrects sequencing errors, and models splicing diversity within and between cells or spots. Applied across multiple datasets, Longcell allows accurate identification of spatial isoform switching. Longcell also reveals widespread high intra-cell isoform heterogeneity for highly expressed genes. Finally, on a perturbation experiment for 9 splicing factors, Longcell identifies regulatory targets that are validated by targeted sequencing.

View details for DOI 10.1038/s41467-025-60902-2

View details for PubMedID 40683866

View details for PubMedCentralID PMC12276307
Direct measurement of engineered cancer mutations and their transcriptional phenotypes in single cells. Nature biotechnology Kim, H. S., Grimes, S. M., Chen, T., Sathe, A., Lau, B. T., Hwang, G. H., Bae, S., Ji, H. P. 2023

Abstract

Genome sequencing studies have identified numerous cancer mutations across a wide spectrum of tumor types, but determining the phenotypic consequence of these mutations remains a challenge. Here, we developed a high-throughput, multiplexed single-cell technology called TISCC-seq to engineer predesignated mutations in cells using CRISPR base editors, directly delineate their genotype among individual cells and determine each mutation's transcriptional phenotype. Long-read sequencing of the target gene's transcript identifies the engineered mutations, and the transcriptome profile from the same set of cells is simultaneously analyzed by short-read sequencing. Through integration, we determine the mutations' genotype and expression phenotype at single-cell resolution. Using cell lines, we engineer and evaluate the impact of >100 TP53 mutations on gene expression. Based on the single-cell gene expression, we classify the mutations as having a functionally significant phenotype.

View details for DOI 10.1038/s41587-023-01949-8

View details for PubMedID 37697151

View details for PubMedCentralID 8018281
Single-cell multi-gene identification of somatic mutations and gene rearrangements in cancer. NAR cancer Grimes, S. M., Kim, H. S., Roy, S., Sathe, A., Ayala, C. I., Bai, X., Almeda-Notestine, A. F., Haebe, S., Shree, T., Levy, R., Lau, B. T., Ji, H. P. 2023; 5 (3): zcad034

Abstract

In this proof-of-concept study, we developed a single-cell method that provides genotypes of somatic alterations found in coding regions of messenger RNAs and integrates these transcript-based variants with their matching cell transcriptomes. We used nanopore adaptive sampling on single-cell complementary DNA libraries to validate coding variants in target gene transcripts, and short-read sequencing to characterize cell types harboring the mutations. CRISPR edits for 16 targets were identified using a cancer cell line, and known variants in the cell line were validated using a 352-gene panel. Variants in primary cancer samples were validated using target gene panels ranging from 161 to 529 genes. A gene rearrangement was also identified in one patient, with the rearrangement occurring in two distinct tumor sites.

View details for DOI 10.1093/narcan/zcad034

View details for PubMedID 37435532

View details for PubMedCentralID PMC10331933
Magnetic DNA random access memory with nanopore readouts and exponentially-scaled combinatorial addressing. Scientific reports Lau, B., Chandak, S., Roy, S., Tatwawadi, K., Wootters, M., Weissman, T., Ji, H. P. 2023; 13 (1): 8514

Abstract

The storage of data in DNA typically involves encoding and synthesizing data into short oligonucleotides, followed by reading with a sequencing instrument. Major challenges include the molecular consumption of synthesized DNA, basecalling errors, and limitations with scaling up read operations for individual data elements. Addressing these challenges, we describe a DNA storage system called MDRAM (Magnetic DNA-based Random Access Memory) that enables repetitive and efficient readouts of targeted files with nanopore-based sequencing. By conjugating synthesized DNA to magnetic agarose beads, we enabled repeated data readouts while preserving the original DNA analyte and maintaining data readout quality. MDRAM utilizes an efficient convolutional coding scheme that leverages soft information in raw nanopore sequencing signals to achieve information reading costs comparable to Illumina sequencing despite higher error rates. Finally, we demonstrate a proof-of-concept DNA-based proto-filesystem that enables an exponentially-scalable data address space using only small numbers of targeting primers for assembly and readout.

View details for DOI 10.1038/s41598-023-29575-z

View details for PubMedID 37231057
Single-molecule methylation profiles of cell-free DNA in cancer with nanopore sequencing. Genome medicine Lau, B. T., Almeda, A., Schauer, M., McNamara, M., Bai, X., Meng, Q., Partha, M., Grimes, S. M., Lee, H., Heestand, G. M., Ji, H. P. 2023; 15 (1): 33

Abstract

Epigenetic characterization of cell-free DNA (cfDNA) is an emerging approach for detecting and characterizing diseases such as cancer. We developed a strategy using nanopore-based single-molecule sequencing to measure cfDNA methylomes. This approach generated up to 200 million reads for a single cfDNA sample from cancer patients, an order of magnitude improvement over existing nanopore sequencing methods. We developed a single-molecule classifier to determine whether individual reads originated from a tumor or immune cells. Leveraging methylomes of matched tumors and immune cells, we characterized cfDNA methylomes of cancer patients for longitudinal monitoring during treatment.

View details for DOI 10.1186/s13073-023-01178-3

View details for PubMedID 37138315

View details for PubMedCentralID 1283450
Single cell and spatial alternative splicing analysis with long read sequencing. Research square Fu, Y., Kim, H., Adams, J. I., Grimes, S. M., Huang, S., Lau, B. T., Sathe, A., Hess, P., Ji, H. P., Zhang, N. R. 2023

Abstract

Long-read sequencing has become a powerful tool for alternative splicing analysis. However, technical and computational challenges have limited our ability to explore alternative splicing at single cell and spatial resolution. The higher sequencing error of long reads, especially high indel rates, have limited the accuracy of cell barcode and unique molecular identifier (UMI) recovery. Read truncation and mapping errors, the latter exacerbated by the higher sequencing error rates, can cause the false detection of spurious new isoforms. Downstream, there is yet no rigorous statistical framework to quantify splicing variation within and between cells/spots. In light of these challenges, we developed Longcell, a statistical framework and computational pipeline for accurate isoform quantification for single cell and spatial spot barcoded long read sequencing data. Longcell performs computationally efficient cell/spot barcode extraction, UMI recovery, and UMI-based truncation- and mapping-error correction. Through a statistical model that accounts for varying read coverage across cells/spots, Longcell rigorously quantifies the level of inter-cell/spot versus intra-cell/ spot diversity in exon-usage and detects changes in splicing distributions between cell populations. Applying Longcell to single cell long-read data from multiple contexts, we found that intra-cell splicing heterogeneity, where multiple isoforms co-exist within the same cell, is ubiquitous for highly expressed genes. On matched single cell and Visium long read sequencing for a tissue of colorectal cancer metastasis to the liver, Longcell found concordant signals between the two data modalities. Finally, on a perturbation experiment for 9 splicing factors, Longcell identified regulatory targets that are validated by targeted sequencing.

View details for DOI 10.21203/rs.3.rs-2674892/v1

View details for PubMedID 36993612

View details for PubMedCentralID PMC10055662
Tumor-associated microbiome features of metastatic colorectal cancer and clinical implications. Frontiers in oncology An, H. J., Partha, M. A., Lee, H., Lau, B. T., Pavlichin, D. S., Almeda, A., Hooker, A. C., Shin, G., Ji, H. P. 2023; 13: 1310054

Abstract

Colon microbiome composition contributes to the pathogenesis of colorectal cancer (CRC) and prognosis. We analyzed 16S rRNA sequencing data from tumor samples of patients with metastatic CRC and determined the clinical implications.We enrolled 133 patients with metastatic CRC at St. Vincent Hospital in Korea. The V3-V4 regions of the 16S rRNA gene from the tumor DNA were amplified, sequenced on an Illumina MiSeq, and analyzed using the DADA2 package.After excluding samples that retained <5% of the total reads after merging, 120 samples were analyzed. The median age of patients was 63 years (range, 34-82 years), and 76 patients (63.3%) were male. The primary cancer sites were the right colon (27.5%), left colon (30.8%), and rectum (41.7%). All subjects received 5-fluouracil-based systemic chemotherapy. After removing genera with <1% of the total reads in each patient, 523 genera were identified. Rectal origin, high CEA level (≥10 ng/mL), and presence of lung metastasis showed higher richness. Survival analysis revealed that the presence of Prevotella (p = 0.052), Fusobacterium (p = 0.002), Selenomonas (p<0.001), Fretibacterium (p = 0.001), Porphyromonas (p = 0.007), Peptostreptococcus (p = 0.002), and Leptotrichia (p = 0.003) were associated with short overall survival (OS, <24 months), while the presence of Sphingomonas was associated with long OS (p = 0.070). From the multivariate analysis, the presence of Selenomonas (hazard ratio [HR], 6.35; 95% confidence interval [CI], 2.38-16.97; p<0.001) was associated with poor prognosis along with high CEA level.Tumor microbiome features may be useful prognostic biomarkers for metastatic CRC.

View details for DOI 10.3389/fonc.2023.1310054

View details for PubMedID 38304032

View details for PubMedCentralID PMC10833227
Large Cancer Pedigree Involving Multiple Cancer Genes including Likely Digenic MSH2 and MSH6 Lynch Syndrome (LS) and an Instance of Recombinational Rescue from LS. Cancers Vogelaar, I. P., Greer, S., Wang, F., Shin, G., Lau, B., Hu, Y., Haraldsdottir, S., Alvarez, R., Hazelett, D., Nguyen, P., Aguirre, F. P., Guindi, M., Hendifar, A., Balcom, J., Leininger, A., Fairbank, B., Ji, H., Hitchins, M. P. 2022; 15 (1)

Abstract

Lynch syndrome (LS), caused by heterozygous pathogenic variants affecting one of the mismatch repair (MMR) genes (MSH2, MLH1, MSH6, PMS2), confers moderate to high risks for colorectal, endometrial, and other cancers. We describe a four-generation, 13-branched pedigree in which multiple LS branches carry the MSH2 pathogenic variant c.2006G>T (p.Gly669Val), one branch has this and an additional novel MSH6 variant c.3936_4001+8dup (intronic), and other non-LS branches carry variants within other cancer-relevant genes (NBN, MC1R, PTPRJ). Both MSH2 c.2006G>T and MSH6 c.3936_4001+8dup caused aberrant RNA splicing in carriers, including out-of-frame exon-skipping, providing functional evidence of their pathogenicity. MSH2 and MSH6 are co-located on Chr2p21, but the two variants segregated independently (mapped in trans) within the digenic branch, with carriers of either or both variants. Thus, MSH2 c.2006G>T and MSH6 c.3936_4001+8dup independently confer LS with differing cancer risks among family members in the same branch. Carriers of both variants have near 100% risk of transmitting either one to offspring. Nevertheless, a female carrier of both variants did not transmit either to one son, due to a germline recombination within the intervening region. Genetic diagnosis, risk stratification, and counseling for cancer and inheritance were highly individualized in this family. The finding of multiple cancer-associated variants in this pedigree illustrates a need to consider offering multicancer gene panel testing, as opposed to targeted cascade testing, as additional cancer variants may be uncovered in relatives.

View details for DOI 10.3390/cancers15010228

View details for PubMedID 36612224
Colorectal cancer metastases in the liver establish immunosuppressive spatial networking between tumor associated SPP1+ macrophages and fibroblasts. Clinical cancer research : an official journal of the American Association for Cancer Research Sathe, A., Mason, K., Grimes, S. M., Zhou, Z., Lau, B. T., Bai, X., Su, A., Tan, X., Lee, H., Suarez, C. J., Nguyen, Q., Poultsides, G., Zhang, N. R., Ji, H. P. 2022

Abstract

The liver is the most frequent metastatic site for colorectal cancer (CRC). Its microenvironment is modified to provide a niche that is conducive for CRC cell growth.This study focused on characterizing the cellular changes in the metastatic CRC (mCRC) liver tumor microenvironment (TME).We analyzed a series of microsatellite stable (MSS) mCRCs to the liver, paired normal liver tissue and peripheral blood mononuclear cells using single cell RNA-seq (scRNA-seq). We validated our findings using multiplexed spatial imaging and bulk gene expression with cell deconvolution.We identified TME-specific SPP1-expressing macrophages with altered metabolism features, foam cell characteristics and increased activity in extracellular matrix (ECM) organization. SPP1+ macrophages and fibroblasts expressed complementary ligand receptor pairs with the potential to mutually influence their gene expression programs. TME lacked dysfunctional CD8 T cells and contained regulatory T cells, indicative of immunosuppression. Spatial imaging validated these cell states in the TME. Moreover, TME macrophages and fibroblasts had close spatial proximity, which is a requirement for intercellular communication and networking.In an independent cohort of mCRCs in the liver, we confirmed the presence of SPP1+ macrophages and fibroblasts using gene expression data. An increased proportion of TME fibroblasts was associated with a worst prognosis in these patients.We demonstrated that mCRC in the liver is characterized by transcriptional alterations of macrophages in the TME. Intercellular networking between macrophages and fibroblasts supports CRC growth in the immunosuppressed metastatic niche in the liver. These features can be used to target immune checkpoint resistant MSS tumors.

View details for DOI 10.1158/1078-0432.CCR-22-2041

View details for PubMedID 36239989
Germline variants of ATG7 in familial cholangiocarcinoma alter autophagy and p62. Scientific reports Greer, S. U., Chen, J., Ogmundsdottir, M. H., Ayala, C., Lau, B. T., Delacruz, R. G., Sandoval, I. T., Kristjansdottir, S., Jones, D. A., Haslem, D. S., Romero, R., Fulde, G., Bell, J. M., Jonasson, J. G., Steingrimsson, E., Ji, H. P., Nadauld, L. D. 2022; 12 (1): 10333

Abstract

Autophagy is a housekeeping mechanism tasked with eliminating misfolded proteins and damaged organelles to maintain cellular homeostasis. Autophagy deficiency results in increased oxidative stress, DNA damage and chronic cellular injury. Among the core genes in the autophagy machinery, ATG7 is required for autophagy initiation and autophagosome formation. Based on the analysis of an extended pedigree of familial cholangiocarcinoma, we determined that all affected family members had a novel germline mutation (c.2000C>T p.Arg659* (p.R659*)) in ATG7. Somatic deletions of ATG7 were identified in the tumors of affected individuals. We applied linked-read sequencing to one tumor sample and demonstrated that the ATG7 somatic deletion and germline mutation were located on distinct alleles, resulting in two hits to ATG7. From a parallel population genetic study, we identified a germline polymorphism of ATG7 (c.1591C>G p.Asp522Glu (p.D522E)) associated with increased risk of cholangiocarcinoma. To characterize the impact of these germline ATG7 variants on autophagy activity, we developed an ATG7-null cell line derived from the human bile duct. The mutant p.R659* ATG7 protein lacked the ability to lipidate its LC3 substrate, leading to complete loss of autophagy and increased p62 levels. Our findings indicate that germline ATG7 variants have the potential to impact autophagy function with implications for cholangiocarcinoma development.

View details for DOI 10.1038/s41598-022-13569-4

View details for PubMedID 35725745
Reconstructing the spatial evolution of cancer through subclone detection on copy number profiles in tumor sequencing data. Wu, C., Hess, P. R., Sathe, A., Rong, J., Lau, B. T., Grimes, S. M., Ji, H. P., Zhang, N. R. AMER ASSOC CANCER RESEARCH. 2022

View details for Web of Science ID 000892509502259
A single-cell solution for solid tumors to detect mutations and quantify copy number variations. Wu, C., Hess, P. R., Sathe, A., Rong, J., Lau, B. T., Grimes, S. M., Ji, H. P., Zhang, N. R. AMER ASSOC CANCER RESEARCH. 2022

View details for Web of Science ID 000892509502260
Reconstructing the spatial evolution of cancer through subclone detection on copy number profiles in tumor sequencing data Wu, C., Hess, P. R., Sathe, A., Rong, J., Lau, B. T., Grimes, S. M., Ji, H. P., Zhang, N. R. AMER ASSOC CANCER RESEARCH. 2022

View details for Web of Science ID 000892509500207
Analysis of 16S rRNA sequencing in advanced colorectal cancer tissue samples An, H., Partha, M. A., Lee, H., Lau, B., Shin, G., Almeda, A., Ji, H. P. LIPPINCOTT WILLIAMS & WILKINS. 2022

View details for DOI 10.1200/JCO.2022.40.4_suppl.163

View details for Web of Science ID 000770995900159
Single-cell characterization of CRISPR-modified transcript isoforms with nanopore sequencing. Genome biology Kim, H. S., Grimes, S. M., Hooker, A. C., Lau, B. T., Ji, H. P. 2021; 22 (1): 331

Abstract

We developed a single-cell approach to detect CRISPR-modified mRNA transcript structures. This method assesses how genetic variants at splicing sites and splicing factors contribute to alternative mRNA isoforms. We determine how alternative splicing is regulated by editing target exon-intron segments or splicing factors by CRISPR-Cas9 and their consequences on transcriptome profile. Our method combines long-read sequencing to characterize the transcript structure and short-read sequencing to match the single-cell gene expression profiles and gRNA sequence and therefore provides targeted genomic edits and transcript isoform structure detection at single-cell resolution.

View details for DOI 10.1186/s13059-021-02554-1

View details for PubMedID 34872615
Integrative single-cell analysis of allele-specific copy number alterations and chromatin accessibility in cancer. Nature biotechnology Wu, C., Lau, B. T., Kim, H. S., Sathe, A., Grimes, S. M., Ji, H. P., Zhang, N. R. 2021

Abstract

Cancer progression is driven by both somatic copy number aberrations (CNAs) and chromatin remodeling, yet little is known about the interplay between these two classes of events in shaping the clonal diversity of cancers. We present Alleloscope, a method for allele-specific copy number estimation that can be applied to single-cell DNA- and/or transposase-accessible chromatin-sequencing (scDNA-seq, ATAC-seq) data, enabling combined analysis of allele-specific copy number and chromatin accessibility. On scDNA-seq data from gastric, colorectal and breast cancer samples, with validation using matched linked-read sequencing, Alleloscope finds pervasive occurrence of highly complex, multiallelic CNAs, in which cells that carry varying allelic configurations adding to the same total copy number coevolve within a tumor. On scATAC-seq from two basal cell carcinoma samples and a gastric cancer cell line, Alleloscope detected multiallelic copy number events and copy-neutral loss-of-heterozygosity, enabling dissection of the contributions of chromosomal instability and chromatin remodeling to tumor evolution.

View details for DOI 10.1038/s41587-021-00911-w

View details for PubMedID 34017141
Profiling SARS-CoV-2 mutation fingerprints that range from the viral pangenome to individual infection quasispecies. Genome medicine Lau, B. T., Pavlichin, D., Hooker, A. C., Almeda, A., Shin, G., Chen, J., Sahoo, M. K., Huang, C. H., Pinsky, B. A., Lee, H. J., Ji, H. P. 2021; 13 (1): 62

Abstract

BACKGROUND: The genome of SARS-CoV-2 is susceptible to mutations during viral replication due to the errors generated by RNA-dependent RNA polymerases. These mutations enable the SARS-CoV-2 to evolve into new strains. Viral quasispecies emerge from de novo mutations that occur in individual patients. In combination, these sets of viral mutations provide distinct genetic fingerprints that reveal the patterns of transmission and have utility in contact tracing.METHODS: Leveraging thousands of sequenced SARS-CoV-2 genomes, we performed a viral pangenome analysis to identify conserved genomic sequences. We used a rapid and highly efficient computational approach that relies on k-mers, short tracts of sequence, instead of conventional sequence alignment. Using this method, we annotated viral mutation signatures that were associated with specific strains. Based on these highly conserved viral sequences, we developed a rapid and highly scalable targeted sequencing assay to identify mutations, detect quasispecies variants, and identify mutation signatures from patients. These results were compared to the pangenome genetic fingerprints.RESULTS: We built a k-mer index for thousands of SARS-CoV-2 genomes and identified conserved genomics regions and landscape of mutations across thousands of virus genomes. We delineated mutation profiles spanning common genetic fingerprints (the combination of mutations in a viral assembly) and a combination of mutations that appear in only a small number of patients. We developed a targeted sequencing assay by selecting primers from the conserved viral genome regions to flank frequent mutations. Using a cohort of 100 SARS-CoV-2 clinical samples, we identified genetic fingerprints consisting of strain-specific mutations seen across populations and de novo quasispecies mutations localized to individual infections. We compared the mutation profiles of viral samples undergoing analysis with the features of the pangenome.CONCLUSIONS: We conducted an analysis for viral mutation profiles that provide the basis of genetic fingerprints. Our study linked pangenome analysis with targeted deep sequenced SARS-CoV-2 clinical samples. We identified quasispecies mutations occurring within individual patients and determined their general prevalence when compared to over 70,000 other strains. Analysis of these genetic fingerprints may provide a way of conducting molecular contact tracing.

View details for DOI 10.1186/s13073-021-00882-2

View details for PubMedID 33875001
Joint single cell DNA-seq and RNA-seq of gastric cancer cell lines reveals rules of in vitro evolution. NAR genomics and bioinformatics Andor, N. n., Lau, B. T., Catalanotti, C. n., Sathe, A. n., Kubit, M. n., Chen, J. n., Blaj, C. n., Cherry, A. n., Bangs, C. D., Grimes, S. M., Suarez, C. J., Ji, H. P. 2020; 2 (2): lqaa016

Abstract

Cancer cell lines are not homogeneous nor are they static in their genetic state and biological properties. Genetic, transcriptional and phenotypic diversity within cell lines contributes to the lack of experimental reproducibility frequently observed in tissue-culture-based studies. While cancer cell line heterogeneity has been generally recognized, there are no studies which quantify the number of clones that coexist within cell lines and their distinguishing characteristics. We used a single-cell DNA sequencing approach to characterize the cellular diversity within nine gastric cancer cell lines and integrated this information with single-cell RNA sequencing. Overall, we sequenced the genomes of 8824 cells, identifying between 2 and 12 clones per cell line. Using the transcriptomes of more than 28 000 single cells from the same cell lines, we independently corroborated 88% of the clonal structure determined from single cell DNA analysis. For one of these cell lines, we identified cell surface markers that distinguished two subpopulations and used flow cytometry to sort these two clones. We identified substantial proportions of replicating cells in each cell line, assigned these cells to subclones detected among the G0/G1 population and used the proportion of replicating cells per subclone as a surrogate of each subclone's growth rate.

View details for DOI 10.1093/nargab/lqaa016

View details for PubMedID 32215369

View details for PubMedCentralID PMC7079336
Profiling SARS-CoV-2 mutation fingerprints that range from the viral pangenome to individual infection quasispecies. medRxiv : the preprint server for health sciences Lau, B. T., Pavlichin, D. n., Hooker, A. C., Almeda, A. n., Shin, G. n., Chen, J. n., Sahoo, M. K., Huang, C. n., Pinsky, B. A., Lee, H. n., Ji, H. P. 2020

Abstract

The genome of SARS-CoV-2 is susceptible to mutations during viral replication due to the errors generated by RNA-dependent RNA polymerases. These mutations enable the SARS-CoV-2 to evolve into new strains. Viral quasispecies emerge from de novo mutations that occur in individual patients. In combination, these sets of viral mutations provide distinct genetic fingerprints that reveal the patterns of transmission and have utility in contract tracing.Leveraging thousands of sequenced SARS-CoV-2 genomes, we performed a viral pangenome analysis to identify conserved genomic sequences. We used a rapid and highly efficient computational approach that relies on k-mers, short tracts of sequence, instead of conventional sequence alignment. Using this method, we annotated viral mutation signatures that were associated with specific strains. Based on these highly conserved viral sequences, we developed a rapid and highly scalable targeted sequencing assay to identify mutations, detect quasispecies and identify mutation signatures from patients. These results were compared to the pangenome genetic fingerprints.We built a k-mer index for thousands of SARS-CoV-2 genomes and identified conserved genomics regions and landscape of mutations across thousands of virus genomes. We delineated mutation profiles spanning common genetic fingerprints (the combination of mutations in a viral assembly) and rare ones that occur in only small fraction of patients. We developed a targeted sequencing assay by selecting primers from the conserved viral genome regions to flank frequent mutations. Using a cohort of SARS-CoV-2 clinical samples, we identified genetic fingerprints consisting of strain-specific mutations seen across populations and de novo quasispecies mutations localized to individual infections. We compared the mutation profiles of viral samples undergoing analysis with the features of the pangenome.We conducted an analysis for viral mutation profiles that provide the basis of genetic fingerprints. Our study linked pangenome analysis with targeted deep sequenced SARS-CoV-2 clinical samples. We identified quasispecies mutations occurring within individual patients, mutations demarcating dominant species and the prevalence of mutation signatures, of which a significant number were relatively unique. Analysis of these genetic fingerprints may provide a way of conducting molecular contact tracing.

View details for DOI 10.1101/2020.11.02.20224816

View details for PubMedID 33173909

View details for PubMedCentralID PMC7654905
Single cell genomic characterization reveals the cellular reprogramming of the gastric tumor microenvironment. Clinical cancer research : an official journal of the American Association for Cancer Research Sathe, A. n., Grimes, S. M., Lau, B. T., Chen, J. n., Suarez, C. n., Huang, R. J., Poultsides, G. A., Ji, H. P. 2020

Abstract

The tumor microenvironment (TME) consists of a heterogenous cellular milieu that can influence cancer cell behavior. Its characteristics havean impact on treatments such as immunotherapy. These features can be revealed with single-cell RNA sequencing (scRNA-seq). We hypothesized that scRNA-seq analysis ofgastric cancer (GC) together with paired normal tissue and peripheral blood mononuclear cells (PBMCs) would identify critical elements of cellular deregulation not apparent with other approaches.scRNA-seq was conducted on seven patients with GC and one patient with intestinal metaplasia. We sequenced 56,167 cells comprising GC (32,407 cells), paired normal tissue (18,657 cells) and PBMCs (5,103 cells). Protein expression was validated by multiplex immunofluorescence.Tumor epithelium had copy number alterations, a distinct gene expression program from normal, with intra-tumor heterogeneity. GC TME was significantly enriched for stromal cells, macrophages, dendritic cells (DCs) and Tregs. TME-exclusive stromal cells expressed distinct extracellular matrix components than normal. Macrophages were transcriptionally heterogenous and did not conform to a binary M1/M2 paradigm. Tumor-DCs had a unique gene expression program compared to PBMC DCs. TME-specific cytotoxic T cells were exhausted with two heterogenous subsets. Helper, cytotoxic T, Treg and NK cells expressed multiple immune checkpoint or costimulatory molecules. Receptor-ligand analysis revealed TME-exclusive inter-cellular communication.Single-cell gene expression studies revealed widespread reprogramming across multiple cellular elements in the GC TME. Cellular remodeling was delineated by changes in cell numbers, transcriptional states and inter-cellular interactions. This characterization facilitates understanding of tumor biology and enables identification of novel targets including for immunotherapy.

View details for DOI 10.1158/1078-0432.CCR-19-3231

View details for PubMedID 32060101
OVERCOMING HIGH NANOPORE BASECALLER ERROR RATES FOR DNA STORAGE VIA BASECALLER-DECODER INTEGRATION AND CONVOLUTIONAL CODES Chandak, S., Neu, J., Tatwawadi, K., Mardia, J., Lau, B., Kubit, M., Hulett, R., Griffin, P., Wootters, M., Weissman, T., Ji, H., IEEE IEEE. 2020: 8822–26

View details for Web of Science ID 000615970409020
A high throughput method for the optimization of digital PCR assays for personalized circulating tumor DNA detection Arce, M. M., Wood-Bouwens, C., Haslem, D., Lau, B. T., Bell, J., Almeda, A., Kubit, M., Moulton, B., Romero, R., St Onge, R. P., Nadauld, L., Ji, H. P. AMER ASSOC CANCER RESEARCH. 2019

View details for DOI 10.1158/1538-7445.AM2019-2278

View details for Web of Science ID 000488279400267
Comprehensive characterization of gastric cancer at single-cell resolution Chen, J., Sathe, A., Grimes, S., Greer, S., Lau, B., Renschler, A., Poultsides, G., Suarez, C., Ji, H. AMER ASSOC CANCER RESEARCH. 2019

View details for DOI 10.1158/1538-7445.SABCS18-151

View details for Web of Science ID 000488129901333
Single cell RNA sequencing reveals multiple adaptive resistance mechanisms to regorafenib in colon cancer Sathe, A., Lau, B. T., Grimes, S., Greer, S., Ji, H. AMER ASSOC CANCER RESEARCH. 2019

View details for DOI 10.1158/1538-7445.SABCS18-2105

View details for Web of Science ID 000488279400102
A functional CRISPR/Cas9 screen identifies kinases that modulate FGFR inhibitor response in gastric cancer ONCOGENESIS Chen, J., Bell, J., Lau, B. T., Whittaker, T., Stapleton, D., Ji, H. P. 2019; 8

View details for DOI 10.1038/s41389-019-0145-z

View details for Web of Science ID 000467678200003
Single-cell transcriptome analysis identifies distinct cell types and niche signaling in a primary gastric organoid model. Scientific reports Chen, J., Lau, B. T., Andor, N., Grimes, S. M., Handy, C., Wood-Bouwens, C., Ji, H. P. 2019; 9 (1): 4536

Abstract

The diverse cellular milieu of the gastric tissue microenvironment plays a critical role in normal tissue homeostasis and tumor development. However, few cell culture model can recapitulate the tissue microenvironment and intercellular signaling in vitro. We used a primary tissue culture system to generate a murine p53 null gastric tissue model containing both epithelium and mesenchymal stroma. To characterize the microenvironment and niche signaling, we used single cell RNA sequencing (scRNA-Seq) to determine the transcriptomes of 4,391 individual cells. Based on specific markers, we identified epithelial cells, fibroblasts and macrophages in initial tissue explants during organoid formation. The majority of macrophages were polarized towards wound healing and tumor promotion M2-type. During the course of time, the organoids maintained both epithelial and fibroblast lineages with the features of immature mouse gastric stomach. We detected a subset of cells in both lineages expressing Lgr5, one of the stem cell markers. We examined the lineage-specific Wnt signaling activation, and identified that Rspo3 was specifically expressed in the fibroblast lineage, providing an endogenous source of the R-spondin to activate Wnt signaling. Our studies demonstrate that this primary tissue culture system enables one to study gastric tissue niche signaling and immune response in vitro.

View details for PubMedID 30872643
Single-cell transcriptome analysis identifies distinct cell types and niche signaling in a primary gastric organoid model SCIENTIFIC REPORTS Chen, J., Lau, B. T., Andor, N., Grimes, S. M., Handy, C., Wood-Bouwens, C., Ji, H. P. 2019; 9

View details for DOI 10.1038/s41598-019-40809-x

View details for Web of Science ID 000461159600013
A functional CRISPR/Cas9 screen identifies kinases that modulate FGFR inhibitor response in gastric cancer. Oncogenesis Chen, J. n., Bell, J. n., Lau, B. T., Whittaker, T. n., Stapleton, D. n., Ji, H. P. 2019; 8 (5): 33

Abstract

Some gastric cancers have FGFR2 amplifications, making them sensitive to FGFR inhibitors. However, cancer cells inevitably develop resistance despite initial response. The underlying resistance mechanism to FGFR inhibition is unclear. In this study, we applied a kinome-wide CRISPR/Cas9 screen to systematically identify kinases that are determinants of sensitivity to a potent FGFR inhibitor AZD4547 in KatoIII cells, a gastric cancer cell line with FGFR2 amplification. In total, we identified 20 kinases, involved in ILK, SRC, and EGFR signaling pathways, as determinants that alter cell sensitivity to FGFR inhibition. We functionally validated the top negatively selected and positively selected kinases, ILK and CSK, from the CRISPR/Cas9 screen using RNA interference. We observed synergistic effects on KatoIII cells as well as three additional gastric cancer cell lines with FGFR2 amplification when AZD4547 was combined with small molecular inhibitors Cpd22 and lapatinib targeting ILK and EGFR/HER2, respectively. Furthermore, we demonstrated that GSK3b is one of the downstream effectors of ILK upon FGFR inhibition. In summary, our study systematically evaluated the kinases and associated signaling pathways modulating cell response to FGFR inhibition, and for the first time, demonstrated that targeting ILK would enhance the effectiveness of AZD4547 treatment of gastric tumors with amplifications of FGFR2.

View details for PubMedID 31076567
Improved read/write cost tradeoff in DNA-based data storage using LDPC codes Chandak, S., Tatwawadi, K., Lau, B., Mardia, J., Kubit, M., Neu, J., Griffin, P., Wootters, M., Weissman, T., Ji, H., IEEE IEEE. 2019: 147–56

View details for Web of Science ID 000535355700022
Covalent 'click chemistry'-based attachment of DNA onto solid phase enables iterative molecular analysis. Analytical chemistry Lau, B. T., Ji, H. P. 2019

Abstract

Molecular analysis of DNA samples with limited quantities can be challenging. Repeatedly sequencing the original DNA molecules from a given sample would overcome many issues related to accurate genetic analysis and mitigate issues with processing small amounts of DNA analyte. Moreover, an iterative, replicated analysis of the same DNA molecule has the potential to improve genetic characterization. Herein, we demonstrate that the use of 'click'-based attachment of DNA sequencing libraries onto an agarose bead support enables repetitive primer extension assays for specific genomic DNA targets such as gene exons. We validated the performance of this assay for evaluating specific genetic alterations in both normal and cancer reference standard DNA samples. We demonstrate the stability of conjugated DNA libraries and related sequencing results over the course of independent serial assays spanning several months from the same set of samples. Finally, we finally applied this method to DNA derived from a tumor sample and demonstrated improved mutation detection accuracy.

View details for PubMedID 30652472
Integrated single-cell DNA and RNA analysis of intratumoral heterogeneity and immune lineages in colorectal and gastric tumor biopsies Lau, B., Andor, N., Sathe, A., Wood-Bouwens, C., Poultsides, G., Ji, H. AMER ASSOC CANCER RESEARCH. 2018

View details for DOI 10.1158/1538-7445.AM2018-4347

View details for Web of Science ID 000468819503011
Characterization of colorectal liver metastasis at single-cell resolution reveals dynamic interplay in the tumor microenvironment Sathe, A., Chen, J., Wood-Bouwens, C., Almeda, A., Lau, B., Grimes, S. M., Poultsides, G. A., Ji, H. AMER ASSOC CANCER RESEARCH. 2018

View details for DOI 10.1158/1538-7445.AM2018-2126

View details for Web of Science ID 000468818904508
Chromosome-scale haplotyping enables comprehensive discovery of cancer rearrangements and germline-related susceptibility mutations Greer, S. U., Lau, B. T., Nadauld, L. D., Ji, H. P. AMER ASSOC CANCER RESEARCH. 2018

View details for DOI 10.1158/1538-7445.AM2018-1280

View details for Web of Science ID 000468818903252
High-quality CNV segments from low-coverage whole genome sequencing from FFPE cancer biopsies based on an evaluation of multiple CNV tools Lee, H., Xia, L., Greer, S., Bell, J., Grimes, S. M., Bouwens, C., Shin, G., Lau, B. C., Johnson, L., Andor, N., Day, K., Miller, M., Escobar, H., Nadauld, L., Ji, H. P., Van Hummelen, P. AMER ASSOC CANCER RESEARCH. 2018

View details for DOI 10.1158/1538-7445.AM2018-438

View details for Web of Science ID 000468818901502
Robust Multiplexed Clustering and Denoising of Digital PCR Assays by Data Gridding ANALYTICAL CHEMISTRY Lau, B. T., Wood-Bouwens, C., Ji, H. P. 2017; 89 (22): 11913–17

Abstract

Digital PCR (dPCR) relies on the analysis of individual partitions to accurately quantify nucleic acid species. The most widely used analysis method requires manual clustering through individual visual inspection. Some automated analysis methods have emerged but do not robustly account for multiplexed targets, low target concentration, and assay noise. In this study, we describe an open source analysis software called Calico that uses "data gridding" to increase the sensitivity of clustering toward small clusters. Our workflow also generates quality score metrics in order to gauge and filter individual assay partitions by how well they were classified. We applied our analysis algorithm to multiplexed droplet-based digital PCR data sets in both EvaGreen and probes-based schemes, and targeted the oncogenic BRAF V600E and KRAS G12D mutations. We demonstrate an automated clustering sensitivity of down to 0.1% mutant fraction and filtering of artifactual assay partitions from low quality DNA samples. Overall, we demonstrate a vastly improved approach to analyzing ddPCR data that can be applied to clinical use, where automation and reproducibility are critical.

View details for PubMedID 29083143
Chromosome-scale mega-haplotypes enable digital karyotyping of cancer aneuploidy NUCLEIC ACIDS RESEARCH Bell, J. M., Lau, B. T., Greer, S. U., Wood-Bouwens, C., Xia, L. C., Connolly, I. D., Gephart, M. H., Ji, H. P. 2017; 45 (19): e162

Abstract

Genomic instability is a frequently occurring feature of cancer that involves large-scale structural alterations. These somatic changes in chromosome structure include duplication of entire chromosome arms and aneuploidy where chromosomes are duplicated beyond normal diploid content. However, the accurate determination of aneuploidy events in cancer genomes is a challenge. Recent advances in sequencing technology allow the characterization of haplotypes that extend megabases along the human genome using high molecular weight (HMW) DNA. For this study, we employed a library preparation method in which sequence reads have barcodes linked to single HMW DNA molecules. Barcode-linked reads are used to generate extended haplotypes on the order of megabases. We developed a method that leverages haplotypes to identify chromosomal segmental alterations in cancer and uses this information to join haplotypes together, thus extending the range of phased variants. With this approach, we identified mega-haplotypes that encompass entire chromosome arms. We characterized the chromosomal arm changes and aneuploidy events in a manner that offers similar information as a traditional karyotype but with the benefit of DNA sequence resolution. We applied this approach to characterize aneuploidy and chromosomal alterations from a series of primary colorectal cancers.

View details for PubMedID 28977555

View details for PubMedCentralID PMC5737808
Single molecule counting and assessment of random molecular tagging errors with transposable giga-scale error-correcting barcodes BMC GENOMICS Lau, B. T., Ji, H. P. 2017; 18: 745

Abstract

RNA-Seq measures gene expression by counting sequence reads belonging to unique cDNA fragments. Molecular barcodes commonly in the form of random nucleotides were recently introduced to improve gene expression measures by detecting amplification duplicates, but are susceptible to errors generated during PCR and sequencing. This results in false positive counts, leading to inaccurate transcriptome quantification especially at low input and single-cell RNA amounts where the total number of molecules present is minuscule. To address this issue, we demonstrated the systematic identification of molecular species using transposable error-correcting barcodes that are exponentially expanded to tens of billions of unique labels.We experimentally showed random-mer molecular barcodes suffer from substantial and persistent errors that are difficult to resolve. To assess our method's performance, we applied it to the analysis of known reference RNA standards. By including an inline random-mer molecular barcode, we systematically characterized the presence of sequence errors in random-mer molecular barcodes. We observed that such errors are extensive and become more dominant at low input amounts.We described the first study to use transposable molecular barcodes and its use for studying random-mer molecular barcode errors. Extensive errors found in random-mer molecular barcodes may warrant the use of error correcting barcodes for transcriptome analysis as input amounts decrease.

View details for PubMedID 28934929
Single-Color Digital PCR Provides High-Performance Detection of Cancer Mutations from Circulating DNA. The Journal of molecular diagnostics : JMD Wood-Bouwens, C., Lau, B. T., Handy, C. M., Lee, H., Ji, H. P. 2017; 19 (5): 697-710

Abstract

We describe a single-color digital PCR assay that detects and quantifies cancer mutations directly from circulating DNA collected from the plasma of cancer patients. This approach relies on a double-stranded DNA intercalator dye and paired allele-specific DNA primer sets to determine an absolute count of both the mutation and wild-type-bearing DNA molecules present in the sample. The cell-free DNA assay uses an input of 1 ng of nonamplified DNA, approximately 300 genome equivalents, and has a molecular limit of detection of three mutation DNA genome-equivalent molecules per assay reaction. When using more genome equivalents as input, we demonstrated a sensitivity of 0.10% for detecting the BRAF V600E and KRAS G12D mutations. We developed several mutation assays specific to the cancer driver mutations of patients' tumors and detected these same mutations directly from the nonamplified, circulating cell-free DNA. This rapid and high-performance digital PCR assay can be configured to detect specific cancer mutations unique to an individual cancer, making it a potentially valuable method for patient-specific longitudinal monitoring.

View details for DOI 10.1016/j.jmoldx.2017.05.003

View details for PubMedID 28818432
Single-Color Digital PCR Provides High-Performance Detection of Cancer Mutations from Circulating DNA JOURNAL OF MOLECULAR DIAGNOSTICS Wood-Bouwens, C., Lau, B. T., Handy, C. M., Lee, H., Ji, H. P. 2017; 19 (5): 697–710

View details for DOI 10.1016/j.jmoldx.2017.05.003

View details for Web of Science ID 000410464600007
CRISPR-Cas9-targeted fragmentation and selective sequencing enable massively parallel microsatellite analysis NATURE COMMUNICATIONS Shin, G., Grimes, S. M., Lee, H., Lau, B. T., Xia, L. C., Ji, H. P. 2017; 8

Abstract

Microsatellites are multi-allelic and composed of short tandem repeats (STRs) with individual motifs composed of mononucleotides, dinucleotides or higher including hexamers. Next-generation sequencing approaches and other STR assays rely on a limited number of PCR amplicons, typically in the tens. Here, we demonstrate STR-Seq, a next-generation sequencing technology that analyses over 2,000 STRs in parallel, and provides the accurate genotyping of microsatellites. STR-Seq employs in vitro CRISPR-Cas9-targeted fragmentation to produce specific DNA molecules covering the complete microsatellite sequence. Amplification-free library preparation provides single molecule sequences without unique molecular barcodes. STR-selective primers enable massively parallel, targeted sequencing of large STR sets. Overall, STR-Seq has higher throughput, improved accuracy and provides a greater number of informative haplotypes compared with other microsatellite analysis approaches. With these new features, STR-Seq can identify a 0.1% minor genome fraction in a DNA mixture composed of different, unrelated samples.

View details for DOI 10.1038/ncomms14291

View details for PubMedID 28169275
CRISPR-Cas9-targeted fragmentation and selective sequencing enable massively parallel microsatellite analysis NATURE COMMUNICATIONS Shin, G., Grimes, S. M., Lee, H., Lau, B. T., Xia, L. C., Ji, H. P. 2017; 8

Abstract

Microsatellites are multi-allelic and composed of short tandem repeats (STRs) with individual motifs composed of mononucleotides, dinucleotides or higher including hexamers. Next-generation sequencing approaches and other STR assays rely on a limited number of PCR amplicons, typically in the tens. Here, we demonstrate STR-Seq, a next-generation sequencing technology that analyses over 2,000 STRs in parallel, and provides the accurate genotyping of microsatellites. STR-Seq employs in vitro CRISPR-Cas9-targeted fragmentation to produce specific DNA molecules covering the complete microsatellite sequence. Amplification-free library preparation provides single molecule sequences without unique molecular barcodes. STR-selective primers enable massively parallel, targeted sequencing of large STR sets. Overall, STR-Seq has higher throughput, improved accuracy and provides a greater number of informative haplotypes compared with other microsatellite analysis approaches. With these new features, STR-Seq can identify a 0.1% minor genome fraction in a DNA mixture composed of different, unrelated samples.

View details for DOI 10.1038/ncomms14291

View details for Web of Science ID 000393379700001

View details for PubMedID 28169275

View details for PubMedCentralID PMC5309709
Linked read sequencing resolves complex genomic rearrangements in gastric cancer metastases. Genome medicine Greer, S. U., Nadauld, L. D., Lau, B. T., Chen, J. n., Wood-Bouwens, C. n., Ford, J. M., Kuo, C. J., Ji, H. P. 2017; 9 (1): 57

Abstract

Genome rearrangements are critical oncogenic driver events in many malignancies. However, the identification and resolution of the structure of cancer genomic rearrangements remain challenging even with whole genome sequencing.To identify oncogenic genomic rearrangements and resolve their structure, we analyzed linked read sequencing. This approach relies on a microfluidic droplet technology to produce libraries derived from single, high molecular weight DNA molecules, 50 kb in size or greater. After sequencing, the barcoded sequence reads provide long range genomic information, identify individual high molecular weight DNA molecules, determine the haplotype context of genetic variants that occur across contiguous megabase-length segments of the genome and delineate the structure of complex rearrangements. We applied linked read sequencing of whole genomes to the analysis of a set of synchronous metastatic diffuse gastric cancers that occurred in the same individual.When comparing metastatic sites, our analysis implicated a complex somatic rearrangement that was present in the metastatic tumor. The oncogenic event associated with the identified complex rearrangement resulted in an amplification of the known cancer driver gene FGFR2. With further investigation using these linked read data, the FGFR2 copy number alteration was determined to be a deletion-inversion motif that underwent tandem duplication, with unique breakpoints in each metastasis. Using a three-dimensional organoid tissue model, we functionally validated the metastatic potential of an FGFR2 amplification in gastric cancer.Our study demonstrates that linked read sequencing is useful in characterizing oncogenic rearrangements in cancer metastasis.

View details for PubMedID 28629429
Haplotyping germline and cancer genomes with high-throughput linked-read sequencing. Nature biotechnology Zheng, G. X., Lau, B. T., Schnall-Levin, M., Jarosz, M., Bell, J. M., Hindson, C. M., Kyriazopoulou-Panagiotopoulou, S., Masquelier, D. A., Merrill, L., Terry, J. M., Mudivarti, P. A., Wyatt, P. W., Bharadwaj, R., Makarewicz, A. J., Li, Y., Belgrader, P., Price, A. D., Lowe, A. J., Marks, P., Vurens, G. M., Hardenbol, P., Montesclaros, L., Luo, M., Greenfield, L., Wong, A., Birch, D. E., Short, S. W., Bjornson, K. P., Patel, P., Hopmans, E. S., Wood, C., Kaur, S., Lockwood, G. K., Stafford, D., Delaney, J. P., Wu, I., Ordonez, H. S., Grimes, S. M., Greer, S., Lee, J. Y., Belhocine, K., Giorda, K. M., Heaton, W. H., McDermott, G. P., Bent, Z. W., Meschi, F., Kondov, N. O., Wilson, R., Bernate, J. A., Gauby, S., Kindwall, A., Bermejo, C., Fehr, A. N., Chan, A., Saxonov, S., Ness, K. D., Hindson, B. J., Ji, H. P. 2016; 34 (3): 303-311

Abstract

Haplotyping of human chromosomes is a prerequisite for cataloguing the full repertoire of genetic variation. We present a microfluidics-based, linked-read sequencing technology that can phase and haplotype germline and cancer genomes using nanograms of input DNA. This high-throughput platform prepares barcoded libraries for short-read sequencing and computationally reconstructs long-range haplotype and structural variant information. We generate haplotype blocks in a nuclear trio that are concordant with expected inheritance patterns and phase a set of structural variants. We also resolve the structure of the EML4-ALK gene fusion in the NCI-H2228 cancer cell line using phased exome sequencing. Finally, we assign genetic aberrations to specific megabase-scale haplotypes generated from whole-genome sequencing of a primary colorectal adenocarcinoma. This approach resolves haplotype information using up to 100 times less genomic DNA than some methods and enables the accurate detection of structural variants.

View details for DOI 10.1038/nbt.3432

View details for PubMedID 26829319

View details for PubMedCentralID PMC4786454
Clonal structure analysis of cancer genomes at single molecule resolution Lau, B., Ji, H. AMER ASSOC CANCER RESEARCH. 2015

View details for DOI 10.1158/1538-7445.AM2015-4889

View details for Web of Science ID 000371597105074
Identification of novel tumor suppressor candidates and characterizing their potential driver role in familial cholangiocarcinoma Greer, S., Nadauld, L. D., Lau, B., Miotke, L., Hopmans, E., Wood, C. M., Bell, J. M., Ji, H. P. AMER ASSOC CANCER RESEARCH. 2015

View details for DOI 10.1158/1538-7445.AM2015-3901

View details for Web of Science ID 000371597102425
Megabase-scale phased haplotypes of genetic aberrations from whole cancer genome sequencing of primary colorectal tumors Lau, B., Bell, J. M., Schnall-Levin, M., Jarosz, M., Hopmans, E., Wood, C. M., Zheng, G. X., Giorda, K., Ji, H. P. AMER ASSOC CANCER RESEARCH. 2015

View details for DOI 10.1158/1538-7445.AM2015-4882

View details for Web of Science ID 000371597105067
Highly sensitive and specific digital quantification of cancer genetic aberrations Miotke, L. K., Lau, B., Rumma, R., Ji, H. AMER ASSOC CANCER RESEARCH. 2014

View details for DOI 10.1158/1538-7445.AM2014-1507

View details for Web of Science ID 000349906901236
A robust and rapid targeted sequencing technology for iterative multiple genomic features in cancer Lau, B., Cushing, A., Ji, H. AMER ASSOC CANCER RESEARCH. 2014

View details for DOI 10.1158/1538-7445.AM2014-3566

View details for Web of Science ID 000349910201063
High sensitivity detection and quantitation of DNA copy number and single nucleotide variants with single color droplet digital PCR. Analytical chemistry Miotke, L., Lau, B. T., Rumma, R. T., Ji, H. P. 2014; 86 (5): 2618-2624

Abstract

In this study, we present a highly customizable method for quantifying copy number and point mutations utilizing a single-color, droplet digital PCR platform. Droplet digital polymerase chain reaction (ddPCR) is rapidly replacing real-time quantitative PCR (qRT-PCR) as an efficient method of independent DNA quantification. Compared to quantative PCR, ddPCR eliminates the needs for traditional standards; instead, it measures target and reference DNA within the same well. The applications for ddPCR are widespread including targeted quantitation of genetic aberrations, which is commonly achieved with a two-color fluorescent oligonucleotide probe (TaqMan) design. However, the overall cost and need for optimization can be greatly reduced with an alternative method of distinguishing between target and reference products using the nonspecific DNA binding properties of EvaGreen (EG) dye. By manipulating the length of the target and reference amplicons, we can distinguish between their fluorescent signals and quantify each independently. We demonstrate the effectiveness of this method by examining copy number in the proto-oncogene FLT3 and the common V600E point mutation in BRAF. Using a series of well-characterized control samples and cancer cell lines, we confirmed the accuracy of our method in quantifying mutation percentage and integer value copy number changes. As another novel feature, our assay was able to detect a mutation comprising less than 1% of an otherwise wild-type sample, as well as copy number changes from cancers even in the context of significant dilution with normal DNA. This flexible and cost-effective method of independent DNA quantification proves to be a robust alternative to the commercialized TaqMan assay.

View details for DOI 10.1021/ac403843j

View details for PubMedID 24483992
New quantitative methods for measuring plasmid loss rates reveal unexpected stability PLASMID Lau, B. C., Malkus, P., Paulsson, J. 2013; 70 (3): 353–61

Abstract

Plasmid loss rate measurements are standard in microbiology and key to understanding plasmid stabilization mechanisms. The conventional assays eliminate selection for plasmids at the beginning of the experiment and screen for the appearance of plasmid-free cells over long-term population growth. However, it has been long appreciated in plasmid biology that the growth rate differential between plasmid-free and plasmid-containing cells at some point overshadows the effect of primary loss events, such that the assays can greatly over-estimate inherent loss rates. The standard solutions to this problem are to either consider the very early phase of loss where the fraction of plasmid-free cells increases linearly, or to measure the growth rate difference either by following the population for longer time or by measuring growth rates separately. Here we mathematically show that in all these cases, seemingly small experimental errors in the growth rate estimates can overshadow the estimates of the loss rates. For many plasmids, loss rates may thus be much lower than previously thought, and for some plasmids, the estimated loss rate may have nothing to do with actual loss rates. We further modify two independent experimental methods to separate inherent losses from growth differences and apply them to the same plasmids. First we use a high-throughput microscopy-based approach to screen for plasmid-free cells at extremely short time scales--tens of minutes rather than tens of generations--and apply it to a par⁻ version of mini-R1. Second we modify a counterselection-based plasmid loss assay inspired by the Luria-Delbrück fluctuation test that completely separates losses from growth, and apply it to various R1 and pSC101 derivatives. Concordant results from the two assays suggest that plasmids are lost at a lower frequency than previously believed. In fact, for par⁻ mini-R1 the observed loss rate of about 10⁻³ per cell and generation seems to be so low as to be inconsistent with what we know about the R1 stabilization mechanisms, suggesting these well characterized plasmids may have some additional and so far unknown stabilization mechanisms, for example improving copy number control or partitioning at cell division.

View details for DOI 10.1016/j.plasmid.2013.07.007

View details for Web of Science ID 000328175600007

View details for PubMedID 24042048

View details for PubMedCentralID PMC3966108
A complete microfluidic screening platform for rational protein crystallization JOURNAL OF THE AMERICAN CHEMICAL SOCIETY Lau, B. C., Baitz, C. A., Dong, X. P., Hansen, C. L. 2007; 129 (3): 454–55

View details for DOI 10.1021/ja065855b

View details for Web of Science ID 000243503700001

View details for PubMedID 17226984

Billy Tsz Cheong Lau

Instructor, Medicine - Oncology

Academic Appointments

Professional Education

Contact

Additional Info

All Publications

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract