Emma Lundberg's Profile | Stanford Profiles

Bio

Dr. Emma Lundberg is an Associate Professor of Bioengineering and Pathology at Stanford University and serves at the Director of the Cell Atlas of the Human Protein Atlas initiative in Sweden, where she is also Professor at KTH Royal Institute of Technology. At the intersection of bioimaging, proteomics, and artificial intelligence, her research aims to define the spatiotemporal organization of the human proteome at both cellular and subcellular level. Dr. Lundberg aims to develop integrated models of human cells to elucidate how variations in protein localization patterns influence cellular function, ultimately enabling the simulation of cell behavior and a systems-level understanding of how biological information is spatially encoded. The Lundberg Lab is responsible for creating the Subcellular Atlas of the Human Protein Atlas database (https://www.proteinatlas.org/). Dr. Lundberg is dedicated to building virtual cell models to simulate cell behavior, and is passionate about engaging the public in her work through citizen science games and computational challenges.

Dr. Lundberg holds a Master’s degree in Bioengineering and a PhD in Biotechnology from KTH Royal Institute of Technology in Sweden. She has served as Secretary General of the Human Proteome Organization, and is actively involved in advisory roles for numerous open-access databases and cell mapping efforts such as the CZI AI Virtual Cell, Human Cell Atlas consortium, UniProt db, Reactome db, Human Proteome Project and various pharma and biotech companies. As a token of her leadership skills and advocate for open science, she was twice recognized as top 10 under 40 for future leaders in biopharma and omics.

Academic Appointments

Associate Professor, Bioengineering
Associate Professor, Pathology
Member, Bio-X
Member, Wu Tsai Human Performance Alliance
Member, Wu Tsai Neurosciences Institute

Administrative Appointments

Chan Zuckerberg Biohub Investigator, Chan Zuckerberg Biohub (2022 - Present)
Director of Graduate Studies, Stanford Bioengineering (2022 - Present)
Steering Committee of Knight Initiative for Brain Resilience, Stanford University (2022 - Present)
Head of Department of Clinical and Cell Proteomics, KTH Royal Institute of Technology (2020 - 2021)
Director of Spatial Proteomics facility, Science for Life Laboratory (2017 - 2022)
Secretary General, Human Proteome Organization (2017 - 2018)
Co-Director, Human Protein Atlas (2008 - Present)

Honors & Awards

McCormick and Gabilan Award, Stanford University (2022)
Göran Gustafsson Award, Göran Gustafsson Foundation (2022)
Royal Microscopy Society Scientific Achievement Award, Royal Microscopy Society (2021)
Anne Heidenthal prize for fluorescent research, Chroma Technology Group (2019)
Ken Standing award for technology development in Life Science, University of Manitoba (2019)
Swedish national association of chemical engineers' annual prize, Swedish national association of chemical engineers (2017)
Wallenberg Academy Fellow Award, Knut and Alice Wallenberg Foundation (2016)

Boards, Advisory Committees, Professional Organizations

AI Advisory Board Member, Chan Zuckerberg Initiative (2024 - Present)
Chair of Scientific Advisory Board, Max Planck Institute of Biochemistry (2024 - Present)
Scientific Advisory Board, EMBL-EBI Biomaging (2022 - Present)
Scientific Advisory Board, AI for science, AISSAI, CNRS, France (2022 - 2024)
Scientific Advisory Board, Center for Open Bioimage Analysis (NIH P41 center) (2020 - 2024)
Scientific Advisory Board, Wellcome Genome Center Oxford (2020 - 2022)
Scientific Advisory Board, UniProt db (2018 - Present)
Standards and Technologies Working Group, Human Cell Atlas (2018 - Present)
Scientific Advisory Board, Reactome (2018 - 2023)

Professional Education

Ph.D., KTH Royal Institute of Technology, Biotechnology (2008)
M.S, KTH Royal Institute of Technology, Bioengineering (2004)

Contact

Alternate Contact Devyn James Administrative Assistant devynj@stanford.edu

All Publications

Technologies to measure and modulate protein subcellular localization. Nature reviews. Molecular cell biology Leineweber, W., Tei, R., Mäkiniemi, A., Ting, A., Lundberg, E. 2026

Abstract

How proteins localize to specific compartments, function in coordination with other biomolecules and, ultimately, contribute to diverse cellular activities are crucial questions in cell biology. Complicating the answers to these questions are multilocalizing and multifunctional proteins, whose impact on the cell depends on both spatial and temporal contexts. Therefore, contextualizing protein functions based on their subcellular localization is necessary to fully understand cell behaviours. Recent advances in instrumentation and protein labelling techniques are rapidly increasing the availability of tools, technologies and applications that measure and control protein localization and compartment-specific function. In this Review, we first discuss microscopy, mass spectrometry-based correlation profiling and proximity labelling methods that assign localizations to proteins, ranging from cellular compartments to protein-protein interactions. We next examine the available tools for manipulating protein localization and measuring the effects of these manipulations, including localization tags and bifunctional molecules. For each technology, we assess the strengths and weaknesses that ultimately determine their usefulness. We conclude with an outlook on future technological advances in the field of spatial subcellular proteomics and their potential implications for cell biology and clinical applications.

View details for DOI 10.1038/s41580-026-00957-1

View details for PubMedID 41857183

View details for PubMedCentralID 6338729
Subcellular localization as a driver of protein function. Nature reviews. Molecular cell biology Sigaeva, A., Hutchings, C., Cesnik, A., Lilley, K. S., Lundberg, E. 2026

Abstract

Biological functions depend on the spatiotemporal distribution of proteins within cells. Key cellular activities such as signal transduction, metabolism, cell cycle and cell death are driven by the interactions of proteins that are localized in multiple cellular compartments. Such multilocalization can even allow protein with identical sequences to display multifunctionality, a phenomenon known as moonlighting. Despite its biological importance, the relationship between protein localization and function remains underexplored. In this Review, we discuss the known mechanisms of protein localization (including RNA transport, role of proteoforms and molecular interactions) and how subcellular localization controls protein function. Proper regulation of protein localization is crucial for specialized cell and tissue functions, including cell differentiation, polarization and the epithelial-mesenchymal transition. Protein mislocalization can also have important roles in pathological processes, such as in cancer, neurodegeneration and autoimmunity. We end with a discussion of current technological and conceptual challenges in the field of subcellular proteomics and spatial biology. Addressing these challenges will allow us to link the dynamic nature of protein localization and function across biological scales and contexts, with great impact on fundamental cell biology and clinical applications.

View details for DOI 10.1038/s41580-026-00947-3

View details for PubMedID 41709002

View details for PubMedCentralID 6338729
Intrinsic heterogeneity of primary cilia revealed through spatial proteomics. Cell Hansen, J. N., Sun, H., Kahnert, K., Westenius, E., Johannesson, A., Villegas, C., Le, T., Tzavlaki, K., Winsnes, C., Pohjanen, E., Mäkiniemi, A., Fall, J., Ballllosera Navarro, F., Bäckström, A., Lindskog, C., Johansson, F., von Feilitzen, K., Delgado-Vega, A. M., Martinez Casals, A., Mahdessian, D., Uhlén, M., Sheu, S. H., Lindstrand, A., Axelsson, U., Lundberg, E. 2025

Abstract

Primary cilia are critical organelles found on most human cells. Their dysfunction is linked to hereditary ciliopathies with a wide phenotypic spectrum. Despite their significance, the specific roles of cilia in different cell types remain poorly understood due to limitations in analyzing ciliary protein composition. We employed antibody-based spatial proteomics to expand the Human Protein Atlas to primary cilia. Our analysis identified the subciliary locations of 715 proteins across three cell lines, examining 128,156 individual cilia. We found that 69% of the ciliary proteome is cell-type specific, and 78% exhibited single-cilia heterogeneity. Our findings portray cilia as sensors tuning their proteome to effectively sense the environment and compute cellular responses. We reveal 91 cilia proteins and found a genetic candidate variant in CREB3 in one clinical case with features overlapping ciliopathy phenotypes. This open, spatial cilia atlas advances research on cilia and ciliopathies.

View details for DOI 10.1016/j.cell.2025.08.039

View details for PubMedID 41005307
Multimodal cell maps as a foundation for structural and functional genomics. Nature Schaffer, L. V., Hu, M., Qian, G., Moon, K. M., Pal, A., Soni, N., Latham, A. P., Pontano Vaites, L., Tsai, D., Mattson, N. M., Licon, K., Bachelder, R., Cesnik, A., Gaur, I., Le, T., Leineweber, W., Palar, A., Pulido, E., Qin, Y., Zhao, X., Churas, C., Lenkiewicz, J., Chen, J., Ono, K., Pratt, D., Zage, P., Echeverria, I., Sali, A., Harper, J. W., Gygi, S. P., Foster, L. J., Huttlin, E. L., Lundberg, E., Ideker, T. 2025

Abstract

Human cells consist of a complex hierarchy of components, many of which remain unexplored1,2. Here we construct a global map of human subcellular architecture through joint measurement of biophysical interactions and immunofluorescence images for over 5,100 proteins in U2OS osteosarcoma cells. Self-supervised multimodal data integration resolves 275 molecular assemblies spanning the range of 10-8 to 10-5 m, which we validate systematically using whole-cell size-exclusion chromatography and annotate using large language models3. We explore key applications in structural biology, yielding structures for 111 heterodimeric complexes and an expanded Rag-Ragulator assembly. The map assigns unexpected functions to 975 proteins, including roles for C18orf21 in RNA processing and DPP9 in interferon signalling, and identifies assemblies with multiple localizations or cell type specificity. It decodes paediatric cancer genomes4, identifying 21 recurrently mutated assemblies and implicating 102 validated new cancer proteins. The associated Cell Visualization Portal and Mapping Toolkit provide a reference platform for structural and functional cell biology.

View details for DOI 10.1038/s41586-025-08878-3

View details for PubMedID 40205054
How to build the virtual cell with artificial intelligence: Priorities and opportunities. Cell Bunne, C., Roohani, Y., Rosen, Y., Gupta, A., Zhang, X., Roed, M., Alexandrov, T., AlQuraishi, M., Brennan, P., Burkhardt, D. B., Califano, A., Cool, J., Dernburg, A. F., Ewing, K., Fox, E. B., Haury, M., Herr, A. E., Horvitz, E., Hsu, P. D., Jain, V., Johnson, G. R., Kalil, T., Kelley, D. R., Kelley, S. O., Kreshuk, A., Mitchison, T., Otte, S., Shendure, J., Sofroniew, N. J., Theis, F., Theodoris, C. V., Upadhyayula, S., Valer, M., Wang, B., Xing, E., Yeung-Levy, S., Zitnik, M., Karaletsos, T., Regev, A., Lundberg, E., Leskovec, J., Quake, S. R. 2024; 187 (25): 7045-7063

Abstract

Cells are essential to understanding health and disease, yet traditional models fall short of modeling and simulating their function and behavior. Advances in AI and omics offer groundbreaking opportunities to create an AI virtual cell (AIVC), a multi-scale, multi-modal large-neural-network-based model that can represent and simulate the behavior of molecules, cells, and tissues across diverse states. This Perspective provides a vision on their design and how collaborative efforts to build AIVCs will transform biological research by allowing high-fidelity simulations, accelerating discoveries, and guiding experimental studies, offering new opportunities for understanding cellular functions and fostering interdisciplinary collaborations in open science.

View details for DOI 10.1016/j.cell.2024.11.015

View details for PubMedID 39672099
Spatiotemporal dissection of the cell cycle with single-cell proteogenomics. Nature Mahdessian, D., Cesnik, A. J., Gnann, C., Danielsson, F., Stenstrom, L., Arif, M., Zhang, C., Le, T., Johansson, F., Shutten, R., Backstrom, A., Axelsson, U., Thul, P., Cho, N. H., Carja, O., Uhlen, M., Mardinoglu, A., Stadler, C., Lindskog, C., Ayoglu, B., Leonetti, M. D., Ponten, F., Sullivan, D. P., Lundberg, E. 2021; 590 (7847): 649–54

Abstract

The cell cycle, over which cells grow and divide, is a fundamental process of life. Its dysregulation has devastating consequences, including cancer1-3. The cell cycle is driven by precise regulation of proteins in time and space, which creates variability between individual proliferating cells. To our knowledge, no systematic investigations of such cell-to-cell proteomic variability exist. Here we present a comprehensive, spatiotemporal map of human proteomic heterogeneity by integrating proteomics at subcellular resolution with single-cell transcriptomics and precise temporal measurements of individual cells in the cell cycle. We show that around one-fifth of the human proteome displays cell-to-cell variability, identify hundreds of proteins with previously unknown associations with mitosis and the cell cycle, and provide evidence that several of these proteins have oncogenic functions. Our results show that cell cycle progression explains less than half of all cell-to-cell variability, and that most cycling proteins are regulated post-translationally, rather than by transcriptomic cycling. These proteins are disproportionately phosphorylated by kinases that regulate cell fate, whereas non-cycling proteins that vary between cells are more likely to be modified by kinases that regulate metabolism. This spatially resolved proteomic map of the cell cycle is integrated into the Human Protein Atlas and will serve as a resource for accelerating molecular studies of the human cell cycle and cell proliferation.

View details for DOI 10.1038/s41586-021-03232-9

View details for PubMedID 33627808
A multi-scale map of cell structure fusing protein images and interactions. Nature Qin, Y., Huttlin, E. L., Winsnes, C. F., Gosztyla, M. L., Wacheul, L., Kelly, M. R., Blue, S. M., Zheng, F., Chen, M., Schaffer, L. V., Licon, K., Bäckström, A., Vaites, L. P., Lee, J. J., Ouyang, W., Liu, S. N., Zhang, T., Silva, E., Park, J., Pitea, A., Kreisberg, J. F., Gygi, S. P., Ma, J., Harper, J. W., Yeo, G. W., Lafontaine, D. L., Lundberg, E., Ideker, T. 2021

Abstract

The cell is a multi-scale structure with modular organization across at least four orders of magnitude1. Two central approaches for mapping this structure-protein fluorescent imaging and protein biophysical association-each generate extensive datasets, but of distinct qualities and resolutions that are typically treated separately2,3. Here we integrate immunofluorescence images in the Human Protein Atlas4 with affinity purifications in BioPlex5 to create a unified hierarchical map of human cell architecture. Integration is achieved by configuring each approach as a general measure of protein distance, then calibrating the two measures using machine learning. The map, known as the multi-scale integrated cell (MuSIC 1.0), resolves 69 subcellular systems, of which approximately half are to our knowledge undocumented. Accordingly, we perform 134 additional affinity purifications and validate subunit associations for the majority of systems. The map reveals a pre-ribosomal RNA processing assembly and accessory factors, which we show govern rRNA maturation, and functional roles for SRRM1 and FAM120C in chromatin and RPS3A in splicing. By integration across scales, MuSIC increases the resolution of imaging while giving protein interactions a spatial dimension, paving the way to incorporate diverse types of data in proteome-wide cell maps.

View details for DOI 10.1038/s41586-021-04115-9

View details for PubMedID 34819669
Spatial proteomics: a powerful discovery tool for cell biology NATURE REVIEWS MOLECULAR CELL BIOLOGY Lundberg, E., Borner, G. H. H. 2019; 20 (5): 285–302

View details for DOI 10.1038/s41580-018-0094-y

View details for Web of Science ID 000465500200008
Deep learning is combined with massive-scale citizen science to improve large-scale image classification NATURE BIOTECHNOLOGY Sullivan, D. P., Winsnes, C. F., Akesson, L., Hjelmare, M., Wiking, M., Schutten, R., Campbell, L., Leifsson, H., Rhodes, S., Nordgren, A., Smith, K., Revaz, B., Finnbogason, B., Szantner, A., Lundberg, E. 2018; 36 (9): 820-+

Abstract

Pattern recognition and classification of images are key challenges throughout the life sciences. We combined two approaches for large-scale classification of fluorescence microscopy images. First, using the publicly available data set from the Cell Atlas of the Human Protein Atlas (HPA), we integrated an image-classification task into a mainstream video game (EVE Online) as a mini-game, named Project Discovery. Participation by 322,006 gamers over 1 year provided nearly 33 million classifications of subcellular localization patterns, including patterns that were not previously annotated by the HPA. Second, we used deep learning to build an automated Localization Cellular Annotation Tool (Loc-CAT). This tool classifies proteins into 29 subcellular localization patterns and can deal efficiently with multi-localization proteins, performing robustly across different cell types. Combining the annotations of gamers and deep learning, we applied transfer learning to create a boosted learner that can characterize subcellular protein distribution with F1 score of 0.72. We found that engaging players of commercial computer games provided data that augmented deep learning and enabled scalable and readily improved image classification.

View details for PubMedID 30125267
A subcellular map of the human proteome. Science (New York, N.Y.) Thul, P. J., Åkesson, L., Wiking, M., Mahdessian, D., Geladaki, A., Ait Blal, H., Alm, T., Asplund, A., Björk, L., Breckels, L. M., Bäckström, A., Danielsson, F., Fagerberg, L., Fall, J., Gatto, L., Gnann, C., Hober, S., Hjelmare, M., Johansson, F., Lee, S., Lindskog, C., Mulder, J., Mulvey, C. M., Nilsson, P., Oksvold, P., Rockberg, J., Schutten, R., Schwenk, J. M., Sivertsson, Å., Sjöstedt, E., Skogs, M., Stadler, C., Sullivan, D. P., Tegel, H., Winsnes, C., Zhang, C., Zwahlen, M., Mardinoglu, A., Pontén, F., von Feilitzen, K., Lilley, K. S., Uhlén, M., Lundberg, E. 2017; 356 (6340)

Abstract

Resolving the spatial distribution of the human proteome at a subcellular level can greatly increase our understanding of human biology and disease. Here we present a comprehensive image-based map of subcellular protein distribution, the Cell Atlas, built by integrating transcriptomics and antibody-based immunofluorescence microscopy with validation by mass spectrometry. Mapping the in situ localization of 12,003 human proteins at a single-cell level to 30 subcellular structures enabled the definition of the proteomes of 13 major organelles. Exploration of the proteomes revealed single-cell variations in abundance or spatial distribution and localization of about half of the proteins to multiple compartments. This subcellular map can be used to refine existing protein-protein interaction networks and provides an important resource to deconvolute the highly complex architecture of the human cell.

View details for DOI 10.1126/science.aal3321

View details for PubMedID 28495876
Proteomics. Tissue-based map of the human proteome. Science (New York, N.Y.) Uhlén, M., Fagerberg, L., Hallström, B. M., Lindskog, C., Oksvold, P., Mardinoglu, A., Sivertsson, Å., Kampf, C., Sjöstedt, E., Asplund, A., Olsson, I., Edlund, K., Lundberg, E., Navani, S., Szigyarto, C. A., Odeberg, J., Djureinovic, D., Takanen, J. O., Hober, S., Alm, T., Edqvist, P. H., Berling, H., Tegel, H., Mulder, J., Rockberg, J., Nilsson, P., Schwenk, J. M., Hamsten, M., von Feilitzen, K., Forsberg, M., Persson, L., Johansson, F., Zwahlen, M., von Heijne, G., Nielsen, J., Pontén, F. 2015; 347 (6220): 1260419

Abstract

Resolving the molecular details of proteome variation in the different tissues and organs of the human body will greatly increase our knowledge of human biology and disease. Here, we present a map of the human tissue proteome based on an integrated omics approach that involves quantitative transcriptomics at the tissue and organ level, combined with tissue microarray-based immunohistochemistry, to achieve spatial localization of proteins down to the single-cell level. Our tissue-based analysis detected more than 90% of the putative protein-coding genes. We used this approach to explore the human secretome, the membrane proteome, the druggable proteome, the cancer proteome, and the metabolic functions in 32 different tissues and organs. All the data are integrated in an interactive Web-based database that allows exploration of individual proteins, as well as navigation of global expression patterns, in all major tissues and organs in the human body.

View details for DOI 10.1126/science.1260419

View details for PubMedID 25613900
A framework for the exploration of subcellular compartmentalization of RNA-binding proteins. Nature communications Guo, X., Hu, J., Kanwal, S., Yuan, J., Tariq, M., Zheng, J., Sun, M., Lu, Y., Wang, J., Jiang, M., Wang, A., Castells-Garcia, A., Zheng, X., Peng, B., Wang, D., Wei, X., Yang, T., Volpe, G., Wu, L., Mazid, M. A., Li, W., Lai, Y., Qin, D., Aguilo, F., Zhou, Y., Liu, C., Cosma, M. P., Xu, X., Lundberg, E., Mulder, J., Hutchins, A. P., Maxwell, P. H., Di Croce, L., Zhang, X., Esteban, M. A., Lv, Y. 2026

Abstract

The ability of RNA-binding proteins to form complexes with other biomolecules underpins a broad range of structural properties and functions. Understanding the subcellular distribution of RNA-binding proteins and their interacting partners in the steady state and upon perturbation can therefore shed light on these aspects. Here, we present the compartmentalized RNA-Binding Protein (or coRBP) map, an experimental resource and analytical pipeline to study subcellular RNA-binding proteins through multimodal dataset integration and machine learning. Using this approach, we generate a dataset of 1,768 known and putative RNA-binding proteins distributed in a broad panel of subcellular compartments and delineate their intermolecular and intercompartmental relationships. We also establish a hierarchy of RNA-binding protein-containing complexes at multiple scales across the cell, which suggests additional functions for multiple RNA-binding proteins. Furthermore, we investigate changes in RNA-binding protein complex composition and subcellular distribution in response to C9ORF72-associated amyotrophic lateral sclerosis/frontotemporal dementia dipeptide repeats and DNA damage stress. The coRBP map provides a resource to study the roles of RNA-binding proteins in homeostasis and disease.

View details for DOI 10.1038/s41467-026-71511-y

View details for PubMedID 42014727
A high-resolution spatial map of cilia-associated proteins in the human fallopian tube. Nature communications Hikmet, F., Digre, A., Hansen, J. N., Schon, S. B., Lundberg, E., Olovsson, M., Uhlén, M., Méar, L., Lindskog, C. 2026; 17 (1)

Abstract

Molecular alterations in the fallopian tubes play a pivotal role in the development of cancer and reproductive disorders, yet their molecular landscape at the protein level remains poorly defined. Here, we map key fallopian tube proteins at single-cell resolution utilizing an integrated transcriptomics and proteomics approach. Based on RNA-seq analysis, we identify 310 genes with elevated expression in the fallopian tube, the majority of which are associated with motile cilia function. We spatially characterize 133 of the corresponding proteins in the fallopian tube and other human tissues with motile cilia to subcellular structures of ciliated cells, validating the findings with single-cell RNA-seq and mass-spectrometry data. Eleven proteins previously only studied on the transcript level without information in cilia databases are further analyzed in a hydrosalpinx patient, showing a thinner epithelium, lower density of FOXJ1 expression, and reduced expression of FHAD1, RIIAD1, and C2orf81. Our high-resolution spatial map aids in dissecting the pathways underlying infertility and diseases linked to cilia-specific functions.

View details for DOI 10.1038/s41467-026-71692-6

View details for PubMedID 42010243

View details for PubMedCentralID 10391316
Cell shapes decode molecular phenotypes in image-based spatial proteomics. Cell systems Le, T., Leineweber, W. D., Viana, M. P., Cesnik, A., Hansen, J. N., Ouyang, W., Rafelski, S. M., Lundberg, E. 2026: 101589

Abstract

Cellular and tissue structures arise from a few cell shapes, which undergo transformations based on biophysical constraints. Despite links between signaling pathways and cellular geometry, whole-proteome orchestration in association with cell shape is underexplored. In this study, over 1 million single cells stained for 11,998 proteins across 11 cell lines in the Human Protein Atlas were analyzed for organelle, pathway, and single-protein levels in association with cellular shapespace. We found that cell and nuclear shapes across cell lines exist in a shared continuum. The subcellular organelle topology varies across cell lines but remains consistent within each cell line's shapespace. At the single-protein level, cells of different shapes in the same cell-cycle phase might be preparing for different fates, and many non-cell-cycle proteins expressed shape-based abundance variation. Using a shape-based coordinate framework, we analyzed the distribution shift of protein spatial localization under drug perturbation.

View details for DOI 10.1016/j.cels.2026.101589

View details for PubMedID 42013840
Generative machine learning unlocks the first proteome-wide image of human cells. bioRxiv : the preprint server for biology Sun, H., Kahnert, K., Hansen, J. N., Leineweber, W., Li, M., Feng, W., Ballllosera, F., Axelsson, U., Ouyang, W., Lundberg, E. 2026

Abstract

The spatial organization of proteins within cells governs virtually all cellular functions. Yet, current imaging technologies can simultaneously visualize only tens of proteins, orders of magnitude below the thousands that populate a single human cell. Here, we present ProtiCelli, a deep generative model that simulates microscopy images for 12,800 human proteins from just three cellular landmark stains. Trained on 1.23 million images from the Human Protein Atlas, ProtiCelli outperforms existing methods in reconstruction accuracy and textural fidelity, and generalizes to unseen cell types and drug perturbations absent from training. We demonstrate that ProtiCelli-generated images preserve hierarchical subcellular organization, recapitulate known protein-protein interaction landscapes, and resolve compartment-specific functions of moonlighting proteins at the single-cell level. Remarkably, the model infers drug-induced changes in protein expression and localization from cell morphology alone, predicts cell cycle stage without dedicated cell cycle markers, and enables unsupervised segmentation of subcellular compartments as well as spatial decomposition of gene sets into functional regions. Ultimately, we leverage ProtiCelli to generate Proteome2Cell, an unprecedented dataset of 30.7 million simulated images creating 2,400 "virtual cells" across 12 human cell lines. These proteome-scale images enable the construction of hierarchical single-cell models that distinguish conserved from dynamic protein architectures. Integration of Pro- teome2Cell into the Human Protein Atlas democratizes the exploration of these "virtual cells". By computationally bridging the experimental scalability gap, ProtiCelli establishes a foundation for spatial virtual cell modeling and paves an avenue for transforming spatial proteomics from cataloging proteins to simulating complete cellular systems.

View details for DOI 10.64898/2026.03.31.715748

View details for PubMedID 41959450

View details for PubMedCentralID PMC13060211
Large-scale mapping of environmental-genetic interactions illustrates the dynamic nature of cell-cycle and DNA repair regulation. Molecular cell Herken, B. W., Wong, G. T., Mäkiniemi, A., Lundberg, E., Norman, T. M., Gilbert, L. A. 2026; 86 (4): 757-773.e5

Abstract

Cells integrate exogenous and endogenous signals to grow, repair, or die. This is likely achieved through dynamic functional associations between genes, but measuring these relationships at scale is non-trivial. Here, we evaluate genetic associations in response to cell-cycle interruption, genotoxic perturbation, and nutrient deprivation using conditional genetic interaction (GI) mapping in human cells. In five maps measuring ∼250,000 GIs or higher-order environmental interactions, we discover widespread rewiring of relationships between genes, complexes, and ontologies across conditions. Specific bioprocesses drive the rewiring signal in each environmental state, as highlighted in our findings that the TIP60 and PP2A complexes radically alter their interaction profiles after inhibition of ATR. This resource reveals numerous genetic relationships for the fields of DNA damage signaling, DNA repair, and cell-cycle control and explores their context specificity. Our work advances a framework for using GI maps to explore environmental rewiring.

View details for DOI 10.1016/j.molcel.2026.01.025

View details for PubMedID 41720076
RNA origin of sex-biased immunity MOLECULAR THERAPY NUCLEIC ACIDS Chang, H. Y., Chung, L., Davis, M. M., Fiorentino, D., Lee, J., Lundberg, E., Utz, P. J. 2026; 37 (1)

View details for DOI 10.1016/j.omtn.2026.102853

View details for Web of Science ID 001694959900001
Molecular pixelation of the CAR T cell surface proteome. bioRxiv : the preprint server for biology Cesnik, A., Takacsi-Nagy, O., Le, T., Roth, T. L., Satpathy, A. T., Lundberg, E. 2026

Abstract

Immunotherapies using CAR T cells are revolutionizing B-cell acute lymphoblastic leukemia treatments. However, the majority of patients remain unresponsive, and chronic stimulation of T cells is a common contributor that reduces effector function and persistence. We apply Molecular Pixelation, a recently developed single-cell technology for characterizing cellular surface proteomes, to determine characteristic topological surface-based proteomic signatures of CAR T cell exhaustion. We analyze 76 surface proteins on 8504 CAR T cells at a single-cell level, collected from three donors and either stimulated once or repeatedly, six times over two weeks. The abundances, polarizations, and colocalizations of surface proteins can each distinguish CAR T cells that were stimulated acutely or chronically, and all but one marker with polarization changes increased in polarization. These data also reveal disrupted adhesion signatures of protein colocalization in the peripheral supramolecular activation complex (pSMAC) and increased CD37/CD82 colocalization after chronic stimulation. These Molecular Pixelation results convey new spatial signatures for proteomic polarization and colocalization on the cell surface that represent new cell-state axes for immunology and systems biology.

View details for DOI 10.64898/2026.01.30.702970

View details for PubMedID 41676457

View details for PubMedCentralID PMC12889555
Ageing promotes microglial accumulation of slow-degrading synaptic proteins. Nature Guldner, I. H., Wagner, V. P., Moran-Losada, P., Shi, S. M., Golub, S. W., Hevler, J. F., Chen, K., Meese, B. T., Ghoochani, A., Pulido, E., Oh, H. S., Le Guen, Y., Lu, N., Wong, P. S., To, N. S., Garceau, D., Guo, Z., Luo, J., Bertozzi, C. R., Lundberg, E., Abu-Remaileh, M., Sasner, M., Keller, A., Yang, A. C., Cheung, T. H., Wyss-Coray, T. 2026

Abstract

Neurodegenerative diseases affect 1 in 12 people globally and remain incurable. Central to their pathogenesis is a loss of neuronal protein maintenance and the accumulation of protein aggregates with ageing1,2. Here we engineered bioorthogonal tools3 that enabled us to tag the nascent neuronal proteome and study its turnover with ageing, its propensity to aggregate and its interaction with microglia. We show that neuronal protein half-life approximately doubles on average between 4-month-old and 24-month-old mice, with the stability of individual proteins differing among brain regions. Furthermore, we describe the aged neuronal 'aggregome', which encompasses 1,726 proteins, nearly half of which show reduced degradation with age. The aggregome includes well-known proteins linked to diseases and numerous proteins previously not associated with neurodegeneration. Notably, we demonstrate that neuronal proteins accumulate in aged microglia, with 54% also displaying reduced degradation and/or aggregation with age. Among these proteins, synaptic proteins are highly enriched, which suggests that there is a cascade of events that emerge from impaired synaptic protein turnover and aggregation to the disposal of these proteins, possibly through microglial engulfment of synapses. These findings reveal the substantial loss of neuronal proteome maintenance with ageing, which could be causal for age-related synapse loss and cognitive decline.

View details for DOI 10.1038/s41586-025-09987-9

View details for PubMedID 41565824

View details for PubMedCentralID 3836174
Engineered calcium-regulated affinity protein for efficient internalization and lysosomal toxin delivery. Proceedings of the National Academy of Sciences of the United States of America Jonsson, M., Moller, M., Schierholz, L., Dorka, N., Tegel, H., Lundberg, E., Uhlen, M., Wolf-Watz, M., Brismar, H., Hober, S. 2025; 122 (48): e2509081122

Abstract

The emerging strategy of protein-drug conjugates (PDCs) for targeted cancer therapy holds great potential to improve treatment efficacy by specifically targeting cancer biomarkers and delivering toxic payloads directly to tumor cells, minimizing off-target toxicity. The success of this approach depends on the internalization and retention of the payload in target cells. This study introduces a method using a small protein domain engineered for conditional target affinity, enabling lysosomal trafficking independent of the biological fate of the receptor. Specifically, we describe the development of an EGF receptor binder, CaRAEGFR, with calcium-regulated affinity (CaRA), meaning the target binding strength is tailored by the available calcium concentration. This allows for endosomal dissociation, as calcium levels are lower in endosomes than in the bloodstream. Affinity measurements and structural modeling reveal the molecular basis of the calcium modulated affinity. Live cell imaging demonstrates efficient internalization and lysosomal trafficking of the calcium-dependent domain, while the EGF receptor is recycled to the membrane. When used as a drug carrier, CaRAEGFR effectively delivers the toxin to the lysosomes, resulting in potent cytotoxicity with an IC50 of 0.8 nM in EGFR-expressing cancer cells.

View details for DOI 10.1073/pnas.2509081122

View details for PubMedID 41289384
SubCell: Proteome-aware vision foundation models for microscopy capture single-cell biology. bioRxiv : the preprint server for biology Gupta, A., Wefers, Z., Kahnert, K., Hansen, J. N., Misra, M. K., Leineweber, W., Cesnik, A., Lu, D., Axelsson, U., Ballllosera, F., Altman, R. B., Karaletsos, T., Lundberg, E. 2025

Abstract

Cell morphology and subcellular protein organization provide important insights into cellular function and behavior. These features of cells can be studied using large-scale protein fluorescence microscopy, and machine learning has become a powerful tool to interpret the resulting images for biological insights. Here, we introduce SubCell, a suite of self-supervised deep learning models for fluorescence microscopy designed to accurately capture cellular morphology, protein localization, cellular organization, and biological function beyond what humans can readily perceive. These models were trained on the proteome-wide image collection from the Human Protein Atlas with a novel proteome-aware learning objective. SubCell outperforms state-of-the-art methods across a variety of tasks relevant to single-cell biology and generalizes to other fluorescence microscopy datasets without any fine-tuning. Additionally, we construct the first proteome-wide hierarchical map of proteome organization that is directly learned from image data. This vision-based multiscale cell map defines cellular subsystems with high resolution of protein complexes, reveals proteins with similar functions, and distinguishes dynamic and stable behaviors within cellular compartments. Finally, Subcell enables a rich multimodal protein representation when integrated with a protein sequence model, allowing for a more comprehensive capture of gene function than either vision-only or sequence-only models alone. In conclusion, SubCell creates deep, image-driven representations of cellular architecture that are applicable across diverse biological contexts and datasets.

View details for DOI 10.1101/2024.12.06.627299

View details for PubMedID 41278937

View details for PubMedCentralID PMC12636579
Spatiotemporal gene expression and cellular dynamics of the developing human heart. Nature genetics Lázár, E., Mauron, R., Andrusivová, Ž., Foyer, J., He, M., Larsson, L., Shakari, N., Salas, S. M., Avenel, C., Sariyar, S., Hansen, J. N., Vicari, M., Czarnewski, P., Braun, E., Li, X., Bergmann, O., Sylvén, C., Lundberg, E., Linnarsson, S., Nilsson, M., Sundström, E., Adameyko, I., Lundeberg, J. 2025

Abstract

Heart development relies on topologically orchestrated cellular transitions and interactions, many of which remain poorly characterized in humans. Here, we combined unbiased spatial and single-cell transcriptomics with imaging-based validation across postconceptional weeks 5.5 to 14 to uncover the molecular landscape of human early cardiogenesis. We present a high-resolution transcriptomic map of the developing human heart, revealing the spatial arrangements of 31 coarse-grained and 72 fine-grained cell states organized into distinct functional niches. Our findings illuminate key insights into the formation of the cardiac pacemaker-conduction system, heart valves and atrial septum, and uncover unexpected diversity among cardiac mesenchymal cells. We also trace the emergence of autonomic innervation and provide the first spatial account of chromaffin cells in the fetal heart. Our study, supported by an open-access spatially centric interactive viewer, offers a unique resource to explore the cellular and molecular blueprint of human heart development, offering links to genetic causes of heart disease.

View details for DOI 10.1038/s41588-025-02352-6

View details for PubMedID 41162788

View details for PubMedCentralID 7078965
Latent plasticity of the human pancreas across development, health, and disease. bioRxiv : the preprint server for biology Mereu, E., Balboa, D., Liebig, J., Gonzalez-Herrero, A., Martinez-Casals, A., Mardamshina, M., Mollandin, F., Schicktanz, F., Tosti, L., Vandenbempt, V., Avrahami, D., Bernardo, E., Björklund, F., Chua, R. L., Engelse, M., García-Hurtado, J., Groen, N., Hanegraaf, M., Iañez, P., Jechow, K., Konukiewitz, B., Lawerenz, C., Marchese, D., Muraro, M. J., Pellegrini, S., Sordi, V., Sudy, A., Taron, U., Ten, F. W., Trefzer, T., Twardziok, S., van Agen, M., Carlotti, F., de Koning, E., Ferrer, J., Glaser, B., Heyn, H., Lundberg, E., Piemonti, L., Steiger, K., van Oudenaarden, A., Weichert, W., Conrad, C., Eils, R. 2025

Abstract

The pancreas plays a central role in major human diseases, yet our understanding of its cellular diversity and plasticity remains incomplete. Here, we present a single-cell multiomics atlas of the human pancreas, profiling over four million cells and nuclei from 57 donors across fetal development, adult homeostasis, and type 2 diabetes (T2D). Integrating sc/snRNA-seq, snATAC-seq, VASA-seq, spatial transcriptomics (Xenium), and multiplexed proteomics (CODEX), we resolve gene expression, chromatin accessibility, and spatial organization at high resolution. We identify transcriptionally plastic centroacinar-like cells (pCACs) in adults with fetal-like features, delineate endocrine and exocrine lineage trajectories during development, and uncover HNF1A-defined beta cell epigenetic states. In T2D, we observe shifts in beta cell subtypes and altered regulatory programs. Glucose perturbation of healthy islets reveals cell-type-specific adaptation and stress responses. This atlas provides a foundational framework to understand pancreas biology and the role of cellular plasticity in regeneration and disease.

View details for DOI 10.1101/2025.10.01.679230

View details for PubMedID 41256699

View details for PubMedCentralID PMC12622017
Flexible and robust cell-type annotation for highly multiplexed tissue images. Cell systems Sun, H., Yu, S., Casals, A. M., Bäckström, A., Lu, Y., Lindskog, C., Ruffalo, M., Lundberg, E., Murphy, R. F. 2025: 101374

Abstract

Identifying cell types in highly multiplexed images is essential for understanding tissue spatial organization. Current cell-type annotation methods often rely on extensive reference images and manual adjustments. In this work, we present a tool, the Robust Image-Based Cell Annotator (RIBCA), that enables accurate, automated, unbiased, and fine-grained cell-type annotation for images with a wide range of antibody panels without requiring additional model training or human intervention. Our tool has successfully annotated over 3 million cells, revealing the spatial organization of various cell types across more than 40 different human tissues. It is open source and features a modular design, allowing for easy extension to additional cell types.

View details for DOI 10.1016/j.cels.2025.101374

View details for PubMedID 40925369
Image based subcellular mapping of the protein landscape of SARS-CoV-2 infected cells for target-centric drug repurposing. Biomedicine & pharmacotherapy = Biomedecine & pharmacotherapie Tampere, M., H Le, T., Asp, E., Kalman, A., Kaimal, J. M., Njenda, D., Backstrom, A., Axelsson, U., Xu, H., Ouyang, W., Axelsson, H., Marabita, F., Moussaud-Lamodiere, E., Sepulveda, C. O., Seashore-Ludlow, B., Vernersson, C., Mirazimi, A., Lundberg, E., Ostling, P., Stadler, C. 2025; 191: 118447

Abstract

The COVID-19 pandemic has resulted in millions of deaths and affected socioeconomic structure worldwide and the search for new antivirals and treatments are still ongoing. In the search for new drug targets and to increase our understanding of the disease, we applied large-scale immunofluorescence profiling to explore host cell response to SARS-CoV-2 infection. Among the 602 host proteins studied in this host response profiling, changes in abundance and subcellular localization were observed for 97 proteins, with 45 proteins showing increased abundance and 10 reduced abundance. 20 proteins displayed changed localization upon infection and an additional 22 proteins displayed altered abundance and localization, together contributing to diverse reshuffling of the host cell protein landscape during infection. We then selected existing and approved small-molecule drugs (n = 123) against our identified host response proteins and identified one compound - elesclomol, that significantly reduced antiviral activity. Our study introduces a novel, targeted and systematic approach based on host protein profiling, to identify new targets for drug repurposing. The dataset of > 100,000 immunofluorescence images from this study are published as a resource available for further studies. AUTHOR SUMMARY: In this study we have evaluated a new approach for identifying drugs that could be used as antiviral drugs, in this case demonstrated for SARS CoV-2. By mining the literature for reported interactions between SARS CoV-2 viral components and host cell proteins, we identified a few hundred host proteins suggested to interact with the virus upon infection. To explore these viral-host interaction proteins further, we developed an image based assay using immunofluorescence and confocal microscopy to visualize the host proteins within infected and non infected cells. This was possible due to the proteome wide collection of antibodies generated within the Human Protein Atlas project, with the aim to systematically map the human proteome in cells and across tissues. The host proteins that altered their location or abundance level upon infection were regarded as putative targets for drug repurposing and we subsequently tested 123 drugs that were targeting a subset of these host proteins. Applying these drugs on two different cell types infected with SARS-CoV-2, revealed a non toxic antiviral effect for one compound that can be explored further as a treatment regimen for SARS-CoV-2 infection. The approach is novel since it combines a targeted approach for drug repurposing screening, giving insight into mechanism of action from start. As such it has the potential to accelerate drug repurposing or identification of targets for new drugs.

View details for DOI 10.1016/j.biopha.2025.118447

View details for PubMedID 40819539
Cell shapes decode molecular phenotypes in image-based spatial proteomics. bioRxiv : the preprint server for biology Le, T., Leineweber, W. D., Viana, M. P., Cesnik, A., Hansen, J. N., Ouyang, W., Rafelski, S. M., Lundberg, E. 2025

Abstract

The diversity of cellular and tissue structures can arise from a few basic cell shapes, which undergo various transformations based on biophysical constraints on cytoskeletal organization. While cellular geometry has been linked with selected biological processes such as polarity, signaling or morphogenesis, the orchestration of the whole proteome in association to cell shape is still poorly understood. In this study, using more than 1 million images of single cells stained for 11,998 proteins across 10 cell lines in the Human Protein Atlas database, we performed an integrated analysis of organelle, pathway and single protein levels in association to a 2D cellular shapespace. We found that cell and nuclear shapes across cell lines exist in a shared continuum. We also found that the subcellular organelle topology varies across cell lines, but remains robust within each cell line's shapespace. At the single protein level, we found that cells of different shapes in the same cell cycle phase might be preparing for different fates, and that many non-cell cycle proteins expressed shape-based abundance variation. Using the same coordinate framework defined by shape, we could analyze the distribution shift of protein spatial localization under drug perturbation.

View details for DOI 10.1101/2025.05.13.653868

View details for PubMedID 40463127
Streamlining Multiplexed Tissue Image Analysis with PIPSigmaX: An Integrated Automated Pipeline for Image Processing and EXploration for Diverse Tissue Types. bioRxiv : the preprint server for biology Mardamshina, M., Navarro, F. B., Casals, A. M., Avenel, C., Wahlby, C., Lundberg, E. 2025

Abstract

Spatial proteomics via multiplexed tissue imaging is transforming how we study biology, enabling researchers to investigate dozens of markers in a single tissue section and explore how cells behave in their native habitat. While imaging technologies have advanced rapidly, data analyses remain a bottleneck. To address this, we developed PIPSigmaX (Pipeline for Image Processing and EXploration), a user-friendly, end-to-end open-source software designed to make complex image analysis approachable, even for those with little or no programming skills. PIPSigmaX combines robust automation with an intuitive graphical user interface, guiding users through each step of the analysis, from image preprocessing and membrane-aware cell segmentation to signal quantification and spatial data exploration. Each feature includes built-in explanations, recommendations, and quality controls to help users make confident choices throughout the process. PIPSigmaX is compatible with a wide range of multiplexed imaging platforms, and its outputs integrate seamlessly with visualization tools like TissUUmaps and QuPath. Also, it supports downstream applications by enabling direct export of selected cell coordinates for laser microdissection. This functionality facilitates precise isolation of target cell populations for deep proteomic or transcriptomic profiling. With PIPSigmaX, researchers can extract meaningful biological insights from multiplexed images more easily and robustly, helping to bridge the gap between powerful imaging technologies and real-world scientific discovery.

View details for DOI 10.1101/2025.05.04.652145

View details for PubMedID 40654620
Enabling global image data sharing in the life sciences. Nature methods Bajcsy, P., Bhattiprolu, S., Börner, K., Cimini, B. A., Collinson, L., Ellenberg, J., Fiolka, R., Giger, M., Goscinski, W., Hartley, M., Hotaling, N., Horwitz, R., Jug, F., Kemmer, I., Kreshuk, A., Lundberg, E., Mathur, A., Narayan, K., Onami, S., Plant, A. L., Prior, F., Swedlow, J. R., Taylor, A., Keppler, A. 2025

Abstract

Despite the importance of imaging in biological and medical research, a large body of informative and precious image data never sees the light of day. To ensure scientific rigor as well as the reuse of data for scientific discovery, image data need to be made FAIR (findable, accessible, interoperable and reusable). Image data experts are working together globally to agree on common data formats, metadata, ontologies and supporting tools toward image data FAIRification. With this Perspective, we call on public funders to join these efforts to support their national scientists. What researchers most urgently need are openly accessible resources for image data storage that are operated under long-term commitments by their funders. Although existing resources in Australia, Japan and Europe are already collaborating to enable global image data sharing, these efforts will fall short unless more countries invest in operating and federating their own open data resources. This will allow us to harvest the enormous potential of existing image data, preventing substantial loss of unrealized value from past investments in imaging acquisition infrastructure.

View details for DOI 10.1038/s41592-024-02585-z

View details for PubMedID 40155720

View details for PubMedCentralID 5536224
MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific Research. ArXiv Burgess, J., Nirschl, J. J., Bravo-Sánchez, L., Lozano, A., Gupte, S. R., Galaz-Montoya, J. G., Zhang, Y., Su, Y., Bhowmik, D., Coman, Z., Hasan, S. M., Johannesson, A., Leineweber, W. D., Nair, M. G., Yarlagadda, R., Zuraski, C., Chiu, W., Cohen, S., Hansen, J. N., Leonetti, M. D., Liu, C., Lundberg, E., Yeung-Levy, S. 2025

Abstract

Scientific research demands sophisticated reasoning over multimodal data, a challenge especially prevalent in biology. Despite recent advances in multimodal large language models (MLLMs) for AI-assisted research, existing multimodal reasoning benchmarks only target up to college-level difficulty, while research-level benchmarks emphasize lower-level perception, falling short of the complex multimodal reasoning needed for scientific discovery. To bridge this gap, we introduce MicroVQA, a visual-question answering (VQA) benchmark designed to assess three reasoning capabilities vital in research workflows: expert image understanding, hypothesis generation, and experiment proposal. MicroVQA consists of 1,042 multiple-choice questions (MCQs) curated by biology experts across diverse microscopy modalities, ensuring VQA samples represent real scientific practice. In constructing the benchmark, we find that standard MCQ generation methods induce language shortcuts, motivating a new two-stage pipeline: an optimized LLM prompt structures question-answer pairs into MCQs; then, an agent-based 'RefineBot' updates them to remove shortcuts. Benchmarking on state-of-the-art MLLMs reveal a peak performance of 53%; models with smaller LLMs only slightly underperform top models, suggesting that language-based reasoning is less challenging than multimodal reasoning; and tuning with scientific articles enhances performance. Expert analysis of chain-of-thought responses shows that perception errors are the most frequent, followed by knowledge errors and then overgeneralization errors. These insights highlight the challenges in multimodal scientific reasoning, showing MicroVQA is a valuable resource advancing AI-driven biomedical research. MicroVQA is available here, project here.

View details for PubMedID 40166749

View details for PubMedCentralID PMC11957224
Human BioMolecular Atlas Program (HuBMAP): 3D Human Reference Atlas construction and usage. Nature methods Borner, K., Blood, P. D., Silverstein, J. C., Ruffalo, M., Satija, R., Teichmann, S. A., Pryhuber, G. J., Misra, R. S., Purkerson, J. M., Fan, J., Hickey, J. W., Molla, G., Xu, C., Zhang, Y., Weber, G. M., Jain, Y., Qaurooni, D., Kong, Y., HRA Team, Bueckle, A., Herr, B. W., Abramson, J., Anderson, D., Ardlie, K., Arends, M. J., Aronow, B. J., Bajema, R., Baldock, R. A., Barnowski, R., Barwinska, D., Bernard, A., Betancur, D., Bidanta, S., Bjorklund, F., Bolin, A., Boppana, A., Boulter, L., Browne, K., Brusko, M. A., Burger, A., Campbell-Thompson, M., Cao-Berg, I., Caron, A. R., Carroll, M., Chadwick, C., Chen, H., Chen, L., de Bono, B., Deutsch, G., Ding, S., Donahue, S., El-Achkar, T. M., Eskaros, A., Falo, L. J., Farrow, M., Ferkowicz, M. J., Fisher, S. A., Gee, J. C., Germain, R. N., Ginda, M., Ginty, F., Gitomer, S. A., Goldstone, M. B., Gustilo, K. S., Hagood, J. S., Halushka, M. K., Haniffa, M. A., Hanna, P., Hardi, J., He, Y. O., Honick, B. J., Houghton, D., Itkin, M., Jain, S., Jardine, L., Jiang, Z. G., Ju, Y., Karunamurthy, A., Kelleher, N. L., Kendall, T. J., Kruse, A. R., Laronda, M. M., Laurent, L. C., Laurenti, E., Lee, S., Lein, E., Li, C., Li, Z., Lin, S., Lin, Y., Lindsay, S. A., Longacre, T. A., Lundberg, E., Maier, L., Malhotra, R., Martinez Casals, A., Masci, A. M., Mathews, C. E., McDonough, E., McLaughlin, J. A., Menon, R., Menon, V., Miller, J. A., Morgan, R., Muller, W., Murphy, R. F., Musen, M. A., Nakshatri, H., Nawijn, M. C., Neumann, E. K., Nigra, D. J., O'Neill, K., Parast, M. M., Patel, U., Pei, L., Phatnani, H., Phillips, G. A., Pouch, A. M., Powers, A. C., Puerto, J. F., Puig-Barbe, A., Quardokus, E. M., Radtke, A. J., Rajbhandari, P., Record, E. G., Roberts, D. J., Ropelewski, A. J., Rowe, D., Ruschman, N. L., Saunders, D. C., Scheuermann, R. H., Schey, K. L., Schilling, B., Schlehlein, H., Schwenk, M., Scibek, R., Seifert, R. P., Shirey, B., Shivkumar, K., Siletti, K., Simmons, J. A., Singhal, D., Snyder, M., Spraggins, J. M., Stanley, V., Strand, D. W., Sunshine, J. C., Surrette, C., Suzuki, A., Tata, P. R., Taylor, D. M., Theriault, T., Theriault, T., Thomas, J. E., Tsui, E. L., Uranic, J., Valerius, M. T., Van Valen, D., Vezina, C. M., Vlachos, I. S., Wang, F., Wang, X. '., Wasserfall, C. H., Welling, J. S., Werlein, C., Winfree, S., Wright, D. M., Yao, L., Yuan, Z., Zhang, T. 2025

Abstract

The Human BioMolecular Atlas Program (HuBMAP) aims to construct a 3D Human Reference Atlas (HRA) of the healthy adult body. Experts from 20+ consortia collaborate to develop a Common Coordinate Framework (CCF), knowledge graphs and tools that describe the multiscale structure of the human body (from organs and tissues down to cells, genes and biomarkers) and to use the HRA to characterize changes that occur with aging, disease and other perturbations. HRA v.2.0 covers 4,499 unique anatomical structures, 1,195 cell types and 2,089 biomarkers (such as genes, proteins and lipids) from 33 ASCT+B tables and 65 3D Reference Objects linked to ontologies. New experimental data can be mapped into the HRA using (1) cell type annotation tools (for example, Azimuth), (2) validated antibody panels or (3) by registering tissue data spatially. This paper describes HRA user stories, terminology, data formats, ontology validation, unified analysis workflows, user interfaces, instructional materials, application programming interfaces, flexible hybrid cloud infrastructure and previews atlas usage applications.

View details for DOI 10.1038/s41592-024-02563-5

View details for PubMedID 40082611
MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific Research Burgess, J., Nirschl, J. J., Bravo-Sanchez, L., Lozano, A., Gupte, S., Galaz-Montoya, J. G., Zhang, Y., Su, Y., Bhowmik, D., Coman, Z., Hasan, S. M., Johannesson, A., Leineweber, W. D., Nair, M. G., Yarlagadda, R., Zuraski, C., Chiu, W., Cohen, S., Hansen, J. N., Leonetti, M. D., Liu, C., Lundberg, E., Yeung-Levy, S., IEEE COMPUTER SOC IEEE COMPUTER SOC. 2025: 19552-19564

View details for DOI 10.1109/CVPR52734.2025.01821

View details for Web of Science ID 001601158200144
A PERTURBATION CELL ATLAS OF HUMAN INDUCED PLURIPOTENT STEM CELLS. bioRxiv : the preprint server for biology Nourreddine, S., Doctor, Y., Dailamy, A., Forget, A., Lee, Y. H., Chinn, B., Khaliq, H., Polacco, B., Muralidharan, M., Pan, E., Zhang, Y., Sigaeva, A., Hansen, J. N., Gao, J., Parker, J. A., Obernier, K., Clark, T., Chen, J. Y., Metallo, C., Lundberg, E., Ideker, T., Krogan, N., Mali, P. 2024

Abstract

Towards comprehensively investigating the genotype-phenotype relationships governing the human pluripotent stem cell state, we generated an expressed genome-scale CRISPRi Perturbation Cell Atlas in KOLF2.1J human induced pluripotent stem cells (hiPSCs) mapping transcriptional and fitness phenotypes associated with 11,739 targeted genes. Using the transcriptional phenotypes, we created a minimum distortion embedding map of the pluripotent state, demonstrating rich recapitulation of protein complexes, such as strong co-clustering of MRPL, BAF, SAGA, and Ragulator family members. Additionally, we uncovered transcriptional regulators that are uncoupled from cell fitness, discovering potential novel pluripotency (JOSD1, RNF7) and metabolic factors (ZBTB41). We validated these findings via phenotypic, protein-interaction, and metabolic tracing assays. Finally, we propose a contrastive human-cell engineering framework (CHEF), a machine learning architecture that learns from perturbation cell atlases to predict perturbation recipes that achieve desired transcriptional states. Taken together, our study presents a comprehensive resource for interrogating the regulatory networks governing pluripotency.

View details for DOI 10.1101/2024.11.03.621734

View details for PubMedID 39574586

View details for PubMedCentralID PMC11580897
High-parametric protein maps reveal the spatial organization in early-developing human lung. Nature communications Sariyar, S., Sountoulidis, A., Hansen, J. N., Marco Salas, S., Mardamshina, M., Martinez Casals, A., Ballllosera Navarro, F., Andrusivova, Z., Li, X., Czarnewski, P., Lundeberg, J., Linnarsson, S., Nilsson, M., Sundström, E., Samakovlis, C., Lundberg, E., Ayoglu, B. 2024; 15 (1): 9381

Abstract

The respiratory system, including the lungs, is essential for terrestrial life. While recent research has advanced our understanding of lung development, much still relies on animal models and transcriptome analyses. In this study conducted within the Human Developmental Cell Atlas (HDCA) initiative, we describe the protein-level spatiotemporal organization of the lung during the first trimester of human gestation. Using high-parametric tissue imaging with a 30-plex antibody panel, we analyzed human lung samples from 6 to 13 post-conception weeks, generating data from over 2 million cells across five developmental timepoints. We present a resource detailing spatially resolved cell type composition of the developing human lung, including proliferative states, immune cell patterns, spatial arrangement traits, and their temporal evolution. This represents an extensive single-cell resolved protein-level examination of the developing human lung and provides a valuable resource for further research into the developmental roots of human respiratory health and disease.

View details for DOI 10.1038/s41467-024-53752-x

View details for PubMedID 39477961

View details for PubMedCentralID PMC11525936
How to Build the Virtual Cell with Artificial Intelligence: Priorities and Opportunities. ArXiv Bunne, C., Roohani, Y., Rosen, Y., Gupta, A., Zhang, X., Roed, M., Alexandrov, T., AlQuraishi, M., Brennan, P., Burkhardt, D. B., Califano, A., Cool, J., Dernburg, A. F., Ewing, K., Fox, E. B., Haury, M., Herr, A. E., Horvitz, E., Hsu, P. D., Jain, V., Johnson, G. R., Kalil, T., Kelley, D. R., Kelley, S. O., Kreshuk, A., Mitchison, T., Otte, S., Shendure, J., Sofroniew, N. J., Theis, F., Theodoris, C. V., Upadhyayula, S., Valer, M., Wang, B., Xing, E., Yeung-Levy, S., Zitnik, M., Karaletsos, T., Regev, A., Lundberg, E., Leskovec, J., Quake, S. R. 2024

Abstract

The cell is arguably the most fundamental unit of life and is central to understanding biology. Accurate modeling of cells is important for this understanding as well as for determining the root causes of disease. Recent advances in artificial intelligence (AI), combined with the ability to generate large-scale experimental data, present novel opportunities to model cells. Here we propose a vision of leveraging advances in AI to construct virtual cells, high-fidelity simulations of cells and cellular systems under different conditions that are directly learned from biological data across measurements and scales. We discuss desired capabilities of such AI Virtual Cells, including generating universal representations of biological entities across scales, and facilitating interpretable in silico experiments to predict and understand their behavior using Virtual Instruments. We further address the challenges, opportunities and requirements to realize this vision including data needs, evaluation strategies, and community standards and engagement to ensure biological accuracy and broad utility. We envision a future where AI Virtual Cells help identify new drug targets, predict cellular responses to perturbations, as well as scale hypothesis exploration. With open science collaborations across the biomedical ecosystem that includes academia, philanthropy, and the biopharma and AI industries, a comprehensive predictive understanding of cell mechanisms and interactions has come into reach.

View details for PubMedID 39398201

View details for PubMedCentralID PMC11468656
Flexible and robust cell type annotation for highly multiplexed tissue images. bioRxiv : the preprint server for biology Sun, H., Yu, S., Casals, A. M., Bäckström, A., Lu, Y., Lindskog, C., Lundberg, E., Murphy, R. F. 2024

Abstract

Identifying cell types in highly multiplexed images is essential for understanding tissue spatial organization. Current cell type annotation methods often rely on extensive reference images and manual adjustments. In this work, we present a tool, Robust Image-Based Cell Annotator (RIBCA), that enables accurate, automated, unbiased, and fine-grained cell type annotation for images with a wide range of antibody panels, without requiring additional model training or human intervention. Our tool has successfully annotated over 1 million cells, revealing the spatial organization of various cell types across more than 40 different human tissues. It is open-source and features a modular design, allowing for easy extension to additional cell types.

View details for DOI 10.1101/2024.09.12.612510

View details for PubMedID 39345395

View details for PubMedCentralID PMC11429614
Early AI Lifecycle Co-Reasoning: Ethics Through Integrated and Diverse Team Science. The American journal of bioethics : AJOB Pacia, D. M., Ravitsky, V., Hansen, J. N., Lundberg, E., Schulz, W., Bélisle-Pipon, J. C. 2024; 24 (9): 86-88

View details for DOI 10.1080/15265161.2024.2377106

View details for PubMedID 39226006
Open-source, high-throughput targeted in-situ transcriptomics for developmental and tissue biology. Development (Cambridge, England) Lee, H., Mattsson Langseth, C., Marco Salas, S., Sariyar, S., Metousis, A., Rueda Alana, E., Bekiari, C., Lundberg, E., Garcia-Moreno, F., Grillo, M., Nilsson, M. 2024

Abstract

Multiplexed spatial profiling of mRNAs has recently gained traction as a tool to explore the cellular diversity and the architecture of tissues. We propose a sensitive, open-source, simple and flexible method for the generation of in-situ expression maps of hundreds of genes. We exploit direct ligation of padlock probes on mRNAs, coupled with rolling circle amplification and hybridization-based in situ combinatorial barcoding, to achieve high detection efficiency, high throughput and large multiplexing. We validate the method across a number of species, and show its use in combination with orthogonal methods such as antibody staining, highlighting its potential value for developmental and tissue biology studies. Finally, we provide an end-to-end computational workflow that covers the steps of probe design, image processing, data extraction, cell segmentation, clustering and annotation of cell types. By enabling easier access to high-throughput spatially resolved transcriptomics, we hope to encourage a diversity of applications and the exploration of a wide range of biological questions.

View details for DOI 10.1242/dev.202448

View details for PubMedID 39099456
Deconwolf enables high-performance deconvolution of widefield fluorescence microscopy images. Nature methods Wernersson, E., Gelali, E., Girelli, G., Wang, S., Castillo, D., Mattsson Langseth, C., Verron, Q., Nguyen, H. Q., Chattoraj, S., Martinez Casals, A., Blom, H., Lundberg, E., Nilsson, M., Marti-Renom, M. A., Wu, C., Crosetto, N., Bienko, M. 2024

Abstract

Microscopy-based spatially resolved omic methods are transforming the life sciences. However, these methods rely on high numerical aperture objectives and cannot resolve crowded molecular targets, limiting the amount of extractable biological information. To overcome these limitations, here we develop Deconwolf, an open-source, user-friendly software for high-performance deconvolution of widefield fluorescence microscopy images, which efficiently runs on laptop computers. Deconwolf enables accurate quantification of crowded diffraction limited fluorescence dots in DNA and RNA fluorescence in situ hybridization images and allows robust detection of individual transcripts in tissue sections imaged with *20 air objectives. Deconvolution of in situ spatial transcriptomics images with Deconwolf increased the number of transcripts identified more than threefold, while the application of Deconwolf to images obtained by fluorescence in situ sequencing of barcoded Oligopaint probes drastically improved chromosome tracing. Deconwolf greatly facilitates the use of deconvolution in many bioimaging applications.

View details for DOI 10.1038/s41592-024-02294-7

View details for PubMedID 38844629
Decrypting lysine deacetylase inhibitor action and protein modifications by dose-resolved proteomics. Cell reports Chang, Y. C., Gnann, C., Steimbach, R. R., Bayer, F. P., Lechner, S., Sakhteman, A., Abele, M., Zecha, J., Trendel, J., The, M., Lundberg, E., Miller, A. K., Kuster, B. 2024; 43 (6): 114272

Abstract

Lysine deacetylase inhibitors (KDACis) are approved drugs for cutaneous T cell lymphoma (CTCL), peripheral T cell lymphoma (PTCL), and multiple myeloma, but many aspects of their cellular mechanism of action (MoA) and substantial toxicity are not well understood. To shed more light on how KDACis elicit cellular responses, we systematically measured dose-dependent changes in acetylation, phosphorylation, and protein expression in response to 21 clinical and pre-clinical KDACis. The resulting 862,000 dose-response curves revealed, for instance, limited cellular specificity of histone deacetylase (HDAC) 1, 2, 3, and 6 inhibitors; strong cross-talk between acetylation and phosphorylation pathways; localization of most drug-responsive acetylation sites to intrinsically disordered regions (IDRs); an underappreciated role of acetylation in protein structure; and a shift in EP300 protein abundance between the cytoplasm and the nucleus. This comprehensive dataset serves as a resource for the investigation of the molecular mechanisms underlying KDACi action in cells and can be interactively explored online in ProteomicsDB.

View details for DOI 10.1016/j.celrep.2024.114272

View details for PubMedID 38795348
Cell Maps for Artificial Intelligence: AI-Ready Maps of Human Cell Architecture from Disease-Relevant Cell Lines. bioRxiv : the preprint server for biology Clark, T., Mohan, J., Schaffer, L., Obernier, K., Al Manir, S., Churas, C. P., Dailamy, A., Doctor, Y., Forget, A., Hansen, J. N., Hu, M., Lenkiewicz, J., Levinson, M. A., Marquez, C., Nourreddine, S., Niestroy, J., Pratt, D., Qian, G., Thaker, S., Bélisle-Pipon, J. C., Brandt, C., Chen, J., Ding, Y., Fodeh, S., Krogan, N., Lundberg, E., Mali, P., Payne-Foster, P., Ratcliffe, S., Ravitsky, V., Sali, A., Schulz, W., Ideker, T. 2024

Abstract

This article describes the Cell Maps for Artificial Intelligence (CM4AI) project and its goals, methods, standards, current datasets, software tools , status, and future directions. CM4AI is the Functional Genomics Data Generation Project in the U.S. National Institute of Health's (NIH) Bridge2AI program. Its overarching mission is to produce ethical, AI-ready datasets of cell architecture, inferred from multimodal data collected for human cell lines, to enable transformative biomedical AI research.

View details for DOI 10.1101/2024.05.21.589311

View details for PubMedID 38826258

View details for PubMedCentralID PMC11142054
Mapping the Multiscale Proteomic Organization of Cellular and Disease Phenotypes. Annual review of biomedical data science Cesnik, A., Schaffer, L. V., Gaur, I., Jain, M., Ideker, T., Lundberg, E. 2024

Abstract

While the primary sequences of human proteins have been cataloged for over a decade, determining how these are organized into a dynamic collection of multiprotein assemblies, with structures and functions spanning biological scales, is an ongoing venture. Systematic and data-driven analyses of these higher-order structures are emerging, facilitating the discovery and understanding of cellular phenotypes. At present, knowledge of protein localization and function has been primarily derived from manual annotation and curation in resources such as the Gene Ontology, which are biased toward richly annotated genes in the literature. Here, we envision a future powered by data-driven mapping of protein assemblies. These maps can capture and decode cellular functions through the integration of protein expression, localization, and interaction data across length scales and timescales. In this review, we focus on progress toward constructing integrated cell maps that accelerate the life sciences and translational research.

View details for DOI 10.1146/annurev-biodatasci-102423-113534

View details for PubMedID 38748859
Bento: a toolkit for subcellular analysis of spatial transcriptomics data. Genome biology Mah, C. K., Ahmed, N., Lopez, N. A., Lam, D. C., Pong, A., Monell, A., Kern, C., Han, Y., Prasad, G., Cesnik, A. J., Lundberg, E., Zhu, Q., Carter, H., Yeo, G. W. 2024; 25 (1): 82

Abstract

The spatial organization of molecules in a cell is essential for their functions. While current methods focus on discerning tissue architecture, cell-cell interactions, and spatial expression patterns, they are limited to the multicellular scale. We present Bento, a Python toolkit that takes advantage of single-molecule information to enable spatial analysis at the subcellular scale. Bento ingests molecular coordinates and segmentation boundaries to perform three analyses: defining subcellular domains, annotating localization patterns, and quantifying gene-gene colocalization. We demonstrate MERFISH, seqFISH+, Molecular Cartography, and Xenium datasets. Bento is part of the open-source Scverse ecosystem, enabling integration with other single-cell analysis tools.

View details for DOI 10.1186/s13059-024-03217-7

View details for PubMedID 38566187
Macromolecular condensation organizes nucleolar sub-phases to set up a pH gradient. Cell King, M. R., Ruff, K. M., Lin, A. Z., Pant, A., Farag, M., Lalmansingh, J. M., Wu, T., Fossat, M. J., Ouyang, W., Lew, M. D., Lundberg, E., Vahey, M. D., Pappu, R. V. 2024

Abstract

Nucleoli are multicomponent condensates defined by coexisting sub-phases. We identified distinct intrinsically disordered regions (IDRs), including acidic (D/E) tracts and K-blocks interspersed by E-rich regions, as defining features of nucleolar proteins. We show that the localization preferences of nucleolar proteins are determined by their IDRs and the types of RNA or DNA binding domains they encompass. In vitro reconstitutions and studies in cells showed how condensation, which combines binding and complex coacervation of nucleolar components, contributes to nucleolar organization. D/E tracts of nucleolar proteins contribute to lowering the pH of co-condensates formed with nucleolar RNAs in vitro. In cells, this sets up a pH gradient between nucleoli and the nucleoplasm. By contrast, juxta-nucleolar bodies, which have different macromolecular compositions, featuring protein IDRs with very different charge profiles, have pH values that are equivalent to or higher than the nucleoplasm. Our findings show that distinct compositional specificities generate distinct physicochemical properties for condensates.

View details for DOI 10.1016/j.cell.2024.02.029

View details for PubMedID 38503281
Harmonizing the Generation and Pre-publication Stewardship of FAIR Image data. ArXiv Bialy, N., Alber, F., Andrews, B., Angelo, M., Beliveau, B., Bintu, L., Boettiger, A., Boehm, U., Brown, C. M., Maina, M. B., Chambers, J. J., Cimini, B. A., Eliceiri, K., Errington, R., Faklaris, O., Gaudreault, N., Germain, R. N., Goscinski, W., Grunwald, D., Halter, M., Hanein, D., Hickey, J. W., Lacoste, J., Laude, A., Lundberg, E., Ma, J., Malacrida, L., Moore, J., Nelson, G., Neumann, E. K., Nitschke, R., Onami, S., Pimentel, J. A., Plant, A. L., Radtke, A. J., Sabata, B., Schapiro, D., Schöneberg, J., Spraggins, J. M., Sudar, D., Adrien Maria Vierdag, W. M., Volkmann, N., Wählby, C., Wang, S. S., Yaniv, Z., Strambio-De-Castillia, C. 2024

Abstract

Together with the molecular knowledge of genes and proteins, biological images promise to significantly enhance the scientific understanding of complex cellular systems and to advance predictive and personalized therapeutic products for human health. For this potential to be realized, quality-assured image data must be shared among labs at a global scale to be compared, pooled, and reanalyzed, thus unleashing untold potential beyond the original purpose for which the data was generated. There are two broad sets of requirements to enable image data sharing in the life sciences. One set of requirements is articulated in the companion White Paper entitled "Enabling Global Image Data Sharing in the Life Sciences," which is published in parallel and addresses the need to build the cyberinfrastructure for sharing the digital array data (arXiv:2401.13023 [q-bio.OT], https://doi.org/10.48550/arXiv.2401.13023). In this White Paper, we detail a broad set of requirements, which involves collecting, managing, presenting, and propagating contextual information essential to assess the quality, understand the content, interpret the scientific implications, and reuse image data in the context of the experimental details. We start by providing an overview of the main lessons learned to date through international community activities, which have recently made considerable progress toward generating community standard practices for imaging Quality Control (QC) and metadata. We then provide a clear set of recommendations for amplifying this work. The driving goal is to address remaining challenges, and democratize access to common practices and tools for a spectrum of biomedical researchers, regardless of their expertise, access to resources, and geographical location.

View details for DOI 10.1242/jcs.254151

View details for PubMedID 38351940

View details for PubMedCentralID PMC10862930
Xist ribonucleoproteins promote female sex-biased autoimmunity. Cell Dou, D. R., Zhao, Y., Belk, J. A., Zhao, Y., Casey, K. M., Chen, D. C., Li, R., Yu, B., Srinivasan, S., Abe, B. T., Kraft, K., Hellström, C., Sjöberg, R., Chang, S., Feng, A., Goldman, D. W., Shah, A. A., Petri, M., Chung, L. S., Fiorentino, D. F., Lundberg, E. K., Wutz, A., Utz, P. J., Chang, H. Y. 2024; 187 (3): 733-749.e16

Abstract

Autoimmune diseases disproportionately affect females more than males. The XX sex chromosome complement is strongly associated with susceptibility to autoimmunity. Xist long non-coding RNA (lncRNA) is expressed only in females to randomly inactivate one of the two X chromosomes to achieve gene dosage compensation. Here, we show that the Xist ribonucleoprotein (RNP) complex comprising numerous autoantigenic components is an important driver of sex-biased autoimmunity. Inducible transgenic expression of a non-silencing form of Xist in male mice introduced Xist RNP complexes and sufficed to produce autoantibodies. Male SJL/J mice expressing transgenic Xist developed more severe multi-organ pathology in a pristane-induced lupus model than wild-type males. Xist expression in males reprogrammed T and B cell populations and chromatin states to more resemble wild-type females. Human patients with autoimmune diseases displayed significant autoantibodies to multiple components of XIST RNP. Thus, a sex-specific lncRNA scaffolds ubiquitous RNP components to drive sex-biased immunity.

View details for DOI 10.1016/j.cell.2023.12.037

View details for PubMedID 38306984
Tools for assembling the cell: Towards the era of cell structural bioinformatics Hu, M., Zhang, X., Latham, A., Sali, A., Ideker, T., Lundberg, E. edited by Hunter, L., Altman, R. B., Ritchie, M. D., Murray, T., Klein, T. E. WORLD SCIENTIFIC PUBL CO PTE LTD. 2024: 661-665

Abstract

Cells consist of large components, such as organelles, that recursively factor into smaller systems, such as condensates and protein complexes, forming a dynamic multi-scale structure of the cell. Recent technological innovations have paved the way for systematic interrogation of subcellular structures, yielding unprecedented insights into their roles and interactions. In this workshop, we discuss progress, challenges, and collaboration to marshal various computational approaches toward assembling an integrated structural map of the human cell.

View details for Web of Science ID 001258333100051

View details for PubMedID 38160316
Single Cell Spatial Biology for Precision Cancer Medicine. Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing Gentles, A. J., Nirmal, A. J., Heiser, L. M., Lundberg, E., Newman, A. M. 2023; 28: 549-553

Abstract

In cancer, complex ecosystems of interacting cell types play fundamental roles in tumor development, progression, and response to therapy. However, the cellular organization, community structure, and spatially defined microenvironments of human tumors remain poorly understood. With the emergence of new technologies for high-throughput spatial profiling of complex tissue specimens, it is now possible to identify clinically significant spatial features with high granularity. In this PSB workshop, we will highlight recent advances in this area and explore how single cell spatial profiling can advance precision cancer medicine.

View details for PubMedID 36541010
Xist Ribonucleoproteins Promote Female Sex-biased Autoimmunity Dou, D., Zhao, Y., Belk, J., Zhao, Y., Casey, K., Chen, D., Li, R., Yu, B., Srinivasan, S., Abe, B., Kraft, K., Hellstroem, C., Sjoeberg, R., Chang, S., Feng, A., Goldman, D., Shah, A., Petri, M., Chung, L., Fiorentino, D., Lundberg, E., Wutz, A., Utz, P., Chang, H. WILEY. 2023: 25-26

View details for Web of Science ID 001190014300018
Segmenting functional tissue units across human organs using community-driven development of generalizable machine learning algorithms. Nature communications Jain, Y., Godwin, L. L., Joshi, S., Mandarapu, S., Le, T., Lindskog, C., Lundberg, E., Börner, K. 2023; 14 (1): 4656

Abstract

The development of a reference atlas of the healthy human body requires automated image segmentation of major anatomical structures across multiple organs based on spatial bioimages generated from various sources with differences in sample preparation. We present the setup and results of the Hacking the Human Body machine learning algorithm development competition hosted by the Human Biomolecular Atlas (HuBMAP) and the Human Protein Atlas (HPA) teams on the Kaggle platform. We create a dataset containing 880 histology images with 12,901 segmented structures, engaging 1175 teams from 78 countries in community-driven, open-science development of machine learning models. Tissue variations in the dataset pose a major challenge to the teams which they overcome by using color normalization techniques and combining vision transformers with convolutional models. The best model will be productized in the HuBMAP portal to process tissue image datasets at scale in support of Human Reference Atlas construction.

View details for DOI 10.1038/s41467-023-40291-0

View details for PubMedID 37537179

View details for PubMedCentralID 10079270
Organ Mapping Antibody Panels: a community resource for standardized multiplexed tissue imaging. Nature methods Quardokus, E. M., Saunders, D. C., McDonough, E., Hickey, J. W., Werlein, C., Surrette, C., Rajbhandari, P., Casals, A. M., Tian, H., Lowery, L., Neumann, E. K., Björklund, F., Neelakantan, T. V., Croteau, J., Wiblin, A. E., Fisher, J., Livengood, A. J., Dowell, K. G., Silverstein, J. C., Spraggins, J. M., Pryhuber, G. S., Deutsch, G., Ginty, F., Nolan, G. P., Melov, S., Jonigk, D., Caldwell, M. A., Vlachos, I. S., Muller, W., Gehlenborg, N., Stockwell, B. R., Lundberg, E., Snyder, M. P., Germain, R. N., Camarillo, J. M., Kelleher, N. L., Börner, K., Radtke, A. J. 2023

Abstract

Multiplexed antibody-based imaging enables the detailed characterization of molecular and cellular organization in tissues. Advances in the field now allow high-parameter data collection (>60 targets); however, considerable expertise and capital are needed to construct the antibody panels employed by these methods. Organ mapping antibody panels are community-validated resources that save time and money, increase reproducibility, accelerate discovery and support the construction of a Human Reference Atlas.

View details for DOI 10.1038/s41592-023-01846-7

View details for PubMedID 37468619

View details for PubMedCentralID 10335836
Building the next generation of virtual cells to understand cellular biology. Biophysical journal Johnson, G. T., Agmon, E., Akamatsu, M., Lundberg, E., Lyons, B., Ouyang, W., Quintero-Carmona, O. A., Rafelski, S., Horwitz, R. 2023

Abstract

Cell science has made significant progress by focusing on understanding individual cellular processes through reductionist approaches. However, the sheer volume of knowledge collected presents challenges in integrating this information across different scales of space and time to comprehend cellular behaviors, as well as making the data and methods more accessible for the community to tackle complex biological questions. This Perspective proposes the creation of next-generation virtual cells, which are dynamic 3D models that integrate information from diverse sources, including simulations, biophysical models, image-based models, and evidence-based knowledge graphs. These virtual cells would provide statistically accurate and holistic views of real cells, bridging the gap between theoretical concepts and experimental data, and facilitating productive new collaborations among researchers across related fields.

View details for DOI 10.1016/j.bpj.2023.04.006

View details for PubMedID 37050874
A topographic atlas defines developmental origins of cell heterogeneity in the human embryonic lung. Nature cell biology Sountoulidis, A., Marco Salas, S., Braun, E., Avenel, C., Bergenstråhle, J., Theelke, J., Vicari, M., Czarnewski, P., Liontos, A., Abalo, X., Andrusivová, Ž., Mirzazadeh, R., Asp, M., Li, X., Hu, L., Sariyar, S., Martinez Casals, A., Ayoglu, B., Firsova, A., Michaëlsson, J., Lundberg, E., Wählby, C., Sundström, E., Linnarsson, S., Lundeberg, J., Nilsson, M., Samakovlis, C. 2023

Abstract

The lung contains numerous specialized cell types with distinct roles in tissue function and integrity. To clarify the origins and mechanisms generating cell heterogeneity, we created a comprehensive topographic atlas of early human lung development. Here we report 83 cell states and several spatially resolved developmental trajectories and predict cell interactions within defined tissue niches. We integrated single-cell RNA sequencing and spatially resolved transcriptomics into a web-based, open platform for interactive exploration. We show distinct gene expression programmes, accompanying sequential events of cell differentiation and maturation of the secretory and neuroendocrine cell types in proximal epithelium. We define the origin of airway fibroblasts associated with airway smooth muscle in bronchovascular bundles and describe a trajectory of Schwann cell progenitors to intrinsic parasympathetic neurons controlling bronchoconstriction. Our atlas provides a rich resource for further research and a reference for defining deviations from homeostatic and repair mechanisms leading to pulmonary diseases.

View details for DOI 10.1038/s41556-022-01064-x

View details for PubMedID 36646791
Segmenting functional tissue units across human organs using community-driven development of generalizable machine learning algorithms. bioRxiv : the preprint server for biology Jain, Y., Godwin, L. L., Joshi, S., Mandarapu, S., Le, T., Lindskog, C., Lundberg, E., Borner, K. 2023

Abstract

The development of a reference atlas of the healthy human body requires automated image segmentation of major anatomical structures across multiple organs based on spatial bioimages generated from various sources with differences in sample preparation. We present the setup and results of the "Hacking the Human Body" machine learning algorithm development competition hosted by the Human Biomolecular Atlas (HuBMAP) and the Human Protein Atlas (HPA) teams on the Kaggle platform. We showcase how 1,175 teams from 78 countries engaged in community- driven, open-science code development that resulted in machine learning models which successfully segment anatomical structures across five organs using histology images from two consortia and that will be productized in the HuBMAP data portal to process large datasets at scale in support of Human Reference Atlas construction. We discuss the benchmark data created for the competition, major challenges faced by the participants, and the winning models and strategies.

View details for DOI 10.1101/2023.01.05.522764

View details for PubMedID 36711953
Analysis of the Human Protein Atlas Weakly Supervised Single-Cell Classification competition. Nature methods Le, T., Winsnes, C. F., Axelsson, U., Xu, H., Mohanakrishnan Kaimal, J., Mahdessian, D., Dai, S., Makarov, I. S., Ostankovich, V., Xu, Y., Benhamou, E., Henkel, C., Solovyev, R. A., Banić, N., Bošnjak, V., Bošnjak, A., Miličević, A., Ouyang, W., Lundberg, E. 2022

Abstract

While spatial proteomics by fluorescence imaging has quickly become an essential discovery tool for researchers, fast and scalable methods to classify and embed single-cell protein distributions in such images are lacking. Here, we present the design and analysis of the results from the competition Human Protein Atlas - Single-Cell Classification hosted on the Kaggle platform. This represents a crowd-sourced competition to develop machine learning models trained on limited annotations to label single-cell protein patterns in fluorescent images. The particular challenges of this competition include class imbalance, weak labels and multi-label classification, prompting competitors to apply a wide range of approaches in their solutions. The winning models serve as the first subcellular omics tools that can annotate single-cell locations, extract single-cell features and capture cellular dynamics.

View details for DOI 10.1038/s41592-022-01606-z

View details for PubMedID 36175767
Imaging cell biology. Nature cell biology Andrews, B., Chang, J. B., Collinson, L., Li, D., Lundberg, E., Mahamid, J., Manley, S., Mhlanga, M., Nakano, A., Schöneberg, J., Van Valen, D., Wu, T. '., Zaritsky, A. 2022

View details for DOI 10.1038/s41556-022-00960-6

View details for PubMedID 35896733
The emerging landscape of spatial profiling technologies. Nature reviews. Genetics Moffitt, J. R., Lundberg, E., Heyn, H. 2022

Abstract

Improved scale, multiplexing and resolution are establishing spatial nucleic acid and protein profiling methods as a major pillar for cellular atlas building of complex samples, from tissues to full organisms. Emerging methods yield omics measurements at resolutions covering the nano- to microscale, enabling the charting of cellular heterogeneity, complex tissue architectures and dynamic changes during development and disease. We present an overview of the developing landscape of in situ spatial genome, transcriptome and proteome technologies, exemplify their impact on cell biology and translational research, and discuss current challenges for their community-wide adoption. Among many transformative applications, we envision that spatial methods will map entire organs and enable next-generation pathology.

View details for DOI 10.1038/s41576-022-00515-3

View details for PubMedID 35859028
New views of old proteins: clarifying the enigmatic proteome. Molecular & cellular proteomics : MCP Burnum-Johnson, K. E., Conrads, T. P., Drake, R. R., Herr, A. E., Iyengar, R., Kelly, R. T., Lundberg, E., MacCoss, M. J., Naba, A., Nolan, G. P., Pevzner, P. A., Rodland, K. D., Sechi, S., Slavov, N., Spraggins, J. M., Van Eyk, J. E., Vidal, M., Vogel, C., Walt, D. R., Kelleher, N. L. 2022: 100254

Abstract

All human diseases involve proteins, yet our current tools to characterize and quantify them are limited. To better elucidate proteins across space, time, and molecular composition, we provide a >10 year projection for technologies to meet the challenges that protein biology presents. With a broad perspective, we discuss grand opportunities to transition the science of proteomics into a more propulsive enterprise. Extrapolating recent trends, we describe a next generation of approaches to define, quantify and visualize the multiple dimensions of the proteome, thereby transforming our understanding and interactions with human disease in the coming decade.

View details for DOI 10.1016/j.mcpro.2022.100254

View details for PubMedID 35654359
Deep Visual Proteomics defines single-cell identity and heterogeneity. Nature biotechnology Mund, A., Coscia, F., Kriston, A., Hollandi, R., Kovacs, F., Brunner, A., Migh, E., Schweizer, L., Santos, A., Bzorek, M., Naimy, S., Rahbek-Gjerdrum, L. M., Dyring-Andersen, B., Bulkescher, J., Lukas, C., Eckert, M. A., Lengyel, E., Gnann, C., Lundberg, E., Horvath, P., Mann, M. 2022

Abstract

Despite the availabilty of imaging-based and mass-spectrometry-based methods for spatial proteomics, a key challenge remains connecting images with single-cell-resolution protein abundance measurements. Here, we introduce Deep Visual Proteomics (DVP), which combines artificial-intelligence-driven image analysis of cellular phenotypes with automated single-cell or single-nucleus laser microdissection and ultra-high-sensitivity mass spectrometry. DVP links protein abundance to complex cellular or subcellular phenotypes while preserving spatial context. By individually excising nuclei from cell culture, we classified distinct cell states with proteomic profiles defined by known and uncharacterized proteins. In an archived primary melanoma tissue, DVP identified spatially resolved proteome changes as normal melanocytes transition to fully invasive melanoma, revealing pathways that change in a spatial manner as cancer progresses, such as mRNA splicing dysregulation in metastatic vertical growth that coincides with reduced interferon signaling and antigen presentation. The ability of DVP to retain precise spatial proteomic information in the tissue context has implications for the molecular profiling of clinical samples.

View details for DOI 10.1038/s41587-022-01302-5

View details for PubMedID 35590073
Understudied proteins: opportunities and challenges for functional proteomics. Nature methods Kustatscher, G., Collins, T., Gingras, A., Guo, T., Hermjakob, H., Ideker, T., Lilley, K. S., Lundberg, E., Marcotte, E. M., Ralser, M., Rappsilber, J. 2022

View details for DOI 10.1038/s41592-022-01454-x

View details for PubMedID 35534633
An open invitation to the Understudied Proteins Initiative. Nature biotechnology Kustatscher, G., Collins, T., Gingras, A., Guo, T., Hermjakob, H., Ideker, T., Lilley, K. S., Lundberg, E., Marcotte, E. M., Ralser, M., Rappsilber, J. 2022

View details for DOI 10.1038/s41587-022-01316-z

View details for PubMedID 35534555
The Blood Proteoform Atlas: A reference map of proteoforms in human hematopoietic cells. Science (New York, N.Y.) Melani, R. D., Gerbasi, V. R., Anderson, L. C., Sikora, J. W., Toby, T. K., Hutton, J. E., Butcher, D. S., Negrao, F., Seckler, H. S., Srzentic, K., Fornelli, L., Camarillo, J. M., LeDuc, R. D., Cesnik, A. J., Lundberg, E., Greer, J. B., Fellers, R. T., Robey, M. T., DeHart, C. J., Forte, E., Hendrickson, C. L., Abbatiello, S. E., Thomas, P. M., Kokaji, A. I., Levitsky, J., Kelleher, N. L. 1800; 375 (6579): 411-418

Abstract

Human biology is tightly linked to proteins, yet most measurements do not precisely determine alternatively spliced sequences or posttranslational modifications. Here, we present the primary structures of ~30,000 unique proteoforms, nearly 10 times more than in previous studies, expressed from 1690 human genes across 21 cell types and plasma from human blood and bone marrow. The results, compiled in the Blood Proteoform Atlas (BPA), indicate that proteoforms better describe protein-level biology and are more specific indicators of differentiation than their corresponding proteins, which are more broadly expressed across cell types. We demonstrate the potential for clinical application, by interrogating the BPA in the context of liver transplantation and identifying cell and proteoform signatures that distinguish normal graft function from acute rejection and other causes of graft dysfunction.

View details for DOI 10.1126/science.aaz5284

View details for PubMedID 35084980
The new era of quantitative cell imaging-challenges and opportunities. Molecular cell Bagheri, N., Carpenter, A. E., Lundberg, E., Plant, A. L., Horwitz, R. 2022; 82 (2): 241-247

Abstract

Quantitative optical microscopy-an emerging, transformative approach to single-cell biology-has seen dramatic methodological advancements over the past few years. However, its impact has been hampered by challenges in the areas of data generation, management, and analysis. Here we outline these technical and cultural challenges and provide our perspective on the trajectory of this field, ushering in a new era of quantitative, data-driven microscopy. We also contrast it to the three decades of enormous advances in the field of genomics that have significantly enhanced the reproducibility and wider adoption of a plethora of genomic approaches.

View details for DOI 10.1016/j.molcel.2021.12.024

View details for PubMedID 35063094
Spatial mapping of protein composition and tissue organization: a primer for multiplexed antibody-based imaging. Nature methods Hickey, J. W., Neumann, E. K., Radtke, A. J., Camarillo, J. M., Beuschel, R. T., Albanese, A., McDonough, E., Hatler, J., Wiblin, A. E., Fisher, J., Croteau, J., Small, E. C., Sood, A., Caprioli, R. M., Angelo, R. M., Nolan, G. P., Chung, K., Hewitt, S. M., Germain, R. N., Spraggins, J. M., Lundberg, E., Snyder, M. P., Kelleher, N. L., Saka, S. K. 2021

Abstract

Tissues and organs are composed of distinct cell types that must operate in concert to perform physiological functions. Efforts to create high-dimensional biomarker catalogs of these cells have been largely based on single-cell sequencing approaches, which lack the spatial context required to understand critical cellular communication and correlated structural organization. To probe in situ biology with sufficient depth, several multiplexed protein imaging methods have been recently developed. Though these technologies differ in strategy and mode of immunolabeling and detection tags, they commonly utilize antibodies directed against protein biomarkers to provide detailed spatial and functional maps of complex tissues. As these promising antibody-based multiplexing approaches become more widely adopted, new frameworks and considerations are critical for training future users, generating molecular tools, validating antibody panels, and harmonizing datasets. In this Perspective, we provide essential resources, key considerations for obtaining robust and reproducible imaging data, and specialized knowledge from domain experts and technology developers.

View details for DOI 10.1038/s41592-021-01316-y

View details for PubMedID 34811556
DeepImageJ: A user-friendly environment to run deep learning models in ImageJ. Nature methods Gómez-de-Mariscal, E., García-López-de-Haro, C., Ouyang, W., Donati, L., Lundberg, E., Unser, M., Muñoz-Barrutia, A., Sage, D. 2021; 18 (10): 1192-1195

Abstract

DeepImageJ is a user-friendly solution that enables the generic use of pre-trained deep learning models for biomedical image analysis in ImageJ. The deepImageJ environment gives access to the largest bioimage repository of pre-trained deep learning models (BioImage Model Zoo). Hence, nonexperts can easily perform common image processing tasks in life-science research with deep learning-based tools including pixel and object classification, instance segmentation, denoising or virtual staining. DeepImageJ is compatible with existing state of the art solutions and it is equipped with utility tools for developers to include new models. Very recently, several training frameworks have adopted the deepImageJ format to deploy their work in one of the most used softwares in the field (ImageJ). Beyond its direct use, we expect deepImageJ to contribute to the broader dissemination and reuse of deep learning models in life sciences applications and bioimage informatics.

View details for DOI 10.1038/s41592-021-01262-9

View details for PubMedID 34594030
A roadmap for the Human Developmental Cell Atlas NATURE Haniffa, M., Taylor, D., Linnarsson, S., Aronow, B. J., Bader, G. D., Barker, R. A., Camara, P. G., Camp, J., Chedotal, A., Copp, A., Etchevers, H. C., Giacobini, P., Gottgens, B., Guo, G., Hupalowska, A., James, K. R., Kirby, E., Kriegstein, A., Lundeberg, J., Marioni, J. C., Meyer, K. B., Niakan, K. K., Nilsson, M., Olabi, B., Pe'er, D., Regev, A., Rood, J., Rozenblatt-Rosen, O., Satija, R., Teichmann, S. A., Treutlein, B., Vento-Tormo, R., Webb, S., Human Cell Atlas Dev Biol Network 2021; 597 (7875): 196-205

Abstract

The Human Developmental Cell Atlas (HDCA) initiative, which is part of the Human Cell Atlas, aims to create a comprehensive reference map of cells during development. This will be critical to understanding normal organogenesis, the effect of mutations, environmental factors and infectious agents on human development, congenital and childhood disorders, and the cellular basis of ageing, cancer and regenerative medicine. Here we outline the HDCA initiative and the challenges of mapping and modelling human development using state-of-the-art technologies to create a reference atlas across gestation. Similar to the Human Genome Project, the HDCA will integrate the output from a growing community of scientists who are mapping human development into a unified atlas. We describe the early milestones that have been achieved and the use of human stem-cell-derived cultures, organoids and animal models to inform the HDCA, especially for prenatal tissues that are hard to acquire. Finally, we provide a roadmap towards a complete atlas of human development.

View details for DOI 10.1038/s41586-021-03620-1

View details for Web of Science ID 000695818300009

View details for PubMedID 34497388

View details for PubMedCentralID 6675628
Which image-based phenotypes are most promising for using AI to understand cellular functions and why? CELL SYSTEMS Lundberg, E., Funke, J., Bakal, C., Uhlmann, V., Gerlich, D., Walter, T., Carpenter, A., Coehlo, L. 2021; 12 (5): 384-387

View details for DOI 10.1016/j.cels.2021.04.012

View details for Web of Science ID 000654208900002

View details for PubMedID 34015259
Pycro-Manager: open-source software for customized and reproducible microscope control. Nature methods Pinkard, H., Stuurman, N., Ivanov, I. E., Anthony, N. M., Ouyang, W., Li, B., Yang, B., Tsuchida, M. A., Chhun, B., Zhang, G., Mei, R., Anderson, M., Shepherd, D. P., Hunt-Isaak, I., Dunn, R. L., Jahr, W., Kato, S., Royer, L. A., Thiagarajah, J. R., Eliceiri, K. W., Lundberg, E., Mehta, S. B., Waller, L. 2021

View details for DOI 10.1038/s41592-021-01087-6

View details for PubMedID 33674797
Building a high-quality Human Cell Atlas NATURE BIOTECHNOLOGY Rozenblatt-Rosen, O., Shin, J. W., Rood, J. E., Hupalowska, A., Regev, A., Heyn, H., Human Cell Atlas Stand Technology 2021; 39 (2): 149-153

View details for DOI 10.1038/s41587-020-00812-4

View details for Web of Science ID 000612071100001

View details for PubMedID 33500565
Illuminating Non-genetic Cellular Heterogeneity with Imaging-Based Spatial Proteomics. Trends in cancer Gnann, C., Cesnik, A. J., Lundberg, E. 2021

Abstract

Cellular heterogeneity is an important biological phenomenon observed across space and time in human tissues. Imaging-based spatial proteomic technologies can provide fruitful new readouts of phenotypic states for individual cells at subcellular resolution, which may help unravel the roles of non-genetic cellular heterogeneity in tumorigenesis and drug resistance.

View details for DOI 10.1016/j.trecan.2020.12.006

View details for PubMedID 33436349
Illuminating nongenetic cellular heterogeneity with spatial proteomics Trends in Cancer Gnann, C., Cesnik, A. J., Lundberg, E. 2021; 7 (4): 278-282

Abstract

Cellular heterogeneity is an important biological phenomenon observed across space and time in human tissues. Imaging-based spatial proteomic technologies can provide fruitful new readouts of phenotypic states for individual cells at subcellular resolution, which may help unravel the roles of non-genetic cellular heterogeneity in tumorigenesis and drug resistance.

View details for DOI 10.1016/j.trecan.2020.12.006
LifeTime and improving European healthcare through cell-based interceptive medicine NATURE Rajewsky, N., Almouzni, G., Gorski, S. A., Aerts, S., Amit, I., Bertero, M. G., Bock, C., Bredenoord, A. L., Cavalli, G., Chiocca, S., Clevers, H., De Strooper, B., Eggert, A., Ellenberg, J., Fernandez, X. M., Figlerowicz, M., Gasser, S. M., Hubner, N., Kjems, J., Knoblich, J. A., Krabbe, G., Lichter, P., Linnarsson, S., Marine, J., Marioni, J. C., Marti-Renom, M. A., Netea, M. G., Nickel, D., Nollmann, M., Novak, H. R., Parkinson, H., Piccolo, S., Pinheiro, I., Pombo, A., Popp, C., Reik, W., Roman-Roman, S., Rosenstiel, P., Schultze, J. L., Stegle, O., Tanay, A., Testa, G., Thanos, D., Theis, F. J., Torres-Padilla, M., Valencia, A., Vallot, C., van Oudenaarden, A., Vidal, M., Voet, T., LifeTime Community Working Groups 2020; 587 (7834): 377-386

Abstract

Here we describe the LifeTime Initiative, which aims to track, understand and target human cells during the onset and progression of complex diseases, and to analyse their response to therapy at single-cell resolution. This mission will be implemented through the development, integration and application of single-cell multi-omics and imaging, artificial intelligence and patient-derived experimental disease models during the progression from health to disease. The analysis of large molecular and clinical datasets will identify molecular mechanisms, create predictive computational models of disease progression, and reveal new drug targets and therapies. The timely detection and interception of disease embedded in an ethical and patient-centred vision will be achieved through interactions across academia, hospitals, patient associations, health data management systems and industry. The application of this strategy to key medical challenges in cancer, neurological and neuropsychiatric disorders, and infectious, chronic inflammatory and cardiovascular diseases at the single-cell level will usher in cell-based interceptive medicine in Europe over the next decade.

View details for DOI 10.1038/s41586-020-2715-9

View details for Web of Science ID 000588618000001

View details for PubMedID 32894860

View details for PubMedCentralID PMC7656507
Mapping the nucleolar proteome reveals a spatiotemporal organization related to intrinsic protein disorder MOLECULAR SYSTEMS BIOLOGY Stenstrom, L., Mahdessian, D., Gnann, C., Cesnik, A. J., Ouyang, W., Leonetti, M. D., Uhlen, M., Cuylen-Haering, S., Thul, P. J., Lundberg, E. 2020; 16 (8)

View details for Web of Science ID 000567939700005
Mapping the nucleolar proteome reveals a spatiotemporal organization related to intrinsic protein disorder. Molecular systems biology Stenstrom, L., Mahdessian, D., Gnann, C., Cesnik, A. J., Ouyang, W., Leonetti, M. D., Uhlen, M., Cuylen-Haering, S., Thul, P. J., Lundberg, E. 2020; 16 (8): e9469

Abstract

The nucleolus is essential for ribosome biogenesis and is involved in many other cellular functions. We performed a systematic spatiotemporal dissection of the human nucleolar proteome using confocal microscopy. In total, 1,318 nucleolar proteins were identified; 287 were localized to fibrillar components, and 157 were enriched along the nucleoplasmic border, indicating a potential fourth nucleolar subcompartment: the nucleoli rim. We found 65 nucleolar proteins (36 uncharacterized) to relocate to the chromosomal periphery during mitosis. Interestingly, we observed temporal partitioning into two recruitment phenotypes: early (prometaphase) and late (after metaphase), suggesting phase-specific functions. We further show that the expression of MKI67 is critical for this temporal partitioning. We provide the first proteome-wide analysis of intrinsic protein disorder for the human nucleolus and show that nucleolar proteins in general, and mitotic chromosome proteins in particular, have significantly higher intrinsic disorder level compared to cytosolic proteins. In summary, this study provides a comprehensive and essential resource of spatiotemporal expression data for the nucleolar proteome as part of the Human Protein Atlas.

View details for DOI 10.15252/msb.20209469

View details for PubMedID 32744794
A Sample Preparation Protocol for High Throughput Immunofluorescence of Suspension Cells on an Adherent Surface JOURNAL OF HISTOCHEMISTRY & CYTOCHEMISTRY Backstrom, A., Kugel, L., Gnann, C., Xu, H., Aslan, J. E., Lundberg, E., Stadler, C. 2020; 68 (7): 473-489

Abstract

Imaging is a powerful approach for studying protein expression and has the advantage over other methodologies in providing spatial information in situ at single cell level. Using immunofluorescence and confocal microscopy, detailed information of subcellular distribution of proteins can be obtained. While adherent cells of different tissue origin are relatively easy to prepare for imaging applications, non-adherent cells from hematopoietic origin, present a challenge due to their poor attachment to surfaces and subsequent loss of a substantial fraction of the cells. Still, these cell types represent an important part of the human proteome and express genes that are not expressed in adherent cell types. In the era of cell mapping efforts, overcoming the challenge with suspension cells for imaging applications would enable systematic profiling of hematopoietic cells. In this work, we successfully established an immunofluorescence protocol for preparation of suspension cell lines, peripheral blood mononucleated cells (PBMC) and human platelets on an adherent surface. The protocol is based on a multi-well plate format with automated sample preparation, allowing for robust high throughput imaging applications. In combination with confocal microscopy, the protocol enables systematic exploration of protein localization to all major subcellular structures.

View details for DOI 10.1369/0022155420935403

View details for Web of Science ID 000542278500001

View details for PubMedID 32564662

View details for PubMedCentralID PMC7350080
Analysis of the Human Protein Atlas Image Classification competition (vol 16, pg 1254, 2019) NATURE METHODS Ouyang, W., Winsnes, C. F., Hjelmare, M., Cesnik, A. J., Akesson, L., Xu, H., Sullivan, D. P., Dai, S., Lan, J., Jinmo, P., Galib, S. M., Henkel, C., Hwang, K., Poplavskiy, D., Tunguz, B., Wolfinger, R. D., Gu, Y., Li, C., Xie, J., Buslov, D., Fironov, S., Kiselev, A., Panchenko, D., Cao, X., Wei, R., Wu, Y., Zhu, X., Tseng, K., Gao, Z., Ju, C., Yi, X., Zheng, H., Kappel, C., Lundberg, E. 2020; 17 (1): 115

View details for DOI 10.1038/s41592-019-0725-z

View details for Web of Science ID 000508582900046
A high-stringency blueprint of the human proteome. Nature communications Adhikari, S., Nice, E. C., Deutsch, E. W., Lane, L., Omenn, G. S., Pennington, S. R., Paik, Y., Overall, C. M., Corrales, F. J., Cristea, I. M., Van Eyk, J. E., Uhlen, M., Lindskog, C., Chan, D. W., Bairoch, A., Waddington, J. C., Justice, J. L., LaBaer, J., Rodriguez, H., He, F., Kostrzewa, M., Ping, P., Gundry, R. L., Stewart, P., Srivastava, S., Srivastava, S., Nogueira, F. C., Domont, G. B., Vandenbrouck, Y., Lam, M. P., Wennersten, S., Vizcaino, J. A., Wilkins, M., Schwenk, J. M., Lundberg, E., Bandeira, N., Marko-Varga, G., Weintraub, S. T., Pineau, C., Kusebauch, U., Moritz, R. L., Ahn, S. B., Palmblad, M., Snyder, M. P., Aebersold, R., Baker, M. S. 2020; 11 (1): 5301

Abstract

The Human Proteome Organization (HUPO) launched the Human Proteome Project (HPP) in 2010, creating an international framework for global collaboration, data sharing, quality assurance and enhancing accurate annotation of the genome-encoded proteome. During the subsequent decade, the HPP established collaborations, developed guidelines and metrics, and undertook reanalysis of previously deposited community data, continuously increasing the coverage of the human proteome. On the occasion of the HPP's tenth anniversary, we here report a 90.4% complete high-stringency human proteome blueprint. This knowledge is essential for discerning molecular processes in health and disease, as we demonstrate by highlighting potential roles the human proteome plays in our understanding, diagnosis and treatment of cancers, cardiovascular and infectious diseases.

View details for DOI 10.1038/s41467-020-19045-9

View details for PubMedID 33067450
Spatial Characterization of the Human Centrosome Proteome Opens up New Horizons for a Small but Versatile Organelle. Proteomics Danielsson, F. n., Mahdessian, D. n., Axelsson, U. n., Sullivan, D. n., Uhlén, M. n., Andersen, J. S., Thul, P. J., Lundberg, E. n. 2020: e1900361

Abstract

After a century of research, the human centrosome continues to fascinate. Based on immunofluorescence and confocal microscopy, we present an extensive inventory of the protein components of the human centrosome, and the centriolar satellites, with the important contribution of over 300 novel proteins localizing to these compartments. We identify a network of candidate centrosome proteins involved in ubiquitination, including six interaction partners of the Kelch-like protein 21, and an additional network of protein phosphatases, together supporting the suggested role of the centrosome as an interactive hub for cell signaling. Analysis of multi-localization across cellular organelles analyzed within the Human Protein Atlas project shows how multi-localizing proteins are particularly overrepresented in centriolar satellites, supporting the dynamic nature and wide range of functions for this compartment. In summary, the spatial dissection of the human centrosome and centriolar satellites described here provides a comprehensive knowledgebase for further exploration of their proteomes. Significance Statement: Exploring the constituents of organellar proteomes lays a foundation for detailed understanding of the cellular functions that take place in these settings. We have come to understand that the molecular environments in which cellular processes take place are complex and highly dynamic, with multiple modes of cross-talk within and between cellular compartments. One goal of the Human Protein Atlas project is a systematic mapping of the expression and subcellular localization of all proteins in human cells, using RNA sequencing combined with immunofluorescence and high-resolution confocal microscopy. Here we present an overview and further analysis of the proteins that localizes to centrosomes and centriolar satellites, expanding these organellar proteomes with more than 300 candidate proteins. This provides new insights to the characteristics and functions of these compartments, and provides an important knowledge-base for further studies of cellular processes that take place at centrosomes and in centriolar satellites. This article is protected by copyright. All rights reserved.

View details for DOI 10.1002/pmic.201900361

View details for PubMedID 32558245
Spatial proteomics: a powerful discovery tool for cell biology. Nature reviews. Molecular cell biology Lundberg, E., Borner, G. H. 2019

Abstract

Protein subcellular localization is tightly controlled and intimately linked to protein function in health and disease. Capturing the spatial proteome - that is, the localizations of proteins and their dynamics at the subcellular level - is therefore essential for a complete understanding of cell biology. Owing to substantial advances in microscopy, mass spectrometry and machine learning applications for data analysis, the field is now mature for proteome-wide investigations of spatial cellular regulation. Studies of the human proteome have begun to reveal a complex architecture, including single-cell variations, dynamic protein translocations, changing interaction networks and proteins localizing to multiple compartments. Furthermore, several studies have successfully harnessed the power of comparative spatial proteomics as a discovery tool to unravel disease mechanisms. We are at the beginning of an era in which spatial proteomics finally integrates with cell biology and medical research, thereby paving the way for unbiased systems-level insights into cellular processes. Here, we discuss current methods for spatial proteomics using imaging or mass spectrometry and specifically highlight global comparative applications. The aim of this Review is to survey the state of the field and also to encourage more cell biologists to apply spatial proteomics approaches.

View details for PubMedID 30659282
The human secretome. Science signaling Uhlén, M. n., Karlsson, M. J., Hober, A. n., Svensson, A. S., Scheffel, J. n., Kotol, D. n., Zhong, W. n., Tebani, A. n., Strandberg, L. n., Edfors, F. n., Sjöstedt, E. n., Mulder, J. n., Mardinoglu, A. n., Berling, A. n., Ekblad, S. n., Dannemeyer, M. n., Kanje, S. n., Rockberg, J. n., Lundqvist, M. n., Malm, M. n., Volk, A. L., Nilsson, P. n., Månberg, A. n., Dodig-Crnkovic, T. n., Pin, E. n., Zwahlen, M. n., Oksvold, P. n., von Feilitzen, K. n., Häussler, R. S., Hong, M. G., Lindskog, C. n., Ponten, F. n., Katona, B. n., Vuu, J. n., Lindström, E. n., Nielsen, J. n., Robinson, J. n., Ayoglu, B. n., Mahdessian, D. n., Sullivan, D. n., Thul, P. n., Danielsson, F. n., Stadler, C. n., Lundberg, E. n., Bergström, G. n., Gummesson, A. n., Voldborg, B. G., Tegel, H. n., Hober, S. n., Forsström, B. n., Schwenk, J. M., Fagerberg, L. n., Sivertsson, Å. n. 2019; 12 (609)

Abstract

The proteins secreted by human cells (collectively referred to as the secretome) are important not only for the basic understanding of human biology but also for the identification of potential targets for future diagnostics and therapies. Here, we present a comprehensive analysis of proteins predicted to be secreted in human cells, which provides information about their final localization in the human body, including the proteins actively secreted to peripheral blood. The analysis suggests that a large number of the proteins of the secretome are not secreted out of the cell, but instead are retained intracellularly, whereas another large group of proteins were identified that are predicted to be retained locally at the tissue of expression and not secreted into the blood. Proteins detected in the human blood by mass spectrometry-based proteomics and antibody-based immunoassays are also presented with estimates of their concentrations in the blood. The results are presented in an updated version 19 of the Human Protein Atlas in which each gene encoding a secretome protein is annotated to provide an open-access knowledge resource of the human secretome, including body-wide expression data, spatial localization data down to the single-cell and subcellular levels, and data about the presence of proteins that are detectable in the blood.

View details for DOI 10.1126/scisignal.aaz0274

View details for PubMedID 31772123
Voices in methods development. Nature methods Anikeeva, P. n., Boyden, E. n., Brangwynne, C. n., Cissé, I. I., Fiehn, O. n., Fromme, P. n., Gingras, A. C., Greene, C. S., Heard, E. n., Hell, S. W., Hillman, E. n., Jensen, G. J., Karchin, R. n., Kiessling, L. L., Kleinstiver, B. P., Knight, R. n., Kukura, P. n., Lancaster, M. A., Loman, N. n., Looger, L. n., Lundberg, E. n., Luo, Q. n., Miyawaki, A. n., Myers, E. W., Nolan, G. P., Picotti, P. n., Reik, W. n., Sauer, M. n., Shalek, A. K., Shendure, J. n., Slavov, N. n., Tanay, A. n., Troyanskaya, O. n., van Valen, D. n., Wang, H. W., Yi, C. n., Yin, P. n., Zernicka-Goetz, M. n., Zhuang, X. n. 2019; 16 (10): 945–51

View details for DOI 10.1038/s41592-019-0585-6

View details for PubMedID 31562479
ImJoy: an open-source computational platform for the deep learning era. Nature methods Ouyang, W. n., Mueller, F. n., Hjelmare, M. n., Lundberg, E. n., Zimmer, C. n. 2019; 16 (12): 1199–1200

View details for DOI 10.1038/s41592-019-0627-0

View details for PubMedID 31780825
Analysis of the Human Protein Atlas Image Classification competition. Nature methods Ouyang, W. n., Winsnes, C. F., Hjelmare, M. n., Cesnik, A. J., Åkesson, L. n., Xu, H. n., Sullivan, D. P., Dai, S. n., Lan, J. n., Jinmo, P. n., Galib, S. M., Henkel, C. n., Hwang, K. n., Poplavskiy, D. n., Tunguz, B. n., Wolfinger, R. D., Gu, Y. n., Li, C. n., Xie, J. n., Buslov, D. n., Fironov, S. n., Kiselev, A. n., Panchenko, D. n., Cao, X. n., Wei, R. n., Wu, Y. n., Zhu, X. n., Tseng, K. L., Gao, Z. n., Ju, C. n., Yi, X. n., Zheng, H. n., Kappel, C. n., Lundberg, E. n. 2019; 16 (12): 1254–61

Abstract

Pinpointing subcellular protein localizations from microscopy images is easy to the trained eye, but challenging to automate. Based on the Human Protein Atlas image collection, we held a competition to identify deep learning solutions to solve this task. Challenges included training on highly imbalanced classes and predicting multiple labels per image. Over 3 months, 2,172 teams participated. Despite convergence on popular networks and training techniques, there was considerable variety among the solutions. Participants applied strategies for modifying neural networks and loss functions, augmenting data and using pretrained networks. The winning models far outperformed our previous effort at multi-label classification of protein localization patterns by ~20%. These models can be used as classifiers to annotate new images, feature extractors to measure pattern similarity or pretrained networks for a wide range of biological applications.

View details for DOI 10.1038/s41592-019-0658-6

View details for PubMedID 31780840
Experimental validation of predicted cancer genes using FRET. Methods and applications in fluorescence Guala, D., Bernhem, K., Blal, H. A., Jans, D., Lundberg, E., Brismar, H., Sonnhammer, E. L. 2018; 6 (3): 035007

Abstract

Huge amounts of data are generated in genome wide experiments, designed to investigate diseases with complex genetic causes. Follow up of all potential leads produced by such experiments is currently cost prohibitive and time consuming. Gene prioritization tools alleviate these constraints by directing further experimental efforts towards the most promising candidate targets. Recently a gene prioritization tool called MaxLink was shown to outperform other widely used state-of-the-art prioritization tools in a large scale in silico benchmark. An experimental validation of predictions made by MaxLink has however been lacking. In this study we used Fluorescence Resonance Energy Transfer, an established experimental technique for detection of protein-protein interactions, to validate potential cancer genes predicted by MaxLink. Our results provide confidence in the use of MaxLink for selection of new targets in the battle with polygenic diseases.

View details for DOI 10.1088/2050-6120/aab932

View details for PubMedID 29570091
Seeing More: A Future of Augmented Microscopy. Cell Sullivan, D. P., Lundberg, E. 2018; 173 (3): 546-548

Abstract

Microscope images are information rich. In this issue of Cell, Christiansen et al. show that label-free images of cells can be used to predict fluorescent labels representing cell type, state, and organelle distribution using a deep-learning framework. This paves the way for computationally multiplexed assays derived from inexpensive label-free microscopy.

View details for DOI 10.1016/j.cell.2018.04.003

View details for PubMedID 29677507
Transcriptome profiling of the interconnection of pathways involved in malignant transformation and response to hypoxia. Oncotarget Danielsson, F., Fasterius, E., Sullivan, D., Hases, L., Sanli, K., Zhang, C., Mardinoglu, A., Al-Khalili, C., Huss, M., Uhlén, M., Williams, C., Lundberg, E. 2018; 9 (28): 19730-19744

Abstract

In tumor tissues, hypoxia is a commonly observed feature resulting from rapidly proliferating cancer cells outgrowing their surrounding vasculature network. Transformed cancer cells are known to exhibit phenotypic alterations, enabling continuous proliferation despite a limited oxygen supply. The four-step isogenic BJ cell model enables studies of defined steps of tumorigenesis: the normal, immortalized, transformed, and metastasizing stages. By transcriptome profiling under atmospheric and moderate hypoxic (3% O2) conditions, we observed that despite being highly similar, the four cell lines of the BJ model responded strikingly different to hypoxia. Besides corroborating many of the known responses to hypoxia, we demonstrate that the transcriptome adaptation to moderate hypoxia resembles the process of malignant transformation. The transformed cells displayed a distinct capability of metabolic switching, reflected in reversed gene expression patterns for several genes involved in oxidative phosphorylation and glycolytic pathways. By profiling the stage-specific responses to hypoxia, we identified ASS1 as a potential prognostic marker in hypoxic tumors. This study demonstrates the usefulness of the BJ cell model for highlighting the interconnection of pathways involved in malignant transformation and hypoxic response.

View details for DOI 10.18632/oncotarget.24808

View details for PubMedID 29731978

View details for PubMedCentralID PMC5929421
CEP128 Localizes to the Subdistal Appendages of the Mother Centriole and Regulates TGF-β/BMP Signaling at the Primary Cilium. Cell reports Mönnich, M., Borgeskov, L., Breslin, L., Jakobsen, L., Rogowski, M., Doganli, C., Schrøder, J. M., Mogensen, J. B., Blinkenkjær, L., Harder, L. M., Lundberg, E., Geimer, S., Christensen, S. T., Andersen, J. S., Larsen, L. A., Pedersen, L. B. 2018; 22 (10): 2584-2592

Abstract

The centrosome is the main microtubule-organizing center in animal cells and comprises a mother and daughter centriole surrounded by pericentriolar material. During formation of primary cilia, the mother centriole transforms into a basal body that templates the ciliary axoneme. Ciliogenesis depends on mother centriole-specific distal appendages, whereas the role of subdistal appendages in ciliary function is unclear. Here, we identify CEP128 as a centriole subdistal appendage protein required for regulating ciliary signaling. Loss of CEP128 did not grossly affect centrosomal or ciliary structure but caused impaired transforming growth factor-β/bone morphogenetic protein (TGF-β/BMP) signaling in zebrafish and at the primary cilium in cultured mammalian cells. This phenotype is likely the result of defective vesicle trafficking at the cilium as ciliary localization of RAB11 was impaired upon loss of CEP128, and quantitative phosphoproteomics revealed that CEP128 loss affects TGF-β1-induced phosphorylation of multiple proteins that regulate cilium-associated vesicle trafficking.

View details for DOI 10.1016/j.celrep.2018.02.043

View details for PubMedID 29514088
GeneGini: Assessment via the Gini Coefficient of Reference "Housekeeping" Genes and Diverse Human Transporter Expression Profiles. Cell systems O'Hagan, S., Wright Muelas, M., Day, P. J., Lundberg, E., Kell, D. B. 2018; 6 (2): 230-244.e1

Abstract

The expression levels of SLC or ABC membrane transporter transcripts typically differ 100- to 10,000-fold between different tissues. The Gini coefficient characterizes such inequalities and here is used to describe the distribution of the expression of each transporter among different human tissues and cell lines. Many transporters exhibit extremely high Gini coefficients even for common substrates, indicating considerable specialization consistent with divergent evolution. The expression profiles of SLC transporters in different cell lines behave similarly, although Gini coefficients for ABC transporters tend to be larger in cell lines than in tissues, implying selection. Transporter genes are significantly more heterogeneously expressed than the members of most non-transporter gene classes. Transcripts with the stablest expression have a low Gini index and often differ significantly from the "housekeeping" genes commonly used for normalization in transcriptomics/qPCR studies. PCBP1 has a low Gini coefficient, is reasonably expressed, and is an excellent novel reference gene. The approach, referred to as GeneGini, provides rapid and simple characterization of expression-profile distributions and improved normalization of genome-wide expression-profiling data.

View details for DOI 10.1016/j.cels.2018.01.003

View details for PubMedID 29428416

View details for PubMedCentralID PMC5840522
How many human proteoforms are there? Nature chemical biology Aebersold, R. n., Agar, J. N., Amster, I. J., Baker, M. S., Bertozzi, C. R., Boja, E. S., Costello, C. E., Cravatt, B. F., Fenselau, C. n., Garcia, B. A., Ge, Y. n., Gunawardena, J. n., Hendrickson, R. C., Hergenrother, P. J., Huber, C. G., Ivanov, A. R., Jensen, O. N., Jewett, M. C., Kelleher, N. L., Kiessling, L. L., Krogan, N. J., Larsen, M. R., Loo, J. A., Ogorzalek Loo, R. R., Lundberg, E. n., MacCoss, M. J., Mallick, P. n., Mootha, V. K., Mrksich, M. n., Muir, T. W., Patrie, S. M., Pesavento, J. J., Pitteri, S. J., Rodriguez, H. n., Saghatelian, A. n., Sandoval, W. n., Schlüter, H. n., Sechi, S. n., Slavoff, S. A., Smith, L. M., Snyder, M. P., Thomas, P. M., Uhlén, M. n., Van Eyk, J. E., Vidal, M. n., Walt, D. R., White, F. M., Williams, E. R., Wohlschlager, T. n., Wysocki, V. H., Yates, N. A., Young, N. L., Zhang, B. n. 2018; 14 (3): 206–14

Abstract

Despite decades of accumulated knowledge about proteins and their post-translational modifications (PTMs), numerous questions remain regarding their molecular composition and biological function. One of the most fundamental queries is the extent to which the combinations of DNA-, RNA- and PTM-level variations explode the complexity of the human proteome. Here, we outline what we know from current databases and measurement strategies including mass spectrometry-based proteomics. In doing so, we examine prevailing notions about the number of modifications displayed on human proteins and how they combine to generate the protein diversity underlying health and disease. We frame central issues regarding determination of protein-level variation and PTMs, including some paradoxes present in the field today. We use this framework to assess existing data and to ask the question, "How many distinct primary structures of proteins (proteoforms) are created from the 20,300 human genes?" We also explore prospects for improving measurements to better regularize protein-level biology and efficiently associate PTMs to function and phenotype.

View details for PubMedID 29443976
Comparative cell cycle transcriptomics reveals synchronization of developmental transcription factor networks in cancer cells. PloS one Boström, J., Sramkova, Z., Salašová, A., Johard, H., Mahdessian, D., Fedr, R., Marks, C., Medalová, J., Souček, K., Lundberg, E., Linnarsson, S., Bryja, V., Sekyrova, P., Altun, M., Andäng, M. 2017; 12 (12): e0188772

Abstract

The cell cycle coordinates core functions such as replication and cell division. However, cell-cycle-regulated transcription in the control of non-core functions, such as cell identity maintenance through specific transcription factors (TFs) and signalling pathways remains unclear. Here, we provide a resource consisting of mapped transcriptomes in unsynchronized HeLa and U2OS cancer cells sorted for cell cycle phase by Fucci reporter expression. We developed a novel algorithm for data analysis that enables efficient visualization and data comparisons and identified cell cycle synchronization of Notch signalling and TFs associated with development. Furthermore, the cell cycle synchronizes with the circadian clock, providing a possible link between developmental transcriptional networks and the cell cycle. In conclusion we find that cell cycle synchronized transcriptional patterns are temporally compartmentalized and more complex than previously anticipated, involving genes, which control cell identity and development.

View details for DOI 10.1371/journal.pone.0188772

View details for PubMedID 29228002

View details for PubMedCentralID PMC5724894
The Human Cell Atlas ELIFE Regev, A., Teichmann, S. A., Lander, E. S., Amt, I., Benoist, C., Birney, E., Bodenmiller, B., Campbell, P., Carninci, P., Clatworthy, M., Clevers, H., Deplancke, B., Dunham, I., Eberwine, J., Elis, R., Enard, W., Farmer, A., Fugger, L., Gottgens, B., Hacohen, N., Haniffa, M., Hemberg, M., Kim, S., Klenerman, P., Kriegstein, A., Lein, E. D., Linnarsson, S., Lundberg, E., Lundeberg, J., Majumder, P., Marioni, J. C., Merad, M., Mhlanga, M., Nawijin, M., Netea, M., Nolan, G., Pe'er, D., Phillipakis, A., Ponting, C. P., Quake, S., Reik, W., Rozenblatt-Rosen, O., Sanes, J., Satija, R., Schumacher, T. N., Shalek, A., Shapiro, E., Sharma, P., Shin, J. W., Stegle, O., Stratton, M., Stubbington, M. J. T., Theis, F. J., Uhlen, M., Van Oudenaarden, A., Wagner, A., Watt, F., Weissman, J., Wold, B., Xavier, R., Yosef, N., HUMAN CELL ATLAS MEETING 2017; 6

Abstract

The recent advent of methods for high-throughput single-cell molecular profiling has catalyzed a growing sense in the scientific community that the time is ripe to complete the 150-year-old effort to identify all cell types in the human body. The Human Cell Atlas Project is an international collaborative effort that aims to define all human cell types in terms of distinctive molecular profiles (such as gene expression profiles) and to connect this information with classical cellular descriptions (such as location and morphology). An open comprehensive reference map of the molecular state of cells in healthy human tissues would propel the systematic study of physiological states, developmental trajectories, regulatory circuitry and interactions of cells, and also provide a framework for understanding cellular dysregulation in human disease. Here we describe the idea, its potential utility, early proofs-of-concept, and some design considerations for the Human Cell Atlas, including a commitment to open data, code, and community.

View details for PubMedID 29206104
Progress on the HUPO Draft Human Proteome: 2017 Metrics of the Human Proteome Project. Journal of proteome research Omenn, G. S., Lane, L., Lundberg, E. K., Overall, C. M., Deutsch, E. W. 2017; 16 (12): 4281-4287

Abstract

The Human Proteome Organization (HUPO) Human Proteome Project (HPP) continues to make progress on its two overall goals: (1) completing the protein parts list, with an annual update of the HUPO draft human proteome, and (2) making proteomics an integrated complement to genomics and transcriptomics throughout biomedical and life sciences research. neXtProt version 2017-01-23 has 17 008 confident protein identifications (Protein Existence [PE] level 1) that are compliant with the HPP Guidelines v2.1 ( https://hupo.org/Guidelines ), up from 13 664 in 2012-12 and 16 518 in 2016-04. Remaining to be found by mass spectrometry and other methods are 2579 "missing proteins" (PE2+3+4), down from 2949 in 2016. PeptideAtlas 2017-01 has 15 173 canonical proteins, accounting for nearly all of the 15 290 PE1 proteins based on MS data. These resources have extensive data on PTMs, single amino acid variants, and splice isoforms. The Human Protein Atlas v16 has 10 492 highly curated protein entries with tissue and subcellular spatial localization of proteins and transcript expression. Organ-specific popular protein lists have been generated for broad use in quantitative targeted proteomics using SRM-MS or DIA-SWATH-MS studies of biology and disease.

View details for DOI 10.1021/acs.jproteome.7b00375

View details for PubMedID 28853897

View details for PubMedCentralID PMC5872831
A comprehensive structural, biochemical and biological profiling of the human NUDIX hydrolase family NATURE COMMUNICATIONS Carreras-Puigvert, J., Zitnik, M., Jemth, A., Carter, M., Unterlass, J. E., Hallstrom, B., Loseva, O., Karem, Z., Calderon-Montano, J., Lindskog, C., Edqvist, P., Matuszewski, D. J., Blal, H., Berntsson, R. P. A., Haggblad, M., Martens, U., Studham, M., Lundgren, B., Wahlby, C., Sonnhammer, E. L. L., Lundberg, E., Stenmark, P., Zupan, B., Helleday, T. 2017; 8: 1541

Abstract

The NUDIX enzymes are involved in cellular metabolism and homeostasis, as well as mRNA processing. Although highly conserved throughout all organisms, their biological roles and biochemical redundancies remain largely unclear. To address this, we globally resolve their individual properties and inter-relationships. We purify 18 of the human NUDIX proteins and screen 52 substrates, providing a substrate redundancy map. Using crystal structures, we generate sequence alignment analyses revealing four major structural classes. To a certain extent, their substrate preference redundancies correlate with structural classes, thus linking structure and activity relationships. To elucidate interdependence among the NUDIX hydrolases, we pairwise deplete them generating an epistatic interaction map, evaluate cell cycle perturbations upon knockdown in normal and cancer cells, and analyse their protein and mRNA expression in normal and cancer tissues. Using a novel FUSION algorithm, we integrate all data creating a comprehensive NUDIX enzyme profile map, which will prove fundamental to understanding their biological functionality.

View details for PubMedID 29142246
Proteomic analysis of cell cycle progression in asynchronous cultures, including mitotic subphases, using PRIMMUS. eLife Ly, T., Whigham, A., Clarke, R., Brenes-Murillo, A. J., Estes, B., Madhessian, D., Lundberg, E., Wadsworth, P., Lamond, A. I. 2017; 6

Abstract

The temporal regulation of protein abundance and post-translational modifications is a key feature of cell division. Recently, we analysed gene expression and protein abundance changes during interphase under minimally perturbed conditions (Ly et al., 2014, 2015). Here, we show that by using specific intracellular immunolabelling protocols, FACS separation of interphase and mitotic cells, including mitotic subphases, can be combined with proteomic analysis by mass spectrometry. Using this PRIMMUS (PRoteomic analysis of Intracellular iMMUnolabelled cell Subsets) approach, we now compare protein abundance and phosphorylation changes in interphase and mitotic fractions from asynchronously growing human cells. We identify a set of 115 phosphorylation sites increased during G2, termed 'early risers'. This set includes phosphorylation of S738 on TPX2, which we show is important for TPX2 function and mitotic progression. Further, we use PRIMMUS to provide the first a proteome-wide analysis of protein abundance remodeling between prophase, prometaphase and anaphase.

View details for DOI 10.7554/eLife.27574

View details for PubMedID 29052541

View details for PubMedCentralID PMC5650473
A pathology atlas of the human cancer transcriptome. Science (New York, N.Y.) Uhlen, M., Zhang, C., Lee, S., Sjöstedt, E., Fagerberg, L., Bidkhori, G., Benfeitas, R., Arif, M., Liu, Z., Edfors, F., Sanli, K., von Feilitzen, K., Oksvold, P., Lundberg, E., Hober, S., Nilsson, P., Mattsson, J., Schwenk, J. M., Brunnström, H., Glimelius, B., Sjöblom, T., Edqvist, P. H., Djureinovic, D., Micke, P., Lindskog, C., Mardinoglu, A., Ponten, F. 2017; 357 (6352)

Abstract

Cancer is one of the leading causes of death, and there is great interest in understanding the underlying molecular mechanisms involved in the pathogenesis and progression of individual tumors. We used systems-level approaches to analyze the genome-wide transcriptome of the protein-coding genes of 17 major cancer types with respect to clinical outcome. A general pattern emerged: Shorter patient survival was associated with up-regulation of genes involved in cell growth and with down-regulation of genes involved in cellular differentiation. Using genome-scale metabolic models, we show that cancer patients have widespread metabolic heterogeneity, highlighting the need for precise and personalized medicine for cancer treatment. All data are presented in an interactive open-access database (www.proteinatlas.org/pathology) to allow genome-wide exploration of the impact of individual proteins on clinical outcomes.

View details for DOI 10.1126/science.aan2507

View details for PubMedID 28818916
RhoA knockout fibroblasts lose tumor-inhibitory capacity in vitro and promote tumor growth in vivo. Proceedings of the National Academy of Sciences of the United States of America Alkasalias, T., Alexeyenko, A., Hennig, K., Danielsson, F., Lebbink, R. J., Fielden, M., Turunen, S. P., Lehti, K., Kashuba, V., Madapura, H. S., Bozoky, B., Lundberg, E., Balland, M., Guvén, H., Klein, G., Gad, A. K., Pavlova, T. 2017; 114 (8): E1413-E1421

Abstract

Fibroblasts are a main player in the tumor-inhibitory microenvironment. Upon tumor initiation and progression, fibroblasts can lose their tumor-inhibitory capacity and promote tumor growth. The molecular mechanisms that underlie this switch have not been defined completely. Previously, we identified four proteins overexpressed in cancer-associated fibroblasts and linked to Rho GTPase signaling. Here, we show that knocking out the Ras homolog family member A (RhoA) gene in normal fibroblasts decreased their tumor-inhibitory capacity, as judged by neighbor suppression in vitro and accompanied by promotion of tumor growth in vivo. This also induced PC3 cancer cell motility and increased colony size in 2D cultures. RhoA knockout in fibroblasts induced vimentin intermediate filament reorganization, accompanied by reduced contractile force and increased stiffness of cells. There was also loss of wide F-actin stress fibers and large focal adhesions. In addition, we observed a significant loss of α-smooth muscle actin, which indicates a difference between RhoA knockout fibroblasts and classic cancer-associated fibroblasts. In 3D collagen matrix, RhoA knockout reduced fibroblast branching and meshwork formation and resulted in more compactly clustered tumor-cell colonies in coculture with PC3 cells, which might boost tumor stem-like properties. Coculturing RhoA knockout fibroblasts and PC3 cells induced expression of proinflammatory genes in both. Inflammatory mediators may induce tumor cell stemness. Network enrichment analysis of transcriptomic changes, however, revealed that the Rho signaling pathway per se was significantly triggered only after coculturing with tumor cells. Taken together, our findings in vivo and in vitro indicate that Rho signaling governs the inhibitory effects by fibroblasts on tumor-cell growth.

View details for DOI 10.1073/pnas.1621161114

View details for PubMedID 28174275

View details for PubMedCentralID PMC5338371
Antibody Validation in Bioimaging Applications Based on Endogenous Expression of Tagged Proteins. Journal of proteome research Skogs, M., Stadler, C., Schutten, R., Hjelmare, M., Gnann, C., Björk, L., Poser, I., Hyman, A., Uhlén, M., Lundberg, E. 2017; 16 (1): 147-155

Abstract

Antibodies are indispensible research tools, yet the scientific community has not adopted standardized procedures to validate their specificity. Here we present a strategy to systematically validate antibodies for immunofluorescence (IF) applications using gene tagging. We have assessed the on- and off-target binding capabilities of 197 antibodies using 108 cell lines expressing EGFP-tagged target proteins at endogenous levels. Furthermore, we assessed batch-to-batch effects for 35 target proteins, showing that both the on- and off-target binding patterns vary significantly between antibody batches and that the proposed strategy serves as a reliable procedure for ensuring reproducibility upon production of new antibody batches. In summary, we present a systematic scheme for antibody validation in IF applications using endogenous expression of tagged proteins. This is an important step toward a reproducible approach for context- and application-specific antibody validation and improved reliability of antibody-based experiments and research data.

View details for DOI 10.1021/acs.jproteome.6b00821

View details for PubMedID 27723985
The endosomal transcriptional regulator RNF11 integrates degradation and transport of EGFR. The Journal of cell biology Scharaw, S., Iskar, M., Ori, A., Boncompain, G., Laketa, V., Poser, I., Lundberg, E., Perez, F., Beck, M., Bork, P., Pepperkok, R. 2016; 215 (4): 543-558

Abstract

Stimulation of cells with epidermal growth factor (EGF) induces internalization and partial degradation of the EGF receptor (EGFR) by the endo-lysosomal pathway. For continuous cell functioning, EGFR plasma membrane levels are maintained by transporting newly synthesized EGFRs to the cell surface. The regulation of this process is largely unknown. In this study, we find that EGF stimulation specifically increases the transport efficiency of newly synthesized EGFRs from the endoplasmic reticulum to the plasma membrane. This coincides with an up-regulation of the inner coat protein complex II (COPII) components SEC23B, SEC24B, and SEC24D, which we show to be specifically required for EGFR transport. Up-regulation of these COPII components requires the transcriptional regulator RNF11, which localizes to early endosomes and appears additionally in the cell nucleus upon continuous EGF stimulation. Collectively, our work identifies a new regulatory mechanism that integrates the degradation and transport of EGFR in order to maintain its physiological levels at the plasma membrane.

View details for DOI 10.1083/jcb.201601090

View details for PubMedID 27872256

View details for PubMedCentralID PMC5119934
Metrics for the Human Proteome Project 2016: Progress on Identifying and Characterizing the Human Proteome, Including Post-Translational Modifications. Journal of proteome research Omenn, G. S., Lane, L., Lundberg, E. K., Beavis, R. C., Overall, C. M., Deutsch, E. W. 2016; 15 (11): 3951-3960

Abstract

The HUPO Human Proteome Project (HPP) has two overall goals: (1) stepwise completion of the protein parts list-the draft human proteome including confidently identifying and characterizing at least one protein product from each protein-coding gene, with increasing emphasis on sequence variants, post-translational modifications (PTMs), and splice isoforms of those proteins; and (2) making proteomics an integrated counterpart to genomics throughout the biomedical and life sciences community. PeptideAtlas and GPMDB reanalyze all major human mass spectrometry data sets available through ProteomeXchange with standardized protocols and stringent quality filters; neXtProt curates and integrates mass spectrometry and other findings to present the most up to date authorative compendium of the human proteome. The HPP Guidelines for Mass Spectrometry Data Interpretation version 2.1 were applied to manuscripts submitted for this 2016 C-HPP-led special issue [ www.thehpp.org/guidelines ]. The Human Proteome presented as neXtProt version 2016-02 has 16,518 confident protein identifications (Protein Existence [PE] Level 1), up from 13,664 at 2012-12, 15,646 at 2013-09, and 16,491 at 2014-10. There are 485 proteins that would have been PE1 under the Guidelines v1.0 from 2012 but now have insufficient evidence due to the agreed-upon more stringent Guidelines v2.0 to reduce false positives. neXtProt and PeptideAtlas now both require two non-nested, uniquely mapping (proteotypic) peptides of at least 9 aa in length. There are 2,949 missing proteins (PE2+3+4) as the baseline for submissions for this fourth annual C-HPP special issue of Journal of Proteome Research. PeptideAtlas has 14,629 canonical (plus 1187 uncertain and 1755 redundant) entries. GPMDB has 16,190 EC4 entries, and the Human Protein Atlas has 10,475 entries with supportive evidence. neXtProt, PeptideAtlas, and GPMDB are rich resources of information about post-translational modifications (PTMs), single amino acid variants (SAAVSs), and splice isoforms. Meanwhile, the Biology- and Disease-driven (B/D)-HPP has created comprehensive SRM resources, generated popular protein lists to guide targeted proteomics assays for specific diseases, and launched an Early Career Researchers initiative.

View details for DOI 10.1021/acs.jproteome.6b00511

View details for PubMedID 27487407

View details for PubMedCentralID PMC5129622
Gene-specific correlation of RNA and protein levels in human cells and tissues. Molecular systems biology Edfors, F., Danielsson, F., Hallström, B. M., Käll, L., Lundberg, E., Pontén, F., Forsström, B., Uhlén, M. 2016; 12 (10): 883

Abstract

An important issue for molecular biology is to establish whether transcript levels of a given gene can be used as proxies for the corresponding protein levels. Here, we have developed a targeted proteomics approach for a set of human non-secreted proteins based on parallel reaction monitoring to measure, at steady-state conditions, absolute protein copy numbers across human tissues and cell lines and compared these levels with the corresponding mRNA levels using transcriptomics. The study shows that the transcript and protein levels do not correlate well unless a gene-specific RNA-to-protein (RTP) conversion factor independent of the tissue type is introduced, thus significantly enhancing the predictability of protein copy numbers from RNA levels. The results show that the RTP ratio varies significantly with a few hundred copies per mRNA molecule for some genes to several hundred thousands of protein copies per mRNA molecule for others. In conclusion, our data suggest that transcriptome analysis can be used as a tool to predict the protein copy numbers per cell, thus forming an attractive link between the field of genomics and proteomics.

View details for DOI 10.15252/msb.20167144

View details for PubMedID 27951527

View details for PubMedCentralID PMC5081484
A proposal for validation of antibodies NATURE METHODS Uhlen, M., Bandrowski, A., Carr, S., Edwards, A., Ellenberg, J., Lundberg, E., Rimm, D. L., Rodriguez, H., Hiltke, T., Snyder, M., Yamamoto, T. 2016; 13 (10): 823-?

View details for DOI 10.1038/NMETH.3995

View details for Web of Science ID 000385194600015

View details for PubMedID 27595404
Voices of biotech. Nature biotechnology Amit, I., Baker, D., Barker, R., Berger, B., Bertozzi, C., Bhatia, S., Biffi, A., Demichelis, F., Doudna, J., Dowdy, S. F., Endy, D., Helmstaedter, M., Junca, H., June, C., Kamb, S., Khvorova, A., Kim, D., Kim, J., Krishnan, Y., Lakadamyali, M., Lappalainen, T., Lewin, S., Liao, J., Loman, N., Lundberg, E., Lynd, L., Martin, C., Mellman, I., Miyawaki, A., Mummery, C., Nelson, K., Paz, J., Peralta-Yahya, P., Picotti, P., Polyak, K., Prather, K., Qin, J., Quake, S., Regev, A., Rogers, J. A., Shetty, R., Sommer, M., Stevens, M., Stolovitzky, G., Takahashi, M., Tang, F., Teichmann, S., Torres-Padilla, M., Tripathi, L., Vemula, P., Verdine, G., Vollmer, F., Wang, J., Ying, J. Y., Zhang, F., Zhang, T. 2016; 34 (3): 270-275

View details for DOI 10.1038/nbt.3502

View details for PubMedID 26963549
Introducing the Affinity Binder Knockdown Initiative⿿A public⿿private partnership for validation of affinity reagents. EuPA open proteomics Alm, T., Lundberg, E., Uhlén, M. 2016; 10: 56-58

Abstract

The newly launched Affinity Binder Knockdown Initiative encourages antibody suppliers and users to join this public⿿private partnership, which uses crowdsourcing to collect characterization data on antibodies. Researchers are asked to share validation data from experiments where gene-editing techniques (such as siRNA or CRISPR) have been used to verify antibody binding. The initiative is launched under the aegis of Antibodypedia, a database designed to allow comparisons and scoring of publicly available antibodies towards human protein targets. What is known about an antibody is the foundation of the scoring and ranking system in Antibodypedia.

View details for DOI 10.1016/j.euprot.2016.01.002

View details for PubMedID 29900101

View details for PubMedCentralID PMC5988587
Towards a functional definition of the mitochondrial human proteome. EuPA open proteomics Fasano, M., Alberio, T., Babu, M., Lundberg, E., Urbani, A. 2016; 10: 24-27

Abstract

The mitochondrial human proteome project (mt-HPP) was initiated by the Italian HPP group as a part of both the chromosome-centric initiative (C-HPP) and the ⿿biology and disease driven⿿ initiative (B/D-HPP). In recent years several reports highlighted how mitochondrial biology and disease are regulated by specific interactions with non-mitochondrial proteins. Thus, it is of great relevance to extend our present view of the mitochondrial proteome not only to those proteins that are encoded by or transported to mitochondria, but also to their interactors that take part in mitochondria functionality. Here, we propose a graphical representation of the functional mitochondrial proteome by retrieving mitochondrial proteins from the NeXtProt database and adding to the network their interactors as annotated in the IntAct database. Notably, the network may represent a reference to map all the proteins that are currently being identified in mitochondrial proteomics studies.

View details for DOI 10.1016/j.euprot.2016.01.004

View details for PubMedID 29900096

View details for PubMedCentralID PMC5988588
Systems Proteomics View of the Endogenous Human Claudin Protein Family JOURNAL OF PROTEOME RESEARCH Liu, F., Koval, M., Ranganathan, S., Fanayan, S., Hancock, W. S., Lundberg, E. K., Beavis, R. C., Lane, L., Duek, P., McQuade, L., Kelleher, N. L., Baker, M. S. 2016; 15 (2): 339-359

Abstract

Claudins are the major transmembrane protein components of tight junctions in human endothelia and epithelia. Tissue-specific expression of claudin members suggests that this protein family is not only essential for sustaining the role of tight junctions in cell permeability control but also vital in organizing cell contact signaling by protein-protein interactions. How this protein family is collectively processed and regulated is key to understanding the role of junctional proteins in preserving cell identity and tissue integrity. The focus of this review is to first provide a brief overview of the functional context, on the basis of the extensive body of claudin biology research that has been thoroughly reviewed, for endogenous human claudin members and then ascertain existing and future proteomics techniques that may be applicable to systematically characterizing the chemical forms and interacting protein partners of this protein family in human. The ability to elucidate claudin-based signaling networks may provide new insight into cell development and differentiation programs that are crucial to tissue stability and manipulation.

View details for DOI 10.1021/acs.jproteome.5b00769

View details for Web of Science ID 000369771700001

View details for PubMedID 26680015

View details for PubMedCentralID PMC4777318
The folate-coupled enzyme MTHFD2 is a nuclear protein and promotes cell proliferation. Scientific reports Gustafsson Sheppard, N., Jarl, L., Mahadessian, D., Strittmatter, L., Schmidt, A., Madhusudan, N., Tegnér, J., Lundberg, E. K., Asplund, A., Jain, M., Nilsson, R. 2015; 5: 15029

Abstract

Folate metabolism is central to cell proliferation and a target of commonly used cancer chemotherapeutics. In particular, the mitochondrial folate-coupled metabolism is thought to be important for proliferating cancer cells. The enzyme MTHFD2 in this pathway is highly expressed in human tumors and broadly required for survival of cancer cells. Although the enzymatic activity of the MTHFD2 protein is well understood, little is known about its larger role in cancer cell biology. We here report that MTHFD2 is co-expressed with two distinct gene sets, representing amino acid metabolism and cell proliferation, respectively. Consistent with a role for MTHFD2 in cell proliferation, MTHFD2 expression was repressed in cells rendered quiescent by deprivation of growth signals (serum) and rapidly re-induced by serum stimulation. Overexpression of MTHFD2 alone was sufficient to promote cell proliferation independent of its dehydrogenase activity, even during growth restriction. In addition to its known mitochondrial localization, we found MTHFD2 to have a nuclear localization and co-localize with DNA replication sites. These findings suggest a previously unknown role for MTHFD2 in cancer cell proliferation, adding to its known function in mitochondrial folate metabolism.

View details for DOI 10.1038/srep15029

View details for PubMedID 26461067

View details for PubMedCentralID PMC4602236
Metrics for the Human Proteome Project 2015: Progress on the Human Proteome and Guidelines for High-Confidence Protein Identification. Journal of proteome research Omenn, G. S., Lane, L., Lundberg, E. K., Beavis, R. C., Nesvizhskii, A. I., Deutsch, E. W. 2015; 14 (9): 3452-60

Abstract

Remarkable progress continues on the annotation of the proteins identified in the Human Proteome and on finding credible proteomic evidence for the expression of "missing proteins". Missing proteins are those with no previous protein-level evidence or insufficient evidence to make a confident identification upon reanalysis in PeptideAtlas and curation in neXtProt. Enhanced with several major new data sets published in 2014, the human proteome presented as neXtProt, version 2014-09-19, has 16,491 unique confident proteins (PE level 1), up from 13,664 at 2012-12 and 15,646 at 2013-09. That leaves 2948 missing proteins from genes classified having protein existence level PE 2, 3, or 4, as well as 616 dubious proteins at PE 5. Here, we document the progress of the HPP and discuss the importance of assessing the quality of evidence, confirming automated findings and considering alternative protein matches for spectra and peptides. We provide guidelines for proteomics investigators to apply in reporting newly identified proteins.

View details for DOI 10.1021/acs.jproteome.5b00499

View details for PubMedID 26155816

View details for PubMedCentralID PMC4755311
Quest for Missing Proteins: Update 2015 on Chromosome-Centric Human Proteome Project. Journal of proteome research Horvatovich, P., Lundberg, E. K., Chen, Y. J., Sung, T. Y., He, F., Nice, E. C., Goode, R. J., Yu, S., Ranganathan, S., Baker, M. S., Domont, G. B., Velasquez, E., Li, D., Liu, S., Wang, Q., He, Q. Y., Menon, R., Guan, Y., Corrales, F. J., Segura, V., Casal, J. I., Pascual-Montano, A., Albar, J. P., Fuentes, M., Gonzalez-Gonzalez, M., Diez, P., Ibarrola, N., Degano, R. M., Mohammed, Y., Borchers, C. H., Urbani, A., Soggiu, A., Yamamoto, T., Salekdeh, G. H., Archakov, A., Ponomarenko, E., Lisitsa, A., Lichti, C. F., Mostovenko, E., Kroes, R. A., Rezeli, M., Végvári, Á., Fehniger, T. E., Bischoff, R., Vizcaíno, J. A., Deutsch, E. W., Lane, L., Nilsson, C. L., Marko-Varga, G., Omenn, G. S., Jeong, S. K., Lim, J. S., Paik, Y. K., Hancock, W. S. 2015; 14 (9): 3415-31

Abstract

This paper summarizes the recent activities of the Chromosome-Centric Human Proteome Project (C-HPP) consortium, which develops new technologies to identify yet-to-be annotated proteins (termed "missing proteins") in biological samples that lack sufficient experimental evidence at the protein level for confident protein identification. The C-HPP also aims to identify new protein forms that may be caused by genetic variability, post-translational modifications, and alternative splicing. Proteogenomic data integration forms the basis of the C-HPP's activities; therefore, we have summarized some of the key approaches and their roles in the project. We present new analytical technologies that improve the chemical space and lower detection limits coupled to bioinformatics tools and some publicly available resources that can be used to improve data analysis or support the development of analytical assays. Most of this paper's content has been compiled from posters, slides, and discussions presented in the series of C-HPP workshops held during 2014. All data (posters, presentations) used are available at the C-HPP Wiki (http://c-hpp.webhosting.rug.nl/) and in the Supporting Information.

View details for DOI 10.1021/pr5013009

View details for PubMedID 26076068
Analysis of the Human Prostate-Specific Proteome Defined by Transcriptomics and Antibody-Based Profiling Identifies TMEM79 and ACOXL as Two Putative, Diagnostic Markers in Prostate Cancer. PloS one O'Hurley, G., Busch, C., Fagerberg, L., Hallström, B. M., Stadler, C., Tolf, A., Lundberg, E., Schwenk, J. M., Jirström, K., Bjartell, A., Gallagher, W. M., Uhlén, M., Pontén, F. 2015; 10 (8): e0133449

Abstract

To better understand prostate function and disease, it is important to define and explore the molecular constituents that signify the prostate gland. The aim of this study was to define the prostate specific transcriptome and proteome, in comparison to 26 other human tissues. Deep sequencing of mRNA (RNA-seq) and immunohistochemistry-based protein profiling were combined to identify prostate specific gene expression patterns and to explore tissue biomarkers for potential clinical use in prostate cancer diagnostics. We identified 203 genes with elevated expression in the prostate, 22 of which showed more than five-fold higher expression levels compared to all other tissue types. In addition to previously well-known proteins we identified two poorly characterized proteins, TMEM79 and ACOXL, with potential to differentiate between benign and cancerous prostatic glands in tissue biopsies. In conclusion, we have applied a genome-wide analysis to identify the prostate specific proteome using transcriptomics and antibody-based protein profiling to identify genes with elevated expression in the prostate. Our data provides a starting point for further functional studies to explore the molecular repertoire of normal and diseased prostate including potential prostate cancer markers such as TMEM79 and ACOXL.

View details for DOI 10.1371/journal.pone.0133449

View details for PubMedID 26237329

View details for PubMedCentralID PMC4523174
The human liver-specific proteome defined by transcriptomics and antibody-based profiling. FASEB journal : official publication of the Federation of American Societies for Experimental Biology Kampf, C., Mardinoglu, A., Fagerberg, L., Hallström, B. M., Edlund, K., Lundberg, E., Pontén, F., Nielsen, J., Uhlen, M. 2014; 28 (7): 2901-14

Abstract

Human liver physiology and the genetic etiology of the liver diseases can potentially be elucidated through the identification of proteins with enriched expression in the liver. Here, we combined data from RNA sequencing (RNA-Seq) and antibody-based immunohistochemistry across all major human tissues to explore the human liver proteome with enriched expression, as well as the cell type-enriched expression in hepatocyte and bile duct cells. We identified in total 477 protein-coding genes with elevated expression in the liver: 179 genes have higher expression as compared to all the other analyzed tissues; 164 genes have elevated transcript levels in the liver shared with at least one other tissue type; and an additional 134 genes have a mild level of increased expression in the liver. We identified the precise localization of these proteins through antibody-based protein profiling and the subcellular localization of these proteins through immunofluorescent-based profiling. We also identified the biological processes and metabolic functions associated with these proteins, investigated their contribution in the occurrence of liver diseases, and identified potential targets for their treatment. Our study demonstrates the use of RNA-Seq and antibody-based immunohistochemistry for characterizing the human liver proteome, as well as the use of tissue-specific proteins in identification of novel drug targets and discovery of biomarkers.-Kampf, C., Mardinoglu, A., Fagerberg, L., Hallström, B. M., Edlund, K., Lundberg, E., Pontén, F., Nielsen, J., Uhlen, M. The human liver-specific proteome defined by transcriptomics and antibody-based profiling.

View details for DOI 10.1096/fj.14-250555

View details for PubMedID 24648543
Immunoproteomics using polyclonal antibodies and stable isotope-labeled affinity-purified recombinant proteins. Molecular & cellular proteomics : MCP Edfors, F., Boström, T., Forsström, B., Zeiler, M., Johansson, H., Lundberg, E., Hober, S., Lehtiö, J., Mann, M., Uhlen, M. 2014; 13 (6): 1611-24

Abstract

The combination of immuno-based methods and mass spectrometry detection has great potential in the field of quantitative proteomics. Here, we describe a new method (immuno-SILAC) for the absolute quantification of proteins in complex samples based on polyclonal antibodies and stable isotope-labeled recombinant protein fragments to allow affinity enrichment prior to mass spectrometry analysis and accurate quantification. We took advantage of the antibody resources publicly available from the Human Protein Atlas project covering more than 80% of all human protein-coding genes. Epitope mapping revealed that a majority of the polyclonal antibodies recognized multiple linear epitopes, and based on these results, a semi-automated method was developed for peptide enrichment using polyclonal antibodies immobilized on protein A-coated magnetic beads. A protocol based on the simultaneous multiplex capture of more than 40 protein targets showed that approximately half of the antibodies enriched at least one functional peptide detected in the subsequent mass spectrometry analysis. The approach was further developed to also generate quantitative data via the addition of heavy isotope-labeled recombinant protein fragment standards prior to trypsin digestion. Here, we show that we were able to use small amounts of antibodies (50 ng per target) in this manner for efficient multiplex analysis of quantitative levels of proteins in a human HeLa cell lysate. The results suggest that polyclonal antibodies generated via immunization of recombinant protein fragments could be used for the enrichment of target peptides to allow for rapid mass spectrometry analysis taking advantage of a substantial reduction in sample complexity. The possibility of building up a proteome-wide resource for immuno-SILAC assays based on publicly available antibody resources is discussed.

View details for DOI 10.1074/mcp.M113.034140

View details for PubMedID 24722731

View details for PubMedCentralID PMC4047479
RNA- and antibody-based profiling of the human proteome with focus on chromosome 19. Journal of proteome research Stadler, C., Fagerberg, L., Sivertsson, Å., Oksvold, P., Zwahlen, M., Hallström, B. M., Lundberg, E., Uhlén, M. 2014; 13 (4): 2019-27

Abstract

An important part of the Human Proteome Project is to characterize the protein complement of the genome with antibody-based profiling. Within the framework of this effort, a new version 12 of the Human Protein Atlas ( www.proteinatlas.org ) has been launched, including transcriptomics data for 27 tissues and 44 cell lines to complement the protein expression data from antibody-based profiling. Besides the extensive addition of transcriptomics data, the Human Protein Atlas now contains antibody-based protein profiles for 82% of the 20 329 putative protein-coding genes. The comprehensive data resulting from RNA-seq analysis and antibody-based profiling performed within the Human Protein Atlas as well as information from UniProt were used to generate evidence summary scores for each of the 20 329 genes, of which 94% now have experimental evidence at least at transcript level. The evidence scores for all individual genes are displayed with regards to both RNA- and antibody-based protein profiles, including chromosome-centric visualizations. An analysis of the human chromosome 19 shows that ∼43% of the genes are expressed at the transcript level in all 27 tissues analyzed, suggesting a "house-keeping" function, while 12% of the genes show a more tissue-specific pattern with enriched expression in one of the analyzed tissues only.

View details for DOI 10.1021/pr401156g

View details for PubMedID 24579871
A chromosome-centric analysis of antibodies directed toward the human proteome using Antibodypedia. Journal of proteome research Alm, T., von Feilitzen, K., Lundberg, E., Sivertsson, Å., Uhlén, M. 2014; 13 (3): 1669-76

Abstract

Antibodies are crucial for the study of human proteins and have been defined as one of the three pillars in the human chromosome-centric Human Proteome Project (C-HPP). In this article the chromosome-centric structure has been used to analyze the availability of antibodies as judged by the presence within the portal Antibodypedia, a database designed to allow comparisons and scoring of publicly available antibodies toward human protein targets. This public database displays antibody data from more than one million antibodies toward human protein targets. A summary of the content in this knowledge resource reveals that there exist more than 10 antibodies to over 70% of all the putative human genes, evenly distributed over the 24 human chromosomes. The analysis also shows that at present, less than 10% of the putative human protein-coding genes (n = 1882) predicted from the genome sequence lack antibodies, suggesting that focused efforts from the antibody-based and mass spectrometry-based proteomic communities should be encouraged to pursue the analysis of these missing proteins. We show that Antibodypedia may be used to track the development of available and validated antibodies to the individual chromosomes, and thus the database is an attractive tool to identify proteins with no or few antibodies yet generated.

View details for DOI 10.1021/pr4011525

View details for PubMedID 24533432
Molecular- and Organelle-Based Predictive Paradigm Underlying Recovery by Left Ventricular Assist Device Support CIRCULATION-HEART FAILURE Liem, D. A., Nsair, A., Setty, S. P., Cadeiras, M., Wang, D., MacLellan, R., Lotz, C., Lin, A. J., Tabaraki, J., Li, H., Ge, J., Odeberg, J., Ponten, F., Larson, E., Mulder, J., Lundberg, E., Weiss, J. N., Uhlen, M., Ping, P., Deng, M. C. 2014; 7 (2): 359-366

View details for DOI 10.1161/CIRCHEARTFAILURE.113.000250

View details for Web of Science ID 000333759600015

View details for PubMedID 24643888

View details for PubMedCentralID PMC4397259
Antibody performance in western blot applications is context-dependent. Biotechnology journal Algenäs, C., Agaton, C., Fagerberg, L., Asplund, A., Björling, L., Björling, E., Kampf, C., Lundberg, E., Nilsson, P., Persson, A., Wester, K., Pontén, F., Wernérus, H., Uhlén, M., Ottosson Takanen, J., Hober, S. 2014; 9 (3): 435-45

Abstract

An important concern for the use of antibodies in various applications, such as western blot (WB) or immunohistochemistry (IHC), is specificity. This calls for systematic validations using well-designed conditions. Here, we have analyzed 13 000 antibodies using western blot with lysates from human cell lines, tissues, and plasma. Standardized stratification showed that 45% of the antibodies yielded supportive staining, and the rest either no staining (12%) or protein bands of wrong size (43%). A comparative study of WB and IHC showed that the performance of antibodies is application-specific, although a correlation between no WB staining and weak IHC staining could be seen. To investigate the influence of protein abundance on the apparent specificity of the antibody, new WB analyses were performed for 1369 genes that gave unsupportive WBs in the initial screening using cell lysates with overexpressed full-length proteins. Then, more than 82% of the antibodies yielded a specific band corresponding to the full-length protein. Hence, the vast majority of the antibodies (90%) used in this study specifically recognize the target protein when present at sufficiently high levels. This demonstrates the context- and application-dependence of antibody validation and emphasizes that caution is needed when annotating binding reagents as specific or cross-reactive. WB is one of the most commonly used methods for validation of antibodies. Our data implicate that solely using one platform for antibody validation might give misleading information and therefore at least one additional method should be used to verify the achieved data.

View details for DOI 10.1002/biot.201300341

View details for PubMedID 24403002
Analysis of the human tissue-specific expression by genome-wide integration of transcriptomics and antibody-based proteomics. Molecular & cellular proteomics : MCP Fagerberg, L., Hallström, B. M., Oksvold, P., Kampf, C., Djureinovic, D., Odeberg, J., Habuka, M., Tahmasebpoor, S., Danielsson, A., Edlund, K., Asplund, A., Sjöstedt, E., Lundberg, E., Szigyarto, C. A., Skogs, M., Takanen, J. O., Berling, H., Tegel, H., Mulder, J., Nilsson, P., Schwenk, J. M., Lindskog, C., Danielsson, F., Mardinoglu, A., Sivertsson, A., von Feilitzen, K., Forsberg, M., Zwahlen, M., Olsson, I., Navani, S., Huss, M., Nielsen, J., Ponten, F., Uhlén, M. 2014; 13 (2): 397-406

Abstract

Global classification of the human proteins with regards to spatial expression patterns across organs and tissues is important for studies of human biology and disease. Here, we used a quantitative transcriptomics analysis (RNA-Seq) to classify the tissue-specific expression of genes across a representative set of all major human organs and tissues and combined this analysis with antibody-based profiling of the same tissues. To present the data, we launch a new version of the Human Protein Atlas that integrates RNA and protein expression data corresponding to ∼80% of the human protein-coding genes with access to the primary data for both the RNA and the protein analysis on an individual gene level. We present a classification of all human protein-coding genes with regards to tissue-specificity and spatial expression pattern. The integrative human expression map can be used as a starting point to explore the molecular constituents of the human body.

View details for DOI 10.1074/mcp.M113.035600

View details for PubMedID 24309898

View details for PubMedCentralID PMC3916642
Analysis of the Human Tissue-specific Expression by Genome-wide Integration of Transcriptomics and Antibody-based Proteomics. Molecular & cellular proteomics : MCP Fagerberg, L., Hallström, B. M., Oksvold, P., Kampf, C., Djureinovic, D., Odeberg, J., Habuka, M., Tahmasebpoor, S., Danielsson, A., Edlund, K., Asplund, A., Sjöstedt, E., Lundberg, E., Szigyarto, C. A., Skogs, M., Takanen, J. O., Berling, H., Tegel, H., Mulder, J., Nilsson, P., Schwenk, J. M., Lindskog, C., Danielsson, F., Mardinoglu, A., Sivertsson, Å., von Feilitzen, K., Forsberg, M., Zwahlen, M., Olsson, I., Navani, S., Huss, M., Nielsen, J., Ponten, F., Uhlén, M. 2014; 13 (2): 397-406

Abstract

Global classification of the human proteins with regards to spatial expression patterns across organs and tissues is important for studies of human biology and disease. Here, we used a quantitative transcriptomics analysis (RNA-Seq) to classify the tissue-specific expression of genes across a representative set of all major human organs and tissues and combined this analysis with antibody-based profiling of the same tissues. To present the data, we launch a new version of the Human Protein Atlas that integrates RNA and protein expression data corresponding to ∼80% of the human protein-coding genes with access to the primary data for both the RNA and the protein analysis on an individual gene level. We present a classification of all human protein-coding genes with regards to tissue-specificity and spatial expression pattern. The integrative human expression map can be used as a starting point to explore the molecular constituents of the human body.

View details for DOI 10.1074/mcp.M113.035600

View details for PubMedID 33498127
Metrics for the Human Proteome Project 2013-2014 and strategies for finding missing proteins. Journal of proteome research Lane, L., Bairoch, A., Beavis, R. C., Deutsch, E. W., Gaudet, P., Lundberg, E., Omenn, G. S. 2014; 13 (1): 15-20

Abstract

One year ago the Human Proteome Project (HPP) leadership designated the baseline metrics for the Human Proteome Project to be based on neXtProt with a total of 13,664 proteins validated at protein evidence level 1 (PE1) by mass spectrometry, antibody-capture, Edman sequencing, or 3D structures. Corresponding chromosome-specific data were provided from PeptideAtlas, GPMdb, and Human Protein Atlas. This year, the neXtProt total is 15,646 and the other resources, which are inputs to neXtProt, have high-quality identifications and additional annotations for 14,012 in PeptideAtlas, 14,869 in GPMdb, and 10,976 in HPA. We propose to remove 638 genes from the denominator that are "uncertain" or "dubious" in Ensembl, UniProt/SwissProt, and neXtProt. That leaves 3844 "missing proteins", currently having no or inadequate documentation, to be found from a new denominator of 19,490 protein-coding genes. We present those tabulations and web links and discuss current strategies to find the missing proteins.

View details for DOI 10.1021/pr401144x

View details for PubMedID 24364385

View details for PubMedCentralID PMC3928647
Contribution of antibody-based protein profiling to the human Chromosome-centric Proteome Project (C-HPP). Journal of proteome research Fagerberg, L., Oksvold, P., Skogs, M., Algenäs, C., Lundberg, E., Pontén, F., Sivertsson, A., Odeberg, J., Klevebring, D., Kampf, C., Asplund, A., Sjöstedt, E., Al-Khalili Szigyarto, C., Edqvist, P. H., Olsson, I., Rydberg, U., Hudson, P., Ottosson Takanen, J., Berling, H., Björling, L., Tegel, H., Rockberg, J., Nilsson, P., Navani, S., Jirström, K., Mulder, J., Schwenk, J. M., Zwahlen, M., Hober, S., Forsberg, M., von Feilitzen, K., Uhlén, M. 2013; 12 (6): 2439-48

Abstract

A gene-centric Human Proteome Project has been proposed to characterize the human protein-coding genes in a chromosome-centered manner to understand human biology and disease. Here, we report on the protein evidence for all genes predicted from the genome sequence based on manual annotation from literature (UniProt), antibody-based profiling in cells, tissues and organs and analysis of the transcript profiles using next generation sequencing in human cell lines of different origins. We estimate that there is good evidence for protein existence for 69% (n = 13985) of the human protein-coding genes, while 23% have only evidence on the RNA level and 7% still lack experimental evidence. Analysis of the expression patterns shows few tissue-specific proteins and approximately half of the genes expressed in all the analyzed cells. The status for each gene with regards to protein evidence is visualized in a chromosome-centric manner as part of a new version of the Human Protein Atlas ( www.proteinatlas.org ).

View details for DOI 10.1021/pr300924j

View details for PubMedID 23276153
Initial quantitative proteomic map of 28 mouse tissues using the SILAC mouse. Molecular & cellular proteomics : MCP Geiger, T., Velic, A., Macek, B., Lundberg, E., Kampf, C., Nagaraj, N., Uhlen, M., Cox, J., Mann, M. 2013; 12 (6): 1709-22

Abstract

Identifying the building blocks of mammalian tissues is a precondition for understanding their function. In particular, global and quantitative analysis of the proteome of mammalian tissues would point to tissue-specific mechanisms and place the function of each protein in a whole-organism perspective. We performed proteomic analyses of 28 mouse tissues using high-resolution mass spectrometry and used a mix of mouse tissues labeled via stable isotope labeling with amino acids in cell culture as a "spike-in" internal standard for accurate protein quantification across these tissues. We identified a total of 7,349 proteins and quantified 6,974 of them. Bioinformatic data analysis showed that physiologically related tissues clustered together and that highly expressed proteins represented the characteristic tissue functions. Tissue specialization was reflected prominently in the proteomic profiles and is apparent already in their hundred most abundant proteins. The proportion of strictly tissue-specific proteins appeared to be small. However, even proteins with household functions, such as those in ribosomes and spliceosomes, can have dramatic expression differences among tissues. We describe a computational framework with which to correlate proteome profiles with physiological functions of the tissue. Our data will be useful to the broad scientific community as an initial atlas of protein expression of a mammalian species.

View details for DOI 10.1074/mcp.M112.024919

View details for PubMedID 23436904

View details for PubMedCentralID PMC3675825
A texture based pattern recognition approach to distinguish melanoma from non-melanoma cells in histopathological tissue microarray sections. PloS one Rexhepaj, E., Agnarsdóttir, M., Bergman, J., Edqvist, P. H., Bergqvist, M., Uhlén, M., Gallagher, W. M., Lundberg, E., Ponten, F. 2013; 8 (5): e62070

Abstract

Immunohistochemistry is a routine practice in clinical cancer diagnostics and also an established technology for tissue-based research regarding biomarker discovery efforts. Tedious manual assessment of immunohistochemically stained tissue needs to be fully automated to take full advantage of the potential for high throughput analyses enabled by tissue microarrays and digital pathology. Such automated tools also need to be reproducible for different experimental conditions and biomarker targets. In this study we present a novel supervised melanoma specific pattern recognition approach that is fully automated and quantitative.Melanoma samples were immunostained for the melanocyte specific target, Melan-A. Images representing immunostained melanoma tissue were then digitally processed to segment regions of interest, highlighting Melan-A positive and negative areas. Color deconvolution was applied to each region of interest to separate the channel containing the immunohistochemistry signal from the hematoxylin counterstaining channel. A support vector machine melanoma classification model was learned from a discovery melanoma patient cohort (n = 264) and subsequently validated on an independent cohort of melanoma patient tissue sample images (n = 157).Here we propose a novel method that takes advantage of utilizing an immuhistochemical marker highlighting melanocytes to fully automate the learning of a general melanoma cell classification model. The presented method can be applied on any protein of interest and thus provides a tool for quantification of immunohistochemistry-based protein expression in melanoma.

View details for DOI 10.1371/journal.pone.0062070

View details for PubMedID 23690928

View details for PubMedCentralID PMC3656869
Majority of differentially expressed genes are down-regulated during malignant transformation in a four-stage model. Proceedings of the National Academy of Sciences of the United States of America Danielsson, F., Skogs, M., Huss, M., Rexhepaj, E., O'Hurley, G., Klevebring, D., Pontén, F., Gad, A. K., Uhlén, M., Lundberg, E. 2013; 110 (17): 6853-8

Abstract

The transformation of normal cells to malignant, metastatic tumor cells is a multistep process caused by the sequential acquirement of genetic changes. To identify these changes, we compared the transcriptomes and levels and distribution of proteins in a four-stage cell model of isogenically matched normal, immortalized, transformed, and metastatic human cells, using deep transcriptome sequencing and immunofluorescence microscopy. The data show that ∼6% (n = 1,357) of the human protein-coding genes are differentially expressed across the stages in the model. Interestingly, the majority of these genes are down-regulated, linking malignant transformation to dedifferentiation. The up-regulated genes are mainly components that control cellular proliferation, whereas the down-regulated genes consist of proteins exposed on or secreted from the cell surface. As many of the identified gene products control basic cellular functions that are defective in cancers, the data provide candidates for follow-up studies to investigate their functional roles in tumor formation. When we further compared the expression levels of four of the identified proteins in clinical cancer cohorts, similar differences were observed between benign and cancer cells, as in the cell model. This shows that this comprehensive demonstration of the molecular changes underlying malignant transformation is a relevant model to study the process of tumor formation.

View details for DOI 10.1073/pnas.1216436110

View details for PubMedID 23569271

View details for PubMedCentralID PMC3637701
Immunofluorescence and fluorescent-protein tagging show high correlation for protein localization in mammalian cells. Nature methods Stadler, C., Rexhepaj, E., Singan, V. R., Murphy, R. F., Pepperkok, R., Uhlén, M., Simpson, J. C., Lundberg, E. 2013; 10 (4): 315-23

Abstract

Imaging techniques such as immunofluorescence (IF) and the expression of fluorescent protein (FP) fusions are widely used to investigate the subcellular distribution of proteins. Here we report a systematic analysis of >500 human proteins comparing the localizations obtained in live versus fixed cells using FPs and IF, respectively. We identify systematic discrepancies between IF and FPs as well as between FP tagging at the N and C termini. The analysis shows that for 80% of the proteins, IF and FPs yield the same subcellular distribution, and the locations of 250 previously unlocalized proteins were determined by the overlap between the two methods. Approximately 60% of proteins localize to multiple organelles for both methods, indicating a complex subcellular protein organization. These results show that both IF and FP tagging are reliable techniques and demonstrate the usefulness of an integrative approach for a complete investigation of the subcellular human proteome.

View details for DOI 10.1038/nmeth.2377

View details for PubMedID 23435261
Centrosome isolation and analysis by mass spectrometry-based proteomics. Methods in enzymology Jakobsen, L., Schrøder, J. M., Larsen, K. M., Lundberg, E., Andersen, J. S. 2013; 525: 371-93

Abstract

Centrioles are microtubule-based scaffolds that are essential for the formation of centrosomes, cilia, and flagella with important functions throughout the cell cycle, in physiology and during development. The ability to purify centriole-containing organelles on a large scale, combined with advances in protein identification using mass spectrometry-based proteomics, have revealed multiple centriole-associated proteins that are conserved during evolution in eukaryotes. Despite these advances, the molecular basis for the plethora of processes coordinated by cilia and centrosomes is not fully understood. Considering the complexity and dynamics of centriole-related proteomes and the first-pass analyses reported so far, it is likely that further insight might come from more thorough proteome analyses under various cellular and physiological conditions. To this end, we here describe methods to isolate centrosomes from human cells and strategies to selectively identify and study the properties of the associated proteins using quantitative mass spectrometry-based proteomics.

View details for DOI 10.1016/B978-0-12-397944-5.00018-3

View details for PubMedID 23522479
RNA deep sequencing as a tool for selection of cell lines for systematic subcellular localization of all human proteins. Journal of proteome research Danielsson, F., Wiking, M., Mahdessian, D., Skogs, M., Ait Blal, H., Hjelmare, M., Stadler, C., Uhlén, M., Lundberg, E. 2013; 12 (1): 299-307

Abstract

One of the major challenges of a chromosome-centric proteome project is to explore in a systematic manner the potential proteins identified from the chromosomal genome sequence, but not yet characterized on a protein level. Here, we describe the use of RNA deep sequencing to screen human cell lines for RNA profiles and to use this information to select cell lines suitable for characterization of the corresponding gene product. In this manner, the subcellular localization of proteins can be analyzed systematically using antibody-based confocal microscopy. We demonstrate the usefulness of selecting cell lines with high expression levels of RNA transcripts to increase the likelihood of high quality immunofluorescence staining and subsequent successful subcellular localization of the corresponding protein. The results show a path to combine transcriptomics with affinity proteomics to characterize the proteins in a gene- or chromosome-centric manner.

View details for DOI 10.1021/pr3009308

View details for PubMedID 23227862
A Chromosome-centric Human Proteome Project (C-HPP) to Characterize the Sets of Proteins Encoded in Chromosome 17 JOURNAL OF PROTEOME RESEARCH Liu, S., Im, H., Bairoch, A., Cristofanilli, M., Chen, R., Deutsch, E. W., Dalton, S., Fenyo, D., Fanayan, S., Gates, C., Gaudet, P., Hincapie, M., Hanash, S., Kim, H., Jeong, S., Lundberg, E., Mias, G., Menon, R., Mu, Z., Nice, E., Paik, Y., Uhlen, M., Wells, L., Wu, S., Yan, F., Zhang, F., Zhang, Y., Snyder, M., Omenn, G. S., Beavis, R. C., Hancock, W. S. 2013; 12 (1): 45-57

Abstract

We report progress assembling the parts list for chromosome 17 and illustrate the various processes that we have developed to integrate available data from diverse genomic and proteomic knowledge bases. As primary resources, we have used GPMDB, neXtProt, PeptideAtlas, Human Protein Atlas (HPA), and GeneCards. All sites share the common resource of Ensembl for the genome modeling information. We have defined the chromosome 17 parts list with the following information: 1169 protein-coding genes, the numbers of proteins confidently identified by various experimental approaches as documented in GPMDB, neXtProt, PeptideAtlas, and HPA, examples of typical data sets obtained by RNASeq and proteomic studies of epithelial derived tumor cell lines (disease proteome) and a normal proteome (peripheral mononuclear cells), reported evidence of post-translational modifications, and examples of alternative splice variants (ASVs). We have constructed a list of the 59 "missing" proteins as well as 201 proteins that have inconclusive mass spectrometric (MS) identifications. In this report we have defined a process to establish a baseline for the incorporation of new evidence on protein identification and characterization as well as related information from transcriptome analyses. This initial list of "missing" proteins that will guide the selection of appropriate samples for discovery studies as well as antibody reagents. Also we have illustrated the significant diversity of protein variants (including post-translational modifications, PTMs) using regions on chromosome 17 that contain important oncogenes. We emphasize the need for mandated deposition of proteomics data in public databases, the further development of improved PTM, ASV, and single nucleotide variant (SNV) databases, and the construction of Web sites that can integrate and regularly update such information. In addition, we describe the distribution of both clustered and scattered sets of protein families on the chromosome. Since chromosome 17 is rich in cancer-associated genes, we have focused the clustering of cancer-associated genes in such genomic regions and have used the ERBB2 amplicon as an example of the value of a proteogenomic approach in which one integrates transcriptomic with proteomic information and captures evidence of coexpression through coordinated regulation.

View details for DOI 10.1021/pr300985j

View details for Web of Science ID 000313156300007

View details for PubMedID 23259914
Automated analysis and reannotation of subcellular locations in confocal images from the Human Protein Atlas. PloS one Li, J., Newberg, J. Y., Uhlén, M., Lundberg, E., Murphy, R. F. 2012; 7 (11): e50514

Abstract

The Human Protein Atlas contains immunofluorescence images showing subcellular locations for thousands of proteins. These are currently annotated by visual inspection. In this paper, we describe automated approaches to analyze the images and their use to improve annotation. We began by training classifiers to recognize the annotated patterns. By ranking proteins according to the confidence of the classifier, we generated a list of proteins that were strong candidates for reexamination. In parallel, we applied hierarchical clustering to group proteins and identified proteins whose annotations were inconsistent with the remainder of the proteins in their cluster. These proteins were reexamined by the original annotators, and a significant fraction had their annotations changed. The results demonstrate that automated approaches can provide an important complement to visual annotation.

View details for DOI 10.1371/journal.pone.0050514

View details for PubMedID 23226299

View details for PubMedCentralID PMC3511558
Estimating microtubule distributions from 2D immunofluorescence microscopy images reveals differences among human cultured cell lines. PloS one Li, J., Shariff, A., Wiking, M., Lundberg, E., Rohde, G. K., Murphy, R. F. 2012; 7 (11): e50292

Abstract

Microtubules are filamentous structures that are involved in several important cellular processes, including cell division, cellular structure and mechanics, and intracellular transportation. Little is known about potential differences in microtubule distributions within and across cell lines. Here we describe a method to estimate information pertaining to 3D microtubule distributions from 2D fluorescence images. Our method allows for quantitative comparisons of microtubule distribution parameters (number of microtubules, mean length) between different cell lines. Among eleven cell lines compared, some showed differences that could be accounted for by differences in the total amount of tubulin per cell while others showed statistically significant differences in the balance between number and length of microtubules. We also observed that some cell lines that visually appear different in their microtubule distributions are quite similar when the model parameters are considered. The method is expected to be generally useful for comparing microtubule distributions between cell lines and for a given cell line after various perturbations. The results are also expected to enable analysis of the differences in gene expression underlying the observed differences in microtubule distributions among cell types.

View details for DOI 10.1371/journal.pone.0050292

View details for PubMedID 23209697

View details for PubMedCentralID PMC3508979
Comprehensive analysis of the genome transcriptome and proteome landscapes of three tumor cell lines. Genome medicine Akan, P., Alexeyenko, A., Costea, P. I., Hedberg, L., Solnestam, B. W., Lundin, S., Hällman, J., Lundberg, E., Uhlén, M., Lundeberg, J. 2012; 4 (11): 86

Abstract

We here present a comparative genome, transcriptome and functional network analysis of three human cancer cell lines (A431, U251MG and U2OS), and investigate their relation to protein expression. Gene copy numbers significantly influenced corresponding transcript levels; their effect on protein levels was less pronounced. We focused on genes with altered mRNA and/or protein levels to identify those active in tumor maintenance. We provide comprehensive information for the three genomes and demonstrate the advantage of integrative analysis for identifying tumor-related genes amidst numerous background mutations by relating genomic variation to expression/protein abundance data and use gene networks to reveal implicated pathways.

View details for DOI 10.1186/gm387

View details for PubMedID 23158748

View details for PubMedCentralID PMC3580420
Comparison of total and cytoplasmic mRNA reveals global regulation by nuclear retention and miRNAs. BMC genomics Solnestam, B. W., Stranneheim, H., Hällman, J., Käller, M., Lundberg, E., Lundeberg, J., Akan, P. 2012; 13: 574

Abstract

The majority of published gene-expression studies have used RNA isolated from whole cells, overlooking the potential impact of including nuclear transcriptome in the analyses. In this study, mRNA fractions from the cytoplasm and from whole cells (total RNA) were prepared from three human cell lines and sequenced using massive parallel sequencing.For all three cell lines, of about 15000 detected genes approximately 400 to 1400 genes were detected in different amounts in the cytoplasmic and total RNA fractions. Transcripts detected at higher levels in the total RNA fraction had longer coding sequences and higher number of miRNA target sites. Transcripts detected at higher levels in the cytoplasmic fraction were shorter or contained shorter untranslated regions. Nuclear retention of transcripts and mRNA degradation via miRNA pathway might contribute to this differential detection of genes. The consequence of the differential detection was further investigated by comparison to proteomics data. Interestingly, the expression profiles of cytoplasmic and total RNA correlated equally well with protein abundance levels indicating regulation at a higher level.We conclude that expression levels derived from the total RNA fraction be regarded as an appropriate estimate of the amount of mRNAs present in a given cell population, independent of the coding sequence length or UTRs.

View details for DOI 10.1186/1471-2164-13-574

View details for PubMedID 23110385

View details for PubMedCentralID PMC3495644
Proteomic Analysis Reveals Drug Accessible Cell Surface N-Glycoproteins of Primary and Established Glioblastoma Cell Lines JOURNAL OF PROTEOME RESEARCH Bock, T., Moest, H., Omasits, U., Dolski, S., Lundberg, E., Frei, A., Hofmann, A., Bausch-Fluck, D., Jacobs, A., Krayenbuehl, N., Uhlen, M., Aebersold, R., Frei, K., Wollscheid, B. 2012; 11 (10): 4885-4893

Abstract

Glioblastoma is the most common primary brain tumor in adults with low average survival time after diagnosis. In order to improve glioblastoma treatment, new drug-accessible targets need to be identified. Cell surface glycoproteins are prime drug targets due to their accessibility at the surface of cancer cells. To overcome the limited availability of suitable antibodies for cell surface protein detection, we performed a comprehensive mass spectrometric investigation of the glioblastoma surfaceome. Our combined cell surface capturing analysis of primary ex vivo glioblastoma cell lines in combination with established glioblastoma cell lines revealed 633 N-glycoproteins, which vastly extends the known data of surfaceome drug targets at subcellular resolution. We provide direct evidence of common glioblastoma cell surface glycoproteins and an approximate estimate of their abundances, information that could not be derived from genomic and/or transcriptomic glioblastoma studies. Apart from our pharmaceutically valuable repertoire of already and potentially drug-accessible cell surface glycoproteins, we built a mass-spectrometry-based toolbox enabling directed, sensitive, and repetitive glycoprotein measurements for clinical follow-up studies. The included Skyline Glioblastoma SRM assay library provides an elevated starting point for parallel testing of the abundance level of the detected glioblastoma surfaceome members in future drug perturbation experiments.

View details for DOI 10.1021/pr300360a

View details for Web of Science ID 000309441000011

View details for PubMedID 22909291
A tool to facilitate clinical biomarker studies--a tissue dictionary based on the Human Protein Atlas. BMC medicine Kampf, C., Bergman, J., Oksvold, P., Asplund, A., Navani, S., Wiking, M., Lundberg, E., Uhlén, M., Ponten, F. 2012; 10: 103

Abstract

The complexity of tissue and the alterations that distinguish normal from cancer remain a challenge for translating results from tumor biological studies into clinical medicine. This has generated an unmet need to exploit the findings from studies based on cell lines and model organisms to develop, validate and clinically apply novel diagnostic, prognostic and treatment predictive markers. As one step to meet this challenge, the Human Protein Atlas project has been set up to produce antibodies towards human protein targets corresponding to all human protein coding genes and to map protein expression in normal human tissues, cancer and cells. Here, we present a dictionary based on microscopy images created as an amendment to the Human Protein Atlas. The aim of the dictionary is to facilitate the interpretation and use of the image-based data available in the Human Protein Atlas, but also to serve as a tool for training and understanding tissue histology, pathology and cell biology. The dictionary contains three main parts, normal tissues, cancer tissues and cells, and is based on high-resolution images at different magnifications of full tissue sections stained with H & E. The cell atlas is centered on immunofluorescence and confocal microscopy images, using different color channels to highlight the organelle structure of a cell. Here, we explain how this dictionary can be used as a tool to aid clinicians and scientists in understanding the use of tissue histology and cancer pathology in diagnostics and biomarker studies.

View details for DOI 10.1186/1741-7015-10-103

View details for PubMedID 22971420

View details for PubMedCentralID PMC3523031
Systematic validation of antibody binding and protein subcellular localization using siRNA and confocal microscopy. Journal of proteomics Stadler, C., Hjelmare, M., Neumann, B., Jonasson, K., Pepperkok, R., Uhlén, M., Lundberg, E. 2012; 75 (7): 2236-51

Abstract

We have developed a platform for validation of antibody binding and protein subcellular localization data obtained from immunofluorescence using siRNA technology combined with automated confocal microscopy and image analysis. By combining the siRNA technology with automated sample preparation, automated imaging and quantitative image analysis, a high-throughput assay has been set-up to enable confirmation of accurate protein binding and localization in a systematic manner. Here, we describe the analysis and validation of the subcellular location of 65 human proteins, targeted by 75 antibodies and silenced by 130 siRNAs. A large fraction of (80%) the subcellular locations, including locations of several previously uncharacterized proteins, could be confirmed by the significant down-regulation of the antibody signal after the siRNA silencing. A quantitative analysis was set-up using automated image analysis to facilitate studies of targets found in more than one compartment. The results obtained using the platform demonstrate that siRNA silencing in combination with quantitative image analysis of antibody signals in different compartments of the cells is an attractive approach for ensuring accurate protein localization as well as antibody binding using immunofluorescence. With a large fraction of the human proteome still unexplored, we suggest this approach to be of great importance under the continued work of mapping the human proteome on a subcellular level.

View details for DOI 10.1016/j.jprot.2012.01.030

View details for PubMedID 22361696
Identification of autophagosome-associated proteins and regulators by quantitative proteomic analysis and genetic screens. Molecular & cellular proteomics : MCP Dengjel, J., Høyer-Hansen, M., Nielsen, M. O., Eisenberg, T., Harder, L. M., Schandorff, S., Farkas, T., Kirkegaard, T., Becker, A. C., Schroeder, S., Vanselow, K., Lundberg, E., Nielsen, M. M., Kristensen, A. R., Akimov, V., Bunkenborg, J., Madeo, F., Jäättelä, M., Andersen, J. S. 2012; 11 (3): M111.014035

Abstract

Autophagy is one of the major intracellular catabolic pathways, but little is known about the composition of autophagosomes. To study the associated proteins, we isolated autophagosomes from human breast cancer cells using two different biochemical methods and three stimulus types: amino acid deprivation or rapamycin or concanamycin A treatment. The autophagosome-associated proteins were dependent on stimulus, but a core set of proteins was stimulus-independent. Remarkably, proteasomal proteins were abundant among the stimulus-independent common autophagosome-associated proteins, and the activation of autophagy significantly decreased the cellular proteasome level and activity supporting interplay between the two degradation pathways. A screen of yeast strains defective in the orthologs of the human genes encoding for a common set of autophagosome-associated proteins revealed several regulators of autophagy, including subunits of the retromer complex. The combined spatiotemporal proteomic and genetic data sets presented here provide a basis for further characterization of autophagosome biogenesis and cargo selection.

View details for DOI 10.1074/mcp.M111.014035

View details for PubMedID 22311637

View details for PubMedCentralID PMC3316729
Antibody-based protein profiling of the human chromosome 21. Molecular & cellular proteomics : MCP Uhlén, M., Oksvold, P., Älgenäs, C., Hamsten, C., Fagerberg, L., Klevebring, D., Lundberg, E., Odeberg, J., Pontén, F., Kondo, T., Sivertsson, Å. 2012; 11 (3): M111.013458

Abstract

The Human Proteome Project has been proposed to create a knowledge-based resource based on a systematical mapping of all human proteins, chromosome by chromosome, in a gene-centric manner. With this background, we here describe the systematic analysis of chromosome 21 using an antibody-based approach for protein profiling using both confocal microscopy and immunohistochemistry, complemented with transcript profiling using next generation sequencing data. We also describe a new approach for protein isoform analysis using a combination of antibody-based probing and isoelectric focusing. The analysis has identified several genes on chromosome 21 with no previous evidence on the protein level, and the isoform analysis indicates that a large fraction of human proteins have multiple isoforms. A chromosome-wide matrix is presented with status for all chromosome 21 genes regarding subcellular localization, tissue distribution, and molecular characterization of the corresponding proteins. The path to generate a chromosome-specific resource, including integrated data from complementary assay platforms, such as mass spectrometry and gene tagging analysis, is discussed.

View details for DOI 10.1074/mcp.M111.013458

View details for PubMedID 22042635

View details for PubMedCentralID PMC3316724
Characterization of MRFAP1 turnover and interactions downstream of the NEDD8 pathway. Molecular & cellular proteomics : MCP Larance, M., Kirkwood, K. J., Xirodimas, D. P., Lundberg, E., Uhlen, M., Lamond, A. I. 2012; 11 (3): M111.014407

Abstract

The NEDD8-Cullin E3 ligase pathway plays an important role in protein homeostasis, in particular the degradation of cell cycle regulators and transcriptional control networks. To characterize NEDD8-cullin target proteins, we performed a quantitative proteomic analysis of cells treated with MLN4924, a small molecule inhibitor of the NEDD8 conjugation pathway. MRFAP1 and its interaction partner, MORF4L1, were among the most up-regulated proteins after NEDD8 inhibition in multiple human cell lines. We show that MRFAP1 has a fast turnover rate in the absence of MLN4924 and is degraded via the ubiquitin-proteasome system. The increased abundance of MRFAP1 after MLN4924 treatment results from a decreased rate of degradation. Characterization of the binding partners of both MRFAP1 and MORF4L1 revealed a complex protein-protein interaction network. MRFAP1 bound to a number of E3 ubiquitin ligases, including CUL4B, but not to components of the NuA4 complex, including MRGBP, which bound to MORF4L1. These data indicate that MRFAP1 may regulate the ability of MORF4L1 to interact with chromatin-modifying enzymes by binding to MORF4L1 in a mutually exclusive manner with MRGBP. Analysis of MRFAP1 expression in human tissues by immunostaining with a MRFAP1-specific antibody revealed that it was detectable in only a small number of tissues, in particular testis and brain. Strikingly, analysis of the seminiferous tubules of the testis showed the highest nuclear staining in the spermatogonia and much weaker staining in the spermatocytes and spermatids. MRGBP was inversely correlated with MRFAP1 expression in these cell types, consistent with an exchange of MORF4L1 interaction partners as cells progress through meiosis in the testis. These data highlight an important new arm of the NEDD8-cullin pathway.

View details for DOI 10.1074/mcp.M111.014407

View details for PubMedID 22038470

View details for PubMedCentralID PMC3316733
Systematic analysis of protein pools, isoforms, and modifications affecting turnover and subcellular localization. Molecular & cellular proteomics : MCP Ahmad, Y., Boisvert, F. M., Lundberg, E., Uhlen, M., Lamond, A. I. 2012; 11 (3): M111.013680

Abstract

In higher eukaryotes many genes encode protein isoforms whose properties and biological roles are often poorly characterized. Here we describe systematic approaches for detection of either distinct isoforms, or separate pools of the same isoform, with differential biological properties. Using information from ion intensities we have estimated protein abundance levels and using rates of change in stable isotope labeling with amino acids in cell culture isotope ratios we measured turnover rates and subcellular distribution for the HeLa cell proteome. Protein isoforms were detected using three data analysis strategies that evaluate differences between stable isotope labeling with amino acids in cell culture isotope ratios for specific groups of peptides within the total set of peptides assigned to a protein. The candidate approach compares stable isotope labeling with amino acids in cell culture isotope ratios for predicted isoform-specific peptides, with ratio values for peptides shared by all the isoforms. The rule of thirds approach compares the mean isotope ratio values for all peptides in each of three equal segments along the linear length of the protein, assessing differences between segment values. The three in a row approach compares mean isotope ratio values for each sequential group of three adjacent peptides, assessing differences with the mean value for all peptides assigned to the protein. Protein isoforms were also detected and their properties evaluated by fractionating cell extracts on one-dimensional SDS-PAGE prior to trypsin digestion and MS analysis and independently evaluating isotope ratio values for the same peptides isolated from different gel slices. The effect of protein phosphorylation on turnover rates was analyzed by comparing mean turnover values calculated for all peptides assigned to a protein, either including, or excluding, values for cognate phosphopeptides. Collectively, these experimental and analytical approaches provide a framework for expanding the functional annotation of the genome.

View details for DOI 10.1074/mcp.M111.013680

View details for PubMedID 22002106

View details for PubMedCentralID PMC3316725
A Protein Epitope Signature Tag (PrEST) library allows SILAC-based absolute quantification and multiplexed determination of protein copy numbers in cell lines. Molecular & cellular proteomics : MCP Zeiler, M., Straube, W. L., Lundberg, E., Uhlen, M., Mann, M. 2012; 11 (3): O111.009613

Abstract

Mass spectrometry-based proteomics increasingly relies on relative or absolute quantification. In relative quantification, stable isotope based methods often allow mixing at early stages of sample preparation, whereas for absolute quantification this has generally required recombinant expression of full length, labeled protein standards. Here we make use of a very large library of Protein Epitope Signature Tags (PrESTs) that has been developed in the course of the Human Protein Atlas Project. These PrESTs are expressed recombinantly in E. coli and they consist of a short and unique region of the protein of interest as well as purification and solubility tags. We first quantify a highly purified, stable isotope labeling of amino acids in cell culture (SILAC)-labeled version of the solubility tag and use it determine the precise amount of each PrEST by its SILAC ratios. The PrESTs are then spiked into cell lysates and the SILAC ratios of PrEST peptides to peptides from endogenous target proteins yield their cellular quantities. The procedure can readily be multiplexed, as we demonstrate by simultaneously determining the copy number of 40 proteins in HeLa cells. Among the proteins analyzed, the cytoskeletal protein vimentin was found to be most abundant with 20 million copies per cell, while the transcription factor and oncogene FOS only had 6000 copies. Direct quantification of the absolute amount of single proteins is possible via a SILAC experiment in which labeled cell lysate is mixed both with the heavy labeled solubility tag and with the corresponding PrEST. The SILAC-PrEST combination allows accurate and streamlined quantification of the absolute or relative amount of proteins of interest in a wide variety of applications.

View details for DOI 10.1074/mcp.O111.009613

View details for PubMedID 21964433

View details for PubMedCentralID PMC3316735
Generation of monospecific antibodies based on affinity capture of polyclonal antibodies. Protein science : a publication of the Protein Society Hjelm, B., Forsström, B., Igel, U., Johannesson, H., Stadler, C., Lundberg, E., Ponten, F., Sjöberg, A., Rockberg, J., Schwenk, J. M., Nilsson, P., Johansson, C., Uhlén, M. 2011; 20 (11): 1824-35

Abstract

A method is described to generate and validate antibodies based on mapping the linear epitopes of a polyclonal antibody followed by sequential epitope-specific capture using synthetic peptides. Polyclonal antibodies directed towards four proteins RBM3, SATB2, ANLN, and CNDP1, potentially involved in human cancers, were selected and antibodies to several non-overlapping epitopes were generated and subsequently validated by Western blot, immunohistochemistry, and immunofluorescence. For all four proteins, a dramatic difference in functionality could be observed for these monospecific antibodies directed to the different epitopes. In each case, at least one antibody was obtained with full functionality across all applications, while other epitope-specific fractions showed no or little functionality. These results present a path forward to use the mapped binding sites of polyclonal antibodies to generate epitope-specific antibodies, providing an attractive approach for large-scale efforts to characterize the human proteome by antibodies.

View details for DOI 10.1002/pro.716

View details for PubMedID 21898641

View details for PubMedCentralID PMC3267947
Mapping the subcellular protein distribution in three human cell lines. Journal of proteome research Fagerberg, L., Stadler, C., Skogs, M., Hjelmare, M., Jonasson, K., Wiking, M., Abergh, A., Uhlén, M., Lundberg, E. 2011; 10 (8): 3766-77

Abstract

The subcellular locations of proteins are closely related to their function and constitute an essential aspect for understanding the complex machinery of living cells. A systematic effort has been initiated to map the protein distribution in three functionally different cell lines with the aim to provide a subcellular localization index for at least one representative protein from all human protein-encoding genes. Here, we present the results of more than 3500 proteins mapped to 16 subcellular compartments. The results indicate a ubiquitous protein expression with a majority of the proteins found in all three cell lines and a large portion localized to two or more compartments. The inter-relationships between the subcellular compartments are visualized in a protein-compartment network based on all detected proteins. Hierarchical clustering was performed to determine how closely related the organelles are in terms of protein constituents and compare the proteins detected in each cell type. Our results show distinct organelle proteomes, well conserved across the cell types, and demonstrate that biochemically similar organelles are grouped together.

View details for DOI 10.1021/pr200379a

View details for PubMedID 21675716
SATB2 in combination with cytokeratin 20 identifies over 95% of all colorectal carcinomas. The American journal of surgical pathology Magnusson, K., de Wit, M., Brennan, D. J., Johnson, L. B., McGee, S. F., Lundberg, E., Naicker, K., Klinger, R., Kampf, C., Asplund, A., Wester, K., Gry, M., Bjartell, A., Gallagher, W. M., Rexhepaj, E., Kilpinen, S., Kallioniemi, O. P., Belt, E., Goos, J., Meijer, G., Birgisson, H., Glimelius, B., Borrebaeck, C. A., Navani, S., Uhlén, M., O'Connor, D. P., Jirström, K., Pontén, F. 2011; 35 (7): 937-48

Abstract

The special AT-rich sequence-binding protein 2 (SATB2), a nuclear matrix-associated transcription factor and epigenetic regulator, was identified as a tissue type-specific protein when screening protein expression patterns in human normal and cancer tissues using an antibody-based proteomics approach. In this respect, the SATB2 protein shows a selective pattern of expression and, within cells of epithelial lineages, SATB2 expression is restricted to glandular cells lining the lower gastrointestinal tract. The expression of SATB2 protein is primarily preserved in cancer cells of colorectal origin, indicating that SATB2 could function as a clinically useful diagnostic marker to distinguish colorectal cancer (CRC) from other types of cancer. The aim of this study was to further explore and validate the specific expression pattern of SATB2 as a clinical biomarker and to compare SATB2 with the well-known cytokeratin 20 (CK20). Immunohistochemistry was used to analyze the extent of SATB2 expression in tissue microarrays with tumors from 9 independent cohorts of patients with primary and metastatic CRCs (n=1882). Our results show that SATB2 is a sensitive and highly specific marker for CRC with distinct positivity in 85% of all CRCs, and that SATB2 and/or CK20 was positive in 97% of CRCs. In conclusion, the specific expression of SATB2 in a large majority of CRCs suggests that SATB2 can be used as an important complementary tool for the differential diagnosis of carcinoma of unknown primary origin.

View details for DOI 10.1097/PAS.0b013e31821c3dae

View details for PubMedID 21677534
Novel asymmetrically localizing components of human centrosomes identified by complementary proteomics methods. The EMBO journal Jakobsen, L., Vanselow, K., Skogs, M., Toyoda, Y., Lundberg, E., Poser, I., Falkenby, L. G., Bennetzen, M., Westendorf, J., Nigg, E. A., Uhlen, M., Hyman, A. A., Andersen, J. S. 2011; 30 (8): 1520-35

Abstract

Centrosomes in animal cells are dynamic organelles with a proteinaceous matrix of pericentriolar material assembled around a pair of centrioles. They organize the microtubule cytoskeleton and the mitotic spindle apparatus. Mature centrioles are essential for biogenesis of primary cilia that mediate key signalling events. Despite recent advances, the molecular basis for the plethora of processes coordinated by centrosomes is not fully understood. We have combined protein identification and localization, using PCP-SILAC mass spectrometry, BAC transgeneOmics, and antibodies to define the constituents of human centrosomes. From a background of non-specific proteins, we distinguished 126 known and 40 candidate centrosomal proteins, of which 22 were confirmed as novel components. An antibody screen covering 4000 genes revealed an additional 113 candidates. We illustrate the power of our methods by identifying a novel set of five proteins preferentially associated with mother or daughter centrioles, comprising genes implicated in cell polarity. Pulsed labelling demonstrates a remarkable variation in the stability of centrosomal protein complexes. These spatiotemporal proteomics data provide leads to the further functional characterization of centrosomal proteins.

View details for DOI 10.1038/emboj.2011.63

View details for PubMedID 21399614

View details for PubMedCentralID PMC3102290
What determines specific cell functions? LAB ON A CHIP Lundberg, E., Svahn, H. 2011; 11 (12): 2039-2041

View details for DOI 10.1039/c1lc90018h

View details for Web of Science ID 000291166000010

View details for PubMedID 21384036
Selection and characterisation of affibody molecules inhibiting the interaction between Ras and Raf in vitro. New biotechnology Grimm, S., Lundberg, E., Yu, F., Shibasaki, S., Vernet, E., Skogs, M., Nygren, P. Å., Gräslund, T. 2010; 27 (6): 766-73

Abstract

Development of molecules with the ability to selectively inhibit particular protein-protein interactions is important in providing tools for understanding cell biology. In this work, we describe efforts to select small Ras- and Raf-specific three-helix bundle affibody binding proteins capable of inhibiting the interaction between H-Ras and Raf-1, from a combinatorial library displayed on bacteriophage. Target-specific variants with typically high nanomolar or low micromolar affinities (K(D)) could be selected successfully against both proteins, as shown by dot blot, ELISA and real-time biospecific interaction analyses. Affibody molecule variants selected against H-Ras were shown to bind epitopes overlapping each other at a site that differed from that at which H-Ras interacts with Raf-1. In contrast, an affibody molecule isolated during selection against Raf-1 was shown to effectively inhibit the interaction between H-Ras and Raf-1 in a dose-dependent manner. Possible intracellular applications of the selected affibody molecules are discussed.

View details for DOI 10.1016/j.nbt.2010.07.016

View details for PubMedID 20674812
Defining the transcriptome and proteome in three functionally different human cell lines. Molecular systems biology Lundberg, E., Fagerberg, L., Klevebring, D., Matic, I., Geiger, T., Cox, J., Algenäs, C., Lundeberg, J., Mann, M., Uhlen, M. 2010; 6: 450

Abstract

An essential question in human biology is how cells and tissues differ in gene and protein expression and how these differences delineate specific biological function. Here, we have performed a global analysis of both mRNA and protein levels based on sequence-based transcriptome analysis (RNA-seq), SILAC-based mass spectrometry analysis and antibody-based confocal microscopy. The study was performed in three functionally different human cell lines and based on the global analysis, we estimated the fractions of mRNA and protein that are cell specific or expressed at similar/different levels in the cell lines. A highly ubiquitous RNA expression was found with >60% of the gene products detected in all cells. The changes of mRNA and protein levels in the cell lines using SILAC and RNA ratios show high correlations, even though the genome-wide dynamic range is substantially higher for the proteins as compared with the transcripts. Large general differences in abundance for proteins from various functional classes are observed and, in general, the cell-type specific proteins are low abundant and highly enriched for cell-surface proteins. Thus, this study shows a path to characterize the transcriptome and proteome in human cells from different origins.

View details for DOI 10.1038/msb.2010.106

View details for PubMedID 21179022

View details for PubMedCentralID PMC3018165
Analysis of transcript and protein overlap in a human osteosarcoma cell line. BMC genomics Klevebring, D., Fagerberg, L., Lundberg, E., Emanuelsson, O., Uhlén, M., Lundeberg, J. 2010; 11: 684

Abstract

An interesting field of research in genomics and proteomics is to compare the overlap between the transcriptome and the proteome. Recently, the tools to analyse gene and protein expression on a whole-genome scale have been improved, including the availability of the new generation sequencing instruments and high-throughput antibody-based methods to analyze the presence and localization of proteins. In this study, we used massive transcriptome sequencing (RNA-seq) to investigate the transcriptome of a human osteosarcoma cell line and compared the expression levels with in situ protein data obtained in-situ from antibody-based immunohistochemistry (IHC) and immunofluorescence microscopy (IF).A large-scale analysis based on 2749 genes was performed, corresponding to approximately 13% of the protein coding genes in the human genome. We found the presence of both RNA and proteins to a large fraction of the analyzed genes with 60% of the analyzed human genes detected by all three methods. Only 34 genes (1.2%) were not detected on the transcriptional or protein level with any method. Our data suggest that the majority of the human genes are expressed at detectable transcript or protein levels in this cell line. Since the reliability of antibodies depends on possible cross-reactivity, we compared the RNA and protein data using antibodies with different reliability scores based on various criteria, including Western blot analysis. Gene products detected in all three platforms generally have good antibody validation scores, while those detected only by antibodies, but not by RNA sequencing, generally consist of more low-scoring antibodies.This suggests that some antibodies are staining the cells in an unspecific manner, and that assessment of transcript presence by RNA-seq can provide guidance for validation of the corresponding antibodies.

View details for DOI 10.1186/1471-2164-11-684

View details for PubMedID 21126332

View details for PubMedCentralID PMC3014981
Towards a knowledge-based Human Protein Atlas. Nature biotechnology Uhlen, M., Oksvold, P., Fagerberg, L., Lundberg, E., Jonasson, K., Forsberg, M., Zwahlen, M., Kampf, C., Wester, K., Hober, S., Wernerus, H., Björling, L., Ponten, F. 2010; 28 (12): 1248-50

View details for DOI 10.1038/nbt1210-1248

View details for PubMedID 21139605
Creation of an antibody-based subcellular protein atlas. Proteomics Lundberg, E., Uhlén, M. 2010; 10 (22): 3984-96

Abstract

An important part for understanding the complex machinery of living cells is to know the spatial distribution of proteins all the way from organ to organelle levels. An equally important part of proteomics is to map the subcellular distribution of all human proteins. Here, we discuss methodologies for systematic subcellular profiling with emphasis on the antibody-based approach performed as a part of the Human Protein Atlas project. The considerations made when creating the subcellular protein atlas and critical parameters of this approach are discussed.

View details for DOI 10.1002/pmic.201000125

View details for PubMedID 20648481
Subcellular distribution and expression of prenylated Rab acceptor 1 domain family, member 2 (PRAF2) in malignant glioma: Influence on cell survival and migration. Cancer science Borsics, T., Lundberg, E., Geerts, D., Koomoa, D. T., Koster, J., Wester, K., Bachmann, A. S. 2010; 101 (7): 1624-31

Abstract

Our previous studies revealed that the expression of the 19-kDa protein prenylated Rab acceptor 1 domain family, member 2 (PRAF2) is elevated in cancer tissues of the breast, colon, lung, and ovary, when compared to noncancerous tissues of paired samples. PRAF2 mRNA expression also correlated with several genetic and clinical features and is a candidate prognostic marker in the pediatric cancer neuroblastoma. The PRAF2-related proteins, PRAF1 and PRAF3, play multiple roles in cellular processes, including endo/exocytic vesicle trafficking and glutamate uptake. PRAF2 shares a high sequence homology with these family members, but its function remains unknown. In this study, we examined PRAF2 mRNA and protein expression in 20 different human cancer types using Affymetrix microarray and human tissue microarray (TMA) analyses, respectively. In addition, we investigated the subcellular distribution of PRAF2 by immunofluorescence microscopy and cell fractionation studies. PRAF2 mRNA and protein expression was elevated in several cancer tissues with highest levels in malignant glioma. At the molecular level, we detected native PRAF2 in small, vesicle-like structures throughout the cytoplasm as well as in and around cell nuclei of U-87 malignant glioma cells. We further found that monomeric and dimeric forms of PRAF2 are associated with different cell compartments, suggesting possible functional differences. Importantly, PRAF2 down-regulation by RNA interference significantly reduced the cell viability, migration, and invasiveness of U-87 cells. This study shows that PRAF2 expression is elevated in various tumors with exceptionally high expression in malignant gliomas, and PRAF2 therefore presents a candidate molecular target for therapeutic intervention.

View details for DOI 10.1111/j.1349-7006.2010.01570.x

View details for PubMedID 20412121
A single fixation protocol for proteome-wide immunofluorescence localization studies. Journal of proteomics Stadler, C., Skogs, M., Brismar, H., Uhlén, M., Lundberg, E. 2010; 73 (6): 1067-78

Abstract

Immunofluorescence microscopy is a valuable tool for analyzing protein expression and localization at a subcellular level thus providing information regarding protein function, interaction partners and its role in cellular processes. When performing sample fixation, parameters such as difference in accessibility of proteins present in various cellular compartments as well as the chemical composition of the protein to be studied, needs to be taken into account. However, in systematic and proteome-wide efforts, a need exists for standard fixation protocol(s) that works well for the majority of all proteins independent of subcellular localization. Here, we report on a study with the goal to find a standardized protocol based on the analysis of 18 human proteins localized in 11 different organelles and subcellular structures. Six fixation protocols were tested based on either dehydration by alcohols (methanol, ethanol or iso-propanol) or cross-linking by paraformaldehyde followed by detergent permeabilization (Triton X-100 or saponin) in three human cell lines. Our results show that cross-linking is essential for proteome-wide localization studies and that cross-linking using paraformaldehyde followed by Triton X-100 permeabilization successfully can be used as a single fixation protocol for systematic studies.

View details for DOI 10.1016/j.jprot.2009.10.012

View details for PubMedID 19896565
Selection of affibody molecules to the ligand-binding site of the insulin-like growth factor-1 receptor. Biotechnology and applied biochemistry Li, J., Lundberg, E., Vernet, E., Larsson, B., Höidén-Guthenberg, I., Gräslund, T. 2010; 55 (2): 99-109

Abstract

Affibody molecules binding to the site of hormone interaction in IGF-1R (insulin-like growth factor-1 receptor) were successfully selected by phage-display technology employing a competitive-elution strategy during biopanning, whereby release of receptor-bound phagemids was accomplished by competition with IGF-1 (insulin-like growth factor-1). In non-competitive selections, the elution of receptor-bound phagemids was performed by imidazole or low-pH incubation, which also resulted in the isolation of affibody molecules that could bind to the receptor. An ELISA-based assay showed that the affibody molecules generated by IGF-1 competition during elution, in addition to affibody molecules generated in the non-competitive selections, could compete with IGF-1 for binding to the receptor. The affinities of the isolated variants to IGF-1R-overexpressing MCF-7 cells were determined and ranged from high nanomolar to 2.3 nM. The most promising variant, Z4:40, was shown to recognize IGF-1R efficiently in several different contexts: in analyses based on flow cytometry, fluorescence microscopy and receptor pull-down from cell extracts. In addition, when Z4:40 was added to the medium of MCF-7 cells that were dependent on IGF-1 for efficient growth, it was found to have a dose-dependent growth-inhibitory effect on the cells. Applications of affibody-based reagents for quantitative and qualitative analyses of IGF-1R status, as well as applications of affibody-based reagents for therapy, are discussed.

View details for DOI 10.1042/BA20090226

View details for PubMedID 20088825
A global view of protein expression in human cells, tissues, and organs. Molecular systems biology Pontén, F., Gry, M., Fagerberg, L., Lundberg, E., Asplund, A., Berglund, L., Oksvold, P., Björling, E., Hober, S., Kampf, C., Navani, S., Nilsson, P., Ottosson, J., Persson, A., Wernérus, H., Wester, K., Uhlén, M. 2009; 5: 337

Abstract

Defining the protein profiles of tissues and organs is critical to understanding the unique characteristics of the various cell types in the human body. In this study, we report on an anatomically comprehensive analysis of 4842 protein profiles in 48 human tissues and 45 human cell lines. A detailed analysis of over 2 million manually annotated, high-resolution, immunohistochemistry-based images showed a high fraction (>65%) of expressed proteins in most cells and tissues, with very few proteins (<2%) detected in any single cell type. Similarly, confocal microscopy in three human cell lines detected expression of more than 70% of the analyzed proteins. Despite this ubiquitous expression, hierarchical clustering analysis, based on global protein expression patterns, shows that the analyzed cells can be still subdivided into groups according to the current concepts of histology and cellular differentiation. This study suggests that tissue specificity is achieved by precise regulation of protein levels in space and time, and that different tissues in the body acquire their unique characteristics by controlling not which proteins are expressed but how much of each is produced.

View details for DOI 10.1038/msb.2009.93

View details for PubMedID 20029370

View details for PubMedCentralID PMC2824494
Affibody-mediated retention of the epidermal growth factor receptor in the secretory compartments leads to inhibition of phosphorylation in the kinase domain. New biotechnology Vernet, E., Lundberg, E., Friedman, M., Rigamonti, N., Klausing, S., Nygren, P. A., Gräslund, T. 2009; 25 (6): 417-23

Abstract

Abnormal activity of the epidermal growth factor receptor (EGFR) is associated with various cancer-related processes and motivates the search for strategies that can selectively block EGFR signalling. In this study, functional knockdown of EGFR was achieved through expression of an affibody construct, (ZEGFR:1907)(2-)KDEL, with high affinity for EGFR and extended with the amino acids KDEL to make it resident in the secretory compartments. Expression of (ZEGFR:1907)(2-)KDEL resulted in 80% reduction ofthe cell surface level of EGFR, and fluorescent staining for EGFR and the (ZEGFR:1907)(2-)KDEL construct showed overlapping intracellular localisation. Immunocapture of EGFR from cell lysates showed that an intracellular complex between EGFR and the affibody construct had been formed, further indicating aspecific interaction between the affibody construct and EGFR. Surface depletion of EGFR led to a dramatic decrease in the amount of kinase domain phosphorylated EGFR, coincident with a significant decrease in the proliferation rate.

View details for DOI 10.1016/j.nbt.2009.02.001

View details for PubMedID 19552886
Selective expression of Syntaxin-7 protein in benign melanocytes and malignant melanoma. Journal of proteome research Strömberg, S., Agnarsdóttir, M., Magnusson, K., Rexhepaj, E., Bolander, A., Lundberg, E., Asplund, A., Ryan, D., Rafferty, M., Gallagher, W. M., Uhlen, M., Bergqvist, M., Ponten, F. 2009; 8 (4): 1639-46

Abstract

To search for proteins expressed in human melanocytes and melanoma, we employed an antibody-based proteomics strategy to screen for protein expression in tissue microarrays containing normal tissues, cancer tissues and cell lines. Syntaxin-7 (STX7) was identified as a novel protein, not previously characterized in cells of melanocytic lineage, displaying a cell type-specific protein expression pattern. In tumor tissues, STX7 was expressed in malignant melanoma and lymphoma. The protein was further characterized regarding subcellular localization, specificity, tissue distribution pattern and potential as a diagnostic and prognostic marker using cell lines and tissue microarrays containing normal skin, melanocytic nevi and primary and metastatic melanoma. STX7 was expressed in normal melanocytes, various benign melanocytic nevi, atypical nevi and malignant melanoma. Analysis in two independent melanoma cohorts demonstrated STX7 expression in nearly all investigated tumors, although at varying levels (> 90% positive tumors). The expression level of STX7 protein was inversely correlated to tumor stage, suggesting that decreased expression of STX7 is associated with more aggressive tumors. In conclusion, we present protein profiling data for a novel protein showing high sensitivity and specificity for cells of the melanocytic lineage. The presented antibody-based proteomics approach can be used as an effective strategy to identify novel tumor markers and evaluate their potential clinical relevance.

View details for DOI 10.1021/pr800745e

View details for PubMedID 19714869
Automated Analysis of Human Protein Atlas Immunofluorescence Images. Proceedings. IEEE International Symposium on Biomedical Imaging Newberg, J. Y., Li, J., Rao, A., Pontén, F., Uhlén, M., Lundberg, E., Murphy, R. F. 2009; 5193229: 1023-1026

Abstract

The Human Protein Atlas is a rich source of location proteomics data. In this work, we present an automated approach for processing and classifying major subcellular patterns in the Atlas images. We demonstrate that two different classification frameworks (support vector machine and random forest) are effective at determining subcellular locations; we can analyze over 3500 Atlas images with a high degree of accuracy, up to 87.5% for all of the samples and 98.5% when only considering samples in whose classification assignments we are most confident. Moreover, the features obtained in both of these frameworks are observed to be highly consistent and generalizable. Additionally, we observe that the features relating the proteins to cell markers are especially important in automated learning approaches.

View details for DOI 10.1109/ISBI.2009.5193229

View details for PubMedID 20628548

View details for PubMedCentralID PMC2901900
Selection and characterization of Affibody ligands to the transcription factor c-Jun. Biotechnology and applied biochemistry Lundberg, E., Brismar, H., Gräslund, T. 2009; 52 (Pt 1): 17-27

Abstract

c-Jun is a highly oncogenic transcription factor involved in the development of different types of cancer. In the present study we have generated c-Jun-binding-affinity proteins from a phage-displayed library of so-called 'Affibody ligands', developed by combinatorial engineering of a non-immunoglobulin-based scaffold protein. Homodimeric c-Jun protein was recombinantly produced in Escherichia coli and, prior to selection, the quality of the target protein was investigated by binding analyses, which indicated specific binding to a double-stranded DNA hairpin construct containing a c-Jun response element, but not to a control sequence. Isolated Affibody variants from the phage selection were expressed in E. coli, purified by affinity chromatography and their interaction with c-Jun was analysed. In biosensor analyses, one Affibody ligand, denoted Z(cJun518), was shown to interact with immobilized c-Jun protein with an apparent dissociation constant of 5 microM. By constructing a head-to-tail homodimeric version of Z(cJun518), its apparent affinity for c-Jun could be increased threefold, suggesting co-operativity effects in the binding to the immobilized c-Jun protein. Further characterization of the Z(cJun518) Affibody molecule demonstrated, in both affinity-capture and Western-blotting experiments, its ability to interact selectively with c-Jun, even when the c-Jun target was present in a complex protein background consisting of a bacterial cell lysate. Z(cJun518) could also be used to stain the c-Jun-overexpressing cell line C8161 visualized by confocal fluorescence microscopy. Results from competition experiments indicated that the binding epitope on c-Jun for the Z(cJun518) Affibody molecule was separate from the binding sites of both a polyclonal antibody raised against the unstructured N-terminal domain and a double-stranded DNA hairpin containing a c-Jun response element. The potential intracellular use of Affibody ligands directed against transcription factors and other oncogenic factors is discussed.

View details for DOI 10.1042/BA20070178

View details for PubMedID 18260830
The correlation between cellular size and protein expression levels--normalization for global protein profiling. Journal of proteomics Lundberg, E., Gry, M., Oksvold, P., Kononen, J., Andersson-Svahn, H., Pontén, F., Uhlén, M., Asplund, A. 2008; 71 (4): 448-60

Abstract

An automated image analysis system was used for protein quantification of 1862 human proteins in 47 cancer cell lines and 12 clinical cell samples using cell microarrays and immunohistochemistry. The analysis suggests that most proteins are expressed in a cell size dependent manner, and that normalization is required for comparative protein quantification in order to correct for the inherent bias of cell size and systematic ambiguities associated with immunohistochemistry. Two reference standards were evaluated, and normalized protein expression values were found to allow for protein profiling across a panel of morphologically diverse cells, revealing putative patterns of over- and underexpression. Using this approach, proteins with stable expression as well as cell-line specific expression were identified. The results demonstrate the value of large-scale, automated proteome analysis using immunohistochemistry, in revealing functional correlations and establishing methods to interpret and mine proteomic data.

View details for DOI 10.1016/j.jprot.2008.06.014

View details for PubMedID 18656560
A genecentric Human Protein Atlas for expression profiles based on antibodies. Molecular & cellular proteomics : MCP Berglund, L., Björling, E., Oksvold, P., Fagerberg, L., Asplund, A., Szigyarto, C. A., Persson, A., Ottosson, J., Wernérus, H., Nilsson, P., Lundberg, E., Sivertsson, A., Navani, S., Wester, K., Kampf, C., Hober, S., Pontén, F., Uhlén, M. 2008; 7 (10): 2019-27

Abstract

An attractive path forward in proteomics is to experimentally annotate the human protein complement of the genome in a genecentric manner. Using antibodies, it might be possible to design protein-specific probes for a representative protein from every protein-coding gene and to subsequently use the antibodies for systematical analysis of cellular distribution and subcellular localization of proteins in normal and disease tissues. A new version (4.0) of the Human Protein Atlas has been developed in a genecentric manner with the inclusion of all human genes and splice variants predicted from genome efforts together with a visualization of each protein with characteristics such as predicted membrane regions, signal peptide, and protein domains and new plots showing the uniqueness (sequence similarity) of every fraction of each protein toward all other human proteins. The new version is based on tissue profiles generated from 6120 antibodies with more than five million immunohistochemistry-based images covering 5067 human genes, corresponding to approximately 25% of the human genome. Version 4.0 includes a putative list of members in various protein classes, both functional classes, such as kinases, transcription factors, G-protein-coupled receptors, etc., and project-related classes, such as candidate genes for cancer or cardiovascular diseases. The exact antigen sequence for the internally generated antibodies has also been released together with a visualization of the application-specific validation performed for each antibody, including a protein array assay, Western blot analysis, immunohistochemistry, and, for a large fraction, immunofluorescence-based confocal microscopy. New search functionalities have been added to allow complex queries regarding protein expression profiles, protein classes, and chromosome location. The new version of the protein atlas thus is a resource for many areas of biomedical research, including protein science and biomarker discovery.

View details for DOI 10.1074/mcp.R800013-MCP200

View details for PubMedID 18669619
Affinity-based entrapment of the HER2 receptor in the endoplasmic reticulum using an affibody molecule. Journal of immunological methods Vernet, E., Konrad, A., Lundberg, E., Nygren, P. A., Gräslund, T. 2008; 338 (1-2): 1-6

Abstract

Interference with the export of cell surface receptors can be performed through co-expression of specific affinity molecules designed for entrapment in the endoplasmic reticulum during the export process. We describe the investigation of a small (6 kDa) non-immunoglobulin-based HER2 receptor binding affibody molecule (Z(HER2:00477)), for use in affinity mediated entrapment of the HER2 receptor in the ER. Constructs encoding Z(HER2:00477) or a control affibody protein, with or without ER-retention peptide extensions (KDEL), were expressed in the HER2 over-expressing cell line SKOV-3. Intracellular expression of the full-length affibody constructs could be confirmed by probing cell extracts by Western blotting. Confocal immunofluorescence microscopy experiments showed extensive co-localization of the HER2 receptor and Z(HER2:00477)-KDEL in the ER, whereas the use of a KDEL-extended control affibody molecule resulted in distinct and separate signals from cell surface-localized HER2 receptor and ER-localized affibody protein. This indicated a capability of the Z(HER2:00477)-KDEL fusion protein to functionally interfere with the export process of HER2 receptor in a specific manner. Using flow cytometry and cell proliferation analyses, it could be shown that expression of the Z(HER2:00477)-KDEL fusion construct in the SKOV-3 cell line resulted both in a marked reduction in cell surface level of HER2 receptors and that the cell population doubling time was significantly increased. Expression of the Z(HER2:00477)-KDEL fusion protein in additional cell lines of different origin and with different expression levels of endogenous HER2 receptor compared to SKOV-3, also resulted in depletion of the cell surface levels of HER2 receptor. This indicated upon a general ability of the Z(HER2:00477)-KDEL fusion protein to functionally interfere with the export process of HER2.

View details for DOI 10.1016/j.jim.2008.06.005

View details for PubMedID 18671978
Toward a confocal subcellular atlas of the human proteome. Molecular & cellular proteomics : MCP Barbe, L., Lundberg, E., Oksvold, P., Stenius, A., Lewin, E., Björling, E., Asplund, A., Pontén, F., Brismar, H., Uhlén, M., Andersson-Svahn, H. 2008; 7 (3): 499-508

Abstract

Information on protein localization on the subcellular level is important to map and characterize the proteome and to better understand cellular functions of proteins. Here we report on a pilot study of 466 proteins in three human cell lines aimed to allow large scale confocal microscopy analysis using protein-specific antibodies. Approximately 3000 high resolution images were generated, and more than 80% of the analyzed proteins could be classified in one or multiple subcellular compartment(s). The localizations of the proteins showed, in many cases, good agreement with the Gene Ontology localization prediction model. This is the first large scale antibody-based study to localize proteins into subcellular compartments using antibodies and confocal microscopy. The results suggest that this approach might be a valuable tool in conjunction with predictive models for protein localization.

View details for DOI 10.1074/mcp.M700325-MCP200

View details for PubMedID 18029348
A novel method for reproducible fluorescent labeling of small amounts of antibodies on solid phase. Journal of immunological methods Lundberg, E., Sundberg, M., Gräslund, T., Uhlén, M., Svahn, H. A. 2007; 322 (1-2): 40-9

Abstract

Fluorescently labeled antibodies are very important tools in cell biology, providing for specific and quantitative detection of antigens. To date, fluorophore labeling of antibodies has been performed in solution and has been limited by low-throughput methods requiring a substantial amount of pure antibody sample at a high concentration. We have developed a novel solid-phase labeling protocol for small amounts (i.e. micrograms) of antibodies with fluorescent dyes. Protein A affinity medium was used as solid support in a micropipette tip format. This solid-phase approach, including the advantage of the strong and specific interaction between Protein A and antibodies, allows for simultaneous purification, labeling and concentration of the antibody sample, making it possible to start with unpure antibody samples at low concentrations. We have optimized the protocol with regard to reaction pH, time, temperature and amount of amine reactive dye. In addition, we have evaluated the stability and activity of the labeled antibodies. To evaluate the reproducibility and robustness of this method we labeled eight antibodies with amine reactive fluorescent dyes followed by evaluation of antibody specificity on protein arrays. Interestingly, this gave an extremely high conformity in the degree of labeling, showing the robustness of the method. The solid-phase method also gave predictable and reproducible results and by varying the amount of reactive dye, the desired degree of labeling can easily be achieved. Antibodies labeled using this solid-phase method were similar in stability and activity to antibodies labeled in solution. This novel solid-phase antibody labeling method may also be applicable for other conjugation chemistries and labels, and has potential for high-throughput applications.

View details for DOI 10.1016/j.jim.2007.01.023

View details for PubMedID 17383674
Site-specifically conjugated anti-HER2 Affibody molecules as one-step reagents for target expression analyses on cells and xenograft samples. Journal of immunological methods Lundberg, E., Höidén-Guthenberg, I., Larsson, B., Uhlén, M., Gräslund, T. 2007; 319 (1-2): 53-63

Abstract

Affibody molecules are a class of small and robust affinity proteins that can be generated to interact with a variety of antigens, thus having the potential to provide useful tools for biotechnological research and diagnostic applications. In this study, we have investigated Affibody-based reagents interacting specifically with the tyrosine kinase receptor HER2. A head-to-tail dimeric construct was site-specifically conjugated with different fluorescent and enzymatic groups resulting in reagents that were used for detection and quantification. The amount of cell surface expressed HER2 on eleven (11) well characterized cell lines was quantified relative to each other by flow cytometry and shown to correlate well with results from parallel analyses of HER2 mRNA levels measured by real-time PCR. Further, immunofluorescence microscopy studies of the cell lines and immunohistochemical analyses of cryosections of HER2 expressing SKOV-3 xenografts showed strong staining of the plasma membrane of tumor cells with little background staining. Full-length HER2 protein could also be efficiently recovered from a cell extract by an immunoprecipitation procedure, using an Affibody ligand-based resin. These novel non-IgG derived reagents could be used to detect and quantify HER2 expression. By adapting the methods for use with Affibody molecules binding to other cell surface receptors, it is anticipated that also these receptors can be detected and quantified in a similar manner.

View details for DOI 10.1016/j.jim.2006.10.013

View details for PubMedID 17196217

Emma Lundberg

Associate Professor of Bioengineering and of Pathology

Bio

Academic Appointments

Administrative Appointments

Honors & Awards

Boards, Advisory Committees, Professional Organizations

Professional Education

Contact

Additional Info

Links

2025-26 Courses

2024-25 Courses

2023-24 Courses

2022-23 Courses

Stanford Advisees

All Publications

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract