Stanford Advisors


All Publications


  • Multimodal learning enables chat-based exploration of single-cell data NATURE BIOTECHNOLOGY Schaefer, M., Peneder, P., Malzl, D., Lombardo, S., Peycheva, M., Burton, J., Hakobyan, A., Sharma, V., Krausgruber, T., Sin, C., Menche, J., Tomazou, E. M., Bock, C. 2025

    Abstract

    Single-cell sequencing characterizes biological samples at unprecedented scale and detail, but data interpretation remains challenging. Here, we present CellWhisperer, an artificial intelligence (AI) model and software tool for chat-based interrogation of gene expression. We establish a multimodal embedding of transcriptomes and their textual annotations, using contrastive learning on 1 million RNA sequencing profiles with AI-curated descriptions. This embedding informs a large language model that answers user-provided questions about cells and genes in natural-language chats. We benchmark CellWhisperer's performance for zero-shot prediction of cell types and other biological annotations and demonstrate its use for biological discovery in a meta-analysis of human embryonic development. We integrate a CellWhisperer chat box with the CELLxGENE browser, allowing users to interactively explore gene expression through a combined graphical and chat interface. In summary, CellWhisperer leverages large community-scale data repositories to connect transcriptomes and text, thereby enabling interactive exploration of single-cell RNA-sequencing data with natural-language chats.

    View details for DOI 10.1038/s41587-025-02857-9

    View details for Web of Science ID 001611684000001

    View details for PubMedID 41219484

    View details for PubMedCentralID 7453005

  • Systematic discovery of CRISPR-boosted CAR T cell immunotherapies NATURE Datlinger, P., Pankevich, E. V., Arnold, C. D., Pranckevicius, N., Lin, J., Romanovskaia, D., Schaefer, M., Piras, F., Orts, A., Nemc, A., Biesaga, P. N., Chan, M., Neuwirth, T., Artemov, A. V., Li, W., Ladstatter, S., Krausgruber, T., Bock, C. 2025

    Abstract

    Chimeric antigen receptor (CAR) T cell therapy has shown remarkable success in treating blood cancers, but CAR T cell dysfunction remains a common cause of treatment failure1. Here we present CELLFIE, a CRISPR screening platform for enhancing CAR T cells across multiple clinical objectives. We performed genome-wide screens in human primary CAR T cells, with readouts capturing key aspects of T cell biology, including proliferation, target cell recognition, activation, apoptosis and fratricide, and exhaustion. Screening hits were prioritized using a new in vivo CROP-seq2 method in a xenograft model of human leukaemia, establishing several gene knockouts that boost CAR T cell efficacy. Most notably, we discovered that RHOG knockout is a potent and unexpected CAR T cell enhancer, both individually and together with FAS knockout, which was validated across multiple in vivo models, CAR designs and sample donors, and in patient-derived cells. Demonstrating the versatility of the CELLFIE platform, we also conducted combinatorial CRISPR screens to identify synergistic gene pairs and saturation base-editing screens to characterize RHOG variants. In summary, we discovered, validated and biologically characterized CRISPR-boosted CAR T cells that outperform standard CAR T cells in widely used benchmarks, establishing a foundational resource for optimizing cell-based immunotherapies.

    View details for DOI 10.1038/s41586-025-09507-9

    View details for Web of Science ID 001577752400001

    View details for PubMedID 40993398

    View details for PubMedCentralID 10965011

  • GPT-4 as a biomedical simulator. Computers in biology and medicine Schaefer, M., Reichl, S., Ter Horst, R., Nicolas, A. M., Krausgruber, T., Piras, F., Stepper, P., Bock, C., Samwald, M. 2024; 178: 108796

    Abstract

    BACKGROUND: Computational simulation of biological processes can be a valuable tool for accelerating biomedical research, but usually requires extensive domain knowledge and manual adaptation. Large language models (LLMs) such as GPT-4 have proven surprisingly successful for a wide range of tasks. This study provides proof-of-concept for the use of GPT-4 as a versatile simulator of biological systems.METHODS: We introduce SimulateGPT, a proof-of-concept for knowledge-driven simulation across levels of biological organization through structured prompting of GPT-4. We benchmarked our approach against direct GPT-4 inference in blinded qualitative evaluations by domain experts in four scenarios and in two quantitative scenarios with experimental ground truth. The qualitative scenarios included mouse experiments with known outcomes and treatment decision support in sepsis. The quantitative scenarios included prediction of gene essentiality in cancer cells and progression-free survival in cancer patients.RESULTS: In qualitative experiments, biomedical scientists rated SimulateGPT's predictions favorably over direct GPT-4 inference. In quantitative experiments, SimulateGPT substantially improved classification accuracy for predicting the essentiality of individual genes and increased correlation coefficients and precision in the regression task of predicting progression-free survival.CONCLUSION: This proof-of-concept study suggests that LLMs may enable a new class of biomedical simulators. Such text-based simulations appear well suited for modeling and understanding complex living systems that are difficult to describe with physics-based first-principles simulations, but for which extensive knowledge is available as written text. Finally, we propose several directions for further development of LLM-based biomedical simulators, including augmentation through web search retrieval, integrated mathematical modeling, and fine-tuning on experimental data.

    View details for DOI 10.1016/j.compbiomed.2024.108796

    View details for PubMedID 38909448