Artificial intelligence-enhancement of flow cytometry data accelerates the identification of measurable residual chronic lymphocytic leukemia

Leukemia, May 2026

Flow cytometry (FC) is essential for detecting measurable residual disease (MRD) in chronic lymphocytic leukemia (CLL), but its use is limited by the expertise and time required for manual analysis. We developed an artificial intelligence (AI) pipeline, Clustering/Classification of All Events, Dimensionality reduction, Downsampling, and Aberrancy Scaling (CCADDAS), to automatically enhance raw FC files, streamlining CLL MRD detection using a single-tube 10-color panel. FC files from 166 MRD-positive and 61 MRD-negative cases were processed in a cloud environment. Automated steps included error correction (FlowCut), clustering (PARC), dimensionality reduction (UMAP), anomaly detection against negative controls, and cluster-informed downsampling that preserved rare MRD events. A deep neural network trained on expert-defined normal subsets enabled automated gating. AI-enhanced files were analyzed in standard FC software, yielding results highly concordant with conventional expert review (R² = 0.98). Downsampling reduced cellularity by 85% and file size by 78%, while retaining low-level MRD events. An AI-generated aberrancy scale distinguished CLL MRD from background B cells with excellent performance (AUC = 0.98). Manual analysis time decreased from 9.0 to 0.9 min per case (90% reduction). CCADDAS provides a largely unsupervised, software-agnostic method that accelerates and simplifies CLL MRD detection without compromising test performance compared to conventional analysis, enabling broader adoption of FC-based MRD testing.

Article PDF cannot be displayed. You can download it here:

https://www.nature.com/articles/s41375-026-02986-3.pdf

Artificial intelligence-enhancement of flow cytometry data accelerates the identification of measurable residual chronic lymphocytic leukemia

Leukemia ARTICLE www.nature.com/leu OPEN Artificial intelligence-enhancement of flow cytometry data accelerates the identification of measurable residual chronic lymphocytic leukemia ✉ April Chiu 1,3 , Jansen N. Seheult1,3, Min Shi 1, Dragan Jevremovic 1, Clarissa E. Jordan1, Mathew J. Weybright Michael M. Timm1, Gregory E. Otteson1, Horatiu Olteanu 1, Sameer A. Parikh 2 and Pedro Horna 1 1 , 1234567890();,: © The Author(s) 2026 Flow cytometry (FC) is essential for detecting measurable residual disease (MRD) in chronic lymphocytic leukemia (CLL), but its use is limited by the expertise and time required for manual analysis. We developed an artificial intelligence (AI) pipeline, Clustering/ Classification of All Events, Dimensionality reduction, Downsampling, and Aberrancy Scaling (CCADDAS), to automatically enhance raw FC files, streamlining CLL MRD detection using a single-tube 10-color panel. FC files from 166 MRD-positive and 61 MRDnegative cases were processed in a cloud environment. Automated steps included error correction (FlowCut), clustering (PARC), dimensionality reduction (UMAP), anomaly detection against negative controls, and cluster-informed downsampling that preserved rare MRD events. A deep neural network trained on expert-defined normal subsets enabled automated gating. AI-enhanced files were analyzed in standard FC software, yielding results highly concordant with conventional expert review (R² = 0.98). Downsampling reduced cellularity by 85% and file size by 78%, while retaining low-level MRD events. An AI-generated aberrancy scale distinguished CLL MRD from background B cells with excellent performance (AUC = 0.98). Manual analysis time decreased from 9.0 to 0.9 min per case (90% reduction). CCADDAS provides a largely unsupervised, software-agnostic method that accelerates and simplifies CLL MRD detection without compromising test performance compared to conventional analysis, enabling broader adoption of FC-based MRD testing. Leukemia; https://doi.org/10.1038/s41375-026-02986-3 INTRODUCTION Chronic lymphocytic leukemia (CLL) is a lymphoid neoplasm characterized by accumulation of clonal mature B-cells in the peripheral blood, bone marrow, lymph nodes, and/or spleen [1]. CLL is currently the most common leukemia in the Western Hemisphere, with approximately 24,000 new cases diagnosed in the United States in 2025 [2]. Although patients with CLL typically present with indolent disease and can be managed by observation alone, most eventually require therapeutic intervention due to progressive or symptomatic disease. CLL-directed therapeutic options have dramatically expanded over the last decade and continue to do so, which consist of chemoimmunotherapy (including CD20 targeting monoclonal antibodies such as rituximab and obinutuzumab), agents targeting B-cell receptor signaling (e.g., ibrutinib), B-cell lymphoma 2 inhibitors (BCL2i, such as venetoclax), and chimeric antigen receptor T-cell (CART) therapy [3]. While many patients do achieve complete remission (CR) according to the 2018 International Workshop on Chronic Lymphocytic Leukemia (iwCLL) criteria [4], residual disease may still be detectable, termed measurable (or minimal) residual disease (MRD), by using highly sensitive assays. MRD detection in patients with CLL following therapy is of great significance as this has been established as an independent prognostic factor for progression free survival and overall survival [5–7]. A negative MRD status in CLL patients is increasingly used as a surrogate endpoint to establish efficacy and consideration for cessation of therapy [8–12]. CLL MRD can be assessed by a variety of methodologies including flow cytometry (FC), allele-specific oligonucleotide quantitative polymerase chain reaction (ASO-qPCR), next generation sequencing (NGS), droplet digital PCR (ddPCR), and cell-free DNA analysis [13]. Of these, FC is the most assessable, fastest, and likely most cost effective; and therefore most widely used. Furthermore, FC does not require prior knowledge of diseasespecific immunoglobulin sequences or immunophenotypic abnormalities at diagnosis. However, while modern FC platforms are capable of achieving the recommended assay sensitivity of at least 0.01% (10−4), which is recommended by both 2018 iwCLL [4] and 2026 NCCN guidelines [14], molecular methods have the advantage of detecting even lower MRD levels (up to 10−6) which may have prognostic significance in some settings. In addition, conventional FC MRD testing is typically performed by manual gating and analysis of cell populations on two-dimensional dot plots, and heavily relies upon technical expertise of clinical laboratory staff. Contributing to this highly manual and timeconsuming process is the large amount of data analyzed, requiring 1 Department of Laboratory Medicine and Pathology, Division of Hematopathology, Mayo Clinic, Rochester, MN, USA. 2Department of Medicine, Division of Hematology, Mayo Clinic, Rochester, MN, USA. 3These authors contributed equally: April Chiu, Jansen N. Seheult. ✉email: Received: 29 December 2025 Revised: 3 April 2026 Accepted: 15 May 2026 A. Chiu et al. 2 robust computational processing power and advanced software analytical capabilities compared to non-MRD-based FC analysis. Therefore, the availability of MRD testing by FC is generally confined to large academic centers and reference laboratories. To overcome these limitations, we implemented a cloud-based computational artificial intelligence (AI) pipeline recently developed by our group [15], which simplifies and accelerates the identification and quantification of MRD without compromising test performance compared to conventional analysis. This pipeline, CCADDAS (Clustering and Classification of All Events, Dimensionality reduction, Downsampling, and Aberrancy Scaling), transforms raw data files produced by the FC analyzer into markedly smaller, AI-annotated, and software-agnostic files, and provides a measure of aberrancy in comparison to negative controls. We applied this pipeline on a large cohort of FC data files produced from our 10-color CLL MRD clinical assay, and compared the pipeline’s performance against that of conventional FC CLL MRD analysis. METHODS Patient and sample selection For training of the pipeline, 29 negative control samples (15 bone marrow aspirates and 14 peripheral bloods) were selected; including 24 CLL MRD negative cases analyzed at Mayo Clinic, Rochester, MN, between January 2021 and July 2023; and 5 normal marrow specimens procured from hip replacements and analyzed between January and May 2021. The selection of controls was based on adequate representation of benign normal subsets as assessed by expert analysts. A separate research cohort included 227 specimens (74 bone marrow aspirate and 153 peripheral blood) received for CLL MRD flow cytometric analysis at Mayo Clinic in Rochester, MN, between April 2023 and February 2024; including 166 specimens from 122 patients reported as MRD- (...truncated)


This is a preview of a remote PDF: https://www.nature.com/articles/s41375-026-02986-3.pdf
Article home page: https://www.nature.com/articles/s41375-026-02986-3

April Chiu, Jansen N. Seheult, Min Shi, Dragan Jevremovic, Clarissa E. Jordan, Mathew J. Weybright, Michael M. Timm, Gregory E. Otteson, Horatiu Olteanu, Sameer A. Parikh, Pedro Horna. Artificial intelligence-enhancement of flow cytometry data accelerates the identification of measurable residual chronic lymphocytic leukemia, Leukemia, 2026, pp. 1-11, DOI: 10.1038/s41375-026-02986-3