TY - JOUR
T1 - Deep generative classification of blood cell morphology
AU - BloodCounts! consortium
AU - Deltadahl, Simon
AU - Gilbey, Julian
AU - Van Laer, Christine
AU - Boeckx, Nancy
AU - Leers, Mathie P.G.
AU - Freeman, Tanya
AU - Aiken, Laura
AU - Farren, Timothy
AU - Smith, Matthew
AU - Zeina, Mohamad
AU - MacDonald, Stephen
AU - Gleghorn, Daniel
AU - Rudd, James H.F.
AU - Ouwehand, Willem H.
AU - Amenga-Etego, Lucas
AU - Sarpong, Kwabena
AU - Awandare, Gordon A.
AU - Trompeter, Sara
AU - Gupta, Rajeev
AU - Secka, Ousman
AU - Bah, Bubacarr
AU - D’Alessandro, Umberto
AU - de Wit, Norbert
AU - Henskens, Yvonne
AU - Koh, Mickey
AU - Williams, Sophie
AU - Kar, Sujoy
AU - Asselbergs, Folkert
AU - Schut, Martijn
AU - Rudd, James H.F.
AU - Piazzese, Concetta
AU - Taylor, Joseph
AU - Gleadall, Nicholas
AU - Schönlieb, Carola Bibiane
AU - Sivapalaratnam, Suthesh
AU - Roberts, Michael
AU - Nachev, Parashkev
N1 - Publisher Copyright:
© The Author(s) 2025.
PY - 2025/11
Y1 - 2025/11
N2 - Blood cell morphology assessment via light microscopy constitutes a cornerstone of haematological diagnostics, providing crucial insights into diverse pathological conditions. This complex task demands expert interpretation owing to subtle morphological variations, biological heterogeneity and technical imaging factors that obstruct automated approaches. Conventional machine learning methods using discriminative models struggle with domain shifts, intraclass variability and rare morphological variants, constraining their clinical utility. We introduce CytoDiffusion, a diffusion-based generative classifier that faithfully models the distribution of blood cell morphology, combining accurate classification with robust anomaly detection, resistance to distributional shifts, interpretability, data efficiency and uncertainty quantification that surpasses clinical experts. Our approach outperforms state-of-the-art discriminative models in anomaly detection (area under the curve, 0.990 versus 0.916), resistance to domain shifts (0.854 versus 0.738 accuracy) and performance in low-data regimes (0.962 versus 0.924 balanced accuracy). In particular, CytoDiffusion generates synthetic blood cell images that expert haematologists cannot distinguish from real ones (accuracy, 0.523; 95% confidence interval: [0.505, 0.542]), demonstrating good command of the underlying distribution. Furthermore, we enhance model explainability through directly interpretable counterfactual heat maps. Our comprehensive evaluation framework establishes a multidimensional benchmark for medical image analysis in haematology, ultimately enabling improved diagnostic accuracy in clinical settings.
AB - Blood cell morphology assessment via light microscopy constitutes a cornerstone of haematological diagnostics, providing crucial insights into diverse pathological conditions. This complex task demands expert interpretation owing to subtle morphological variations, biological heterogeneity and technical imaging factors that obstruct automated approaches. Conventional machine learning methods using discriminative models struggle with domain shifts, intraclass variability and rare morphological variants, constraining their clinical utility. We introduce CytoDiffusion, a diffusion-based generative classifier that faithfully models the distribution of blood cell morphology, combining accurate classification with robust anomaly detection, resistance to distributional shifts, interpretability, data efficiency and uncertainty quantification that surpasses clinical experts. Our approach outperforms state-of-the-art discriminative models in anomaly detection (area under the curve, 0.990 versus 0.916), resistance to domain shifts (0.854 versus 0.738 accuracy) and performance in low-data regimes (0.962 versus 0.924 balanced accuracy). In particular, CytoDiffusion generates synthetic blood cell images that expert haematologists cannot distinguish from real ones (accuracy, 0.523; 95% confidence interval: [0.505, 0.542]), demonstrating good command of the underlying distribution. Furthermore, we enhance model explainability through directly interpretable counterfactual heat maps. Our comprehensive evaluation framework establishes a multidimensional benchmark for medical image analysis in haematology, ultimately enabling improved diagnostic accuracy in clinical settings.
UR - https://www.scopus.com/pages/publications/105023956294
U2 - 10.1038/s42256-025-01122-7
DO - 10.1038/s42256-025-01122-7
M3 - Article
AN - SCOPUS:105023956294
SN - 2522-5839
VL - 7
SP - 1791
EP - 1803
JO - Nature Machine Intelligence
JF - Nature Machine Intelligence
IS - 11
ER -