Decoding the Radio Sky: Bayesian ensemble learning and SVD-based feature extraction for automated radio galaxy classification

Theophilus Ansah-Narh, Jordan Lontsi Tedongmo, Joseph Bremang Tandoh, Nia Imara, Ezekiel Nii Noye Nortey

Research output: Contribution to journalArticlepeer-review

Abstract

The classification of radio galaxies is central to understanding galaxy evolution, active galactic nuclei dynamics, and the large-scale structure of the universe. However, traditional manual techniques are inadequate for processing the massive, heterogeneous datasets generated by modern radio surveys. In this study, we present a probabilistic machine learning framework that integrates Singular Value Decomposition (SVD) for feature extraction with Bayesian ensemble learning to achieve robust, scalable radio galaxy classification. The SVD approach effectively reduces dimensionality while preserving key morphological structures, enabling efficient representation of galaxy features. To mitigate class imbalance and avoid the introduction of artefacts, we incorporate a Local Neighbourhood Encoding strategy tailored to the astrophysical distribution of galaxy types. The resulting features are used to train and optimise several baseline classifiers: Logistic Regression, Support Vector Machines, LightGBM, and Multi-Layer Perceptrons within bagging, boosting, and stacking ensembles governed by a Bayesian weighting scheme. Our results demonstrate that Bayesian ensembles outperform their traditional counterparts across all metrics, with the Bayesian stacking model achieving a classification accuracy of 99.0% and an F1-score of 0.99 across Compact, Bent, Fanaroff–Riley Type I (FR-I), and Type II (FR-II) sources. Interpretability is enhanced through SHAP analysis, which highlights the principal components most associated with morphological distinctions. Beyond improving classification performance, our framework facilitates uncertainty quantification, paving the way for more reliable integration into next-generation survey pipelines. This work contributes a reproducible and interpretable methodology for automated galaxy classification in the era of data-intensive radio astronomy.

Original languageEnglish
Article number101018
JournalAstronomy and Computing
Volume54
DOIs
Publication statusPublished - Jan 2026

Keywords

  • Bayesian ensemble learning
  • Class imbalance correction
  • Machine learning in astronomy
  • Radio galaxy classification
  • SHAP interpretability
  • Singular Value Decomposition

Fingerprint

Dive into the research topics of 'Decoding the Radio Sky: Bayesian ensemble learning and SVD-based feature extraction for automated radio galaxy classification'. Together they form a unique fingerprint.

Cite this