TY - JOUR
T1 - Sub-population identification of multimorbidity in sub-Saharan African populations
AU - Oshingbesan, Adebayo
AU - Kamp, Michelle
AU - Mpangase, Phelelani Thokozani
AU - Adetunji, Kayode
AU - Iddi, Samuel
AU - Nderitu, Daniel Maina
AU - Akumu, Tanya
AU - Achilonu, Okechinyere
AU - Kisiangani, Isaac
AU - Mathema, Theophilous
AU - Tadesse, Girmaw
AU - Gomez-Olive, F. Xavier
AU - Kabudula, Chodziwadziwa Whiteson
AU - Hazelhurst, Scott
AU - Asiki, Gershim
AU - Ramsay, Michele
AU - Speakman, Skyler
N1 - Publisher Copyright:
© The Author(s) 2025.
PY - 2025/12
Y1 - 2025/12
N2 - This work provides three contributions that straddle the medical literature on multimorbidity and the data science community with an interest on exploratory analysis of health-related research data. First, we propose a definition for multimorbidity as the co-occurrence of (at least) two disease diagnoses from a pre-determined list. This interpretation adds to a growing body of working definitions emerging from the literature. Second, we apply this novel outcome of-interest to two sub-Saharan populations located in Nairobi, Kenya and Agincourt, South Africa. The source data for this analysis was collected as part of the Africa Wits-INDEPTH Partnership for Genomic Studies project. Third, we stratify this outcome-of-interest across all possible sub-populations and identify sub-populations with anomalously high (or low) rates of multimorbidity. Critically, the automatic stratification approach emphasizes efficient, disciplined exploratory-based analysis as a complementary alternative to more commonly-used confirmation analysis methods. Our results show that high-risk sub-populations identified in one part of the continent transfer to the other location (and vice-versa) with the equivalent sub-population at the other location also experiencing higher rates of multimorbidity. Second, we discover a real-world scenario where a more-at risk sub-population existed beyond the simpler sub-populations traditionally stratified by age and sex. This is in contrast to existing literature which commonly stratifies disease diagnoses by sex when reporting results. Patterns in diseases, and healthcare more generally, are likely more nuanced than manual approaches may be able to describe. This work helps introduce public health researchers to data science methods that scale to the size and complexity of modern day datasets.
AB - This work provides three contributions that straddle the medical literature on multimorbidity and the data science community with an interest on exploratory analysis of health-related research data. First, we propose a definition for multimorbidity as the co-occurrence of (at least) two disease diagnoses from a pre-determined list. This interpretation adds to a growing body of working definitions emerging from the literature. Second, we apply this novel outcome of-interest to two sub-Saharan populations located in Nairobi, Kenya and Agincourt, South Africa. The source data for this analysis was collected as part of the Africa Wits-INDEPTH Partnership for Genomic Studies project. Third, we stratify this outcome-of-interest across all possible sub-populations and identify sub-populations with anomalously high (or low) rates of multimorbidity. Critically, the automatic stratification approach emphasizes efficient, disciplined exploratory-based analysis as a complementary alternative to more commonly-used confirmation analysis methods. Our results show that high-risk sub-populations identified in one part of the continent transfer to the other location (and vice-versa) with the equivalent sub-population at the other location also experiencing higher rates of multimorbidity. Second, we discover a real-world scenario where a more-at risk sub-population existed beyond the simpler sub-populations traditionally stratified by age and sex. This is in contrast to existing literature which commonly stratifies disease diagnoses by sex when reporting results. Patterns in diseases, and healthcare more generally, are likely more nuanced than manual approaches may be able to describe. This work helps introduce public health researchers to data science methods that scale to the size and complexity of modern day datasets.
KW - Africa
KW - Exploratory analysis
KW - Multimorbidity
KW - Subset scanning
KW - Survey data
UR - http://www.scopus.com/inward/record.url?scp=105003258121&partnerID=8YFLogxK
U2 - 10.1038/s41598-025-96569-4
DO - 10.1038/s41598-025-96569-4
M3 - Article
AN - SCOPUS:105003258121
SN - 2045-2322
VL - 15
JO - Scientific Reports
JF - Scientific Reports
IS - 1
M1 - 13992
ER -