TY - JOUR
T1 - AICpred
T2 - Machine Learning-Based Prediction of Potential Anti-Inflammatory Compounds Targeting TLR4-MyD88 Binding Mechanism
AU - Fry-Nartey, Lucindah N.
AU - Akafia, Cyril
AU - Nkonu, Ursula S.
AU - Baiden, Spencer B.
AU - Dorvi, Ignatus Nunana
AU - Agyenkwa-Mawuli, Kwasi
AU - Agyapong, Odame
AU - Hayford, Claude Fiifi
AU - Wilson, Michael D.
AU - Miller, Whelton A.
AU - Kwofie, Samuel K.
N1 - Publisher Copyright:
© 2025 by the authors.
PY - 2025/1
Y1 - 2025/1
N2 - Toll-like receptor 4 (TLR4) has been implicated in the production of uncontrolled inflammation within the body, known as the cytokine storm. Studies that employ machine learning (ML) in the prediction of potential inhibitors of TLR4 are limited. This study introduces AICpred, a robust, free, user-friendly, and easily accessible machine learning-based web application for predicting inhibitors against TLR4 by targeting the TLR4-myeloid differentiation primary response 88 (MyD88) interaction. MyD88 is a crucial adaptor protein in the TLR4-induced hyper-inflammation pathway. Predictive models were trained using random forest, adaptive boosting (AdaBoost), eXtreme gradient boosting (XGBoost), k-nearest neighbours (KNN), and decision tree models. To handle imbalance within the training data, resampling techniques such as random under-sampling, synthetic minority oversampling technique, and the random selection of 5000 instances of the majority class were employed. A 10-fold cross-validation strategy was used to evaluate model performance based on metrics including accuracy, balanced accuracy, and recall. The XGBoost model demonstrated superior performance with accuracy, balanced accuracy, and recall scores of 0.994, 0.958, and 0.917, respectively, on the test. The AdaBoost and decision tree models also excelled with accuracies ranging from 0.981 to 0.992, balanced accuracies between 0.921 and 0.944, and recall scores between 0.845 and 0.891 on both training and test datasets. The XGBoost model was deployed as AICpred and was used to screen compounds that have been reported to have positive effects on mitigating the hyperinflammation-associated cytokine storm, which is a key factor in COVID-19. The models predicted Baricitinib, Ibrutinib, Nezulcitinib, MCC950, and Acalabrutinib as anti-TLR4 compounds with prediction probability above 0.90. Additionally, compounds known to inhibit TLR4, including TAK-242 (Resatorvid) and benzisothiazole derivative (M62812), were predicted as bioactive agents within the applicability domain with probabilities above 0.80. Computationally inferred compounds using AICpred can be explored as potential starting skeletons for therapeutic agents against hyperinflammation. These predictions must be consolidated with experimental screening to enhance further optimisation of the compounds. AICpred is the first of its kind targeting the inhibition of TLR4-MyD88 binding and is freely available at http://197.255.126.13:8080.
AB - Toll-like receptor 4 (TLR4) has been implicated in the production of uncontrolled inflammation within the body, known as the cytokine storm. Studies that employ machine learning (ML) in the prediction of potential inhibitors of TLR4 are limited. This study introduces AICpred, a robust, free, user-friendly, and easily accessible machine learning-based web application for predicting inhibitors against TLR4 by targeting the TLR4-myeloid differentiation primary response 88 (MyD88) interaction. MyD88 is a crucial adaptor protein in the TLR4-induced hyper-inflammation pathway. Predictive models were trained using random forest, adaptive boosting (AdaBoost), eXtreme gradient boosting (XGBoost), k-nearest neighbours (KNN), and decision tree models. To handle imbalance within the training data, resampling techniques such as random under-sampling, synthetic minority oversampling technique, and the random selection of 5000 instances of the majority class were employed. A 10-fold cross-validation strategy was used to evaluate model performance based on metrics including accuracy, balanced accuracy, and recall. The XGBoost model demonstrated superior performance with accuracy, balanced accuracy, and recall scores of 0.994, 0.958, and 0.917, respectively, on the test. The AdaBoost and decision tree models also excelled with accuracies ranging from 0.981 to 0.992, balanced accuracies between 0.921 and 0.944, and recall scores between 0.845 and 0.891 on both training and test datasets. The XGBoost model was deployed as AICpred and was used to screen compounds that have been reported to have positive effects on mitigating the hyperinflammation-associated cytokine storm, which is a key factor in COVID-19. The models predicted Baricitinib, Ibrutinib, Nezulcitinib, MCC950, and Acalabrutinib as anti-TLR4 compounds with prediction probability above 0.90. Additionally, compounds known to inhibit TLR4, including TAK-242 (Resatorvid) and benzisothiazole derivative (M62812), were predicted as bioactive agents within the applicability domain with probabilities above 0.80. Computationally inferred compounds using AICpred can be explored as potential starting skeletons for therapeutic agents against hyperinflammation. These predictions must be consolidated with experimental screening to enhance further optimisation of the compounds. AICpred is the first of its kind targeting the inhibition of TLR4-MyD88 binding and is freely available at http://197.255.126.13:8080.
KW - anti-inflammatory
KW - cytokine storm
KW - inflammatory
KW - inhibitors
KW - machine learning
KW - toll-like receptor 4 (TLR4)
KW - web application
UR - http://www.scopus.com/inward/record.url?scp=85215667520&partnerID=8YFLogxK
U2 - 10.3390/info16010034
DO - 10.3390/info16010034
M3 - Article
AN - SCOPUS:85215667520
SN - 2078-2489
VL - 16
JO - Information (Switzerland)
JF - Information (Switzerland)
IS - 1
M1 - 34
ER -