Unveiling etiology and mortality risks in community-acquired pneumonia: A machine learning approach

Authors

  • Alaa Ali Department of Clinical Pharmacy and Therapeutics, Applied Science Private University, Amman, Jordan
  • Ahmad R. Alsayed Department of Clinical Pharmacy and Therapeutics, Applied Science Private University, Amman, Jordan
  • Nesrin Seder Department of Pharmaceutical Chemistry and Pharmacognosy, Applied Science Private University, Amman, Jordan
  • Yazun Jarrar Department of Basic Medical Sciences, Faculty of Medicine, Al-Balqa Applied University, Al-Salt, Jordan
  • Raed H. Altabanjeh Department of Clinical Pharmacy and Therapeutics, Applied Science Private University, Amman, Jordan
  • Mamoon Zihlif Department of Internal Medicine, Section of Pulmonary, Islamic Hospital, Amman, Jordan
  • Osama Abu Ata Department of Internal Medicine, Section of Infectious Diseases, Islamic Hospital, Amman, Jordan
  • Anas Samara Department of Software Engineering, Bethlehem University, Bethlehem, Palestine
  • Malek Zihlif Department of Pharmacology, School of Medicine, The University of Jordan, Amman, Jordan

DOI:

https://doi.org/10.17305/bb.2025.12378

Keywords:

Community-acquired pneumonia, CAP, machine learning, ML, mortality prediction, risk assessment, clinical predictors, SHAP analysis, logistic regression

Abstract

Community-acquired pneumonia (CAP) is associated with high mortality, and accurate diagnosis and risk prediction are essential for improving patient outcomes. Traditional diagnostic methods have limitations, prompting the use of machine learning (ML) to enhance diagnostic precision and treatment strategies. This study aims to develop ML models to predict CAP etiology and mortality using clinical data to enable early intervention. A retrospective cohort study was conducted on 251 adult CAP patients admitted to two Jordanian hospitals between March 2021 and February 2024. Various clinical data were analyzed using ML techniques, including linear regression, random forest, SHapley Additive exPlanations (SHAP), lasso regression, mutual information analysis, logistic regression, and correlation analysis. Key predictors of CAP survival included zinc, vitamin C, enoxaparin, and insulin bolus. Mutual information analysis identified neutrophils, alanine transaminase, mean corpuscular volume, hemoglobin, and platelets as significant mortality predictors, while lasso regression highlighted meropenem, arterial blood gases, PCO₂, and platelet count. Logistic regression confirmed intensive care unit (ICU) stay, pH, pulmonary severity index, white blood cell (WBC) count, and bicarbonate levels as crucial variables. Interestingly, lymphocyte count emerged as the strongest predictor of bacterial CAP, conflicting with established knowledge that associates neutrophils with bacterial infections. However, findings related to HCO₃, blood urea nitrogen, and WBC levels were consistent with clinical expectations. SHAP analysis highlighted basophils and fever as key predictors. Further investigation is needed to resolve conflicting findings and optimize predictive models. ML offers promising applications for CAP prognosis but requires refinement to address discrepancies and improve reliability in clinical decision-making.

Citations

Downloads

Download data is not yet available.
Unveiling etiology and mortality risks in community-acquired pneumonia: A machine learning approach

Published

02-07-2025

Issue

Section

Thematic issue: AI/ML in diseases

Categories

How to Cite

1.
Unveiling etiology and mortality risks in community-acquired pneumonia: A machine learning approach. Biomol Biomed [Internet]. 2025 Jul. 2 [cited 2025 Jul. 3];. Available from: https://www.bjbms.org/ojs/index.php/bjbms/article/view/12378