BACKGROUND AND AIMS
ST-elevation myocardial infarction (STEMI) is a leading cause of mortality, with significant variations in risk profiles among the Asian population. Traditional risk scores, such as the Thrombolysis in Myocardial Infarction (TIMI) score, were developed based on Western cohorts and have limited predictive accuracy for Asian patients. Machine learning models have shown superior performance but remain underutilised due to a lack of accessible and interpretable tools.
PURPOSE
This study aims to develop a web-based application that integrates ensemble learning, explainable AI, and a chatbot for STEMI mortality risk prediction.1 The system provides transparent assessments, enhancing clinical decision-making for Asian healthcare providers.
METHODS
A dataset of 42,348 STEMI cases (2006–2019) from the National Cardiovascular Disease Database (NCVD) registry with 54 clinical features was used. Data preprocessing included handling missing values, outlier detection, and feature normalisation. Feature selection was performed using recursive feature elimination and expert input.
Ensemble models, including random forest, gradient boosting, XGBoost, and stack ensemble, were trained and validated using 10-fold cross-validation. SHapley Additive exPlanations (SHAP) provided interpretability of model predictions, with a user-friendly interface displaying summary plots and individual risk factor explanations.
A chatbot, powered by a fine-tuned large language model, was developed for real-time guidance. The chatbot was localised with region-specific clinical guidelines, terminology, and frequently asked questions to improve usability for Asian healthcare providers. The LangChain framework (LangChain, Inc., San Francisco, California, USA) facilitated seamless knowledge retrieval, enhancing chatbot interactions.
A prospective data collection feature enabled continuous model validation and refinement by incorporating new patient data over time.
RESULTS
The ensemble models outperformed traditional risk scoring methods, achieving an area under the curve score of 0.96, a recall score of 0.89, and a precision of 0.61 for in-hospital mortality prediction, compared to the TIMI risk score (area under the curve: 0.81). SHAP analysis identified key predictors, providing interpretable insights into risk stratification. The chatbot improved accessibility, offering real-time assistance for clinicians in risk assessment. The prospective data collection feature ensured ongoing model updates, maintaining predictive accuracy and clinical relevance.
CONCLUSION
This study presents a web-based STEMI risk prediction tool that integrates ensemble learning, explainable AI, and a chatbot, tailored for the Asian population. The system enhances predictive accuracy, interpretability, and usability, bridging the gap between advanced machine learning models and clinical practice. By combining AI-driven risk assessment with a user-friendly interface, this tool provides a scalable solution for improving STEMI outcomes in Asian healthcare settings.