Basar, RemziOcak, OznurErturk, Alperde la Roche, Marcelle2025-10-112025-10-1120252078-2489https://doi.org/10.3390/info16080639https://hdl.handle.net/20.500.12684/21565Cardiovascular diseases (CVDs) remain the leading cause of death globally, underscoring the urgent need for data-driven early diagnostic tools. This study proposes a multilayer artificial neural network (ANN) model for heart disease prediction, developed using a real-world clinical dataset comprising 13,981 patient records. Implemented on the Orange data mining platform, the ANN was trained using backpropagation and validated through 10-fold cross-validation. Dimensionality reduction via principal component analysis (PCA) enhanced computational efficiency, while Shapley additive explanations (SHAP) were used to interpret model outputs. Despite achieving 83.4% accuracy and high specificity, the model exhibited poor sensitivity to disease cases, identifying only 76 of 2233 positive samples, with a Matthews correlation coefficient (MCC) of 0.058. Comparative benchmarks showed that random forest and support vector machines significantly outperformed the ANN in terms of discrimination (AUC up to 91.6%). SHAP analysis revealed serum creatinine, diabetes, and hemoglobin levels to be the dominant predictors. To address the current study's limitations, future work will explore LIME, Grad-CAM, and ensemble techniques like XGBoost to improve interpretability and balance. This research emphasizes the importance of explainability, data representativeness, and robust evaluation in the development of clinically reliable AI tools for heart disease detection.en10.3390/info16080639info:eu-repo/semantics/openAccessheart disease predictionartificial neural network (ANN)machine learningSHAPmedical diagnosticsensemble learningclass imbalanceLeveraging Machine Learning Techniques to Predict Cardiovascular Heart DiseaseArticle1682-s2.0-105014328245WOS:001557688000001N/AN/A