EXPLORATORY EVALUATION OF MACHINE LEARNING ALGORITHMS IN SICKLE CELL GENOTYPE DETECTION FROM HAEMORHEOLOGICAL PARAMETERS DATASET OBTAINED IN NIGERIA
DOI:
https://doi.org/10.5281/zenodo.18104588Keywords:
Machine learning, sickle cell disease, haemorheological parameters, genotype prediction, Random Forest, Support Vector Machine, Neural NetworkAbstract
Sickle cell disease (SCD) patients have characteristic abnormal haemoglobins that cause red blood cells to become sickle-like in shape, leading to various complications. Early detection is desirous, yet existing diagnostic methods require high cost and deep learning curves. This study evaluated the potential of three machine learning (ML) algorithms—Random Forest, Support Vector Machine (SVM), and a Neural Network—in detecting sickle cell genotypes (SS, AS, AA) from a Nigerian dataset of 54 participants using haemorheological parameters. We employed a stratified 5-fold cross-validation methodology to ensure reliable performance evaluation. The Random Forest and SVM models achieved the highest mean accuracy at 90.9% ± 5.8%. Feature importance analysis confirmed Packed Cell Volume (PCV) as the most discriminative parameter, followed by Plasma Viscosity (PV) and Age. While all models demonstrated high sensitivity in identifying sickle cell anaemia (SS), they consistently failed to correctly classify the sickle cell trait (AS), a critical limitation highlighted by the validation. Our findings suggest that ML leveraging routine lab parameters is a promising screening tool for sickle cell disease, but is not yet viable for comprehensive genotype classification due to challenges with small dataset size and class imbalance. Future works need to focus on acquiring larger, more balanced datasets to improve the detection of the AS trait.
Downloads
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Rahman Abiodun Olalekan, Ilesanmi Paul IGE, Oludare Alani AGBEYANGI, Daniel Paditeiye REUBEN

This work is licensed under a Creative Commons Attribution 4.0 International License.
This is an open-access journal which means that all content is freely available without charge to the user or his/her institution. Users are allowed to read, download, copy, distribute, print, search, or link to the full texts of the articles, or use them for any other lawful purpose, without asking prior permission from the publisher or the author.
The Authors own the copyright of the articles.
