Wilcox, Colin and Giagos, Vasileios and Djahel, Soufiene (2023) A Neighborhood-Similarity-Based Imputation Algorithm for Healthcare Data Sets: A Comparative Study. Electronics, 12 (23). p. 4809. DOI https://doi.org/10.3390/electronics12234809
Wilcox, Colin and Giagos, Vasileios and Djahel, Soufiene (2023) A Neighborhood-Similarity-Based Imputation Algorithm for Healthcare Data Sets: A Comparative Study. Electronics, 12 (23). p. 4809. DOI https://doi.org/10.3390/electronics12234809
Wilcox, Colin and Giagos, Vasileios and Djahel, Soufiene (2023) A Neighborhood-Similarity-Based Imputation Algorithm for Healthcare Data Sets: A Comparative Study. Electronics, 12 (23). p. 4809. DOI https://doi.org/10.3390/electronics12234809
Abstract
The increasing computerisation of medical services has highlighted inconsistencies in the way in which patients’ historic medical data were recorded. Differences in process and practice between medical services and facilities have led to many incomplete and inaccurate medical histories being recorded. To create a single point of truth going forward, it is necessary to correct these inconsistencies. A common way to do this has been to use imputation techniques to predict missing data values based on the known values in the data set. In this paper, we propose a neighborhood similarity measure-based imputation technique and analyze its achieved prediction accuracy in comparison with a number of traditional imputation methods using both an incomplete anonymized diabetes medical data set and a number of simulated data sets as the sources of our data. The aim is to determine whether any improvement could be made in the accuracy of predicting a diabetes diagnosis using the known outcomes of the diabetes patients’ data set. The obtained results have proven the effectiveness of our proposed approach compared to other state-of-the-art single-pass imputation techniques.
Item Type: | Article |
---|---|
Uncontrolled Keywords: | healthcare; imputation algorithms; incomplete data; neighborhood similarity |
Divisions: | Faculty of Science and Health Faculty of Science and Health > Mathematics, Statistics and Actuarial Science, School of |
SWORD Depositor: | Unnamed user with email elements@essex.ac.uk |
Depositing User: | Unnamed user with email elements@essex.ac.uk |
Date Deposited: | 27 Feb 2025 16:17 |
Last Modified: | 27 Feb 2025 16:17 |
URI: | http://repository.essex.ac.uk/id/eprint/37298 |
Available files
Filename: electronics-12-04809.pdf
Licence: Creative Commons: Attribution 4.0