Fawley, Richard J and Cordeiro de Amorim, Renato (2026) Shapley-Inspired Feature Weighting in k-means with No Additional Hyperparameters. Expert Systems With Applications. p. 133406. DOI https://doi.org/10.1016/j.eswa.2026.133406
Fawley, Richard J and Cordeiro de Amorim, Renato (2026) Shapley-Inspired Feature Weighting in k-means with No Additional Hyperparameters. Expert Systems With Applications. p. 133406. DOI https://doi.org/10.1016/j.eswa.2026.133406
Fawley, Richard J and Cordeiro de Amorim, Renato (2026) Shapley-Inspired Feature Weighting in k-means with No Additional Hyperparameters. Expert Systems With Applications. p. 133406. DOI https://doi.org/10.1016/j.eswa.2026.133406
Abstract
Clustering algorithms often assume all features contribute equally to the data structure, an assumption that usually fails in high-dimensional or noisy settings. Feature weighting methods can address this, but most require additional parameter tuning. We propose SHARK (Shapley Reweighted k-means), a feature-weighted clustering algorithm motivated by the use of Shapley values from cooperative game theory to quantify feature relevance, which requires no additional parameters beyond those in k-means. We prove that the k-means objective can be decomposed into a sum of per-feature Shapley values, providing an axiomatic foundation for unsupervised feature relevance and reducing Shapley computation from exponential to polynomial time. SHARK iteratively re-weights features by the inverse of their Shapley contribution, emphasising informative dimensions and down-weighting irrelevant ones, and is equivalent to replacing the arithmetic mean of feature dispersions with their harmonic mean. Experiments on synthetic and real-world data sets show that SHARK consistently matches or outperforms existing methods, achieving superior robustness and accuracy, particularly in scenarios where noise may be present. Software: https://github.com/rickfawley/SHARK
| Item Type: | Article |
|---|---|
| Uncontrolled Keywords: | feature weighting; clustering; Shapley values; unsupervised feature selection; noisy data |
| Subjects: | Z Bibliography. Library Science. Information Resources > ZR Rights Retention |
| Divisions: | Faculty of Science and Health > Computer Science and Electronic Engineering, School of |
| SWORD Depositor: | Unnamed user with email elements@essex.ac.uk |
| Depositing User: | Unnamed user with email elements@essex.ac.uk |
| Date Deposited: | 26 Jun 2026 12:14 |
| Last Modified: | 26 Jun 2026 12:15 |
| URI: | http://repository.essex.ac.uk/id/eprint/43464 |
Available files
Filename: Shapley_Weighting_Paper__Revised_ (3).pdf
Licence: Creative Commons: Attribution 4.0