Zhang, Xudong and Cordeiro de Amorim, Renato (2026) Scalable unsupervised feature selection via weight stability. Information Sciences, 755. p. 123807. DOI https://doi.org/10.1016/j.ins.2026.123807
Zhang, Xudong and Cordeiro de Amorim, Renato (2026) Scalable unsupervised feature selection via weight stability. Information Sciences, 755. p. 123807. DOI https://doi.org/10.1016/j.ins.2026.123807
Zhang, Xudong and Cordeiro de Amorim, Renato (2026) Scalable unsupervised feature selection via weight stability. Information Sciences, 755. p. 123807. DOI https://doi.org/10.1016/j.ins.2026.123807
Abstract
Unsupervised feature selection is critical for improving clustering performance in high-dimensional data, where irrelevant features can obscure meaningful structure. In this work, we propose the Minkowski weighted k-means++, a novel initialisation strategy for the Minkowski Weighted k-means. Our initialisation selects centroids probabilistically using feature relevance estimates derived from the data itself. Building on this, we propose two new feature selection algorithms, FS-MWK++, which aggregates feature weights across a range of Minkowski exponents identifying stable and informative features, and SFS-MWK++, a scalable variant based on subsampling. We support our approach with a theoretical analysis, demonstrating that, under explicit assumptions on noise features and cluster structure, relevant features are assigned consistently higher weights than noise features across a range of Minkowski exponents. Our software can be found at https://github.com/xzhang4-ops1/FSMWK.
| Item Type: | Article |
|---|---|
| Uncontrolled Keywords: | unsupervised feature selection, clustering, noisy data |
| Subjects: | Z Bibliography. Library Science. Information Resources > ZR Rights Retention |
| Divisions: | Faculty of Science and Health > Computer Science and Electronic Engineering, School of |
| SWORD Depositor: | Unnamed user with email elements@essex.ac.uk |
| Depositing User: | Unnamed user with email elements@essex.ac.uk |
| Date Deposited: | 26 Jun 2026 10:29 |
| Last Modified: | 26 Jun 2026 10:29 |
| URI: | http://repository.essex.ac.uk/id/eprint/43444 |
Available files
Filename: Unsupervised_feature_selection__Arxiv_.pdf
Licence: Creative Commons: Attribution 4.0