Fuzzy C-means clustering-based multi-label feature selection via weighted neighborhood mutual information

•FCM cluster centers are initialized via WOA, and updated in iteratives to optimize the search.•The fusion of WOA and FCM converges quickly and effectively avoids falling into local optimum.•An association matrix is constructed through fuzzy synthesis to develop label enhancement.•Weighted neighborh...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:Information sciences Ročník 718; s. 122389
Hlavní autoři: Sun, Lin, Guo, Jiaqi, Wu, Xuejiao, Xu, Jiucheng
Médium: Journal Article
Jazyk:angličtina
Vydáno: Elsevier Inc 01.11.2025
Témata:
ISSN:0020-0255
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:•FCM cluster centers are initialized via WOA, and updated in iteratives to optimize the search.•The fusion of WOA and FCM converges quickly and effectively avoids falling into local optimum.•An association matrix is constructed through fuzzy synthesis to develop label enhancement.•Weighted neighborhood mutual information can handle the unbalanced label and redundancy.•Experiments show superior classification effect of the designed method on multi-label data. As an effective soft computing model, fuzzy C-means (FCM) clustering can efficiently deal with uncertainty problems; however, it is usually sensitive to the selection of the initial cluster center and may fall into local optimization for classification; and existing multi-label feature selection algorithms ignore the potential relationship between labels and samples. To solve these problems, this article develops hybrid soft computing methodologies for feature selection employing FCM clustering and weighted neighborhood-based mutual information on multi-label classification tasks. First, cluster centroids are initialized stochastically via the whale optimization algorithm (WOA), and then the cluster center is updated in conjunction with the FCM clustering during the iterative process. By fusing soft computing methods of WOA and FCM clustering, this combination converges quickly and effectively avoids falling into the local optimal solution. Secondly, based on the membership degrees of each sample derived from FCM clustering, an association matrix is established, followed by a fuzzy synthesis operation to formulate a label enhancement strategy. Thirdly, with the advantage of handling unbalanced labels via neighborhood mutual information, feature weight is applied to calculate the weighted neighborhood of samples by calculating maximum information coefficient, and the redundancy among candidate and selected features is quantified through the application of weighted neighborhood-based mutual information. Finally, a feature selection algorithm utilizing label enhancement and weighted neighborhood-based mutual information is developed, and those competitive experiments will be carried out across 11 multi-label datasets. These results demonstrate that this methodology significantly enhances the classification efficacy on multi-label datasets.
ISSN:0020-0255
DOI:10.1016/j.ins.2025.122389