A novel hybrid binary whale optimization algorithm with chameleon hunting mechanism for wrapper feature selection in QSAR classification model:A drug-induced liver injury case study
High dimensionality is one of the main challenges in Quantitative Structure-Activity Relationship (QSAR) classification modeling, and feature selection as an effective dimensionality reduction method plays an important role in machine learning, particularly in fields such as chemometrics. In this pa...
Gespeichert in:
| Veröffentlicht in: | Expert systems with applications Jg. 234; S. 121015 |
|---|---|
| Hauptverfasser: | , , |
| Format: | Journal Article |
| Sprache: | Englisch |
| Veröffentlicht: |
Elsevier Ltd
30.12.2023
|
| Schlagworte: | |
| ISSN: | 0957-4174, 1873-6793 |
| Online-Zugang: | Volltext |
| Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
| Zusammenfassung: | High dimensionality is one of the main challenges in Quantitative Structure-Activity Relationship (QSAR) classification modeling, and feature selection as an effective dimensionality reduction method plays an important role in machine learning, particularly in fields such as chemometrics. In this paper, for feature selection in QSAR classification modeling, a hybrid whale optimization algorithm (WOA) with a chameleon hunting mechanism (HWOA-CHM) is proposed, and its binary version is used to find the best subset for wrapper feature selection in the QSAR classification model. First, a chaos weighting factor is introduced and used as a perturbation factor to increase the diversity of populations. Second, a retractable transformation strategy is designed to prevent the HWOA-CHM from falling into a local optimum. Third, the chameleon predation mechanism is introduced to improve the convergence accuracy of the HWOA-CHM. The performance of HWOA-CHM is evaluated and compared with state-of-the-art classical algorithms and well-known WOA variants. Then, a binary HWOA-CHM (BHWOA-CHM) was designed to solve the feature selection, the BHWOA-CHM is validated using the UCI machine learning repository and compared with binary version WOA, and well-known WOA variants in terms of accuracy, number of features, and time. Finally, BHWOA-CHM was used to solve the high-dimensional feature selection problem in the drug-induced liver injury classification model. It has shown excellent results in terms of feature selection compared to other methods. The proposed method effectively improves the robustness of QSAR predictions while reducing the complexity of the feature sets, demonstrating its potential for improving the accuracy of QSAR models.
•A new wrapper-based feature selection approach based on HWOA-CHM is proposed.•The iterative map is used to enhance the diversity in the search process.•A retractable transformation is presented to prevent the HWOA-CHM from falling into local optima.•The chameleon predation mechanism is introduced to improve the convergence accuracy.•BHWOA-CHM is competitive in QSAR classification models compared to other algorithms. |
|---|---|
| ISSN: | 0957-4174 1873-6793 |
| DOI: | 10.1016/j.eswa.2023.121015 |