A three-stage wavelength selection algorithm for near-infrared spectroscopy calibration

[Display omitted] •A three-stage wavelength selection method is constructed by combining CC and SWR.•Wavelength selection results meet requirements of MLR modeling method.•The three-stage wavelength selection method exhibits certain superiority over other methods. The near-infrared spectral data is...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Spectrochimica acta. Part A, Molecular and biomolecular spectroscopy Jg. 324; S. 125029
Hauptverfasser: Feng, Xi-Yao, Chen, Zheng-Guang, Yi, Shu-Juan, Wang, Peng-Hui
Format: Journal Article
Sprache:Englisch
Veröffentlicht: England Elsevier B.V 05.01.2025
Schlagworte:
ISSN:1386-1425, 1873-3557, 1873-3557
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:[Display omitted] •A three-stage wavelength selection method is constructed by combining CC and SWR.•Wavelength selection results meet requirements of MLR modeling method.•The three-stage wavelength selection method exhibits certain superiority over other methods. The near-infrared spectral data is highly high dimensional and contains redundant information, it is necessary to identify the most representative characteristic wavelengths before modeling to improve model accuracy and reliability. At present, there are many methods for selecting the characteristic wavelengths of NIR spectroscopy, but the collinearity among wavelengths is still a main issue that leads to poor model effects. Therefore, this study proposes a three-stage wavelength selection algorithm (Stage III) to reduce redundancy in NIR spectral data and collinearity between wavelength variables, resulting in a simpler and more accurate predictive model. The research uses a public NIR data set of corn samples as its subject. Initially, the wavelengths with the higher correlation coefficients are chosen after calculating the relationship coefficients between every wavelength vector and the concentration vector. On this basis, the correlation coefficients between the vectors of each wavelength point are calculated, and those wavelength points with smaller correlation coefficients with other wavelength points are selected. Ultimately, the stepwise regression analysis selects the wavelengths that provide substantial value to the model as the variables for modeling, leading to the development of a multiple linear regression model. The results show that the model using the three-stage wavelength selection algorithm outperforms those using the full spectrum, Stages I and Stage II, and the coefficient of determination of the test set of the Stage III-MLR model achieved an accuracy of 0.9360. Instead of the successive projections algorithm (SPA), uninformative variable elimination (UVE), and competitive adaptive reweighted sampling (CARS), Stage III is better in the model prediction accuracy. Therefore, the three-stage wavelength selection algorithm is an effective wavelength selection algorithm that can effectively model NIR spectroscopy, reduce the collinearity between the wavelength variables, simplify the complexity of the model, and improve the prediction precision of the model.
Bibliographie:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:1386-1425
1873-3557
1873-3557
DOI:10.1016/j.saa.2024.125029