Weighted CART with spatially split rules: A new kernel-based approach in spatial classification trees.

Uložené v:
Podrobná bibliografia
Názov: Weighted CART with spatially split rules: A new kernel-based approach in spatial classification trees.
Autori: Alami, Tahereh1 (AUTHOR), Doostparast, Mahdi1 (AUTHOR) doustparast@um.ac.ir
Zdroj: Communications in Statistics: Theory & Methods. 2025, Vol. 54 Issue 22, p7336-7361. 26p.
Predmety: MACHINE learning, SPATIAL data structures, SPATIAL variation, GEOGRAPHIC spatial analysis
Abstrakt: Machine learning algorithms are commonly utilized for independent observations and may not be efficient for analyzing spatial data sets since dependency is present in all directions. In spatial data sets, due to spatial correlations between observations, the independence assumption is violated; thus, some modifications are necessary for accurate analysis. In this article, the well-known CART algorithm is adapted to the spatial domain by proposing two new approaches, namely kernel-based weighted CART and kernel-based weighted CART with spatially split rules. The first approach suggests kernel-based weights as a new method for weighting the observations according to their spatial locations. The weights are inversely proportional to the density of the data location. We also use the Voronoi and Kriging weights, all of which assign less weight to clustered data than others. The second approach is defined based on spatial entropy as an impurity criterion in the weighted CART to achieve a more accurate algorithm. Findings are evaluated by conducting simulation studies and analyzing a real data set to highlight the advantages and limitations of the proposed approaches. The numerical results indicate that the proposed W-CART-SSR algorithm with class-based kernel weight outperforms other algorithms in terms of accuracy, tree structure, and implementation time. [ABSTRACT FROM AUTHOR]
Copyright of Communications in Statistics: Theory & Methods is the property of Taylor & Francis Ltd and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Databáza: Business Source Index
Popis
Abstrakt:Machine learning algorithms are commonly utilized for independent observations and may not be efficient for analyzing spatial data sets since dependency is present in all directions. In spatial data sets, due to spatial correlations between observations, the independence assumption is violated; thus, some modifications are necessary for accurate analysis. In this article, the well-known CART algorithm is adapted to the spatial domain by proposing two new approaches, namely kernel-based weighted CART and kernel-based weighted CART with spatially split rules. The first approach suggests kernel-based weights as a new method for weighting the observations according to their spatial locations. The weights are inversely proportional to the density of the data location. We also use the Voronoi and Kriging weights, all of which assign less weight to clustered data than others. The second approach is defined based on spatial entropy as an impurity criterion in the weighted CART to achieve a more accurate algorithm. Findings are evaluated by conducting simulation studies and analyzing a real data set to highlight the advantages and limitations of the proposed approaches. The numerical results indicate that the proposed W-CART-SSR algorithm with class-based kernel weight outperforms other algorithms in terms of accuracy, tree structure, and implementation time. [ABSTRACT FROM AUTHOR]
ISSN:03610926
DOI:10.1080/03610926.2025.2473608