Analyzing Household Expenditures with Generalized Random Forests
Uložené v:
| Názov: | Analyzing Household Expenditures with Generalized Random Forests |
|---|---|
| Autori: | Isnanda, Eriski, Notodiputro, Khairil Anwar, Sadik, Kusman |
| Zdroj: | CAUCHY: Jurnal Matematika Murni dan Aplikasi; Vol 10, No 1 (2025): CAUCHY: JURNAL MATEMATIKA MURNI DAN APLIKASI; 166-179 ; 2477-3344 ; 2086-0382 |
| Informácie o vydavateľovi: | Mathematics Department, Universitas Islam Negeri Maulana Malik Ibrahim Malang |
| Rok vydania: | 2025 |
| Zbierka: | Jurnal Universitas Islam Negeri Maulana Malik Ibrahim Malang |
| Predmety: | Sains, Statistika dan Sains Data, generalized linear mixed model, generalized random forest, household per capita expenditure, random forest, winsorization |
| Popis: | This study investigates the performance of Generalized Random Forest (GRF), which has been known to be useful in understanding heterogeneous treatment effects (HTE) and non-linear relationships in high-dimensional data. In this paper the performance of GRF was compared with Random Forest (RF), Generalized Linear Mixed Model (GLMM) as continuation of previous study conducted by Athey (2019). The data utilized in this study is from the National Socioeconomic Survey (SUSENAS) to predict household per capita expenditure in West Java, Indonesia. The models are evaluated based on their ability to handle outliers using Winsorization. The results show that RF performed the best, yielding the smallest MSE values, followed by GRF with reasonably good performance, and GLMM with the highest MSE, indicating its limitations in handling non-linear data patterns. These findings indicate that RF is the most accurate method for modeling per capita expenditure in West Java, with recommendations for further research to develop hybrid methods or use more specific random effects in GLMM |
| Druh dokumentu: | article in journal/newspaper |
| Popis súboru: | application/pdf |
| Jazyk: | English |
| Relation: | http://ejournal.uin-malang.ac.id/index.php/Math/article/view/30104/pdf |
| DOI: | 10.18860/cauchy.v10i1.30104 |
| Dostupnosť: | http://ejournal.uin-malang.ac.id/index.php/Math/article/view/30104 https://doi.org/10.18860/cauchy.v10i1.30104 |
| Rights: | Copyright (c) 2025 Eriski Isnanda, Khairil Anwar Notodiputro, Kusman Sadik ; https://creativecommons.org/licenses/by-sa/4.0 |
| Prístupové číslo: | edsbas.314DBE23 |
| Databáza: | BASE |
| Abstrakt: | This study investigates the performance of Generalized Random Forest (GRF), which has been known to be useful in understanding heterogeneous treatment effects (HTE) and non-linear relationships in high-dimensional data. In this paper the performance of GRF was compared with Random Forest (RF), Generalized Linear Mixed Model (GLMM) as continuation of previous study conducted by Athey (2019). The data utilized in this study is from the National Socioeconomic Survey (SUSENAS) to predict household per capita expenditure in West Java, Indonesia. The models are evaluated based on their ability to handle outliers using Winsorization. The results show that RF performed the best, yielding the smallest MSE values, followed by GRF with reasonably good performance, and GLMM with the highest MSE, indicating its limitations in handling non-linear data patterns. These findings indicate that RF is the most accurate method for modeling per capita expenditure in West Java, with recommendations for further research to develop hybrid methods or use more specific random effects in GLMM |
|---|---|
| DOI: | 10.18860/cauchy.v10i1.30104 |
Nájsť tento článok vo Web of Science