Wijayanti, Ella Budi and Setiadi, De Rosal Ignatius Moses and Setyoko, Bimo Haryo (2024) Dataset Analysis and Feature Characteristics to Predict Rice Production based on eXtreme Gradient Boosting. Journal of Computing Theories and Applications, 1 (3). pp. 299-310. ISSN 3024-9104
10057-Article Text-32219-3-10-20240615.pdf - Published Version
Download (910kB) | Preview
Abstract
Rice plays a vital role as the main food source for almost half of the global population, contributing more than 21% of the total calories humans need. Production predictions are important for determining import-export policies. This research proposes the XGBoost method to predict rice harvests globally using FAO and World Bank datasets. Feature analysis, removal of duplicate data, and parameter tuning were carried out to support the performance of the XGBoost method. The results showed excellent performance based on which reached 0.99. Evaluation of model performance using metrics such as MSE, and MAE measured by k-fold validation show that XGBoost has a high ability to predict crop yields accurately compared to other regression methods such as Random Forest (RF), Gradient Boost (GB), Bagging Regressor (BR) and K-Nearest Neighbor (KNN). Apart from that, an ablation study was also carried out by comparing the performance of each model with various features and state-of-the-art. The results prove the superiority of the proposed XGBoost method. Where results are consistent, and performance is better, this model can effectively support agricultural sustainability, especially rice production.
Item Type: | Article |
---|---|
Subjects: | Q Science > QA Mathematics > QA75 Electronic computers. Computer science |
Depositing User: | dl fts |
Date Deposited: | 29 Nov 2024 00:51 |
Last Modified: | 29 Nov 2024 01:26 |
URI: | https://dl.futuretechsci.org/id/eprint/43 |