Comparing machine learning approaches for predicting spatially explicit life cycle global warming and eutrophication impacts from corn production

XX Romeiko, Z Guo, Y Pang, EK Lee, X Zhang - Sustainability, 2020 - mdpi.com
Sustainability, 2020mdpi.com
Agriculture ranks as one of the top contributors to global warming and nutrient pollution.
Quantifying life cycle environmental impacts from agricultural production serves as a
scientific foundation for forming effective remediation strategies. However, methods capable
of accurately and efficiently calculating spatially explicit life cycle global warming (GW) and
eutrophication (EU) impacts at the county scale over a geographic region are lacking. The
objective of this study was to determine the most efficient and accurate model for estimating …
Agriculture ranks as one of the top contributors to global warming and nutrient pollution. Quantifying life cycle environmental impacts from agricultural production serves as a scientific foundation for forming effective remediation strategies. However, methods capable of accurately and efficiently calculating spatially explicit life cycle global warming (GW) and eutrophication (EU) impacts at the county scale over a geographic region are lacking. The objective of this study was to determine the most efficient and accurate model for estimating spatially explicit life cycle GW and EU impacts at the county scale, with corn production in the U.S.’s Midwest region as a case study. This study compared the predictive accuracies and efficiencies of five distinct supervised machine learning (ML) algorithms, testing various sample sizes and feature selections. The results indicated that the gradient boosting regression tree model built with approximately 4000 records of monthly weather features yielded the highest predictive accuracy with cross-validation (CV) values of 0.8 for the life cycle GW impacts. The gradient boosting regression tree model built with nearly 6000 records of monthly weather features showed the highest predictive accuracy with CV values of 0.87 for the life cycle EU impacts based on all modeling scenarios. Moreover, predictive accuracy was improved at the cost of simulation time. The gradient boosting regression tree model required the longest training time. ML algorithms demonstrated to be one million times faster than the traditional process-based model with high predictive accuracy. This indicates that ML can serve as an alternative surrogate of process-based models to estimate life-cycle environmental impacts, capturing large geographic areas and timeframes.
MDPI
以上显示的是最相近的搜索结果。 查看全部搜索结果