收藏设为首页 广告服务联系我们在线留言

干旱区地理 ›› 2023, Vol. 46 ›› Issue (10): 1643-1653.doi: 10.12118/j.issn.1000-6060.2023.034

• 生物与土壤 • 上一篇    下一篇

基于XGBoost模型的湟水流域耕地土壤养分遥感反演

刘尊方1(),雷浩川1(),盛海彦2   

  1. 1.青海大学地质工程系,青海 西宁 810016
    2.青海大学农牧学院,青海 西宁 810016
  • 收稿日期:2023-01-19 修回日期:2023-04-17 出版日期:2023-10-25 发布日期:2023-11-10
  • 通讯作者: 雷浩川(1973-),男,博士,讲师,主要从事遥感与GIS应用等方面的研究. E-mail: 56242188@qq.com
  • 作者简介:刘尊方(1999-),男,硕士研究生,主要从事土壤定量遥感等方面的研究. E-mail: lzunfang@163.com
  • 基金资助:
    国家自然科学基金项目(U20A20115);青海大学创新创业工坊项目(GF-20230005)

Remote sensing inversion of soil nutrient on farmland in Huangshui River Basin based on XGBoost model

LIU Zunfang1(),LEI Haochuan1(),SHENG Haiyan2   

  1. 1. Department of Geological Engineering, Qinghai University, Xining 810016, Qinghai, China
    2. College of Agriculture and Animal Husbandry, Qinghai University, Xining 810016, Qinghai, China
  • Received:2023-01-19 Revised:2023-04-17 Online:2023-10-25 Published:2023-11-10

摘要:

湟水流域是河湟谷地重要的组成部分,协同环境因素预测土壤养分空间分布对农业土壤养分管理尤为重要。土壤养分反演研究中对于参数对模型结果的影响和模型适用性的研究较少。选取研究区地形因子、土壤pH及光谱反射率共28个因子,结合贝叶斯优化算法构建人工神经网络(ANN)、支持向量机(SVM)和极端梯度提升(XGBoost)3种机器学习模型预测耕地土壤养分空间分布,计算决定系数(R2)、均方根误差(RMSE)和相对分析误差(RPD)评价3种模型的精度。结果表明:(1) 基于贝叶斯优化超参数的XGBoost模型对全氮(TN)含量预测精度优于其他模型(R2=0.893,RMSE=0.359,RPD=2.470),预测土壤有机质(SOM)、速效磷(AP)和速效钾(AK)含量时,XGBoost模型验证集R2分别为0.801、0.509、0.442。(2) 对比3种模型的寻优次数和误差发现,BOA-XGBoost模型参数优化次数少、效率高,具有更好的鲁棒性。对于不同的养分,ANN和SVM模型预测精度存在差异,SVM模型预测SOM含量时精度更高(RPD=1.580),而ANN模型预测TN时精度最佳(RPD=2.460)。基于贝叶斯算法进行超参数优化构建的XGBoost模型预测精度高,可以达到良好的预测效果,可为湟水流域精准农业施肥提供参考。

关键词: 土壤养分, XGBoost, 空间分布, 环境因子, 湟水流域

Abstract:

The Huangshui River Basin is an important part of the Huangshui Valley. Additionally, collaborative environmental factors that predict the spatial distribution of soil nutrients are particularly important for managing soil nutrients. Moreover, less attention is paid to the effect of model parameters on the results obtained from soil nutrient inversion studies. In this study, the Huangshui River Basin in Qinghai Province (China) was selected as the study area, and 28 factors, including elevation, aspect, slope, plane curvature, section curvature, relief degree of land surface, topographic wetness index, soil pH, and spectral reflectance, were selected. In addition, these factors were combined with the Bayesian optimization algorithm (BOA) to construct artificial neural network (ANN), support vector machine (SVM), and extreme gradient boosting (XGBoost) machine learning models for predicting the spatial distribution of four soil nutrients in farmlands: soil organic matter (SOM), total nitrogen (TN), available phosphorus (AP), and available potassium (AK), respectively. Further, the prediction accuracy of these three models was evaluated based on the model coefficient of determination (R2), root-mean-square error (RMSE), and relative percent deviation (RPD). The results revealed that: (1) All four soil nutrients exhibited a moderate degree of variability, with TN showing the highest variability of 69.481%. The XGBoost model based on the Bayesian optimized hyperparameter combination was better than other models in predicting the TN content (R2, RMSE, and RPD were 0.893, 0.359, and 2.470, respectively). The R2 values of the XGBoost model validation set for estimating the SOM, AK, and AP contents were 0.801, 0.509, and 0.442, respectively, and the corresponding RPD values were 2.152, 1.210, and 1.274, respectively. Moreover, this model exhibited a better prediction capability. (2) The comparison of the number of optimizations and errors of the three models revealed that the BOA-XGBoost model exhibited minimum number of parameter optimizations, higher efficiency, and better robustness. The ANN and SVM models demonstrated different prediction accuracies for different nutrients; additionally, the SVM model predicted the SOM content with high accuracy (RPD=1.580), while the ANN model predicted TN efficiently (RPD=2.460). Based on Landsat 8 remote sensing images, the XGBoost inversion model developed by combining 28 factors of the Huangshui River Basin was found to be more suitable for application in soil nutrient inversion research; furthermore, it can more accurately describe the spatial distribution pattern of the soil nutrient inversion in the Huangshui River Basin, better ensure precise agriculture fertilization, improve the fertilizer utilization rate and crop yield, and provide a reference for precise agriculture fertilization in the Huangshui River Basin.

Key words: soil nutrient, XGBoost, spatial distribution, environmental factor, Huangshui River Basin