基于特征筛选算法的数字土壤制图研究
作者:
基金项目:

国家自然科学基金项目(42171056,41877001)资助


Research on Digital Soil Mapping Based on Feature Selection Algorithm
Author:
Fund Project:

National Natural Science Foundation of China (Nos. 42171056,41877001)

  • 摘要
  • | |
  • 访问统计
  • |
  • 参考文献 [22]
  • |
  • 相似文献 [20]
  • | | |
  • 文章评论
    摘要:

    平缓地带数字土壤制图中,环境协变量的选择是提高制图精度的关键。已有研究证明遥感影像可作为推理制图的辅助因子,而如何确定环境因子推理制图时各自的权重已成为现阶段研究的重点。选取湖北省麻城市乘马岗镇为研究区,采用3种特征筛选方法进行有效环境变量筛选,探索参与平原-丘陵混合区域制图的因子并确定其重要性,依据选择的相对稳定的指标,进一步探索提高土壤类型制图准确性的途径。根据141个野外独立样点的检验结果表明:在推理制图中,遥感因子在平原区域的重要性程度高于丘陵区域,且遥感因子中归一化植被指数(NDVI)和均值(Mean)较为稳定;基于递归特征算法的按地形推理制图精度最高为75.89%,分别高于ReliefF算法和基于Tree的特征筛选算法13.48%和4.97%;此外3种特征筛选算法制图结果中,按地形因子分区制图的精度均高于整体区域制图。因此,遥感因子作为辅助手段参与推理过程可有效提高制图精度。本研究采用的特征挖掘与机器学习算法对提升土壤制图精度具有一定的理论意义。

    Abstract:

    ObjectiveTraditional digital soil mapping methods are unable to produce detailed soil maps within a reasonable cost and time. Digital soil mapping is a powerful technique, which is popular and widely used by scholars coupled with environmental covariates to map soil types or properties. The selection of environmental covariates is the key to ensuring the accuracy of mapping. Previous studies have proven that remote-sensing images can be used as auxiliary factors for reasoning mapping. Remote sensing data can provide rich soil landscape information, which is consistent with the core idea of using grids to express spatial changes of soil features in digital soil mapping. Moreover, remote sensing technology can obtain real-time information quickly. However, there are few relevant studies on how principal components and texture information of remote sensing factors contribute to the reasoning process. Thus, determining the weight of remote sensing factors in the reasoning process is the key content of this study, which is tested by the reliability of testing mapping results.MethodChengmagang Town, Macheng City, Hubei Province was selected as the study area. Using Chinese soil classification and soil type map with a spacing of 10 meters, which were extracted from the contour data and remote sensing image using a variety of feature selection algorithms to effective screening of variables, this study conducted the soil digital mapping by reasoning machine learning algorithms. Specifically, the recursive feature elimination screening algorithm, ReliefF algorithm and tree-based feature screening algorithm were used to rank all environmental factors in the whole area, plain and hilly areas of the study area, respectively. Then, it screened the effective environmental variables of environmental factors and analyzed the weight of remote sensing factors in the reasoning process. The factors involved in plant-hill region mapping were explored and their importance was determined. According to the selected relatively stable indicators, the gradient boosting decision tree model after parameter tuning of the Bayesian optimization algorithm based on TPE was used for modeling. Also, the mapping accuracy results after different feature screening algorithms were compared between the whole region and the terrain region to further explore ways to improve the accuracy of soil type mapping.ResultThe soil type inference map was verified by 141 independent field sampling sites. The results showed that the importance of remote sensing factors in the plain area was higher than that in the hilly area and the NDVI and Mean values of the remote sensing factors were relatively stable. The highest accuracy of topographical inference mapping based on the recursive feature algorithm was 75.89%, which was higher than the 13.48% and 4.97% of the ReliefF algorithm and tree-based feature screening algorithm, respectively. In addition, among the mapping results of the three feature screening algorithms, the accuracy of the mapping based on terrain factors was higher than that of the overall region mapping. It suggests that remote sensing factors as an auxiliary means to participate in the reasoning process can effectively improve mapping accuracy.ConclusionThis study uses a feature selection algorithm to select features with a strong correlation with soil types as auxiliary variables in the machine learning model. The method is efficient and cost-effective for soil type prediction. Compared to other methods, the soil type mapping method based on machine learning is advantageous and the feature mining and machine learning algorithms have theoretical significance and practical value.

    参考文献
    [1] Carré F, McBratney A B, Mayr T, et al. Digital soil assessments: Beyond DSM[J]. Geoderma, 2007, 142(1/2): 69—79.
    [2] Huang W, Xu W, Wang S Q, et al. Extraction of knowledge about soil-environment relationship based on an uncertainty model[J]. Acta Pedologica Sinica, 2018, 55(1): 54—63. 黄魏, 许伟, 汪善勤, 等. 基于不确定性模型的土壤——环境关系知识获取方法的研究[J]. 土壤学报, 2018, 55(1): 54—63.
    [3] Zhu A X, Li B L, Pei T, et al. Model and method of fine digital soil survey[M]. Beijing: Science Press, 2008: 21—26. 朱阿兴, 李宝林, 裴韬, 等. 精细数字土壤普查模型与方法[M]. 北京: 科学出版社, 2008: 21—26.
    [4] Huang W, Luo Y, Wang S Q, et al. Knowledge of soil-landscape model obtain from a soil map and mapping[J]. Acta Pedologica Sinica, 2016, 53(1): 72—80. 黄魏, 罗云, 汪善勤, 等. 基于传统土壤图的土壤—环境关系获取及推理制图研究[J]. 土壤学报, 2016, 53(1): 72—80.
    [5] Hao C K, Sun X L, Wang H L. Application of generalized linear statistical model to digital soil mapping in typical subtropical hilly region[J]. Journal of Soil Science, 2023, 60(4): 993—1006. 郝辰恺, 孙孝林, 王会利. 广义线性地统计模型在典型亚热带丘陵区数字土壤制图中的应用[J]. 土壤学报, 2023, 60(4): 993—1006.
    [6] Liu F, Geng X Y, Zhu A X, et al. Soil texture mapping over low relief areas using land surface feedback dynamic patterns extracted from MODIS[J]. Geoderma, 2012, 171/172: 44—52.
    [7] Guo S X, Zhu A X, Meng L K, et al. Unification of soil feedback patterns under different evaporation conditions to improve soil differentiation over flat area[J]. International Journal of Applied Earth Observation and Geoinformation, 2016, 49: 126—137.
    [8] Xu Y M, Li B, Shen X B, et al. Digital soil mapping of soil total nitrogen based on Landsat 8, Sentinel 2, and WorldView-2 images in smallholder farms in Yellow River Basin, China[J]. Environmental Monitoring and Assessment, 2022, 194(4): 282.
    [9] Zhang M W, Liu H J, Zhang M N, et al. Mapping soil organic matter and analyzing the prediction accuracy of typical cropland soil types on the northern Songnen plain[J]. Remote Sensing, 2021, 13(24): 5162.
    [10] Duan M Q, Zhang X G. Using remote sensing to identify soil types based on multiscale image texture features[J]. Computers and Electronics in Agriculture, 2021, 187: 106272.
    [11] Zhang Z H, Ding J L, Wang J Z, et al. Digital soil properties mapping by ensembling soil-environment relationship and machine learning in arid regions[J]. Scientia Agricultura Sinica, 2020, 53(3): 563—573. 张振华, 丁建丽, 王敬哲, 等. 集成土壤-环境关系与机器学习的干旱区土壤属性数字制图[J]. 中国农业科学, 2020, 53(3): 563—573.
    [12] Shi X, Long R, Dekett R, et al. Integrating different types of knowledge for digital soil mapping[J]. Soil Science Society of America Journal, 2009, 73(5): 1682—1692.
    [13] Cheng W, Zhu A X, Qin C Z, et al. Updating conventional soil maps by mining soil-environment relationships from individual soil polygons[J]. Journal of Integrative Agriculture, 2019, 18(2): 265—278.
    [14] Zhang S M, Xu M X, Zhang Z X, et al. Methods of sampling soil organic carbon in farmlands with different landform types on the loess plateau[J]. Journal of Natural Resources, 2018, 33(4): 634—643. 张圣民, 许明祥, 张志霞, 等. 黄土高原不同地貌类型区农田土壤有机碳采样布点方法研究[J]. 自然资源学报, 2018, 33(4): 634—643.
    [15] Qi F. Knowledge discovery from area-class resource maps: Data preprocessing for noise reduction[J]. Transactions in GIS, 2004, 8(3): 297—308.
    [16] Demarchi L, Kania A, Ciężkowski W, et al. Recursive feature elimination and random forest classification of natura 2000 grasslands in lowland river valleys of Poland based on airborne hyperspectral and LiDAR data fusion[J]. Remote Sensing, 2020, 12(11): 1842.
    [17] Kononenko I. Estimating attributes: Analysis and extensions of RELIEF[C]//Machine Learning: ECML-94, 1994: 171—182.
    [18] Ren J S, Wang R X, Liu G, et al. Partitioned relief-F method for dimensionality reduction of hyperspectral images[J]. Remote Sensing, 2020, 12(7): 1104.
    [19] Wang Z G, Wang G C, Zhang Y G, et al. Quantification of the effect of soil erosion factors on soil nutrients at a small watershed in the Loess Plateau, Northwest China[J]. Journal of Soils and Sediments, 2020, 20(2): 745—755.
    [20] Guo F, Xu Z, Ma H H, et al. Estimating chromium concentration in arable soil based on the optimal principal components by hyperspectral data[J]. Ecological Indicators, 2021, 133: 108400.
    [21] Wang T, Wang X F, Liu J Z. The prediction model of moisture content of young Aquilaria sinensis leaves based on RFE_RF algorithm[J]. Journal of Nanjing Forestry University(Natural Sciences Edition), 2022, 46(4): 177—184. 王甜, 王雪峰, 刘嘉政. 基于RFE_RF算法的幼龄沉香叶片含水率预估模型[J]. 南京林业大学学报(自然科学版), 2022, 46(4): 177—184.
    [22] Chen X Y, Yang K, Wang J S. Extraction of impervious surface in mountainous city combined with sentinel images and feature optimization[J]. Software Guide, 2022, 21(4): 214—219. 陈鑫亚, 杨昆, 王加胜. 结合Sentinel影像与特征优选的山地城市不透水面提取[J]. 软件导刊, 2022, 21(4): 214—219.
    引证文献
    网友评论
    网友评论
    分享到微博
    发 布
引用本文

张晓婷,黄魏,傅佩红,孟可,王苏放.基于特征筛选算法的数字土壤制图研究[J].土壤学报,2024,61(3):635-647. DOI:10.11766/trxb202208090441 ZHANG Xiaoting, HUANG Wei, FU Peihong, MENG Ke, WANG Sufang. Research on Digital Soil Mapping Based on Feature Selection Algorithm[J]. Acta Pedologica Sinica,2024,61(3):635-647.

复制
分享
文章指标
  • 点击次数:310
  • 下载次数: 1685
  • HTML阅读次数: 1020
  • 引用次数: 0
历史
  • 收稿日期:2022-08-09
  • 最后修改日期:2023-05-12
  • 录用日期:2023-10-20
  • 在线发布日期: 2023-10-23
  • 出版日期: 2024-05-15
文章二维码