Mapping soil organic matter and identifying potential controls in the farmland of Southern China: Integration of multi‐source data, machine learning and geostatistics
Abstract
Abstract Soil organic matter (SOM) plays a critical role in terrestrial ecosystem functioning and is closely related to many global issues like soil fertility, soil health and climate regulation. Therefore, obtaining accurate information on the spatial distribution of SOM and its potential controlling factors is of global interest. However, this remains a great challenge since SOM is affected by numerous natural and anthropogenic factors and usually showed strong heterogeneity. In this study, we collected a total of 16,580 surface soil (0–20 cm) samples from the farmland throughout Jiangxi Province. And the Random Forest (RF), Cubist and gradient‐boosted models were compared and used to define the factor which is most associated with SOM. Then the ordinary kriging (OK) and machine learning‐ordinary co‐kriging (ML‐COK) were used to map SOM. We found that on average, 30.86 g kg −1 SOM was present in farmland soil of Jiangxi Province. Anthropogenic activities strongly affected SOM level, with five of the top 10 most important factors are anthropogenic related. The straw return amount was proved to have the largest importance (31.46%) for modelling SOM and a significant ( p < 0.001) positive relationship between SOM content and the amount of straw returned to farmland was detected. Additionally, returning straw improved crop production. Soil derived from the Quaternary Subred Sand has the highest SOM content (37.82 g kg −1 ). Crop rotation also improved SOM content and the rice‐bean rotation system has the highest SOM content (34.27 g kg −1 ). With the best performance, the RF algorithm ( R 2 = 0.49, RMSE = 6.77 g kg −1 ) was selected to identify the primary control of SOM and integrated with COK, which we termed as ML‐COK, to map the SOM in the farmland of Jiangxi Province. ML‐COK outperformed OK method for mapping the SOM in farmland of Jiangxi Province with R 2 of 0.351 and Lin's concordance correlation coefficient of 0.549. Farmland distributed in the central part of the province had high SOM content. In contrast, farmland in the north, south and east parts had relatively low SOM. Our study offers new insight for mapping soil properties, identifying potential factors driving variation in SOM, and also provides valuable information for making more reasonable and environmentally friendly farmland management measures.