节点文献

基于可见近红外光谱检测土壤养分及仪器开发

Soil Nutrition Content Detection and Instrument Development Based on Visible Near Infrared Spectrum Technology

【作者】 刘雪梅

【导师】 柳建设;

【作者基本信息】 东华大学 , 环境科学与工程, 2014, 博士

【摘要】 实施精细农业需要清晰地了解土壤的空间变异特性以及实时的营养状况,数字农业的发展也对土壤养分的测定迫切地提出了精确的时间和效率上的要求。土壤有机质、全氮、碱解氮、速效磷和速效钾是植物健康成长所必须的营养成分,这些土壤指标参数是土壤养分管理和测土配方施肥的重要对象,目前这些指标检测实验室和土肥站一直沿用常规检测方法。这些检测方法需要昂贵的检测设备和对检测人员要求较高,且存在指标检测效率低,检测样品数量小和成本高等问题,是实施精细农业管理的一个重要障碍因素。光谱分析技术作为一种快速、无损、简便的绿色测量方法和分析技术,在土壤养分的测定方面扮演着越来越重要的角色。近红外光谱检测技术具有一系列的优点,如快速、无需样品制备和成本低等优点。近红外光谱能够反映土壤如有机质和全氮等养分信息,使得近红外光谱检测技术在农业与农业环境检测中得到了广泛应用,近红外光谱检测能力主要依靠其对C-H、0-H和N-H功能键的能量吸收进而反映相应土壤养分含量等信息。土壤有机质、氮、磷、钾是农作物生长的主要养分,是土壤养分管理和测土配方施肥的重要对象,随着测土配方施肥技术的大规模推广,迫切需要一种低成本、可靠的土壤养分快速检测方法。本文比较研究了多种不同建模方法对土壤养分检测效果,将获得的原始光谱数据用于进行主成分分析(PCA)得到的前6个主成分(PCs)和偏最小二乘回归(PLSR)建模得到的6个潜变量(LVs),分别作为BP传播神经网络(BPNN)和最小二乘支持向量机(LS-SVM)的输入,共建立了6个模型,分别为PCR、PLSR、BPNN-PCs、BPNN-LVs、LS-SVM-PCs和LS-SVM-LVs,对这些建模方法对预测的土壤有机质、碱解氮、速效磷和速效钾含量的结果进行评价,从中筛选出最佳模型。在预测土壤有机质、碱解氮、速效磷和速效钾时,LS-SVM-LVs模型优于PCR、PLSR、 BPNN-PCs、BPNN-LVs和LS-SVM-PCs模型。用LS-SVM-LVs模型得到的有机质、碱解氮、速效磷和速效钾预测集的决定系数和预测误差分别为0.8734,0.8310,0.7801,07353和2.92,16.49,4.97,13.42。本文采用的光谱预处理包括标准正态变换(SNV),多元散射校正(MSC)和SG(Savitzky Golay)平滑结合一阶导数,以消除系统噪声和外部干扰,分别应用偏最小二乘回归(PLSR)和最小二乘支持向量机(LS-SVM)方法建立校正模型,LS-SVM回归方法规避了高维数据处理时须面对的众多问题,较好地解决了非线性和高维数等现实问题。最小二乘支持向量机(LS-SVM)输入分别包括主成分分析得到的主成分(PCs)、 PLSR建模得到的潜在变量(LVs)和由PLSR模型回归系数得到有效波长(EWs)。结果表明,三种输入的LS-SVM模型都优于PLS模型,其中EWs-LS-SVM模型最佳,碱解氮(N)的决定系数(R2)和预测均方误差RMSEP分别0.82和17.2,速效钾(K)为0.72和15.0。由于采用原始光谱建模分析,数据量大,波长数多,本文探讨了多种特征波长提取方法,也称特征变量提取方法,如连续投影算法、遗传算法、无信息变量消除算法和有效波长提取方法等,并应用这些特征波长替代原始光谱进行建模分析,如为了提高模型分析方法的预测精度,研究了消除无信息建模变量对模型稳定性的影响,原始光谱平滑后采用蒙特卡罗无信息变量消除(Monte Carlo Uninformative Variables Elimination,MC-UVE)方法对土壤碱解氮(N)和速效钾(K)的建模变量进行筛选,应用偏最小二乘(PLS)方法建立校正模型。对于碱解氮(N)模型,采用MC-UVE-PLS方法,建模变量减少为210,碱解氮(N)的决定系数(R2)和预测均方误差RMSEP分别0.86和17.1。对于速效钾(K)的预测模型,采用MC-UVE方法后,建模变量减少为150,模型的预测决定系数为0.78,预测均方根误差为15.4。结果表明,利用可见光和短波近红外光谱(VIS/SW-NIR)(325-1075nm)结合MC-UVE方法可以有效的选择建模变量,能提高模型的稳定性,可以作为一个精确的土壤理化性质的测定方法。遗传算法在分析测量土壤碱解氮(N)和速效钾(K)含量的应用情况,根据遗传算法优化结果提取到的特征波长替代原始光谱数据作为输入,应用最小二乘支持向量机(LS-SVM)方法建立校正模型,预测结果优于偏最小二乘(PLS)建模。应用遗传算法优化后建模变量由原来的751个全谱变量减少到17个特征变量,大大简化了模型复杂度,并提高了模型预测精度。碱解氮(N)的决定系数(R2)和预测均方误差RMSEP分别0.81和17.8,速效钾(K)为0.71和15.6。表明应用遗传算法提取特征波长,将提取到的特征波长作为LS-SVM模型的输入,建立预测模型,这种方法也可以作为一个精确的土壤理化性质的测定方法。应用连续投影算法提取特征波长的方法,也是采用LS-SVM建模。分析过程是将原始光谱经平滑结合一阶微分预处理后,然后采集连续投影算法确定特征波长,作为建模集和预测集的光谱输入数据。发现采用基于连续投影算法得到的特征波长为输入的最小二乘支持向量机模型优于偏最小二乘回归法模型,连续投影算法从大量原始光谱数据中提取少数几列数据,高度概括了绝大多数样品光谱数据的有用信息,避免了信息重叠,同时去除了冗余信息,简化了模型。有机质的决定系数和预测均方误差分别0.8602和2.98,速效钾为0.7305和15.78。对于土壤全氮养分,应用留一法交互验证偏最小二乘回归模型(PLSR)对三个不同地区土壤样本光谱数据(三个独立模型)和所有土壤样本光谱数据(通用模型)分别建立全氮预测模型,三个地区土壤样本全氮独立预测集的决定系数(R2)分别为0.81,0.70,0.31,剩余预测偏差(RPD)分别为3.01,2.09,1.08,均方根预测误差(RMSEP)为0.06,0.03,0.03,通用模型独立预测集的决定系数(R2)为0.72,剩余预测偏差(RPD)为2.23,均方根预测误差(RMSEP)为0.05。研究发现,全氮理化值分布区间越大,R2和RPD也越大,故通用模型检测结果优于汪家和昌东两个地区,且样本理化值标准偏差(standard deviation,SD)越大,模型决定系数(R2)和剩余预测偏差(RPD)也越大,但是模型的均方根预测误差(RMSEP)也越大。因此,建模选择样本时,应确保模型的均方根预测误差(RMSEP)值较小的条件下,应尽量选择理化值分布区间大的样本用于建模,这样得到的模型达到最优。本文研究开发了一款应用近红外光谱分析技术、基于USB4000的便携式土壤养分(有机质)含量测定仪。便携式测定仪器由软件和硬件两部分组成。软件部分包括基于JAVA语言开发的土壤有机质含量检测软件以及USB4000底层驱动程序;硬件部分包括光源驱动电路、光纤、winCE开发板、便携式电源、触摸液晶显示电路和仪器机箱等组成。光源信号通过入射光纤传输到被测土壤表面,经过土壤发生漫反射,通过反射光纤传输到USB4000光谱仪得到土壤反射率值,软件系统获取这些反射率数据进行处理、显示、存储等处理,并土壤有机质含量结果显示在液晶显示屏上。

【Abstract】 It is necessary for precision agriculture to understand the spatial-temporal variability and real-time nutritional status of soil. And the digital agriculture also requests that the soil nutrition detection should be timely and effective. Soil organic matter (OM), total nitrogen (TN), available nitrogen (N), available phosphorus (P), available potassium (K) are the main nutrients for crop growth, soil nutrient management and soil testing important objects. Conventional detection methods of these parameters are adopted by many laboratories and soil fertilizer manage station, these methods require expensive testing equipment and complex manual and have many disadvantages such as low efficiency, few samples and high cost problem, which hold back soil fertility management and the development of precision agriculture management. With large-scale promotion of fertilization technology, it is urgent for a low-cost and reliable method to rapidly detect soil nutrients. As a rapid, convenient, nondestructive and green technique, spectroscopy analysis becomes more and more important in the area of soil nutrition detecting. Near infrared spectroscopy technique offers a quick analysis, little sample preparation requirement, and low cost. They are highly sensitive to both organic and inorganic components of the soil, making their use in the agricultural and environmental sciences particularly appropriate. The analytical abilities of visible near infrared spectrum (Vis/NIRS) depend on the repetitive and broad absorption of Vis/NIRS light by C-H, O-H and N-H bonds.Soil organic matter (OM), nitrogen (N), phosphorus (P) and potassium (K) are the main nutrients for crop growth, soil nutrient management and soil testing important objects. With large-scale promotion of fertilization technology, it is urgent for a low-cost and reliable method to rapidly detect soil nutrients.Different calibration methods were used to detect the soil nutrition based on near infrared spectrum technology. Near infrared diffuse reflectance spectroscopy data of soil samples was used for the principal component analysis (PCA) to get the first six principal components (PCs), and PLSR mold was built to get six latent variables (LVs), respectively. PCR, PLSR, BPNN-PCs, BPNN-LVs, LS-SVM-PCs and LS-SVM-LVs modeling methods were built to predict the content of soil organic matter, available nitrogen, available P and available K. These modeling methods were evaluated respectively and selected the best model. The results showed that all LS-SVM-LVs models outperformed PCR, PLSR, BPNN-PCs, BPNN-LVs and LS-SVM-PCs models. The best predictions were obtained with LS-SVM-LVs model for OM (R2=0.8734and RMSEP=2.92), N (R2=0.7801and RMSEP=16.49), P (R2=0.7801and RMSEP=4.97) and K (R2=0.7353and RMSEP=13.42). The near-infrared diffuse reflectance spectroscopy based on LS-SVM combined with PLSR can be used for the measurement of soil organic matter, available N, available P and available K.Near infrared diffuse reflectance spectroscopy was investigated for measurement accuracy of soil properties, namely, available nitrogen (N) and available potassium (K). Three types of pretreatments including standard normal variate (SNV), multiplicative scattering correction (MSC) and Savitzky-Golay smoothing+first derivative were adopted to eliminate the system noises and external disturbances. Then partial least squares (PLS) and least squares-support vector machine (LS-SVM) models analysis were implemented for calibration models, LS-SVM regression preferably solved the practical issues such as non-linearity, multi-dimension and so on. Simultaneously, the performance of least squares-support vector machine (LS-SVM) models was compared with three kinds of inputs, including PCA (PCs), latent variables (LVs), and effective wavelengths (EWs). The results indicated that all LS-SVM models outperformed PLS models. The performance of the model was evaluated by the determination coefficient (R2), RMSEP. The optimal EWs-LS-SVM models were achieved, and the determination coefficient (R2), RMSEP was0.82,17.2for N and0.72,15.0for K, respectively. The results indicated that visible and near infrared spectroscopy (Vis/NIRS)(325-1075nm) combined with LS-SVM could be utilized as a precision method for the determination of soil properties.For the reason that using the raw spectra data has many drawbacks such as big data, too many wavelength, So this research studied some selecting characteristic wavelengths way to choose characteristic wavelength or characteristic variables, these ways includegenetic algorithm (GA), successive projections algorithm (SPA),uninformative variable elimination (UVE) and effective wavelengths (EWs) and so on. In order to improve the predictive precision, and eliminate the influence of uninformative variables for model robustness, Monte carlo uninformative variables elimination (MC-UVE) methods were proposed for variable selection in available nitrogen (N) and available potassium (K) spectral modeling.Partial least squares (PLS) models analysis were implemented for calibration models.The modeling variable number was reduced to210from751for available nitrogen (N) calibration model and150for available potassium (K) calibration model. The performance of the model was evaluated by the determination coefficient (R2), RMSEP. The optimal MC-UVE-PLS models were achieved, and the determination coefficient (R2), RMSEP were0.86,17.1for N and0.78,15.4for K, respectively.The results indicated that visible and near infrared spectroscopy (Vis/NIRS)(325-1075nm) combined with MC-UVE could be utilized as a precision method for the determination of soil properties.The calibration was optimized by genetic algorithm (GA) in the wavelength range of325-1075nm. After optimizations, the sample number of calibration set decreased from751to17, then least squares-support vector machine (LS-SVM) models analysis were implemented for calibration models. Simultaneously, the performance of least squares-support vector machine (LS-SVM) models was compared with PLS models. The results indicated that LS-SVM models outperformed PLS models. The performance of the models was evaluated by the determination coefficient (R2), RMSEP. The optimal GA-LS-SVM models were achieved, and the determination coefficient (R2), RMSEP was0.81,17.8for N and0.71,15.6for K, respectively.The results indicated that visible and near infrared spectroscopy (Vis/NIRS)(325-1075nm) combined with LS-SVM based on GA could be utilized as a precision method for the determination of soil properties.Successive projections algorithm (SPA) based on NIR was investigated in this study for measurement of soil organic matter (OM) and available potassium (K). Four types of pretreatments including smoothing, SNV, MSC and SG smoothing+first derivative were adopted to eliminate the system noises and external disturbances. Then partial least squares regression (PLSR) and least squares-support vector machine (LS-SVM) models analysis were implemented for calibration models. The LS-SVM model was built by using characteristic wavelength based on successive projections algorithm (SPA). Simultaneously, the performance of LS-SVM models was compared with PLSR models. The results indicated that LS-SVM models using characteristic wavelength as inputs based on SPA outperformed PLSR models. The optimal SPA-LS-SVM models were achieved, and the determination coefficient (R2), RMSEP were0.8602,2.98for OM and0.7305,15.78for K, respectively.Spectra in the calibration set were subjected to partial least squares regression (PLSR) to establish calibration models of soil properties. Except for the Wangjia farm, individual farm models provided successful calibration result for total nitrogen (TN) with coefficient of determination (R2) of0.82-0.88and0.72-0.82and residual prediction deviation (RPD) of2.62-3.27and2.02-3.07for the calibration dataset and independent validation respectively. General calibration models gave improved prediction accuracies compared with models of farms in the Wangjia and Changdong, which was attributed to larger ranges in the variation of soil properties in general models compared with those in individual farm models. The results showed that larger standard deviations (SDs) and wider variation ranges have resulted in larger R2and RPD, meanwhile larger root mean square errors of prediction (RMSEP).Therefore, a compromise solution, which also results in small RMSEP values, soil samples should be selected for calibration to cover a wide variation range.The soil organic matter detection instrument was based on NIRS technology including USB4000optical spectrum instrument. The instrument consisted of two parts, software section and hardware section. The software section included soil organic matter detection software based on JAVE language and USB4000driven program. The hardware section included lamp source driven circuit, Y type optical fiber, win CE development board, portable power, and touch liquid crystal display circuit and instrument box. Incident light signals was transmit through optical fiber to the measured soil surface, diffuse reflection data was caused from the soil surface through reflection optical fiber transmission to the USB4000spectrometer to obtain soil reflectance value, the software system for processing, display, storage and other processing.

【关键词】 有机质速效钾速效磷便携式仪器
【Key words】 organic matternitrogenavailable Kavailable Pportable instrument
  • 【网络出版投稿人】 东华大学
  • 【网络出版年期】2014年 05期
节点文献中: 

本文链接的文献网络图示:

本文的引文网络