改进随机森林的波长选择用于烟叶近红外稳健校正模型的建立
Wavelength Selection Based on Modified Random Forest for Establishing Robust Near Infrared Calibration Model of Tobacco
-
摘要: 为了增强烟叶定量近红外模型的稳健性,采用改进的随机森林(Random Forests,RF)重要性度量的方法选择与待测组分相关性强的波长变量,建立了抗干扰能力强的分析模型;用不同含水率和不同样品温度的烟叶测试样品,验证了所建模型的适用性。结果表明:①RF波长优选可以筛选出对外界变化敏感性低的波长变量。②模型更加简单,且具有更高的精度和稳健性。该方法对建立稳健的烟叶近红外品质快速分析模型具有一定的参考意义。Abstract: In order to improve the robustness of near infrared(NIR) quantitative model for tobacco leaves,an analytical model strongly resistant to external interference was developed by using improved random forest(RF) importance measure method to select wavelength variables which strongly correlated to the constituents to be tested.The developed model was validated with tobacco samples of different moisture contents and temperatures.The results showed that:1) RF wavelength optimization was capable of selecting wavelength variables less sensitive to external variations.2) The model was simpler and featured higher precision and robustness.This method provides a certain reference for the establishment of robust and rapid NIR model for tobacco leaf analysis.
下载: