本平台为互联网非涉密平台,严禁处理、传输国家秘密或工作秘密

基于高光谱成像和机器学习的烟叶霉变检测方法

Method for detecting mildew on tobacco leaves based on hyperspectral imaging and machine learning

  • 摘要: 为降低烟叶霉变对卷烟产品品质的影响,建立了一种烟叶霉变无损检测方法。使用高光谱成像仪采集受霉变影响的烟叶高光谱图像并提取光谱数据,采用归一化(Min-Max Scaler,MMS)、标准正态变化(Standard Normal Variate,SNV)、多元散射校正(Multivariate Scattering Correction,MSC)和平滑滤波(Savitzky-Golay,SG)等7种方法,对获得的烟叶光谱数据进行预处理;利用连续投影变换(Successive Projections Algorithm,SPA)、主成分分析(Principal Component Analysis,PCA)等算法对特征波长进行选择,并运用随机森林(Random Forest,RF)、支持向量机(Support Vector Machine,SVM)等机器学习算法建立6种分类模型。将基于7种预处理方法建立的6种分类模型进行对比测试,结果表明:①SNV为最优光谱预处理方法,而基于SPA选择特征波长建立的RF模型性能优异,SPA-RF模型在训练集和测试集上的识别准确率分别达到98.82%和98.64%,降低了基于全波长分类模型的运算时间,对于相同等级不同产地烟叶具有良好分类结果;②高光谱成像技术结合SPA-RF模型可有效实现烟叶霉变情况的准确识别。该技术可为提高烟叶品质管控质量提供支持。

     

    Abstract: To mitigate the impact of moldy tobacco leaves on the quality of cigarette products, a non-destructive method for detecting moldy tobacco leaves was established. Hyperspectral images of moldy tobacco leaves were acquired using a hyperspectral imager, and the spectral data were extracted. The acquired spectral data were pre-processed with seven methods including Min-Max Scaling (MMS), Standard Normal Variate (SNV), Multivariate Scatter Correction (MSC), and Savitzky-Golay smoothing filter (SG). The characteristic wavelengths were selected with Successive Projections Algorithm (SPA) and Principal Component Analysis (PCA). Six classification models were established using machine learning algorithms including Random Forest (RF) and Support Vector Machine (SVM), and comparatively tested. The results showed that: 1) SNV was the optimal spectral preprocessing method, and the RF model based on characteristic wavelengths selected by SPA exhibited superior performance, achieved recognition accuracies of 98.82% and 98.64% on the training and test sets, respectively. This approach also reduced the computational time of full-spectrum classification algorithms and achieved favorable classification results for tobacco of the same grade from different growing areas. 2) The hyperspectral imaging technology combined with the SPA-RF model could effectively and accurately identify the moldy states of tobacco leaves. This technology supports the promotion of tobacco leaf quality control and management.

     

/

返回文章
返回