本平台为互联网非涉密平台,严禁处理、传输国家秘密或工作秘密

基于人工提取特征的DBO-XGBoost烤烟部位识别方法

A stalk position recognition method for flue-cured tobacco leaves using DBO-XGBoost algorithm based on manually extracted features

  • 摘要: 为解决烤烟分级过程中烟叶部位识别混淆、有效特征数量少、识别准确率低等问题,根据烤烟烟叶着生位置建立了包含顶叶(T)、上二棚(B)、腰叶(C)、下二棚(X)和脚叶(P)5个部位的数据集;融合专家经验采用OpenCV图像处理技术将烤烟图像特征进行数字化转换,人工提取烤烟形态、颜色和纹理等155个特征;选取XGBoost(Extreme Gradient Boosting)集成学习算法,利用蜣螂优化算法(Dung Beetle Optimizer,DBO)对XGBoost超参数进行自动寻优,构建了DBO-XGBoost烤烟部位识别模型,并在烤烟分级设备上进行部署验证。结果表明:DBO-XGBoost模型对烤烟部位识别准确率达到98.88%,macroF1-score为0.987 3;使用阈值分割方法提取的橘色、烤红、青色及聚类色占比特征可显著提升模型性能;与XGBoost、支持向量机、随机森林、决策树、ResNet50、MobileNetv3-s、MobileViT-s、YOLOv8-s等经典模型相比,DBO-XGBoost模型识别准确率提高2.37百分点以上,识别速度达到0.3 s/片。所构建的DBO-XGBoost模型能够准确识别烤烟部位,满足实际收购过程中烤烟分级要求。

     

    Abstract: To overcome problems in flue-cured tobacco leaf grading such as confusion in stalk position recognition, limited effective features, and low recognition accuracy, a dataset containing data of leaves from five stalk positions: tips (T), second upper leaves (B), cutters (C), lugs (X), and flyings (P) was established. The features of leaf images were digitally transformed by combining OpenCV image processing technique with expert experiences, and 155 features including morphology, color, and texture of leaves were manually extracted. The XGBoost ensemble learning algorithm was selected, and the Dung Beetle Optimizer (DBO) was used to automatically optimize the hyperparameters of XGBoost to develop a DBO-XGBoost model for recognizing the stalk positions of flue-cured tobacco leaves. The DBO-XGBoost model was validated on a tobacco grading machine. The results showed that the DBO-XGBoost model achieved a recognition accuracy of 98.88% for the stalk positions of flue-cured tobacco leaves with a macroF1-score of 0.987 3. The features extracted using the threshold segmentation method, including the proportions of orange, scorched red, green, and clustered colors, significantly improved the performance of the model. In addition, the DBO-XGBoost model showed at least a 2.37 percentage increase in recognition accuracy compared with classical models such as XGBoost, Support Vector Machine, Random Forest, Decision Trees, ResNet50, MobileNetv3-s, MobileViT-s, and YOLOv8-s, with an recognition speed of 0.3 seconds per image. The DBO-XGBoost model is capable of accurately recognizing the stalk position of flue-cured tobacco leaves, and applicable to the actual flue-cured tobacco purchasing process.

     

/

返回文章
返回