Development and Application of Tobacco Climate Spot Disease Grading Recognition Model Based on Cu -ViT Deep Learning

JIN Yabo; SUN Jiazhao; LUO Jinzhou; WU Xiumiao; DING Wei; LUO Jianqin

doi:10.13718/j.cnki.zwyx.2025.06.008

2025 Volume 4 Issue 6

Article Contents

Previous Article Next Article

JIN Yabo, SUN Jiazhao, LUO Jinzhou, et al. Development and Application of Tobacco Climate Spot Disease Grading Recognition Model Based on Cu -ViT Deep Learning[J]. PLANT HEALTH AND MEDICINE, 2025, 4(6): 65-76. doi: 10.13718/j.cnki.zwyx.2025.06.008

Citation:

JIN Yabo, SUN Jiazhao, LUO Jinzhou, et al. Development and Application of Tobacco Climate Spot Disease Grading Recognition Model Based on Cu -ViT Deep Learning[J]. PLANT HEALTH AND MEDICINE, 2025, 4(6): 65-76. doi: 10.13718/j.cnki.zwyx.2025.06.008

Development and Application of Tobacco Climate Spot Disease Grading Recognition Model Based on Cu -ViT Deep Learning

1.
China Tobacco Guangxi Industrial Co. Ltd., Nanning 530000, China
2.
School of Plant Protection, Southwest University, Chongqing 400715, China

More Information

Corresponding author: LUO Jianqin
Received Date: 20/10/2025
Available Online: 25/12/2025
MSC: S435.72; TP181

Abstract

Accurate grading and identification of tobacco climate spot disease hold multidimensional value in agricultural production, disease control, and environmental protection. Manual identification suffers from issues such as high costs, strong subjectivity, and low efficiency, which can be addressed through image recognition technology. This study, based on the Vision Transformer (ViT) framework, replaces patch embedding with compression units to propose the Cu-ViT model, systematically enhancing the ViT model's capability in image capture and recognition. In simulated tests, the Cu-ViT model achieved an accuracy of 91.23%, with its F1 score, precision, and recall all surpassing those of ViT as well as advanced recognition models such as ResNet152, InceptionResNetV2, Swin Transformer (SwinT) and VGGNet19. The average recognition time per image was 104.23 milliseconds. Furthermore, the Cu-ViT model's accuracy, validated in real production environments, outperformed manual identification (p < 0.01). These results indicate that the Cu-ViT model is capable of grading and identifying tobacco climate spot disease.
- tobacco climate spot disease,
- disease grading,
- image recognition,
- deep learning

References

[1]	孙佳照, 冉渝澳, 冯俊, 等. 西南地区烟草潜在适生区预测[J]. 中国烟草科学, 2023, 44(5): 37-44, 61. Google Scholar
[2]	WANG G L, ZHU Q K, SONG C D, et al. MedKAFormer: When Kolmogorov-Arnold Theorem Meets Vision Transformer for Medical Image Representation[J]. IEEE Journal of Biomedical and Health Informatics, 2025, 29(6): 4303-4313. doi: 10.1109/JBHI.2025.3541982 CrossRef Google Scholar
[3]	WU Y H, LIU Y, ZHAN X, et al. P2T: Pyramid Pooling Transformer for Scene Understanding[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023, 45(11): 12760-12771. doi: 10.1109/TPAMI.2022.3202765 CrossRef Google Scholar
[4]	HARIDASAN A, THOMAS J, RAJ E D. Deep Learning System for Paddy Plant Disease Detection and Classification[J]. Environmental Monitoring and Assessment, 2023, 195(1): 120. doi: 10.1007/s10661-022-10656-x CrossRef Google Scholar
[5]	孙佳照, 李群岭, 林小兴, 等. 基于Resnet-101模型的烟蚜数量图像识别系统开发[J]. 植物医学, 2024, 3(4): 26-31. doi: 10.13718/j.cnki.zwyx.2024.04.004 CrossRef Google Scholar
[6]	LI Z C, ZHOU G X, HU Y W, et al. Maize Leaf Disease Identification Based on WG-MARNet[J]. PLoS One, 2022, 17(4): e0267650. doi: 10.1371/journal.pone.0267650 CrossRef Google Scholar
[7]	LIU H, CUI Y D, WANG J M, et al. Analysis and Research on Rice Disease Identification Method Based on Deep Learning[J]. Sustainability, 2023, 15(12): 9321. doi: 10.3390/su15129321 CrossRef Google Scholar
[8]	BEGUM N, HAZARIKA M K. Prediction of Physico-Chemical Properties in Tomatoes Using Deep Neural Architecture[J]. Agricultural Research, 2024: 1-11. Google Scholar
[9]	LIN J W, CHEN Y, PAN R Y, et al. CAMFFNet: a Novel Convolutional Neural Network Model for Tobacco Disease Image Recognition[J]. Computers and Electronics in Agriculture, 2022, 202: 107390. doi: 10.1016/j.compag.2022.107390 CrossRef Google Scholar
[10]	WU T N, ZHANG Y W, GONG Z W, et al. Quantification of Tobacco Leaf Appearance Quality Index Based on Computer Vision[J]. IEEE Access, 2022, 10: 120352-120368. doi: 10.1109/ACCESS.2022.3221978 CrossRef Google Scholar
[11]	HAN K, WANG Y H, CHEN H T, et al. A Survey on Vision Transformer[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023, 45(1): 87-110. doi: 10.1109/TPAMI.2022.3152247 CrossRef Google Scholar
[12]	HE F Y, LIU Y, LIU J F. ECA-ViT: Leveraging ECA and Vision Transformer for Crop Leaves Diseases Identification in Cultivation Environments[C]//2024 4th International Conference on Machine Learning and Intelligent Systems Engineering (MLISE). June 28-30, 2024. Zhuhai, China. IEEE, 2024: 101-104 DOI: 10.1109/mlise62164.2024.10674238. Google Scholar
[13]	SCHMIDT-HIEBER J. The Kolmogorov-Arnold Representation Theorem Revisited[J]. Neural Networks, 2021, 137: 119-126. doi: 10.1016/j.neunet.2021.01.020 CrossRef Google Scholar
[14]	SILVEIRA A C C, DO CARMO D S, UEDA L H, et al. VITMST++: Efficient Hyperspectral Reconstruction through Vision Transformer-Based Spatial Compression[J]. IEEE Open Journal of Signal Processing, 2025, 6: 398-404. doi: 10.1109/OJSP.2025.3544891 CrossRef Google Scholar
[15]	ZENG Z H, LIU C B, TANG Z, et al. AccTFM: an Effective Intra-Layer Model Parallelization Strategy for Training Large-Scale Transformer-Based Models[J]. IEEE Transactions on Parallel and Distributed Systems, 2022, 33(12): 4326-4338. doi: 10.1109/TPDS.2022.3187815 CrossRef Google Scholar
[16]	MACDOWALL F D H. Predisposition of Tobacco to Ozone Damage[J]. Canadian Journal of Plant Science, 1965, 45(1): 1-12. doi: 10.4141/cjps65-001 CrossRef Google Scholar
[17]	XUE W X, XU P J, WANG X F, et al. Natural-Enemy-Based Biocontrol of Tobacco Arthropod Pests in China[J]. Agronomy, 2023, 13(8): 1972. doi: 10.3390/agronomy13081972 CrossRef Google Scholar
[18]	HAQUE M A, DEB C K, GOLE P, et al. An Enhanced Vision Transformer Network for Efficient and Accurate Crop Disease Detection[J]. Expert Systems with Applications, 2025, 283: 127743. doi: 10.1016/j.eswa.2025.127743 CrossRef Google Scholar
[19]	MONTAVON G, SAMEK W, MVLLER K R. Methods for Interpreting and Understanding Deep Neural Networks[J]. Digital Signal Processing, 2018, 73: 1-15. doi: 10.1016/j.dsp.2017.10.011 CrossRef Google Scholar
[20]	SHARMA S K, VISHWAKARMA D K. Classification of Banana Plant Leaves Based on Nutrient Deficiency Using Vision Transformer[C]//2024 5th International Conference for Emerging Technology (INCET). May 24-26, 2024, Belgaum, India. IEEE, 2024: 1-6. Google Scholar
[21]	冉渝澳, 金亚波, 王振国, 等. 烟草靶斑病预测模型构建及数字化应用研发[J]. 植物医学, 2024, 3(4): 40-49. doi: 10.13718/j.cnki.zwyx.2024.04.006 CrossRef Google Scholar
[22]	SHINODA R, KATAOKA H, HARA K, et al. Transformer-Based Ripeness Segmentation for Tomatoes[J]. Smart Agricultural Technology, 2023, 4: 100196. doi: 10.1016/j.atech.2023.100196 CrossRef Google Scholar

Access History

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

Figures(10) / Tables(5)

Export Citation

PDF

XML

Article Metrics

Article views(232) PDF downloads(27) Cited by(0)

Access History

Other Articles By Authors

on this site
on Google Scholar

HTML

开放科学(资源服务)标识码(OSID)：
目前，全球有100多个国家和地区种植烟草，烟草种植行业已经成为一些国家的经济支柱^[1]。在中式卷烟的生产中，烟草原料的质量与烟叶的工业可用性直接挂钩，会影响后续的卷烟配方加工和卷烟产品的质量^[2]。由于病虫害的侵扰和危害，烟草的产量和质量逐年下降^[3]。在烟草种植过程中，叶部病害历来是导致作物减产和品质下降的主要原因。烟草在田间生长阶段，烟草气候斑病害是主要发生的非侵染性叶部病害之一。气候变化引起的大气臭氧浓度失衡已被广泛证实是该病害的诱发因素^[4]。在生产过程中，病害种类得以明确后，实施与病害特征相适配的补救方案。目前，烟草气候斑病害的诊断主要依靠传统经验，在病害类型和损害严重程度方面容易出现误判，迫切需要用数字化方法代替传统经验判别，以提高诊断的准确性和效率。

近年来，计算机技术与机器人技术的飞速发展，为烟草病害的智能识别提供了可能。人工智能(AI)与先进分类技术，尤其是基于图像的方法的应用，大幅提升了病害检测的效率^[5]。在玉米病害诊断领域，研究人员采用了WG-MARNet模型并结合数据增强技术，平均分类准确率达到97.96%^[6]。同样，在水稻病害识别方面，改进后的VGG网络架构结合ResNet模型成功检测出稻瘟病、纹枯病和白叶枯病，准确率达98.64%，展现出精准的病害分类能力^[7]。VGGNet19网络用于对番茄成熟度等级进行分类，分类效率达92%^[8]。在烟草病害研究方面，已有研究提出一种名为CAMIFFNet的卷积神经网络模型，用于在田间条件下识别烟草花叶病和烟草赤星病。该模型通过多特征融合模块和坐标注意力机制，能有效提取病害特征并降低环境干扰。实验结果表明，在烟草病害图像分类任务中，CAMIFFNet模型的准确率达到89.71%^[9]。现有模型尽管在病害类型分类方面已取得显著成果，但它们均侧重于识别不同种类的病害，若能对特定病害的严重程度进行分级识别，将进一步提升在农业生产中的实际应用价值。因此，对烟叶病害感染程度进行精准量化，将为针对性防治措施的实施提供更科学的指导，进而优化病害管理策略。

2020年，谷歌团队提出了ViT网络模型，该模型主要应用于图像分类任务。近年来，ViT模型通过各种视觉任务推进的最新技术，取得了显著的成功^[10]。作为一种图像识别模型，与卷积神经网络(Convolutional Neural Network，CNN) 模型相比，ViT模型可以通过自注意力机制捕获图像中不同区域之间的全局依赖关系，从而弥补了CNN在全局信息提取方面的不足，同时，其模块化结构使其易于扩展^[11]，因此，ViT越来越受到研究人员的青睐。例如，He等^[12]提出了一种用于水稻叶片病害识别的ECA-ViT模型，该模型将ECA模块集成到ViT模型的网络中，以弥补ViT模型在提取图像局部特征信息方面的不足。Schmidt ^[13]开发了一种使用迁移学习的ViT模型，用于分类和识别香蕉叶片中营养缺乏的类型。尽管使用了预训练模型来调整超参数并冻结ViT模型的网络层，但该模型在提取局部图像特征方面仍然表现出较弱的能力。上述两项研究都证明了ViT模型在作物病害识别中的效用。在本研究中，需要对受烟草气候斑病害不同程度影响的烟叶进行分类和识别，因此，将ViT模型作为基础模型采用。尽管ViT模型在作物病害图像识别领域表现出色，但它缺乏CNN固有的归纳偏置，并且在提取局部特征信息和处理多尺度信息方面存在不足。

为解决上述问题，本文提出一种基于ViT模型的改进型烟草病害识别模型，即Cu-ViT模型，用于烟草气候斑严重程度的分级识别。在ViT模型的基础上，引入压缩单元替代补丁嵌入，通过卷积层提取局部特征，以提升模型的识别性能。实验结果显示，Cu-ViT模型在模拟测试中的准确率达91.23%，显著高于人工识别准确率(p＜0.05)，单张图像平均识别时间为104.23 ms。

3. 讨论

不同类型的烟草病害之间存在显著差异，而同一种病害的不同严重程度等级之间的差异相对较小，开发用于烟草气候斑病害的等级识别模型具有挑战性。在本研究中，ViT模型被选为基线模型，其在农业分类和识别方面的优越性已得到广泛证实^[18-20]。烟草气候斑病害病变颜色多样且大小不均匀，直接使用ViT模型识别会导致准确率降低，无法作为有效的识别模型。近年来，许多研究人员对ViT模型进行了修改，以适应各种场景需求^[21]。

本研究以ViT模型为基础，考虑到不同严重程度的烟草气候斑病害特征主要位于烟叶的局部区域，因此，为了保留图像所携带的详细特征，将ViT模型的补丁嵌入(使用步长为16的单个16×16卷积)替换为压缩单元。该单元采用3个小型卷积层联级，然后进行ReLU激活，以逐步扩大感受野，从而能够从局部到中局和全局尺度提取多尺度特征。烟草气候斑病害早期(如白点)和晚期(如棕色坏死区域)的形态差异需要根据局部细节进行区分，而压缩单元可以有效地保留这些特征。与单尺度补丁嵌入相比，多尺度特征融合可以提供更丰富的表示，这在区分相似的病害等级(如轻度和中度)时尤为重要。类似的研究方法已应用于其他研究^[22]。在集成压缩单元后，该模型在所有指标上都表现出改进，准确率提高了约5.59%。因此，Cu-ViT模型在烟草气候斑病害图像的分级和识别任务中表现出色。与基线模型相比，它不仅提高了分类准确率，而且有效地减少了不同类别之间的混淆。这一结果对于烟草病害自动识别技术的应用具有重要意义，为针对不同病害等级实施的防控策略提供了精确的判别支持。

随着中国人口老龄化的加速和农业人口的急剧下降，劳动力的严重短缺已成为制约农业生产和可持续发展的主要因素。因此，本研究选取烟草气候斑病害不同损害程度的叶片图像，进行人工识别和Cu-ViT模型识别的比较分析。结果表明，Cu-ViT模型识别的平均准确率接近0.9，大多数数据点都聚集在这个水平附近，表明Cu-ViT模型在识别过程中能保持足够的稳定性和可靠性。相比之下，人工识别的平均准确率略低于模型，而且数据点的分布更为分散。这些测试结果表明，Cu-ViT模型已在实际场景中实现了初步的应用性能，可以在一定程度上替代传统的人工识别。

4. 结论

在本研究中，旨在对烟草气候斑病害的严重程度进行分级和识别。提出的Cu-ViT模型，基于ViT模型框架，通过将压缩单元替代补丁嵌入，提升了ViT模型多尺度和多层次特征提取的能力，并加强了非线性表达。结果表明，Cu-ViT模型在测试中达到了91.23%的准确率，其综合性能优于ResNet152、InceptionResNetV2、SwinT和VGGNet19等先进的图像分类模型。在本研究中，研究对象采用中国主要栽培品种“云烟87”，未来，可以收集更多不同品种的烟草图像作为训练样本，以提高模型的泛化能力。

Figure (10) Table (5) Reference (22)

Name
	Name cannot be empty!
E-mail
	Mailbox cannot be empty! Mailbox cannot be empty!
Telephone
	Mobile number cannot be empty! Please enter a valid mobile number!
Title

Content
Verification Code

Message Board

Development and Application of Tobacco Climate Spot Disease Grading Recognition Model Based on Cu -ViT Deep Learning

Abstract

References

Access History

通讯作者: 陈斌, bchen63@163.com

Article Metrics

Access History

Other Articles By Authors