Improving the Accuracy of Tobacco Target Spot Disease Recognition through Data Augmentation

CHEN Haitao; SUN Jiazhao; LUO Jinzhou; DING Wei

doi:10.13718/j.cnki.zwyx.2025.06.009

2025 Volume 4 Issue 6

Article Contents

Previous Article Next Article

CHEN Haitao, SUN Jiazhao, LUO Jinzhou, et al. Improving the Accuracy of Tobacco Target Spot Disease Recognition through Data Augmentation[J]. PLANT HEALTH AND MEDICINE, 2025, 4(6): 77-84. doi: 10.13718/j.cnki.zwyx.2025.06.009

Citation:

CHEN Haitao, SUN Jiazhao, LUO Jinzhou, et al. Improving the Accuracy of Tobacco Target Spot Disease Recognition through Data Augmentation[J]. PLANT HEALTH AND MEDICINE, 2025, 4(6): 77-84. doi: 10.13718/j.cnki.zwyx.2025.06.009

Improving the Accuracy of Tobacco Target Spot Disease Recognition through Data Augmentation

1.
Chongqing Tobacco Branch, China National Tobacco Corporation, Chongqing 400000, China
2.
School of Plant Protection, Southwest University, Chongqing 400715, China

More Information

Corresponding author: DING Wei
Received Date: 25/04/2025
Available Online: 25/12/2025
MSC: S432; TP391.4

Abstract

Crop disease identification is of great significance for ensuring the healthy growth of crops and the stable development of agricultural production. In recent years, many studies have shown that the introduction of data augmentation techniques has significantly improved the accuracy of crop disease recognition models. This study proposes the application of data augmentation techniques to enhance the performance of tobacco target spot disease recognition models. The research employs various data augmentation methods, including image flipping, grayscale adjustment, brightness adjustment and chroma adjustment, as well as MixUp and CutMix data augmentation methods, to expand and diversify the image data of tobacco target spot disease. The data augmentation effects on the tobacco target spot disease image recognition models were verified using mainstream image recognition models, namely AlexNet, GoogleNet, and ResNet101. The results show that after the application of data augmentation, the training set accuracy and test set accuracy of the image recognition models were increased by up to 2.80% and 3.78%, respectively, compared to those without data augmentation. Meanwhile, the training set loss and test set loss were reduced by 10.84% and 4.73%, respectively. The study concludes that the use of data augmentation techniques can improve the performance of tobacco target spot disease recognition models. This method provides a data processing approach for the research of tobacco disease image recognition models and offers a scientific basis for the application of image recognition models.
- tobacco target spot disease,
- image recognition model,
- data augmentation,
- model performance

References

[1]	李树圆. 基于深度学习的变电站设备识别及应用研究[D]. 石家庄: 石家庄铁道大学, 2021. Google Scholar
[2]	施静. 基于深度学习的棉花苗蚜危害特征检测方法研究[D]. 郑州: 河南农业大学, 2023. Google Scholar
[3]	张俊林, 张木春, 邱盛军. 内蒙古农业信息化背景下玉米高产栽培技术要点分析[J]. 农村科学实验, 2024(2): 36-38. Google Scholar
[4]	黎妍妍, 邱梦娟, 李锡宏, 等. 烟草靶斑病LFD-RPA快速检测方法的建立[J]. 中国烟草科学, 2023, 44(5): 62-69. Google Scholar
[5]	GONZALEZ M, PUJOL M, METRAUX J P, et al. Tobacco Leaf Spot and Root Rot Caused by Rhizoctonia Solani Kühn[J]. Molecular Plant Pathology, 2011, 12(3): 209-216. doi: 10.1111/j.1364-3703.2010.00664.x CrossRef Google Scholar
[6]	VAN DYK D A, MENG X-L. The Art of Data Augmentation[J]. Journal of Computational and Graphical Statistics, 2001, 10(1): 1-50. doi: 10.1198/10618600152418584 CrossRef Google Scholar
[7]	孙佳照, 李群岭, 林小兴, 等. 基于Resnet-101模型的烟蚜数量图像识别系统开发[J]. 植物医学, 2024, 3(4): 26-31. doi: 10.13718/j.cnki.zwyx.2024.04.004 CrossRef Google Scholar
[8]	刘宇平, 刘程飞, 赵平伟. 基于DeepLabv3-Faster R-CNN的水稻叶片病害检测方法[J]. 中国农机化学报, 2025, 46(4): 108-113, 132. Google Scholar
[9]	陈永超, 何彦琪, 刘阳, 等. 一种基于生成对抗网络的低光照图像增强算法[J/OL]. 计算机工程与科学, 1-9[2025-04-03]. https://link.cnki.net/urlid/43.1258.TP.20250403.1006.002. Google Scholar
[10]	潘轲. 基于残差网络面向街道场景的轻量语义分割模型研究[D]. 西安: 长安大学, 2020. Google Scholar
[11]	HARRIS E, MARCU A, PAINTER M, et al. FMix: Enhancing Mixed Sample Data Augmentation[EB/OL]. 2020: arXiv: 2002. 12047. https://arxiv.org/abs/2002.12047. Google Scholar
[12]	曾武, 朱恒亮, 毛国君. DynamicMix: 一种动态的像素级混合的图像数据增强方法[J/OL]. 计算机应用与软件, 1-11[2025-03-03]. https://link.cnki.net/urlid/31.1260.TP.20250313.1343.002. Google Scholar
[13]	CAI Z T, XIN J M, YOU C Y, et al. Style Mixup Enhanced Disentanglement Learning for Unsupervised Domain Adaptation in Medical Image Segmentation[J]. Medical Image Analysis, 2025, 101: 103440. doi: 10.1016/j.media.2024.103440 CrossRef Google Scholar
[14]	邓相红, 阳霜, 龙铁光. 基于残差卷积神经网络模型的猴痘疾病图像识别[J]. 科技与创新, 2025(6): 40-42, 47. Google Scholar
[15]	江顺, 黄红星, 莫里楠, 等. 基于改进AlexNet的岭南水稻虫害识别方法研究[J]. 江苏农业科学, 2023, 51(23): 187-195. Google Scholar
[16]	戴敏, 孙文靖, 缪宏, 等. 基于轻量化CBAM-GoogLeNet的辣椒病虫害识别[J]. 中国农机化学报, 2025, 46(2): 224-229, 252. Google Scholar
[17]	李爱莲, 刘浩楠, 郭志斌, 等. 改进ResNet101网络下渣出钢状态识别研究[J]. 中国测试, 2020, 46(11): 116-119, 125. Google Scholar
[18]	KHAN R U, ZHANG X S, KUMAR R. Analysis of ResNet and GoogleNet Models for Malware Detection[J]. Journal of Computer Virology and Hacking Techniques, 2019, 15(1): 29-37. doi: 10.1007/s11416-018-0324-z CrossRef Google Scholar
[19]	ALOM M Z, TAHA T M, YAKOPCIC C, et al. The History Began from AlexNet: a Comprehensive Survey on Deep Learning Approaches[EB/OL]. 2018: arXiv: 1803.01164. https://arxiv.org/abs/1803.01164. Google Scholar
[20]	蔡靖, 谷承睿, 刘光达, 等. 基于改进AlexNet卷积神经网络人脸识别的研究[J]. 电子技术应用, 2024, 50(11): 42-46. Google Scholar
[21]	李伟, 何遥, 林东岳, 等. 基于高斯混合模型扣除毛发SERS信号中增强基底的背景峰[J]. 光谱学与光谱分析, 2023, 43(3): 854-860. Google Scholar
[22]	ZHANG L J, DENG Z, KAWAGUCHI K, et al. How Does Mixup Help with Robustness and Generalization?[EB/OL]. 2020: arXiv: 2010. 04819. https://arxiv.org/abs/2010.04819. Google Scholar
[23]	ALQAHTANI H, KAVAKLI-THORNE M, KUMAR G. Applications of Generative Adversarial Networks (GANs): an Updated Review[J]. Archives of Computational Methods in Engineering, 2021, 28(2): 525-552. doi: 10.1007/s11831-019-09388-y CrossRef Google Scholar

Access History

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

Figures(6) / Tables(2)

Export Citation

PDF

XML

Article Metrics

Article views(232) PDF downloads(24) Cited by(0)

Access History

Other Articles By Authors

on this site
on Google Scholar

HTML

开放科学(资源服务)标识码(OSID)：
近年来，随着深度学习技术的兴起，视觉领域的研究被广泛应用到图像识别模型中^[1]。图像识别技术能够快速、准确地识别病虫害类型，相较于传统的人工识别方法，其效率更高且误差更小^[2]。依托图像识别模型对农作物病虫害进行诊断，可显著提升农业生产的精准化水平。例如，卷积神经网络(CNN)和AlexNet等深度学习算法可以自动提取图像中的特征并进行分类，从而实现对病虫害的精准识别。通过精准识别病虫害，农民可以及时采取防治措施，减少农药的使用量，降低对环境的污染^[3]。烟草靶斑病是一种由真菌引起的烟草叶部病害，病原菌为瓜亡革菌(Rhizoctonia solani Kühn)^[4]，在烟草生长过程中具有较高危害性，严重威胁烟叶产量与质量。该病害症状在烟草各生长阶段均可显现，主要侵染叶片，偶见危害茎部^[5]。凭借人工经验判别靶斑病易引入主观误差，而传统检测手段依赖生化鉴定与鉴别培养基，流程冗长、效率较低。因此，构建图像识别模型对烟草靶斑病进行智能识别，可在提高准确率的同时，显著降低判断成本。

然而，在图像识别模型研究过程中，数据不足与数据样式单一化问题常导致识别模型性能欠佳。为此，研究者提出采用数据增强技术来弥补图像数据劣势。数据增强(Data Augmentation)是一种通过对现有图像数据进行变换和处理，生成更多训练样本的技术，其主要目的是增加数据集的多样性和数量，从而提升模型的泛化能力和鲁棒性^[6]。该技术在农业病虫害识别领域已得到广泛应用。例如，采用图像翻转、亮度调整等技术对带有烟蚜的烟叶图像进行数据增强，显著提升了ResNet101模型的识别准确率^[7]。在棉花作物病害检测中，定制深度学习模型耦合数据增强技术，同样实现了检测精度的跃升。在茶叶病害识别研究中，改进的生成对抗网络用于数据增强，生成新样本后，VGG16模型的鲁棒性与识别准确率均得到显著提升 ^[8]。基于此，本研究拟采用图像数据增强技术，围绕烟草靶斑病不同发病症状图像进行多样化扩增，以期提升烟草靶斑病病害识别模型的性能。

3. 讨论

3.1. 数据增强技术对烟草靶斑病识别模型性能的影响

本研究通过传统数据增强(图像翻转、灰度调整、亮度调整、色度调整等)与混合增强(MixUp和CutMix)方法，在AlexNet、GoogleNet和ResNet101 3种主流图像识别模型上验证了数据增强对烟草靶斑病识别模型的提升效果。结果表明，数据增强技术显著提升了模型的性能。具体而言，数据增强后，模型的训练集和测试集损失值均显著降低，训练集准确率和测试集准确率最高分别提升了4.98%和2.21%。这表明数据增强技术能够有效增加数据集的多样性和数量，从而提升模型的泛化能力和鲁棒性^[18]。

3.2. 不同模型对数据增强的响应差异

尽管数据增强技术对所有模型都产生了积极影响，但不同模型的响应程度存在差异。对于GoogleNet，数据增强显著提升了其准确率和泛化能力，测试集准确率提高了3.06%，训练集损失值降低了0.005 6，测试集损失值降低了0.043 0。对于ResNet101，数据增强也提升了模型性能，测试集准确率提高了0.58%，训练集损失值降低了0.097 0，测试集损失值降低了0.026 8。然而，对于AlexNet，数据增强的效果相对有限，测试集准确率提高了0.75%，训练集损失值甚至略有增加。这种差异可能与模型的结构复杂度和对数据多样性的敏感度有关。GoogleNet和ResNet101具有更复杂的网络结构，能够更好地利用数据增强带来的多样性，而AlexNet结构相对简单，对数据增强的响应不够敏感^[19-20]。

3.3. 数据增强方法的选择与优化

本研究综合应用变换增强和混合增强方法处理烟草靶斑病图像数据。变换增强通过旋转、反射、缩放、移动、翻转和裁剪等操作，有效克服了训练数据中的位置偏差。同时，通过色彩通道空间调整，增强了图像的亮度、对比度、灰度和色彩^[21]。混合增强(如MixUp和CutMix)通过将不同样本混合生成新的训练数据，进一步提高了模型的鲁棒性^[22]。然而，不同的数据增强方法对模型性能的提升效果存在差异。CutMix在GoogleNet上表现最优(测试准确率提升3.06%)，而MixUp在ResNet101上效果更显著(损失降低0.097 0)。未来可探索生成对抗网络(GAN)等先进技术，进一步丰富数据多样性并提升模型性能^[23]。

4. 结论

本研究通过数据增强技术显著提升了烟草靶斑病图像识别模型的性能，为烟草病害图像识别模型的研究提供了有效的数据处理方法，并为图像识别技术在农业领域的应用提供了科学依据。未来研究应进一步优化数据增强方法，扩大样本采集范围，并探索更先进的数据增强技术，以进一步提升模型的泛化能力和鲁棒性。

Figure (6) Table (2) Reference (23)

Name
	Name cannot be empty!
E-mail
	Mailbox cannot be empty! Mailbox cannot be empty!
Telephone
	Mobile number cannot be empty! Please enter a valid mobile number!
Title

Content
Verification Code

Message Board

Improving the Accuracy of Tobacco Target Spot Disease Recognition through Data Augmentation