A Lemon Fruit Recognition Method Based on Improved YOLOv8

LIU Yucheng; LIANG Xincheng; LI Falin; ZHANG Fengling; LI Yunwu

doi:10.13718/j.cnki.xdzk.2025.07.019

2025 Volume 47 Issue 7

Article Contents

Previous Article Next Article

LIU Yucheng, LIANG Xincheng, LI Falin, et al. A Lemon Fruit Recognition Method Based on Improved YOLOv8[J]. Journal of Southwest University Natural Science Edition, 2025, 47(7): 219-230. doi: 10.13718/j.cnki.xdzk.2025.07.019

Citation:

LIU Yucheng, LIANG Xincheng, LI Falin, et al. A Lemon Fruit Recognition Method Based on Improved YOLOv8[J]. Journal of Southwest University Natural Science Edition, 2025, 47(7): 219-230. doi: 10.13718/j.cnki.xdzk.2025.07.019

A Lemon Fruit Recognition Method Based on Improved YOLOv8

College of Engineering and Technology, Southwest University, Chongqing 400715, China

More Information

Corresponding author: LI Yunwu
Received Date: 12/03/2024
Available Online: 20/07/2025
MSC: TP391;S23

Abstract

In order to address the challenges of high costs and low efficiency of manual picking lemon fruits, and achieve swift and precise identification of lemon fruits in intricate environments, a lemon fruit recognition method based on the improved YOLOv8 model was established. Firstly, the SPDConv module was introduced into the backbone network to enhance the accuracy of model's detection for low-resolution images and small targets. Then, the EMA attention mechanism was added to effectively extract the features of obscured fruits. Finally, the CIoU bounding box loss function was replaced with Wise-IoU to reduce the dependence on high-quality anchor boxes and improve the generalization ability of the model. Tested on a self-constructed dataset, the YOLOv8-SEW model exhibited precision, recall and mean average precision values of 94.5%, 85.7% and 92.4% separately. Compared with before improvement, the precision, recall and mean average precision of the model was increased by 1.0%, 4.2% and 2.9%, respectively. The detection time for a single image was 44.8 ms, enabling rapid and accurate identification of lemon fruits, thus providing a technological foundation for automatic harvesting lemon fruits.
- recognition of lemon fruits,
- YOLOv8,
- SPD convolution module,
- Wise-IoU loss function,
- attention mechanism

References

[1]	潼南区委宣传部. 潼南: 小柠檬"走出去" 开拓国内国际大市场[J]. 重庆与世界, 2023(12): 56-59. Google Scholar
[2]	代小红, 钟光跃, 曹树梅, 等. 安岳柠檬品牌培育现状及建议[J]. 四川农业科技, 2023(1): 119-121. Google Scholar
[3]	周冰. 农业机械化助力乡村振兴中的影响与作用[J]. 当代农机, 2024(2): 39-40, 42. Google Scholar
[4]	郑太雄, 江明哲, 冯明驰. 基于视觉的采摘机器人目标识别与定位方法研究综述[J]. 仪器仪表学报, 2021, 42(9): 28-51. Google Scholar
[5]	LECUN Y, BENGIO Y, HINTON G. Deep Learning[J]. Nature, 2015, 521(7553): 436-444. doi: 10.1038/nature14539 CrossRef Google Scholar
[6]	ZHANG X H, WANG H P, XU C A, et al. A Lightweight Feature Optimizing Network for Ship Detection in SAR Image[J]. IEEE Access, 2019, 7: 141662-141678. Google Scholar
[7]	GIRSHICK R, DONAHUE J, DARRELL T, et al. Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation[C] //2014 IEEE Conference on Computer Vision and Pattern Recognition. New York: IEEE, 2014: 580-587. Google Scholar
[8]	GIRSHICK R. Fast R-CNN[C] //2015 IEEE International Conference on Computer Vision (ICCV). New York: IEEE, 2015: 1440-1448. Google Scholar
[9]	REN S Q, HE K M, GIRSHICK R, et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(6): 1137-1149. doi: 10.1109/TPAMI.2016.2577031 CrossRef Google Scholar
[10]	LIU W, ANGUELOV D, ERHAN D, et al. SSD: Single Shot MultiBox Detector[C] // Computer Vision - ECCV 2016. Cham: Springer International Publishing, 2016: 21-37. Google Scholar
[11]	REDMON J, FARHADI A. YOLO9000: Better, Faster, Stronger[C] //2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). New York: IEEE, 2017: 6517-6525. Google Scholar
[12]	REDMON J, FARHADI A. YOLOv3: An Incremental Improvement[EB/OL]. (2018-04-08)[2024-03-14]. https://arxiv.org/abs/1804.02767. Google Scholar
[13]	BOCHKOVSKIY A, WANG C Y, LIAO H M. YOLOv4: Optimal Speed and Accuracy of Object Detection[EB/OL]. (2020-04-23)[2024-03-14]. https://arxiv.org/abs/2004.10934v1. Google Scholar
[14]	REDMON J, DIVVALA S, GIRSHICK R, et al. You Only Look Once: Unified, Real-Time Object Detection[C] //2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). New York: IEEE, 2016: 779-788. Google Scholar
[15]	LI G J, HUANG X J, AI J Y, et al. Lemon-YOLO: An Efficient Object Detection Method for Lemons in the Natural Environment[J]. IET Image Processing, 2021, 15(9): 1998-2009. Google Scholar
[16]	LAWAL M O. Tomato Detection Based on Modified YOLOv3 Framework[J]. Scientific Reports, 2021, 11(1): 1447. Google Scholar
[17]	GAI R L, CHEN N, YUAN H. A Detection Algorithm for Cherry Fruits Based on the Improved YOLO-V4 Model[J]. Neural Computing and Applications, 2023, 35(19): 13895-13906. Google Scholar
[18]	张成尧, 张艳诚, 张宇乾, 等. 基于YOLOv5的咖啡瑕疵豆检测方法[J]. 食品与机械, 2023, 39(2): 50-56, 175. Google Scholar
[19]	ZHONG Z Y, YUN L J, CHENG F Y, et al. Light-YOLO: A Lightweight and Efficient YOLO-Based Deep Learning Model for Mango Detection[J]. Agriculture, 2024, 14(1): 140. Google Scholar
[20]	RUIZ-PONCE P, ORTIZ-PEREZ D, GARCIA-RODRIGUEZ J, et al. POSEIDON: a Data Augmentation Tool for Small Object Detection Datasets in Maritime Environments[J]. Sensors, 2023, 23(7): 3691. Google Scholar
[21]	LI P, ZHENG J S, LI P Y, et al. Tomato Maturity Detection and Counting Model Based on MHSA-YOLOv8[J]. Sensors, 2023, 23(15): 6701. Google Scholar
[22]	李茂, 肖洋轶, 宗望远, 等. 基于改进YOLOv8模型的轻量化板栗果实识别方法[J]. 农业工程学报, 2024, 40(2): 1-9. Google Scholar
[23]	LUO Q, WU C B, WU G J, et al. A Small Target Strawberry Recognition Method Based on Improved YOLOv8n Model[J]. IEEE Access, 2024, 12: 14987-14995. Google Scholar
[24]	SUNKARA R, LUO T. No More Strided Convolutions or Pooling: A New CNN Building Block for Low-Resolution Images and Small Objects[C] // Machine Learning and Knowledge Discovery in Databases. Cham: Springer Nature Switzerland, 2023: 443-459. Google Scholar
[25]	OUYANG D L, HE S, ZHANG G Z, et al. Efficient Multi-Scale Attention Module with Cross-Spatial Learning[C] //ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). New York: IEEE, 2023: 1-5. Google Scholar
[26]	TONG Z, CHEN Y, XU Z, et al. Wise-IoU: Bounding Box Regression Loss with Dynamic Focusing Mechanism[EB/OL]. (2020-01-24)[2024-03-19]. https://arxiv.org/abs/2301.10051. Google Scholar
[27]	赵德安, 吴任迪, 刘晓洋, 等. 基于YOLO深度卷积神经网络的复杂背景下机器人采摘苹果定位[J]. 农业工程学报, 2019, 35(3): 164-173. Google Scholar
[28]	BRAUWERS G, FRASINCAR F. A General Survey on Attention Mechanisms in Deep Learning[J]. IEEE Transactions on Knowledge and Data Engineering, 2023, 35(4): 3279-3298. Google Scholar
[29]	GUO M H, XU T X, LIU J J, et al. Attention Mechanisms in Computer Vision: A Survey[J]. Computational Visual Media, 2022, 8(3): 331-368. Google Scholar
[30]	NIU Z Y, ZHONG G Q, YU H. A Review on the Attention Mechanism of Deep Learning[J]. Neurocomputing, 2021, 452: 48-62. Google Scholar
[31]	LAI Q X, KHAN S, NIE Y W, et al. Understanding More about Human and Machine Attention in Deep Neural Networks[J]. IEEE Transactions on Multimedia, 2020, 23: 2086-2099. Google Scholar
[32]	WAN D H, LU R S, SHEN S Y, et al. Mixed Local Channel Attention for Object Detection[J]. Engineering Applications of Artificial Intelligence, 2023, 123: 106442. Google Scholar
[33]	YANG H, YUAN C F, ZHANG L, et al. STA-CNN: Convolutional Spatial-Temporal Attention Learning for Action Recognition[J]. IEEE Transactions on Image Processing, 2020, 29: 5783-5793. Google Scholar
[34]	JIN X, XIE Y P, WEI X S, et al. Delving Deep into Spatial Pooling for Squeeze-and-Excitation Networks[J]. Pattern Recognition, 2022, 121: 108159. Google Scholar
[35]	赵茂程, 邹涛, 齐亮, 等. 基于MobileViT-CBAM的枇杷表面缺陷检测方法[J]. 农业机械学报, 2024, 7(2): 1-10. Google Scholar
[36]	LI X, HU X L, YANG J. Spatial Group-Wise Enhance: Improving Semantic Feature Learning in Convolutional Networks[EB/OL]. (2020-04-23)[2024-06-14]. https://arxiv.org/abs/1905.09646v2. Google Scholar

Access History

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

Figures(11) / Tables(4)

Export Citation

PDF

XML

Article Metrics

Article views(77) PDF downloads(23) Cited by(0)

Access History

Other Articles By Authors

on this site
on Google Scholar

HTML

开放科学（资源服务）标识码（OSID）：
丘陵山区的柠檬因其皮厚气香、出汁率高而受到市场的广泛欢迎，在农业和食品行业中具有重要的经济价值和市场需求^[1-2]。而农业机械化帮助实现规模化种植与采摘是农业现代化的重要组成部分，且是乡村振兴这一国家战略的重要手段^[3]。传统的柠檬果实检测方法通常依靠人工视觉判断，效率低下且容易受主观因素影响。因此，开发高效精确的自动化柠檬目标检测模型对于提高采摘效率、降低人工成本具有重要意义^[4]。

当前，基于卷积神经网络的目标检测算法在果实检测领域取得显著进展^[5]。深度学习目标检测算法分为两类：一类是基于候选区域的双阶段目标检测算法^[6]，主要代表算法有RCNN(Regions with Convolutional Neural Network)和Faster RCNN^[7-9]等；另一类是基于回归的单阶段目标检测算法，主要代表算法有SSD(Single Shot Detector)^[10]和YOLO(You Only Look Once)^[11-14]等。相比其他目标检测算法，YOLO具有检测速度快、流程简单等优势。这种算法可以同时完成物体识别和位置定位的任务，而且不需要额外的候选区域提取过程，从而大大减少了计算量和内存占用。由于其高效、准确和易于实现的特点，YOLO系列算法被广泛应用于许多领域，例如果实采摘、自动驾驶、医疗影像分析等。

对于柠檬果实识别，文献[15]提出了改进YOLOv3的柠檬果实检测方法，精确率达到96.28%，但未以平均精度均值做参考，将相同网络用于葡萄数据集后出现大幅度精确率下降，模型依赖高质量锚框且泛化能力较弱；而对于其他果实识别，文献[16]在不均匀的环境条件下，用深度卷积神经网络对西红柿进行检测后精度达到98.3%，但YOLOv3训练后的权重过大，不利于在移动设备上部署；文献[17]在YOLOv4的基础上增强了特征提取并加深了网络结构，虽然检测樱桃果实的平均精度均值比改进前提高了0.15%，但检测精度仍然不佳；文献[18]在YOLOv5中加入CBAM注意力机制和Hardswish激活函数后提高了对咖啡瑕疵豆的精度均值，但光照对检测精度影响很大；文献[19]将Light-YOLO用于芒果果实的检测，在颈部网络中引入残差结构并加入注意力机制，从而提高模型的检测能力，但该模型在检测被严重遮挡的芒果时表现不佳。2023年初，Ultralytics公司发布了具有更快推理速度和更高精度的YOLOv8系列，且训练和调整也更加容易，现已成为最受欢迎的目标检测算法之一，在果实识别等领域得到了广泛应用^[20]。文献[21]提出了一种改进MHSA-YOLOv8的模型来检测番茄果实的成熟度，在测试场景中的平均精度均值达到86.4%，但由于只引入了MHSA注意力机制，在精度和召回率方面都有一定的提升空间；文献[22]提出了改进YOLOv8的板栗果实识别方法，引入了加权双向特征金字塔网络，更改了边界框损失函数，加快了模型的收敛速度，且使模型的召回率和平均精度均值分别提升了1.5%和1.8%；文献[23]基于YOLOv8引入针对小目标的空间到深度的卷积，提升了检测草莓果实的精度，但由于自然场景中的柠檬果实与树叶色彩相近，且容易被遮挡，因此精确检测复杂自然背景中的柠檬是一个技术难点。

为改进柠檬果实的识别精度，提出一种基于改进YOLOv8的目标检测算法，在主干网络中加入针对小目标的SPDConv(Space-to-Depth Convolution)模块^[24]和EMA(Efficient Multi-Scale Attention)高效的多尺度注意力模块^[25]，精确识别复杂自然环境中的柠檬，并将损失函数更改为Wise-IoU(Wise Intersection over Union) Loss^[26]，以期降低训练的损失值并提高模型的收敛速度。

4. 结论

为实现在复杂背景中对柠檬果实的快速精确识别，提出了基于YOLOv8的单阶段目标检测算法YOLOv8-SEW。首先在主干网络中加入EMA注意力机制，将所输出通道信息和上下文信息相结合以加强不同尺度下被遮挡的柠檬果实的特征学习；其次引入SPDConv卷积模块用于处理小目标和低分辨率图像细粒度信息丢失的问题；最后将损失函数替换为WIoU，降低模型对高质量锚框依赖的同时提高泛化性能和定位对象的能力，并且梯度下降速度和收敛后损失值均优于改进前。

对3个改进点进行排列组合的消融实验和对比实验，结果表明3个改进方法对模型的性能均有提升效果，并且优于所有的基线网络模型。在自建柠檬数据集上的实验表明，改进后的YOLOv8-SEW模型的精确率、召回率和平均精度均值分别达到94.5%、85.7%、92.4%，与YOLOv8相比分别增长1.0%、4.2%、2.9%。

在实际检测场景中随着果树与相机的距离拉远，柠檬果实在图片中占据的像素更小，后续可以加入小目标检测层来提高对小目标果实的检测性能。YOLO模型只能获取图像的二维坐标信息，后续可通过双目相机获取深度信息进行坐标融合得到果实的三维坐标，为自动化采摘机器提供定位方法。

Figure (11) Table (4) Reference (36)

Name
	Name cannot be empty!
E-mail
	Mailbox cannot be empty! Mailbox cannot be empty!
Telephone
	Mobile number cannot be empty! Please enter a valid mobile number!
Title

Content
Verification Code

Message Board

A Lemon Fruit Recognition Method Based on Improved YOLOv8

Abstract

References

Access History

通讯作者: 陈斌, bchen63@163.com

Article Metrics

Access History

Other Articles By Authors