YOLO-FSL: Method of Maturity Detection for Plum in Natural Environment Based on Improved YOLOv8
DOI:
https://doi.org/10.52151/jae2025621.1915Keywords:
Deep learning, lightweight model, mean average precision, object detection, pruning techniques, squeeze and excitation attention mechanismAbstract
This study introduces an improved YOLOv8 model YOLO-FSL for rapid detection of plum maturity under natural conditions. Firstly, the Faster Net Block module was introduced into the C2f module of the YOLOv8s backbone network, forming the C2f_Faster module; Secondly, squeeze and excitation (SE) attention mechanism after each C2f_Faster module was added in the backbone network; Finally, the improved YOLOv8s model was pruned using channel pruning algorithm to simplify the detection model and ensure detection efficiency. The experimental results show that the mean average precision (mAP) and model size of the improved YOLO-FSL model is 91.2% and 4.1 MB (Mega Byte), respectively, which reduces the number of parameters by 82.57% and improves the mAP by 1.6% compared with the original network YOLOv8s. Compared with the mainstream target detection algorithms of YOLOv3-tiny, YOLOv5s, YOLOv6s, and YOLOv7-Tiny, the mAP of the YOLO-FSL model is improved by 4.4%, 2.0%, 1.0%, and 1.6%, respectively. The YOLO-FSL model has a higher detection accuracy, and at the same time reduces the amount of computation and the number of parameters, and lowers the hardware dependence, which provides a technical reference for the automated picking of plums.
Downloads
References
Bochkovskiy, A., Wang, C.Y., & Liao, H. Y. M. (2020). YOLOv4: Optimal Speed and Accuracy of Object Detection [Paper presentation]. 2020 IEEE/CVF conference on computer vision and pattern recognition (CVPR). https://doi.org/10.48550/arXiv.2004.10934
Chen, S., Chen, X., Yu, Y., Zou, B., Bo, Z., Guo, D., & Lin, Y. (2023a). Research progress in nutritional value and processing of prunus domestica. L. China Fruit Vegetable, 43, 15–21. https://doi.org/10.19590/j.cnki.1008-1038.2023.06.004
Chen, J., Kao, S., He, H., Zhuo, W., Wen, S., Lee, C. H., & Chan, S. H. G. (2023b). Run, Don’t Walk: Chasing Higher FLOPS for Faster Neural Networks [Paper presentation]. 2023 IEEE/ CVF conference on computer vision and pattern recognition (CVPR). https://doi.org/10.1109/CVPR52729.2023.01157
Gao, G., Shuai, C., Wang, S., & Ding, T. (2024). Using improved YOLO V5s to recognize tomatoes in a continuous working environment. Signal, Image and Video Processing, 18, 4019–4028. https://doi.org/10.1007/s11760-024-03010-w
Girshick, R. (2015). Fast R-CNN. In 2015 IEEE International Conference on Computer Vision (ICCV), Santiago, Chile, pp. 1440-1448. https://doi.org/10.1109/ICCV.2015.169
Guo, M-H., Xu, T-X., Liu, J-J., Liu, Z-N., Jiang, P-T., Mu, T-J., Zhang, S-H., Martin, R. R., Cheng, M-M., & Hu, S-M. (2021). Attention Mechanisms in Computer Vision: A Survey [Paper presentation]. 2021 IEEE/ CVF international conference on computer vision and pattern recognition (CVPR). https://doi.org/10.48550/arXiv.2111.07624
Guo, M-H., Xu, T-X., Liu, J-J., Liu, Z-N., Jiang, P-T., Mu, T-J., Zhang, S-H., Martin, R. R., Cheng, M-M., & Hu, S-M. (2022). Attention mechanisms in computer vision: A survey. Computational Visual Media, 8, 331–368. https://doi.org/10.1007/s41095-022-0271-y
He, K., Gkioxari, G., Dollár, P., & Girshick, R. (2017). Mask R-CNN [Paper presentation]. 2017 IEEE/CVF international conference on computer vision (ICCV). https://doi.org/10.1109/ICCV.2017.322
Hu, J., Shen, L., Albanie, S., Sun, G., & Wu, E. (2017). Squeeze-and-Excitation Networks [Paper presentation]. 2017 IEEE/ CVF conference on computer vision and pattern recognition (CVPR). https://doi.org/10.48550/arXiv.1709.01507
Huang, J., Zhao, X., Gao, F., Wen, X., Jin, S., & Zhang, Y. (2023). Recognizing and detecting the strawberry at multi-stages using improved lightweight YOLOv5s. Transactions of the Chinese Society of Agricultural Engineering, 39, 181–187. https://doi.org/10.11975/j.issn.1002-6819.202307186
Inbaraj, X.A., Villavicencio, C., Macrohon, J.J., Jeng, J.H., & Hsieh, J.G. (2021). Object identification and localization using Grad-CAM++ with mask regional convolution neural network. Electronics, 10(13), 1541. https://doi.org/10.3390/electronics10131541
Lee, J., Park, S., Mo, S., Ann, S., & Shin, J. (2021). Layer-adaptive sparsity for the Magnitude-based Pruning [Paper presentation]. 2021 ICLR/ CVF conference on machine learning. https://doi.org/10.48550/arXiv.2010.07611
Li, S., Zhang, S., Xue, J., & Sun, H. (2022a). Lightweight target detection for the field flat jujube based on improved YOLOv5. Computers and Electronics in Agriculture, 202, 107391. https://doi.org/10.1016/j.compag.2022.107391
Li, C., Li, L., Jiang, H., Weng, K., Geng, Y., Li, L., … & Wei, X. (2022b). YOLOv6: A Single-Stage Object Detection Framework for Industrial Applications [Paper presentation]. 2022 IEEE/CVF conference on computer vision and pattern recognition (CVPR). https://doi.org/10.48550/arXiv.2209.02976
Li, D., Hua, C., & Liu, Y. (2023). Identifying apple leaf disease using a fine-grained distillation model. Transactions of the Chinese Society of Agricultural Engineering, 39(7), 185–194. https://doi.org/10.11975/j.issn.1002-6819.202211209
Liu, Z., Zhu, X., Zhao, Y., Zhang, Y., Shi, H., Luan, R., & Liu, J. (2024). Effects of short-term treatment with high concentration CO2 on Postharvest storage quality and antioxidant metabolism of plums. Food Industry Technology. 45, 311–318. https://doi.org/10.13386/j.issn1002-0306.2023080119
López-Barrios, J.D., Cabello, J.A.E., Gómez-Espinosa, A., & Montoya-Cavero, L.E. (2023). Green sweet pepper fruit and peduncle detection using mask R-CNN in greenhouses. Applied Sciences, 13(10), 6296. https://doi.org/10.3390/app13106296
Redmon, J., & Farhadi, A. (2018). YOLOv3: An Incremental Improvement [Paper presentation]. 2018 IEEE/CVF conference on computer vision and pattern recognition (CVPR). https://doi.org/10.48550/arXiv.1804.02767
Redmon, J., Divvala, S., Girshick, R., & Farhadi, A. (2016). "You Only Look Once: Unified, Real-Time Object Detection". In 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, pp. 779-788. https://doi.org/10.1109/CVPR.2016.91
Ren, R., Sun, H., Zhang, S., Wang, N., Lu, X., Jing, J., Xin, M., & Cui, T. (2023). Intelligent detection of lightweight “Yuluxiang” pear in non-structural environment based on YOLO-GEW. Agronomy, 13(9), 2418. https://doi.org/10.3390/agronomy13092418
Ren, S., He, K., Girshick, R., & Sun, J. (2016). Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks [Paper presentation]. 2016 IEEE/ CVF international conference on computer vision and pattern recognition (CVPR). https://doi.org/10.48550/arXiv.1506.01497
Wang, Z., Ling, Y., Wang, X., Meng, D., Nie, L., An, G., & Wang, X. (2022a). An improved Faster R-CNN model for multi-object tomato maturity detection in complex scenarios. Ecological Informatics, 72, 101886. https://doi.org/10.1016/j.ecoinf.2022.101886
Wang, C., Bochkovskiy, A., & Liao, H.Y.M. (2022b). YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors [Paper presentation]. 2022 IEEE/ CVF international conference on computer vision and pattern recognition (CVPR). https://doi.org/10.48550/arXiv.2207.02696
Wang, Q., Wu, B., Zhu, P., Li, P., Zuo, W., & Hu, Q. (2020). ECA-Net: Efficient Channel Attention for Deep Convolutional Neural Networks. In 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, pp. 11531-11539. https://doi.org/10.1109/CVPR42600.2020.01155
Woo, S., Park, J., Lee, J. Y., Kweon, I. S. (2018). CBAM: Convolutional Block Attention Module. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds) Computer Vision – ECCV 2018. ECCV 2018. Lecture Notes in Computer Science, vol. 11211. Springer, Cham. https://doi.org/10.1007/978-3-030-01234-2_1
Xiao, X., Wang, Y., Zhou, B., & Jiang, Y. (2024). Flexible hand claw picking method for citrus-picking robot based on target fruit recognition. Agriculture, 14(8), 1227. https://doi.org/10.3390/agriculture14081227
Yang, H., Liu, Y., Wang, S., Qu, H., Li, N., Wu, J., Yan, Y., Zhang, H., Wang, J., & Qiu, J. (2023a). Improved apple fruit target recognition method based on YOLOv7 model. Agriculture, 13(7), 1278. https://doi.org/10.3390/agriculture13071278
Yang, S., Wang, W., Gao, S., & Deng, Z. (2023b). Strawberry ripeness detection based on YOLOv8 algorithm fused with LW-Swin Transformer. Computers and Electronics in Agriculture, 215, 108360. https://doi.org/10.1016/j.compag.2023.108360
Yang, L., Zhang, R., Li, L., & Xie, X. (2021). SimAM: A Simple, Parameter-Free Attention Module for Convolutional Neural Networks. In Proceedings of the 38th International Conference on Machine Learning, Proceedings of Machine Learning Research, 139:11863-11874 Available from https://proceedings.mlr.press/v139/yang21o.html.
Yu, Y., Zhang, K., Yang, L., & Zhang, D. (2019). Fruit detection for strawberry harvesting robot in non-structural environment based on Mask-RCNN. Computers and Electronics in Agriculture, 163, 104846. https://doi.org/10.1016/j.compag.2019.06.001
Yue, K., Zhang, P., Wang, L., Guo, Z., & Zhang, J. (2024). Recognizing citrus in complex environment using improved YOLOv8n. Transactions of the Chinese Society of Agricultural Engineering, 40(8), 152–158. https://doi.org/10.11975/j.issn.1002-6819.202401118
Zhang, X., Cui, J., Liu, H., Han, Y., Ai, H., Dong, C., Zhang, J., & Chu, Y. (2023). Weed identification in soybean seedling stage based on optimized faster R-CNN algorithm. Agriculture, 13(1), 175. https://doi.org/10.3390/agriculture13010175
Zhang, Z., Zhou, J., & Jiang, Z. (2024). Lightweight apple recognition method in natural orchard environment based on improved YOLO v7 model. Transactions of the Chinese Society for Agricultural Machinery, 55(3), 231-242. https://doi.org/10.6041/j.issn.1000-1298.2024.03.023
Zhou, H., Jin, S., Zhou, L., Guo, Z., Sun, M., & Shi, M. (2023). Classification and recognition of camellia oleifera fruit in the field based on transfer learning and YOLOv8n. Transactions of the Chinese Society of Agricultural Engineering, 39(20), 159–166. https://doi.org/10.11975/j.issn.1002-6819.202307244





