ISSN 1000-1239 CN 11-1777/TP

• 人工智能 •

基于改进RetinaNet的自然环境中蝴蝶种类识别

1. 1（陕西师范大学计算机科学学院 西安 710119）；2（陕西师范大学生命科学学院 西安 710119) (xiejuany@snnu.edu.cn)
• 出版日期: 2021-08-01
• 基金资助:
国家自然科学基金项目(62076159,61673251,12031010)；中央高校基本科研业务费专项资金项目(GK202105003);陕西师范大学研究生培养创新基金项目(2016CSY009,2018TS078)

Butterfly Species Identification from Natural Environment Based on Improved RetinaNet

Xie Juanying1, Lu Yinyuan1, Kong Weixuan1, Xu Shengquan2

1. 1（School of Computer Science, Shaanxi Normal University, Xi’an 710119);2(College of Life Sciences, Shaanxi Normal University, Xi’an 710119)
• Online: 2021-08-01
• Supported by:
This work was supported by the National Natural Science Foundation of China (62076159, 61673251, 12031010), the Fundamental Research Funds for the Central Universities (GK202105003), and the Innovation Funds of Graduate Programs at Shaanxi Normal University(2016CSY009, 2018TS078).

Abstract: Butterfly is a kind of insects that are sensitive to the habitat. The distribution of butterfly species in natural environment reflects the balance of regional ecosystem and the biodiversity of the region. To identify the species of butterflies manually is a heavy time consuming work for experts. Computer vision technology makes it possible to automatically identify butterfly species. This paper focuses on identifying the butterfly species via images taken in natural environment. This is a very challenging task because the butterfly wings in the images are always folded and the features identifying the butterfly species cannot be seen. Therefore two new attention mechanisms, referred to as DSEA (direct squeeze-and-excitation with global average pooling) and DSEM (direct squeeze-and-excitation with global max pooling), are proposed in this paper to advance the classical object detection algorithm RetinaNet. And the deformable convolution is simultaneously introduced to enhance the power of RetinaNet to simulate the butterfly deformation in images from natural environment, so as to realize the automatic butterfly species identification task according to the features of butterfly images from natural environment. The very famous criterion mAP (mean average precision) for target detection is taken to value the proposed model, and the visualization is adopted to investigate the primary factors influencing the performance of the predictive model. Extensive experiments demonstrate that the improved RetinaNet is valid in identifying the butterfly species from images taken in the natural environment, especially the RetinaNet embedded with DSEM module. The balanced data can improve the generalization of the predictive model, and the structural dissimilarity of samples is a key factor affecting the performance of the predictive model.