刊名 |
Agricultural Biotechnology |
作者 |
Hanlin XU, Shiyu WU, Guochao DING* |
作者单位 |
College of Information and Electrical Engineering, Heilongjiang Bayi Agricultural University, Daqing 163319, China |
DOI |
DOI:10.19759/j.cnki.2164-4993.2025.01.018 |
年份 |
2025 |
刊期 |
1 |
页码 |
77-79 |
关键词 |
Fish; Group behavior; Behavior recognition; Deep learning; YOLOv10 |
摘要 |
A common but flawed design in existing CNN architectures is using strided convolutions and/or pooling layer, which will result in the loss of fine-grained feature information, especially for low-resolution images and small objects. In this paper, a new CNN building block named SPD-Conv was used, which completely eliminated stride and pooling operations and replaced them with a space-to-depth convolution and a non-strided convolution. Such new design has the advantage of downsampling feature maps while retaining discriminant feature information. It also represents a general unified method, which can be easily applied to any CNN architectures, and can also be applied to strided conversion and pooling in the same way. |