RTDETR-Refa: a real-time detection method for multi-breed classification of cattle
Journal of Real-Time Image Processing
(2025) 22:38
https://doi.org/10.1007/s11554-024-01613-7
RESEARCH
RTDETR‑Refa: a real‑time detection method for multi‑breed
classification of cattle
Bingxuan Li1,2,3 · Jiandong Fang1,2,3 · Yvdong Zhao2,3
Received: 14 June 2024 / Accepted: 17 December 2024
© The Author(s) 2025
Abstract
In the farming industry, to cope with problems such as complex pasture environments and dense targets, which lead to
increased difficulty in recognising and thus quickly classifying and automatically identifying cattle breeds to improve accuracy. In this paper, an RTDETR-Refa (RepConv Efficient Faster Attention) algorithm based on ResNet18 backbone network
is proposed for cattle breed classification and identification. First, new improvements are made to the ResNet18 backbone
network: the Faster-Block module is introduced to improve the feature extraction network and increase the computational
speed without sacrificing the accuracy; the 1×1 convolution in the Faster-Block module is replaced by a 3×3 convolution
using the RepConv reparameterised with the RepVGG block, which makes the algorithm more lightweight and improves
the inference speed. Second, in order to enhance the feature transformation and classification, the Efficient Multiscale Attention (EMA) module is added after the Faster-Block module at different stages. Finally, the above improved Faster-Block
module is used to replace the 4-layer BasicBlock after the 3 convolutional layers in the backbone network of ResNet18. The
training test results of the RTDETR-Refa algorithm are compared with other classical models to validate the superiority of
the RTDETR-Refa algorithm. The average accuracy of the RTDETR-Refa algorithm on the bovine The average accuracy
of RTDETR-Refa algorithm on the classification training set is 91.6%, which is 0.8% higher than the original model and
0.9–5.2% higher than other classical models. The experimental results show that the RTDETR-Refa model proposed in this
paper is able to identify and classify different breeds of cattle while guaranteeing similar detection speed, which proves the
feasibility of convolutional neural networks in breed identification and classification.
Keywords RT-DETR classification model · Classification of varieties · Toolkit for Identifying Detection and segmentation
Errors(TIDE) · Practical Scenario Application
1 Introduction
* Jiandong Fang
Bingxuan Li
Yvdong Zhao
1
College of Information Engineering, Inner Mongolia
University of Technology, Hohhot 010080, Inner Mongolia,
China
2
Inner Mongolia Key Laboratory of Perceptive Technology
and Intelligent Systems, Hohhot 010080, Inner Mongolia,
China
3
Inner Mongolia Synergy Innovation Center of Perception
Technology in Intelligent Agriculture and Animal
Husbandry, Hohhot 010080, Inner Mongolia, China
In the cattle breeding industry, as different breeds of cattle
produce different values, the accurate identification of cattle classification and numbers in unrestricted environments
can, to a certain extent, realise individualised and refined
management [1], and can also effectively solve the problems
of slowness, low accuracy, and subjectivity when manually determining cattle breeds [2]. Some studies [3–6] have
pointed out that the multiple behavioural postures of cattle
(lying, ruminating, crawling and straddling, etc.) lead to an
increase in the difficulty of recognition, and the problems
of multiple types and scales of target cattle within a single
picture, easy to stick when the recognition targets are dense,
and the existence of non-targets in the recognition map also
exist in the real environment. Therefore, not only is there a
high demand for recognition accuracy, but also the problems
Vol.:(0123456789)
38
Page 2 of 16
of target sticking and non-target misrecognition occurring in
practical applications need to be solved urgently. In order to
solve the current technical problems, deep learning counting
techniques can be used instead of human vision for classification detection and recognition of cattle images. At the
same time, these image recognition-based methods also have
high requirements for image quality and lighting conditions,
which may be limited in practical applications [7, 8]. Therefore, a more accurate, cost-effective, and intelligent method
for cattle breed classification and detection is now needed
[9, 10].
Duraiswami et al [11] suggested the use of different cattle
picture datasets to identify cattle breeds. Photographs will be
taken of the front, back and side of the cattle to extract key
features and these feature extraction will be used to uniquely
identify each cow. After image preprocessing, the image of
each cow will be categorised by breed. Based on the preprocessed cattle breed image dataset, another category of
rare cattle breeds will be identified. Yılmaz et al [12] aimed
to evaluate the performance of cattle detection and breed
classification based on specific regions on cattle that vary
by breed. In order to demonstrate the effectiveness of the
proposed cattle detection and classification technique, a customised dataset was created using images collected from the
Google image library. The experimental results show that
the YOLO algorithm is successful in cattle image detection
and breed classification with an accuracy of 92.85%. Manoj
et al [13] In order to identify cattle breeds, it is proposed to
use several cattle image datasets for identification. Images
of cattle will be taken to obtain salient features of their front,
back and flanks and then these features will be extracted to
identify each cattle and its information. Rare breed of cattle will be individually classified using preprocessed cattle
breed image dataset. CNN deep learning method is used
for recognising different shapes of face and whole body of
the cattle using CNN deep learning method. Gupta et al
[14] proposed a computer vision based approach to identify
cattle breeds. They customised a dataset using web mining
techniques and evaluated the performance of the YOLOv4
algorithm by training the model on different sets of training
parameters. A comprehensive analysis of the experimental
results shows that the proposed method achieves 81.07%
accuracy on the test dataset.
Chen et al. [15] proposed a system for image segmentation and recognition of Angus cattle using a deep learning
approach. Two databases of cattle were first collected and
annotated, one was a frontal image taken on the same day
in a laboratory setting with controlled lighting and pose.
The second database was taken on a farm on three different
days using natural light and background. The entire body
of the cow was photographed from different angles. Multiple design alternatives for cattle recognition were evaluated using three popular networks PrimNet, VGG16 and
Journal of Real-Time Image Processing
(2025) 22:38
ResNet50, with the VGG85 network achieving an accuracy
of 45.16%. Shanthakumari et al [16] A high-pre (...truncated)