یادگیری عمیق در مدیریت دامداری هوشمند: بررسی جامع نقش شبکه‌های عصبی کانولوشن

نوع مقاله : مقاله مروری

نویسنده

دانشجوی دکتری فیزیولوژی دام، گروه علوم دامی، دانشکده کشاورزی، دانشگاه کردستان، سنندج، ایران

10.22098/naas.2026.18530.1006

چکیده

هدف: تلفیق سیستم‌های بینایی کامپیوتری (CV) بر پایه شبکه عصبی کانولوشن (CNN) به‌سرعت در حال تغییر مدیریت حیوانات مزرعه‌ای در صنعت کشاورزی است. باوجود پیشرفت‌های قابل‌توجهی در این سیستم‌ها ، هنوز نیاز اساسی به یک مرور جامع و کامل از کاربردهای فعلی، نمونه‌های کاربردی، نواقص اصلی و راه‌حل‌های پیشنهادی آن‌ها وجود دارد. در این مقاله مروری به بررسی مراحل کاربردهای گوناگون بینایی کامپیوتری بر پایه شبکه عصبی(CNN) در حوزه مدیریت دام‌پروری هوشمند پرداخته‌شده است
روش پژوهش: در این مطالعه به تحلیل و بررسی کاربردهای مختلف و به‌کارگیری شبکه عصبی کانولوشن، در زمینه تشخیص اشیاء، دسته‌بندی معنایی نمونه، پیش‌بینی حالت و ردیابی در بین دام‌های اصلی مزرعه مانند گاو، گوسفند و بز، خوک و طیور پرداخته‌شده است. در این مطالعه مراحل مهم برای توسعه سیستم، ازجمله تنظیمات دقیق دوربین، روش‌های مختلف جمع‌آوری داده‌ها، واحدهای پردازش گرافیکی، آماده‌سازی و پردازش اولیه تصاویر و برچسب‌گذاری مؤثر داده‌ها را بررسی می‌کند. همچنین معماری‌های مختلف شبکه عصبی کانولوشن متناسب با وظایف خاص بینایی ماشینی را شکل کامل و دقیق جمع‌آوری مورد تحلیل قرارگرفته است.
یافته‌ها: این پژوهش نکات  کاربردی مهمی برای توسعه الگوریتم، مانند تقسیم‌بندی دقیق داده‌ها، افزایش داده‌ها، تنظیم داده‌ها قبل از شروع یادگیری و انتخاب معیارهای ارزیابی مناسب را ارائه می‌دهد. همچنین در این مطالعه یک چارچوب کلی و نظام‌مند برای تشخیص چالش‌های موجود و مشکلات آینده ارائه می‌دهد
نتیجه‌گیری: هدف نهایی این مقاله مروری ارائه یک راهنما به پژوهشگران و فعالان و متخصصان جهت توسعه و پیاده‌سازی موفقیت‌آمیز سیستم‌های بینایی ماشینی (CV) مبتنی بر شبکه عصبی کانولوشن(CNN)  برای مدیریت پایدار و صحیح در دام‌پروری نوین است.

کلیدواژه‌ها


عنوان مقاله [English]

Deep Learning in Intelligent Livestock Management: A Comprehensive Review of the Role of Convolutional Neural Networks

نویسنده [English]

  • Amjad Salimi
PhD student in Animal Physiology, Department of Animal Sciences, Faculty of Agriculture, University of Kurdistan, Sanandaj, Iran
چکیده [English]

Introduction
The Paradigm Shift in Precision Livestock Farming (PLF). The global livestock industry is currently undergoing a radical digital transformation. As demand for high-quality protein increases alongside a growing global population, traditional husbandry methods struggle to balance productivity with the growing requirements for animal welfare and environmental sustainability. Precision Livestock Farming (PLF) has emerged as the definitive solution, utilizing technologies such as the Internet of Things (IoT), smart sensors, and advanced analytics. Among these, Computer Vision (CV), powered by Convolutional Neural Networks (CNNs), stands out as the most promising non-invasive tool. Unlike wearable sensors that may cause physical stress or require constant battery maintenance, CNN-based systems offer a "set-and-forget" approach to continuous, real-time monitoring of ethological patterns. Despite the proliferation of individual studies, there is a critical need for a comprehensive synthesis that maps the entire development pipeline—from image acquisition hardware to high-level algorithmic inference—across diverse species, including cattle, small ruminants, swine, and poultry. This study addresses this gap by reviewing the integration of Deep Learning for identifying phenotypic traits and by providing a strategic roadmap for sustainable, modern production.
Method
This review was conducted in accordance with the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guidelines to ensure transparency and replicability. A systematic multi-database search was conducted in Scopus, Web of Science, PubMed, and Google Scholar, focusing on peer-reviewed literature published between 2015 and 2024. The search string integrated keywords such as "Convolutional Neural Networks," "Deep Learning," "Precision Livestock Farming," and specific CV tasks. Out of hundreds of screened papers, only those demonstrating high technical validity and practical relevance to livestock management were selected. The inclusion criteria focused on the four pillars of computer vision: Object Detection, Semantic/Instance Segmentation, Pose Estimation, and Multi-Object Tracking (MOT). The study synthesized data from various farm environments, specifically analyzing how these models handle the "domain shift" between controlled lab settings and the harsh, unpredictable conditions of commercial barns characterized by variable illumination and high-density stocking.
Results

Core Computer Vision Applications in Animal Husbandry

Object Detection and Individual Identification: Identifying, counting, and localizing animals is foundational to digital inventory management and Biosecurity. Advanced architectures such as the YOLO (You Only Look Once) series (v5-v8), Faster R-CNN, and SSD (Single Shot Multi Box Detector) have revolutionized this domain. Beyond simple counting, these models are now tuned for high-stakes event detection, such as Calving (Parturition) monitoring and Estrous behavior recognition. By automating the detection of these physiological milestones, farmers can intervene precisely when needed, reducing mortality rates and improving reproductive efficiency.
Segmentation for Morphological Analysis:
Image segmentation (specifically Mask R-CNN and U-Net) involves a pixel-level understanding of the animal's body. This technology is instrumental in automated Body Condition Scoring (BCS). By isolating specific anatomical markers, such as the rump, hooks, and pins, in dairy cows, the system can provide a numerical score of the animal's fat reserves. This objective, human-bias-free measurement is critical for managing nutritional intake and predicting milk yield, ensuring that metabolic diseases are caught before they escalate.
Pose Estimation and Locomotion Analysis:
By detecting skeletal key points and joints, systems such as Alpha Pose and Open Pose enable granular analysis of animal movement. In the dairy industry, this is applied to Locomotion Scoring for the early diagnosis of Lameness. Since CNNs can detect subtle changes in gait cycles and joint angles—often invisible to the human eye—they enable early treatment, preventing significant economic losses and protecting animal welfare.
Multi-Object Tracking (MOT) and Behavioral Time Budgets: Combining CNN detectors with tracking algorithms such as Deep SORT (Simple Online and Realtime Tracking) enables persistent monitoring of individuals within a herd. This creates a high-fidelity map of an animal's Time Budget, recording exactly how long it spends feeding, drinking, or resting. Anomalies in these patterns act as "early warning signals" for distress, social isolation, or the onset of sub-clinical illness.

Technical Development Pipeline: From Hardware to Inference

A successful CNN implementation in a livestock barn is not merely about the code; it is a complex engineering pipeline:
Data Acquisition and Hardware Selection:
The review highlights the importance of camera specifications (resolution, frame rate) and placement (nadir vs. oblique views). The role of Graphics Processing Units (GPUs) is emphasized as the engine for training these deep models, alongside the potential of thermal and depth cameras for low-light environments.
Pre-processing and Data Augmentation:
Farm images are often noisy, dusty, or poorly lit. Techniques to enhance the Contrast-to-Noise Ratio (CNR) are essential. To address the challenge of small "farm-specific" datasets, this study reviews strategies such as Data Augmentation (flipping, cropping, and color jittering) and Transfer Learning (leveraging pre-trained weights from ImageNet) to prevent Overfitting and ensure the model generalizes well to new barns.
Evaluation Benchmarks:
To ensure scientific rigor, the study advocates for the standardized use of Mean Average Precision (mAP) for detection tasks, Intersection over Union (IoU) for segmentation, and MOTA (Multi-Object Tracking Accuracy) for monitoring systems.

Discussion: Barriers to Implementation and Future Horizons

While the technical accuracy of these models is high, the "Real-world" farm presents significant hurdles. Occlusions-where animals block the camera's view of one another-and extreme animal density remain the primary causes of "ID switches" in tracking systems. Furthermore, the high computational demands of state-of-the-art CNNs make them difficult to deploy on Edge Devices (e.g., local, low-power farm servers). Looking ahead, the trend is toward Sensor Fusion, in which CV data are integrated with Bio-acoustic sensors and environmental data (e.g., humidity and temperature) to create a comprehensive Digital Twin of the farm. Additionally, the shift toward Semi-supervised and Unsupervised Learning will likely reduce the substantial labor costs associated with manual data labeling, thereby making these systems more adaptable to different breeds and global farming practices.
Conclusions
CNN-based computer vision systems represent the cornerstone of the next generation of Smart Livestock Management. By automating the monitoring of health, welfare, and productivity, these technologies enable producers to shift from reactive to proactive management. This review provides the necessary strategic and technical framework for researchers and industry practitioners to successfully deploy deep learning models, ensuring a future that is more efficient, welfare-centric, and economically sustainable.

کلیدواژه‌ها [English]

  • Deep Learning
  • Convolutional Neural Network (CNN)
  • Computer Vision
  • Precision Livestock Farming (PLF)