A power-aware vision-based virtual sensor for real-time edge computing (pdf)

Article PDF cannot be displayed. You can download it here:

https://link.springer.com/content/pdf/10.1007/s11554-024-01482-0.pdf

A power-aware vision-based virtual sensor for real-time edge computing

Journal of Real-Time Image Processing (2024) 21:103 https://doi.org/10.1007/s11554-024-01482-0 RESEARCH A power‑aware vision‑based virtual sensor for real‑time edge computing Chiara Contoli1 · Lorenzo Calisti1 · Giacomo Di Fabrizio1 · Nicholas Kania1 · Alessandro Bogliolo1 · Emanuele Lattanzi1 Received: 31 January 2024 / Accepted: 20 May 2024 © The Author(s) 2024 Abstract Graphics processing units and tensor processing units coupled with tiny machine learning models deployed on edge devices are revolutionizing computer vision and real-time tracking systems. However, edge devices pose tight resource and power constraints. This paper proposes a real-time vision-based virtual sensors paradigm to provide power-aware multi-object tracking at the edge while preserving tracking accuracy and enhancing privacy. We thoroughly describe our proposed system architecture, focusing on the Dynamic Inference Power Manager (DIPM). Our proposed DIPM is based on an adaptive frame rate to provide energy savings. We implement and deploy the virtual sensor and the DIPM on the NVIDIA Jetson Nano edge platform to prove the effectiveness and efficiency of the proposed solution. The results of extensive experiments demonstrate that the proposed virtual sensor can achieve a reduction in energy consumption of about 36% in videos with relatively low dynamicity and about 21% in more dynamic video content while simultaneously maintaining tracking accuracy within a range of less than 1.2%. Keywords Edge computing · Virtual sensor · Energy-aware object tracking 1 Introduction The edge computing paradigm is gaining momentum thanks to the advent of real-time, low-power Graphical Processing Units (GPUs) and Tensor Processing Units (TPUs) able to run tiny Machine Learning (ML) models on resourceconstrained devices. The shift of paradigm of moving intelligence from the cloud towards the edge provides benefits * Chiara Contoli Lorenzo Calisti Giacomo Di Fabrizio Nicholas Kania Alessandro Bogliolo Emanuele Lattanzi 1 Department of Pure and Applied Sciences, University of Urbino, Piazza della Repubblica 13, Urbino, Italy in terms of privacy, latency, and bandwidth [2, 49]. Of particular interest are applications enabled by combining edge computing with object detection and tracking tasks, such as remote sensing image [11, 24], video surveillance [1, 18], human–computer interaction [48, 54], and autonomous driving [7, 36], to cite a few. Object detection and tracking are two distinct computer vision tasks. Initially, those tasks were mainly carried out on powerful cloud server [12, 37]; subsequently, the idea was to adopt a collaborative approach between the cloud and the edge, where the two tasks are split across the edge computing architecture [6, 33]. A more recent approach envisages the design of efficient machine and deep learning solutions that are lightweight enough to be deployed on resource-constrained edge devices [13, 45]. Three well-known and widely adopted multi-object tracking-by-detection algorithms are Simple Online Realtime Tracker (SORT) [3], DeepSORT [43], and Intersection over Union (IoU) [35]. Many works in the existing literature compare the performance of SORT and DeepSORT algorithms against other state-of-the-art trackers [20, 31, 41, 44, 51]. The same applies to the IoU algorithm [10, 34, 46, 50]. Despite the rich literature, those works lack the Vol.:(0123456789) 103 Page 2 of 12 investigation of the energy efficiency of the tracker algorithms. Only a few papers, e.g., [30], consider skipping some frames for performance improvement. However, the authors consider a fixed number of frames to be skipped and do not explore the energy efficiency of their algorithm. If detection and tracking algorithms are required to run on low-power edge devices, their power consumption requires attention because of the challenges posed by resource constraints [55, 56]. Despite the high attention paid to energy expenditure, most of the literature only focuses on the impact of the detection phase. When the tracking phase is considered, many works, e.g., [47], only consider the single object tracking case. Our work advances existing literature by proposing an adaptive frame rate strategy for the IoU multi-object tracker to provide a real-time, power-aware, energy-efficient tracking algorithm. 1.1 Our contributions In this paper, we propose the real-time vision-based virtual sensors paradigm for energy-efficient multi-object tracking on edge devices. We first thoroughly describe our proposed system architecture, with a particular focus on the Dynamic Inference Power Manager (DIPM). We implement and deploy the virtual sensor and the DIPM to perform extensive experimental measurements to prove the effectiveness and efficiency of our proposed methodology. Specifically, we consider the Single Shot Detector (SSD) MobileNet [9], and the Train Adapt Optimize (TAO) TrafficCamNet [28] as object detectors, and the lightweight Intersection over Union (IoU) tracking algorithm [38]. Our testbed uses the NVIDIA Jetson Nano [27] as an edge device platform, and we tested it on well-known benchmarks based on the MultiObject Tracking (MOT) challenge [25]. Results show that the proposed virtual sensor can achieve a reduction in energy consumption of about 36% in videos with relatively low dynamicity and about 21% in more dynamic video content while simultaneously maintaining tracking accuracy within a range of less than 1.2%. Our contributions are summarized as follows: – Real-time vision-based virtual sensors: a family of synthetic sensors that process data from camera sources and extract anonymous numerical information. The virtual sensor boosts privacy by consolidating data processing on the edge device without sending sensitive data to a centralized server. – Dynamic Inference Power Manager: enhances the virtual sensors by implementing an adaptive frame rate approach to allow energy savings while preserving tracking accuracy. Journal of Real-Time Image Processing (2024) 21:103 – deployment on the NVIDIA Jetson Nano: we highlight the advantages of our methodology compared to conventional non-power-aware edge computing approaches. The rest of the paper is organized as follows: Sect. 2 provides related work on object detection and tracking on edge device platforms. Section 3 describes the design principles of the proposed vision-based virtual sensor and provides a detailed description of the proposed DIPM. In Sect. 4, we provide background on the performance evaluation metrics typically employed to evaluate tracking systems. Section 5 presents the experimental setup and methodology employed to assess the efficacy of our solution. Section 6 discusses the results and findings, while Sect. 7 concludes the paper. 2 Related work This section reviews the relevant literature on object detection and tracking efforts on testbed implementation considering edge hardware platforms. We also review the literature on (...truncated)