SDPH: a new technique for spatial detection of path holes from huge volume high-resolution raster images in near real-time (pdf)

Article PDF cannot be displayed. You can download it here:

https://link.springer.com/content/pdf/10.1007/s11554-024-01451-7.pdf

SDPH: a new technique for spatial detection of path holes from huge volume high-resolution raster images in near real-time

Journal of Real-Time Image Processing (2024) 21:70 https://doi.org/10.1007/s11554-024-01451-7 RESEARCH SDPH: a new technique for spatial detection of path holes from huge volume high‑resolution raster images in near real‑time Murat Tasyurek1 Received: 4 January 2024 / Accepted: 3 March 2024 / Published online: 4 April 2024 © The Author(s) 2024 Abstract Detecting and repairing road defects is crucial for road safety, vehicle maintenance, and enhancing tourism on well-maintained roads. However, monitoring all roads by vehicle incurs high costs. With the widespread use of remote sensing technologies, high-resolution satellite images offer a cost-effective alternative. This study proposes a new technique, SDPH, for automated detection of damaged roads from vast, high-resolution satellite images. In the SDPH technique, satellite images are organized in a pyramid grid file system, allowing deep learning methods to efficiently process them. The images, generated as 256 × 256 dimensions, are stored in a directory with explicit location information. The SDPH technique employs a two-stage object detection models, utilizing classical and modified RCNNv3, YOLOv5, and YOLOv8. Classical RCNNv3, YOLOv5, and YOLOv8 and modified RCNNv3, YOLOv5, and YOLOv8 in the first stage for identifying roads, achieving f1 scores of 0.743, 0.716, 0.710, 0.955, 0.958, and 0.954, respectively. When the YOLOv5, with the highest f1 score, was fed to the second stage; modified RCNNv3, YOLOv5, and YOLOv8 detected road defects, achieving f1 scores of 0.957,0.971 and 0.964 in the second process. When the same CNN model was used for road and road defect detection in the proposed SDPH model, classical RCNNv3, improved RCNNv3, classical YOLOv5, improved YOLOv5, classical YOLOv8, improved RCNNv8 achieved micro f1 scores of 0.752, 0.956, 0.726, 0.969, 0.720 and 0.965, respectively. In addition, these models processed 11, 10, 33, 31, 37, and 36 FPS images by performing both stage operations, respectively. Evaluations on geotiff satellite images from Kayseri Metropolitan Municipality, ranging between 20 and 40 gigabytes, demonstrated the efficiency of the SDPH technique. Notably, the modified YOLOv5 outperformed, detecting paths and defects in 0.032 s with the micro f1 score of 0.969. Fine-tuning on TileCache enhanced f1 scores and reduced computational costs across all models. Keywords Near real-time · High-resolution imagery · Pat hole detection · Object detection with spatial location 1 Introduction Damaged paths cause traffic accidents and injuries [1, 2]. Imperfections such as potholes, cracks, or trapezoids can make vehicles difficult to control, distract drivers, and cause them to make sudden maneuvers. Detecting damaged roads improves driver safety and helps prevent accidents [3]. However, it is very costly to constantly check all roads by assigning personnel to detect broken roads. On the other hand, with the widespread use of remote sensing technology in recent years, huge volume and high-resolution images are obtained [4–6]. High-resolution satellite images are * Murat Tasyurek 1 Department of Computer Engineering, Kayseri University, Kayseri, Turkey images obtained from space or the air that show the details of the Earth in high resolution. These images usually have sufficient pixel density to distinguish very small objects or details [7]. However, these images take up a large amount of space on computer systems [8]. Large volumes of data are challenging to handle by classical data processing methods [9, 10]. Technology giants such as ArcGIS and Google Earth cut huge volumes of images into small pieces, allowing users to access these images at high speed [11–13]. Large-volume images are divided into small pieces in the form of small grids and accessed in pyramid grid file format [14]. This operation is called tile [15]. In the tiling process, huge volumes of satellite images are recorded on computer disks as z/x/y.mime type [16]. Z denotes the recording level of the image, x signifies the image’s position along the X-axis, the y-value represents its position along the Y-axis, and the Vol.:(0123456789) 70 Page 2 of 19 mime type specifies the format in which the image is to be stored. These images are produced and saved in 256 × 256 dimensions. When the user wants to access the image in a region, it takes a very long time to take the huge volume image and present the relevant region. Instead, the pyramid grid file system provides high-speed access to users because small-size images in the appropriate location are presented [17, 18]. In recent times, the advanced deep learning technique, a cutting-edge computer technology, has found widespread application in diverse domains such as image classification [19], object tracking [20], and pose estimation [21]. Numerous deep learning methodologies have gained widespread acceptance in this domain. Notably, among these techniques, convolutional neural networks (CNNs) have demonstrated remarkable efficacy, particularly in the realm of image classification [22]. Detection of road disturbances is a classification problem in computer vision. Within the literature, research on the identification of road disturbances through onboard cameras often employs CNN-based methods [23–25]. In this investigation, we propose a new method, referred to as SDPH, for identifying road disturbances along with their spatial coordinates within huge volumes and high-resolution satellite imagery. CNN models could not process satellite images ranging in size from 20 to 40 gigabits. In the suggested SDPH methodology, the conversion of these images into a pyramid grid file format is facilitated through the GeoServerTileCache software, employing an open-source strategy. This format ensures compatibility for processing the images through CNN models. In the satellite images, there are disturbances in the soil. In the recommended SDPH technique, a two-stage deep learning technique has been developed to detect only road disturbances. As a deep learning method, RCNNv3 [25, 26], YOLOv5 [24, 27], and YOLOv8 [28, 29] were used. 1.1 Contributions – A new technique, SDPH, is recommended to detect road disturbances and their corresponding spatial locations within vast volumes of high-resolution satellite imagery. – The achievement of the recommended technique has been tested on real satellite image data with sizes between 20 and 40 gigabytes. – A pyramid grid file system using GeoServer-TileCache has been recommended so that CNN models can process huge volume satellite images. – Thanks to the recommended pyramid grid technique, a huge volume satellite image that occupies at least 20 gigabytes of space and, therefore cannot be processed has been converted into small and processable images, the largest of which is 23 kilobytes. Journal of Real-Time Image Processing (2024) 21:70 – A fine-tuning technique has been developed to improve the object detection achievement of the recommended SDPH te (...truncated)