EURASIP Journal on Image and Video Processing

http://link.springer.com/journal/13640

List of Papers (Total 538)

Hierarchical complexity control algorithm for HEVC based on coding unit depth decision

The next-generation High Efficiency Video Coding (HEVC) standard reduces the bit rate by 44% on average compared to the previous-generation H.264 standard, resulting in higher encoding complexity. To achieve normal video coding in power-constrained devices and minimize the rate distortion degradation, this paper proposes a hierarchical complexity control algorithm for HEVC on the...

Experiments and assessments of a 3-DOF haptic device for interactive operation

Haptic devices have been applied in interactive operation to perform contact tasks. To explore the haptic perception characteristics of typical push-pull and rotation operation, an experimental system was built by incorporating a three degrees of freedom (3-DOF) haptic device and the virtual environment. In this system, the haptic device is used to provide motion commands to...

Research on 3D measurement model by line structure light vision

For serious radial distortion and high precision measurement, 3D measurement model is studied in the paper. Based on the two-step calibration algorithm, a round initialization window is proposed to calculate the initial value of the camera parameters for nonlinear optimization. In order to solve the problem that the Supersonic Transport Evaluation Group algorithm of light stripe...

Remote sensing image mosaic technology based on SURF algorithm in agriculture

The remote sensing technology of unmanned aerial vehicle (UAV) is a low altitude remote sensing technology. The technology has been widely used in military, agricultural, medical, geographical mapping, and other fields by virtue of the advantages of fast acquisition, high resolution, low cost, and good security. But limited by the flying height of UAV and the focal length of the...

Secure and efficient DRM watermark algorithm of forensics in mobile internet

With the development of mobile Internet technology, the characteristic of easy editing, transmitting, and forging the digital media bring great challenges in authenticity of multimedia. Due to that, the focus on the digital forensics and identification, such as source identification, content authentication, and information integrity, have become an important research content in...

Ensemble feature learning for material recognition with convolutional neural networks

Material recognition is the process of recognizing the constituent material of the object, and it is a crucial step in many fields. Therefore, it is valuable to create a system that could achieve material recognition automatically. This paper proposes a novel approach named ensemble learning for material recognition with convolutional neural networks (CNNs). In the proposed...

Image denoising with morphology- and size-adaptive block-matching transform domain filtering

BM3D is a state-of-the-art image denoising method. Its denoised results in the regions with strong edges can often be better than in the regions with smooth or weak edges, due to more accurate block-matching for the strong-edge regions. So using adaptive block sizes on different image regions may result in better image denoising. Based on these observations, in this paper, we...

Attribute-enhanced metric learning for face retrieval

Metric learning is a significant factor for media retrieval. In this paper, we propose an attribute label enhanced metric learning model to assist face image retrieval. Different from general cross-media retrieval, in the proposed model, the information of attribute labels are embedded in a hypergraph metric learning framework for face image retrieval tasks. The attribute labels...

A general framework for shiftable position-based dual-image reversible data hiding

This paper proposes an improved method for shiftable position-based dual-image reversible data hiding. During the procedure of embedding data, the total number of shiftable coordinates is set as the parameter to make a trade-off between distortion and embedding rate. First, the optimal parameter is sought and the one-to-one code table is generated according to the expected...

Multi-scale contrast and relative motion-based key frame extraction

The huge amount of video data available these days requires effective management techniques for storage, indexing, and retrieval. Video summarization, a method to manage video data, provides concise versions of the videos for efficient browsing and retrieval. Key frame extraction is a form of video summarization which selects only the most salient frames from a given video. Since...

Anchored neighborhood deep network for single-image super-resolution

Real-time image and video processing is a challenging problem in smart surveillance applications. It is necessary to trade off between high frame rate and high resolution to meet the limited bandwidth requirement in many specific applications. Thus, image super-resolution become one commonly used techniques in surveillance platform. The existing image super-resolution methods...

Variational approach for capsule video frame interpolation

Capsule video endoscopy, which uses a wireless camera to visualize the digestive tract, is emerging as an alternative to traditional colonoscopy. Colonoscopy is considered as the gold standard for visualizing the colon and takes 30 frames per second. Capsule images, on the other hand, are taken with low frame rate (average five frames per second), which makes it difficult to find...

An emergency task autonomous planning method of agile imaging satellite

As the number of satellite emergency imaging tasks grows, the main goal of satellites becomes putting forward solutions and meeting users’ demands in a relatively short time. This study aims to investigate the problem of emergency task planning for agile satellites. Through the analysis of the problem and its constraints, a model of emergency task autonomous planning was...

A novel texture-based asymmetric visibility threshold model for stereoscopic video coding

Asymmetric stereoscopic video coding is becoming increasingly popular, as it can reduce the bandwidth required for stereoscopic 3D delivery without degrading the visual quality. Based on the perceptual theory of binocular suppression, the left and right views of stereoscopic video are coded with different levels of quality. However, existing asymmetric perceptual coding...

Unpaved road detection based on spatial fuzzy clustering algorithm

Vision-based unpaved road detection is a challenging task due to the complex nature scene. In this paper, a novel algorithm is proposed to improve the accuracy and robustness of unpaved road detection and boundary extraction with low computational costs. The novelties of this paper are as follows: (1) We use a normal distribution with infrared images to detect the vanishing line...

Improved BM3D image denoising using SSIM-optimized Wiener filter

Image denoising is considered a salient pre-processing step in sophisticated imaging applications. Over the decades, numerous studies have been conducted in denoising. Recently proposed Block matching and 3D (BM3D) filtering added a new dimension to the study of denoising. BM3D is the current state-of-the-art of denoising and is capable of achieving better denoising as compared...

Triple Threshold Statistical Detection filter for removing high density random-valued impulse noise in images

This study presents a novel noise detection algorithm which satisfactorily detects noisy pixels in images corrupted by random-valued impulse noise of high levels up to 80% noise density. Three levels of adaptive thresholds along with an auxiliary condition are used in this method which adequately addresses the drawbacks of existing methods, especially the miss detection of noise...

Color sensors and their applications based on real-time color image segmentation for cyber physical systems

Color information plays an important role in the color image segmentation and real-time color sensor, which affects the result of video image segmentation and correct real-time temperature value. In this paper, a novel real-time color image segmentation method is proposed, which is based on color similarity in RGB color space. According to the color and luminance information in...

A neighborhood regression approach for removing multiple types of noises

Image denoising is an important first step to provide cleaned images for follow-up tasks such as image segmentation and object recognition. Many image denoising filters have been proposed, with most of the filters focusing on one particular type of additive or multiplicative noise. In this article, we propose a novel neighborhood regression approach. Using the neighboring pixels...

Automated approach for splicing detection using first digit probability distribution features

Digital image tampering operations destroy inbuilt fingerprints and create own new fingerprint in the tampered region. Considering the Internet speed and storage space, most of the images are circulated in the JPEG format. In a single compressed JPEG image, the first digits of DCT coefficients follow a logarithmic distribution. This distribution is not followed by DCT...

Optimized buffer allocation for video multicasting applications with virtual memory implementation

Memory requirement is a key issue when servicing more number of nodes in a heterogeneous environment. We have to ensure an optimum buffer size in order to reduce the initial latency. In this paper, we propose a novel approach, which involves, augmenting a virtual memory (VM) with the existing physical memory of the video buffer. In addition to the benefits of improved queuing...

Hands-on: deformable pose and motion models for spatiotemporal localization of fine-grained dyadic interactions

We introduce a novel spatiotemporal deformable part model for the localization of fine-grained human interactions of two persons in unsegmented videos. Our approach is the first to classify interactions and additionally provide the temporal and spatial extent of the interaction in the video. To this end, our models contain part detectors that support different scales as well as...

Hierarchical semantic segmentation of image scene with object labeling

Semantic segmentation of an image scene provides semantic information of image regions while less information of objects. In this paper, we propose a method of hierarchical semantic segmentation, including scene level and object level, which aims at labeling both scene regions and objects in an image. In the scene level, we use a feature-based MRF model to recognize the scene...

SIP-FS: a novel feature selection for data representation

Multiple features are widely used to characterize real-world datasets. It is desirable to select leading features with stability and interpretability from a set of distinct features for a comprehensive data description. However, most of existing feature selection methods focus on the predictability (e.g., prediction accuracy) of selected results yet neglect stability. To obtain...