EURASIP Journal on Image and Video Processing

http://link.springer.com/journal/13640

List of Papers (Total 528)

Noise-resistant network: a deep-learning method for face recognition under noise

Along with the developments of deep learning, many recent architectures have been proposed for face recognition and even get close to human performance. However, accurately recognizing an identity from seriously noisy face images still remains a challenge. In this paper, we propose a carefully designed deep neural network coined noise-resistant network (NR-Network) for face...

Improved gradient local ternary patterns for facial expression recognition

Automated human emotion detection is a topic of significant interest in the field of computer vision. Over the past decade, much emphasis has been on using facial expression recognition (FER) to extract emotion from facial expressions. Many popular appearance-based methods such as local binary pattern (LBP), local directional pattern (LDP) and local ternary pattern (LTP) have...

Adaptive video streaming over HTTP through 4G wireless networks based on buffer analysis

One of the main challenges in video transmission is understanding and adapting to the varying network bandwidth. The traditional approach of bandwidth estimation is not accurate as there are many factors like congestion that can delay the arrival rate of the ping packet which may lead to a misconception that the bandwidth was low. Thus, the better approach to this problem will be...

Object detection using ensemble of linear classifiers with fuzzy adaptive boosting

The Adaboost (Freund and Schapire, Eur. Conf. Comput. Learn. Theory 23–37, 1995) chooses a good set of weak classifiers in rounds. On each round, it chooses the optimal classifier (optimal feature and its threshold value) by minimizing the weighted error of classification. It also reweights training data so that the next round would focus on data that are difficult to classify...

Efficient pavement crack detection and classification

Each year, millions of dollars are invested on road maintenance and reparation all over the world. In order to minimize costs, one of the main aspects is the early detection of those flaws. Different types of cracks require different types of repairs; therefore, not only a crack detection is required but a crack type classification. Also, the earlier the crack is detected, the...

Damaged region filling and evaluation by symmetrical exemplar-based image inpainting for Thangka

Exemplar-based image inpainting, as proposed by Criminisi et al. (IEEE Trans Image Process 13(9):1200–1212, 2004), fills missing regions by using a similar exemplar. However, when the missing region is a unique texture patch, an incorrect texture is filled in the missing region because a similar exemplar of damaged patch could not be found. A new image inpainting method based on...

High performance and low complexity decoding light-weight video coding with motion estimation and mode decision at decoder

Light-weight video coding (LVC) follows distributed video coding (DVC) and designs to move computational complexity from the encoder to the decoder, thus making a low computational complexity encoder. In traditional video coding, the high computational complexity encoder algorithms, where motion estimation and mode decision, are the main transferred objects. In order to alleviate...

Lossy image compression based on prediction error and vector quantisation

Lossy image compression has been gaining importance in recent years due to the enormous increase in the volume of image data employed for Internet and other applications. In a lossy compression, it is essential to ensure that the compression process does not affect the quality of the image adversely. The performance of a lossy compression algorithm is evaluated based on two...

Research of segmentation method on color image of Lingwu long jujubes based on the maximum entropy

This paper researches on methods of the color image segmentation method of Lingwu long jujubes based on the maximum entropy to achieve the accuracy of image segmentation and improve accuracy of machine recognition. According to law between the color of Lingwu long jujubes and characteristic of environment, starting from the hue information, this paper is first to explore the...

A variant of the Hough Transform for the combined detection of corners, segments, and polylines

The Hough Transform (HT) is an effective and popular technique for detecting image features such as lines and curves. From its standard form, numerous variants have emerged with the objective, in many cases, of extending the kind of image features that could be detected. Particularly, corner and line segment detection using HT has been separately addressed by several approaches...

Robust image hashing through DWT-SVD and spectral residual method

In the last few decades, the discovery of various methods for generating secure image hash has become revolutionary in the field of image hashing. This paper presents an efficient approach to obtain image hash through DWT-SVD and a saliency detection technique using spectral residual model. The latest image hashing technique based on ring partition and invariant vector distance...

DTCTH: a discriminative local pattern descriptor for image classification

Despite lots of effort being exerted in designing feature descriptors, it is still challenging to find generalized feature descriptors, with acceptable discrimination ability, which are able to capture prominent features in various image processing applications. To address this issue, we propose a computationally feasible discriminative ternary census transform histogram (DTCTH...

Recursive non-local means filter for video denoising

In this paper, we propose a computationally efficient algorithm for video denoising that exploits temporal and spatial redundancy. The proposed method is based on non-local means (NLM). NLM methods have been applied successfully in various image denoising applications. In the single-frame NLM method, each output pixel is formed as a weighted sum of the center pixels of...

Glyph-based video visualization on Google Map for surveillance in smart cities

Video visualization (VV) is considered to be an essential part of multimedia visual analytics. Many challenges have arisen from the enormous video content of cameras which can be solved with the help of data analytics and hence gaining importance. However, the rapid advancement of digital technologies has resulted in an explosion of video data, which stimulates the needs for...

Refining deep convolutional features for improving fine-grained image recognition

Fine-grained image recognition, a computer vision task filled with challenges due to its imperceptible inter-class variance and large intra-class variance, has been drawing increasing attention. While manual annotation can be utilized to effectively enhance performance in this task, it is extremely time-consuming and expensive. Recently, Convolutional Neural Networks (CNN...

Online motion smoothing for video stabilization via constrained multiple-model estimation

Video stabilization smooths camera motion estimates in a way that should adapt to different types of intentional motion. Corrective motion (the difference between smoothed and original motions) should be constrained so that black borders do not intrude into the (cropped) stabilized frames. Although offline smoothing can use all of the frames, online (real-time) smoothing can only...

Segmentation method based on multiobjective optimization for very high spatial resolution satellite images

In this paper, a new multicriterion segmentation method has been proposed to be applied to satellite image of very high spatial resolution (VHSR). It is consisted of the following process: For each region of the grayscale image, a center of gravity has been calculated and it has been also selected a threshold for its histogram. According to a certain criteria, this approach has...

A novel architecture for parallel multi-view HEVC decoder on mobile device

The multi-view HEVC (MV-HEVC) extension was finalized in July of 2014 by the Moving Picture Experts Group and the Video Coding Experts Group. Recently, multi-view videos based on stereo representations are becoming widely popular. Also, a variety of multimedia contents are now available for mobile devices. A real-time multi-view video decoder is therefore needed. In mobile...

Instant messaging with emotion-embedded vectorized handwritings on mobile devices

Generally, handwriting can reflect writers’ personality, thoughts, and emotions, i.e., handwriting can deliver emotion- and sincereness-embedded messages. However, texting messages and notes such as emails and instant messages replace handwriting letters and notes in communication due to the popularity and availability of mobile devices and personal computers. Furthermore, the...

Image analysis using new set of separable two-dimensional discrete orthogonal moments based on Racah polynomials

In this paper, we propose three new separable two-dimensional discrete orthogonal moments baptized: RTM (Racah-Tchebichef moments), RKM (Racah-Krawtchouk moments), and RdHM (Racah-dual Hahn moments). We present a comparative study between our proposed separable two-dimensional discrete orthogonal moments and the classical ones, in terms of gray-level image reconstruction accuracy...

Selection of optimal pixel resolution for landslide susceptibility analysis within the Bukit Antarabangsa, Kuala Lumpur, by using image processing and multivariate statistical tools

Landslides are considered as one of the natural hazards responsible for casualties, damage of assets, and infrastructures. In many situations, collection of field data from remote places is difficult due to inaccessibility of landslide area. This paper examines landslide susceptibility in the Bukit Antarabangsa, Kuala Lumpur, to ease geographical studies, using image processing...

Enhanced resampling detection based on image correlation of 3D stereoscopic images

In this paper, we propose a resampling detection method for stereoscopic images. Although previous resampling techniques can be applied to stereoscopic images, performance improvement is hard to be expected with the two separated results. In this research, we found a strong relationship between the left and right images derived from the characteristics of the stereoscopic images...

Querying geo-tagged videos for vision applications using spatial metadata

In this paper, we propose a novel geospatial image and video filtering tool (GIFT) to select the most relevant input images and videos for computer vision applications with geo-tagged mobile videos. GIFT tightly couples mobile media content and their geospatial metadata for fine granularity video manipulation in the spatial and temporal domain and intelligently indexes field of...