Computer Vision and Pattern Recognition

  1. Automatic Clustering with Single Optimal Solution.

    Authors: K. Karteeka Pavan, Allam Appa Rao, A. V. Dattatreya Rao
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Determining optimal number of clusters in a dataset is a challenging task.
    Though some methods are available, there is no algorithm that produces unique
    clustering solution. The paper proposes an Automatic Merging for Single Optimal
    Solution (AMSOS) which aims to generate unique and nearly optimal clusters for
    the given datasets automatically. The AMSOS is iteratively merges the closest
    clusters automatically by validating with cluster validity measure to find
    single and nearly optimal clusters for the given data set.

  2. Robust seed selection algorithm for k-means type algorithms.

    Authors: K. Karteeka Pavan, Allam Appa Rao, A.V. Dattatreya Rao, G.R.Sridhar
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Selection of initial seeds greatly affects the quality of the clusters and in
    k-means type algorithms. Most of the seed selection methods result different
    results in different independent runs. We propose a single, optimal, outlier
    insensitive seed selection algorithm for k-means type algorithms as extension
    to k-means++. The experimental results on synthetic, real and on microarray
    data sets demonstrated that effectiveness of the new algorithm in producing the
    clustering results

  3. Comparing Methods for segmentation of Microcalcification Clusters in Digitized Mammograms.

    Authors: Hajar Moradmand, Saeed Setayeshi, Hossein Khazaei Targhi
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    The appearance of microcalcifications in mammograms is one of the early signs
    of breast cancer. So, early detection of microcalcification clusters (MCCs) in
    mammograms can be helpful for cancer diagnosis and better treatment of breast
    cancer. In this paper a computer method has been proposed to support
    radiologists in detection MCCs in digital mammography. First, in order to
    facilitate and improve the detection step, mammogram images have been enhanced
    with wavelet transformation and morphology operation. Then for segmentation of
    suspicious MCCs, two methods have been investigated.

  4. Task-Driven Adaptive Statistical Compressive Sensing of Gaussian Mixture Models.

    Authors: Guillermo Sapiro, Guoshen Yu, Lawrence Carin, Julio M. Duarte-Carvajalino
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    A framework for adaptive and non-adaptive statistical compressive sensing is
    developed, where a statistical model replaces the standard sparsity model of
    classical compressive sensing. We propose within this framework optimal
    task-specific sensing protocols specifically and jointly designed for
    classification and reconstruction. A two-step adaptive sensing paradigm is
    developed, where online sensing is applied to detect the signal class in the
    first step, followed by a reconstruction step adapted to the detected class and
    the observed samples.

  5. Compressive Acquisition of Dynamic Scenes.

    Authors: Aswin C Sankaranarayanan, Pavan K Turaga, Rama Chellappa, Richard G Baraniuk
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Compressive sensing (CS) is a new approach for the acquisition and recovery
    of sparse signals and images that enables sampling rates significantly below
    the classical Nyquist rate. Despite significant progress in the theory and
    methods of CS, little headway has been made in compressive video acquisition
    and recovery. Video CS is complicated by the ephemeral nature of dynamic
    events, which makes direct extensions of standard CS imaging architectures and
    signal models difficult.

  6. A Novel Approach to Fast Image Filtering Algorithm of Infrared Images based on Intro Sort Algorithm.

    Authors: Kapil Kumar Gupta, Rizwan Beg, Jitendra Kumar Niranjan
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    In this study we investigate the fast image filtering algorithm based on
    Intro sort algorithm and fast noise reduction of infrared images. Main feature
    of the proposed approach is that no prior knowledge of noise required. It is
    developed based on Stefan- Boltzmann law and the Fourier law. We also
    investigate the fast noise reduction approach that has advantage of less
    computation load. In addition, it can retain edges, details, text information
    even if the size of the window increases.

  7. On the Lagrangian Biduality of Sparsity Minimization Problems.

    Authors: S. Shankar Sastry, Allen Y. Yang, Dheeraj Singaraju, Ehsan Elhamifar, Roberto Tron
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Recent results in Compressive Sensing have shown that, under certain
    conditions, the solution to an underdetermined system of linear equations with
    sparsity-based regularization can be accurately recovered by solving convex
    relaxations of the original problem. In this work, we present a novel
    primal-dual analysis on a class of sparsity minimization problems.

  8. G-Lets: Signal Processing Using Transformation Groups.

    Authors: B.Rajathilagam, Murali Rangarajan, K.P.Soman
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    We present an algorithm using transformation groups and their irreducible
    representations to generate an orthogonal basis for a signal in the vector
    space of the signal. It is shown that multiresolution analysis can be done with
    amplitudes using a transformation group. G-lets is thus not a single transform,
    but a group of linear transformations related by group theory. The algorithm
    also specifies that a multiresolution and multiscale analysis for each
    resolution is possible in terms of frequencies.

  9. Autonomous Cleaning of Corrupted Scanned Documents - A Generative Modeling Approach.

    Authors: Jörg Lücke, Zhenwen Dai
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    We study the task of cleaning scanned text documents that are strongly
    corrupted by dirt such as manual line strokes, spilled ink etc. We aim at
    autonomously removing dirt from a single letter-size page based only on the
    information the page contains. Our approach, therefore, has to learn character
    representations without supervision and requires a mechanism to distinguish
    learned representations from irregular patterns.

  10. Re-initialization Free Level Set Evolution via Reaction Diffusion.

    Authors: Kaihua Zhang, Lei Zhang, Huihui Song, David Zhang
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    This paper presents a novel reaction-diffusion (RD) method for implicit
    active contours, which is completely free of the costly re-initialization
    procedure in level set evolution (LSE). A diffusion term is introduced into
    LSE, resulting in a RD-LSE equation, to which a piecewise constant solution can
    be derived. In order to have a stable numerical solution of the RD based LSE,
    we propose a two-step splitting method (TSSM) to iteratively solve the RD-LSE
    equation: first iterating the LSE equation, and then solving the diffusion
    equation.

  11. POCS Based Super-Resolution Image Reconstruction Using an Adaptive Regularization Parameter.

    Authors: S.S. Panda, M.S.R.S Prasad, G. Jena
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Crucial information barely visible to the human eye is often embedded in a
    series of low-resolution images taken of the same scene. Super-resolution
    enables the extraction of this information by reconstructing a single image, at
    a high resolution than is present in any of the individual images. This is
    particularly useful in forensic imaging, where the extraction of minute details
    in an image can help to solve a crime.

  12. Classification with Invariant Scattering Representations.

    Authors: Stéphane Mallat, Joan Bruna
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    A scattering transform defines a signal representation which is invariant to
    translations and Lipschitz continuous relatively to deformations. It is
    implemented with a non-linear convolution network that iterates over wavelet
    and modulus operators. Lipschitz continuity locally linearizes deformations.
    Complex classes of signals and textures can be modeled with low-dimensional
    affine spaces, computed with a PCA in the scattering domain. Classification is
    performed with a penalized model selection.

  13. Distributed Representation of Geometrically Correlated Images with Compressed Linear Measurements.

    Authors: Pascal Frossard, Vijayaraghavan Thirumalai
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    This paper addresses the problem of distributed coding of images whose
    correlation is driven by the motion of objects or positioning of the vision
    sensors. It concentrates on the problem where images are encoded with
    compressed linear measurements. We propose a geometry-based correlation model
    in order to describe the common information in pairs of images. We assume that
    the constitutive components of natural images can be captured by visual
    features that undergo local transformations (e.g., translation) in different
    images.

  14. A New IRIS Normalization Process For Recognition System With Cryptographic Techniques.

    Authors: S. Nithyanandam, K. S. Gayathri, P. L. K. Priyadarshini
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Biometric technologies are the foundation of personal identification systems.
    It provides an identification based on a unique feature possessed by the
    individual. This paper provides a walkthrough for image acquisition,
    segmentation, normalization, feature extraction and matching based on the Human
    Iris imaging. A Canny Edge Detection scheme and a Circular Hough Transform, is
    used to detect the iris boundaries in the eye's digital image. The extracted
    IRIS region was normalized by using Image Registration technique.

  15. A Theory for Optical flow-based Transport on Image Manifolds.

    Authors: Richard G. Baraniuk, Sriram Nagaraj, Aswin C. Sankaranarayanan
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    An image articulation manifold (IAM) is the collection of images formed when
    an object is articulated in front of a camera. IAMs arise in a variety of image
    processing and computer vision applications, where they provide a natural
    low-dimensional embedding of the collection of high-dimensional images.

  16. The Object Projection Feature Estimation Problem in Unsupervised Markerless 3D Motion Tracking.

    Authors: Luis Quesada, Alejandro J. León
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    3D motion tracking is a critical task in many computer vision applications.
    Existing 3D motion tracking techniques require either a great amount of
    knowledge on the target object or specific hardware. These requirements
    discourage the wide spread of commercial applications based on 3D motion
    tracking. 3D motion tracking systems that require no knowledge on the target
    object and run on a single low-budget camera require estimations of the object
    projection features (namely, area and position).

  17. A Novel Approach to Texture classification using statistical feature.

    Authors: V. Subbiah Bharathi, B. Vijayalakshmi
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Texture is an important spatial feature which plays a vital role in content
    based image retrieval. The enormous growth of the internet and the wide use of
    digital data have increased the need for both efficient image database creation
    and retrieval procedure. This paper describes a new approach for texture
    classification by combining statistical texture features of Local Binary
    Pattern and Texture spectrum.

  18. New Method for 3D Shape Retrieval.

    Authors: Abdelghni Lakehal, Omar El Beqqali
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    The recent technological progress in acquisition, modeling and processing of
    3D data leads to the proliferation of a large number of 3D objects databases.
    Consequently, the techniques used for content based 3D retrieval has become
    necessary. In this paper, we introduce a new method for 3D objects recognition
    and retrieval by using a set of binary images CLI (Characteristic level
    images). We propose a 3D indexing and search approach based on the similarity
    between characteristic level images using Hu moments for it indexing.

  19. Covariant fractional extension of the modified Laplace-operator used in 3D-shape recovery.

    Authors: Richard Herrmann
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Extending the Liouville-Caputo definition of a fractional derivative to a
    nonlocal covariant generalization of arbitrary bound operators acting on
    multidimensional Riemannian spaces an appropriate approach for the 3D shape
    recovery of aperture afflicted 2D slide sequences is proposed. We demonstrate,
    that the step from a local to a nonlocal algorithm yields an order of magnitude
    in accuracy and by using the specific fractional approach an additional factor
    2 in accuracy of the derived results.

  20. Sparsity and Robustness in Face Recognition.

    Authors: John Wright, Arvind Ganesh, Yi Ma, Zihan Zhou, Allen Yang
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    This report concerns the use of techniques for sparse signal representation
    and sparse error correction for automatic face recognition. Much of the recent
    interest in these techniques comes from the paper "Robust Face Recognition via
    Sparse Representation" by Wright et al. (2009), which showed how, under certain
    technical conditions, one could cast the face recognition problem as one of
    seeking a sparse representation of a given input face image in terms of a
    "dictionary" of training images and images of individual pixels.

  21. Graph Regularized Nonnegative Matrix Factorization for Hyperspectral Data Unmixing.

    Authors: Roozbeh Rajabi, Mahdi Khodadadzadeh, Hassan Ghassemian
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Spectral unmixing is an important tool in hyperspectral data analysis for
    estimating endmembers and abundance fractions in a mixed pixel. This paper
    examines the applicability of a recently developed algorithm called graph
    regularized nonnegative matrix factorization (GNMF) for this aim. The proposed
    approach exploits the intrinsic geometrical structure of the data besides
    considering positivity and full additivity constraints. Simulated data based on
    the measured spectral signatures, is used for evaluating the proposed
    algorithm.

  22. Text-Independent Speaker Recognition for Low SNR Environments with Encryption.

    Authors: Aman Chadha, Divya Jyoti, M. Mani Roja
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Recognition systems are commonly designed to authenticate users at the access
    control levels of a system. A number of voice recognition methods have been
    developed using a pitch estimation process which are very vulnerable in low
    Signal to Noise Ratio (SNR) environments thus, these programs fail to provide
    the desired level of accuracy and robustness. Also, most text independent
    speaker recognition programs are incapable of coping with unauthorized attempts
    to gain access by tampering with the samples or reference database.

  23. The proximal point method for a hybrid model in image restoration.

    Authors: Zhi-Feng Pang, Li-Lian Wang, Yu-Fei Yang
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Models including two $L^1$ -norm terms have been widely used in image
    restoration. In this paper we first propose the alternating direction method of
    multipliers (ADMM) to solve this class of models. Based on ADMM, we then
    propose the proximal point method (PPM), which is more efficient than ADMM.
    Following the operator theory, we also give the convergence analysis of the
    proposed methods. Furthermore, we use the proposed methods to solve a class of
    hybrid models combining the ROF model with the LLT model.

  24. Linearized Additive Classifiers.

    Authors: Subhransu Maji
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    We revisit the additive model learning literature and adapt a penalized
    spline formulation due to Eilers and Marx, to train additive classifiers
    efficiently. We also propose two new embeddings based two classes of orthogonal
    basis with orthogonal derivatives, which can also be used to efficiently learn
    additive classifiers. This paper follows the popular theme in the current
    literature where kernel SVMs are learned much more efficiently using a
    approximate embedding and linear machine.

  25. Non-Gaussian Scale Space Filtering with 2 by 2 Matrix of Linear Filters.

    Authors: Toshiro Kubota
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Construction of a scale space with a convolution filter has been studied
    extensively in the past. It has been proven that the only convolution kernel
    that satisfies the scale space requirements is a Gaussian type. In this paper,
    we consider a matrix of convolution filters introduced in [1] as a building
    kernel for a scale space, and shows that we can construct a non-Gaussian scale
    space with a $2\times 2$ matrix of filters. The paper derives sufficient
    conditions for the matrix of filters for being a scale space kernel, and
    present some numerical demonstrations.

  26. Learning to relate images: Mapping units, complex cells and simultaneous eigenspaces.

    Authors: Roland Memisevic
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    A fundamental operation in many vision tasks, including motion understanding,
    stereopsis, visual odometry, or invariant recognition, is establishing
    correspondences between images or between images and data from other
    modalities. We present an analysis of the role that multiplicative interactions
    play in learning such correspondences, and we show how learning and inferring
    relationships between images can be viewed as detecting rotations in the
    eigenspaces shared among a set of orthogonal matrices.

  27. A Probabilistic Framework for Discriminative Dictionary Learning.

    Authors: Bernard Ghanem, Narendra Ahuja
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    In this paper, we address the problem of discriminative dictionary learning
    (DDL), where sparse linear representation and classification are combined in a
    probabilistic framework. As such, a single discriminative dictionary and linear
    binary classifiers are learned jointly. By encoding sparse representation and
    discriminative classification models in a MAP setting, we propose a general
    optimization framework that allows for a data-driven tradeoff between faithful
    representation and accurate classification.

  28. Curvature Prior for MRF-based Segmentation and Shape Inpainting.

    Authors: Pushmeet Kohli, Carsten Rother, Alexander Shekhovtsov
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Most image labeling problems such as segmentation and image reconstruction
    are fundamentally ill-posed and suffer from ambiguities and noise. Higher order
    image priors encode high level structural dependencies between pixels and are
    key to overcoming these problems. However, these priors in general lead to
    computationally intractable models. This paper addresses the problem of
    discovering compact representations of higher order priors which allow
    efficient inference.

  29. Toward Parts-Based Scene Understanding with Pixel-Support Parts-Sparse Pictorial Structures.

    Authors: Jason J. Corso
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Scene understanding remains a significant challenge in the computer vision
    community. The visual psychophysics literature has demonstrated the importance
    of interdependence among parts of the scene. Yet, the majority of methods in
    computer vision remain local. Pictorial structures have arisen as a fundamental
    parts-based model for some vision problems, such as articulated object
    detection. However, the form of classical pictorial structures limits their
    applicability for global problems, such as semantic pixel labeling.

  30. Hierarchical Object Parsing from Noisy Point Clouds.

    Authors: Adrian Barbu
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Object parsing and segmentation from point clouds are challenging tasks
    because the relevant data is available only as thin structures along object
    boundaries or other object features and is corrupted by large amounts of noise.
    One way to handle this kind of data is by employing shape models that can
    accurately follow the object boundaries.

  31. A Invertible Dimension Reduction of Curves on a Manifold.

    Authors: Sheng Yi, Hamid Krim, Larry K. Norris
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    In this paper, we propose a novel lower dimensional representation of a shape
    sequence. The proposed dimension reduction is invertible and computationally
    more efficient in comparison to other related works. Theoretically, the
    differential geometry tools such as moving frame and parallel transportation
    are successfully adapted into the dimension reduction problem of high
    dimensional curves.

  32. Confidence-Based Dynamic Classifier Combination For Mean-Shift Tracking.

    Authors: Hakan Erdogan, Ibrahim Saygin Topkaya
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    We introduce a novel tracking technique which uses dynamic confidence-based
    fusion of two different information sources for robust and efficient tracking
    of visual objects. Mean-shift tracking is a popular and well known method used
    in object tracking problems. Originally, the algorithm uses a similarity
    measure which is optimized by shifting a search area to the center of a
    generated weight image to track objects. Recent improvements on the original
    mean-shift algorithm involves using a classifier that differentiates the object
    from its surroundings.

  33. Filtering algorithms using shiftable kernels.

    Authors: Kunal Narayan Chaudhury
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    It was recently demonstrated in [4][arxiv:1105.4204] that the non-linear
    bilateral filter \cite{Tomasi} can be efficiently implemented using an O(1) or
    constant-time algorithm. At the heart of this algorithm was the idea of
    approximating the Gaussian range kernel of the bilateral filter using
    trigonometric functions. In this letter, we explain how the idea in [4] can be
    extended to few other linear and non-linear filters [18,21,2]. While some of
    these filters have received a lot of attention in recent years, they are known
    to be computationally intensive.

  34. Efficient variational inference in large-scale Bayesian compressed sensing.

    Authors: George Papandreou, Alan Yuille
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    We study linear models under heavy-tailed priors from a probabilistic
    viewpoint. Instead of computing a single sparse most probable (MAP) solution as
    in standard compressed sensing, the focus in the Bayesian framework shifts
    towards capturing the full posterior distribution on the latent variables,
    which allows quantifying the estimation uncertainty and learning model
    parameters using maximum likelihood. The exact posterior distribution under the
    sparse linear model is intractable and we concentrate on a number of
    alternative variational Bayesian techniques to approximate it.

  35. The IHS Transformations Based Image Fusion.

    Authors: N.V. Kalyankar, Firouz Abdullah Al-Wassai, Ali A. Al-Zuky
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    The IHS sharpening technique is one of the most commonly used techniques for
    sharpening. Different transformations have been developed to transfer a color
    image from the RGB space to the IHS space. Through literature, it appears that,
    various scientists proposed alternative IHS transformations and many papers
    have reported good results whereas others show bad ones as will as not those
    obtained which the formula of IHS transformation were used. In addition to
    that, many papers show different formulas of transformation matrix such as IHS
    transformation.

  36. Exploring New Directions in Iris Recognition.

    Authors: Nicolaie Popescu-Bodorin
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    A new approach in iris recognition based on Circular Fuzzy Iris Segmentation
    (CFIS) and Gabor Analytic Iris Texture Binary Encoder (GAITBE) is proposed and
    tested here. CFIS procedure is designed to guarantee that similar iris segments
    will be obtained for similar eye images, despite the fact that the degree of
    occlusion may vary from one image to another. Its result is a circular iris
    ring (concentric with the pupil) which approximates the actual iris. GAITBE
    proves better encoding of statistical independence between the iris codes
    extracted from different irides using Hilbert Transform.

  37. A Fuzzy View on k-Means Based Signal Quantization with Application in Iris Segmentation.

    Authors: Nicolaie Popescu-Bodorin
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    This paper shows that the k-means quantization of a signal can be interpreted
    both as a crisp indicator function and as a fuzzy membership assignment
    describing fuzzy clusters and fuzzy boundaries. Combined crisp and fuzzy
    indicator functions are defined here as natural generalizations of the ordinary
    crisp and fuzzy indicator functions, respectively. An application to iris
    segmentation is presented together with a demo program.

  38. Analysis and Improvement of Low Rank Representation for Subspace segmentation.

    Authors: Wei Siming, Lin Zhouchen
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    We analyze and improve low rank representation (LRR), the state-of-the-art
    algorithm for subspace segmentation of data. We prove that for the noiseless
    case, the optimization model of LRR has a unique solution, which is the shape
    interaction matrix (SIM) of the data matrix. So in essence LRR is equivalent to
    factorization methods. We also prove that the minimum value of the optimization
    model of LRR is equal to the rank of the data matrix. For the noisy case, we
    show that LRR can be approximated as a factorization method that combines noise
    removal by column sparse robust PCA.

  39. Spatial Features for Multi-Font/Multi-Size Kannada Numerals and Vowels Recognition.

    Authors: B.V. Dhandra, Mallikarjun Hangarge, Gururaj Mukarambi
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    This paper presents multi-font/multi-size Kannada numerals and vowels
    recognition based on spatial features. Directional spatial features viz stroke
    density, stroke length and the number of stokes in an image are employed as
    potential features to characterize the printed Kannada numerals and vowels.
    Based on these features 1100 numerals and 1400 vowels are classified with
    Multi-class Support Vector Machines (SVM). The proposed system achieves the
    recognition accuracy as 98.45% and 90.64% for numerals and vowels respectively.

  40. Online Vehicle Detection For Estimating Traffic Status.

    Authors: Ranch Y.Q. Lai
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    We propose a traffic congestion estimation system based on unsupervised
    on-line learning algorithm. The system does not rely on background extraction
    or motion detection. It extracts local features inside detection regions of
    variable size which are drawn on lanes in advance. The extracted features are
    then clustered into two classes using K-means and Gaussian Mixture Models(GMM).
    A Bayes classifier is used to detect vehicles according to the previous cluster
    information which keeps updated whenever system is running by on-line EM
    algorithm.

  41. Automatic Road Lighting System (ARLS) Model Based on Image Processing of Captured Video of Vehicle Toy Motion.

    Authors: Suprijadi, Thomas Muliawan, Sparisoma Viridi
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Using a vehicle toy as a moving object an automatic road lighting system
    (ARLS) model is constructed. A video camera with 25 fps is used to capture the
    vehicle toy motion as it moves in the test segment of the road. Captured images
    are then processed to calculate vehicle toy speed. This information of the
    speed together with position of vehicle toy is then used to switch on and off
    the lighting system along the path that passes by the vehicle toy.

  42. A navigation filter for fusing DTM/correspondence updates.

    Authors: Oleg Kupervasser
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    An algorithm for pose and motion estimation using corresponding features in
    images and a digital terrain map is proposed. Using a Digital Terrain (or
    Digital Elevation) Map (DTM/DEM) as a global reference enables recovering the
    absolute position and orientation of the camera. In order to do this, the DTM
    is used to formulate a constraint between corresponding features in two
    consecutive frames. The utilization of data is shown to improve the robustness
    and accuracy of the inertial navigation algorithm.

  43. High Performance Human Face Recognition using Independent High Intensity Gabor Wavelet Responses: A Statistical Approach.

    Authors: Mahantapas Kundu, Mita Nasipuri, Dipak Kumar Basu, Debotosh Bhattacharjee, Arindam Kar
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    In this paper, we present a technique by which high-intensity feature vectors
    extracted from the Gabor wavelet transformation of frontal face images, is
    combined together with Independent Component Analysis (ICA) for enhanced face
    recognition. Firstly, the high-intensity feature vectors are automatically
    extracted using the local characteristics of each individual face from the
    Gabor transformed images. Then ICA is applied on these locally extracted
    high-intensity feature vectors of the facial images to obtain the independent
    high intensity feature (IHIF) vectors.

  44. Next Level of Data Fusion for Human Face Recognition.

    Authors: Mita Nasipuri, Dipak Kumar Basu, Debotosh Bhattacharjee, Mrinal Kanti Bhowmik, Gautam Majumdar
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    This paper demonstrates two different fusion techniques at two different
    levels of a human face recognition process. The first one is called data fusion
    at lower level and the second one is the decision fusion towards the end of the
    recognition process. At first a data fusion is applied on visual and
    corresponding thermal images to generate fused image. Data fusion is
    implemented in the wavelet domain after decomposing the images through
    Daubechies wavelet coefficients (db2). During the data fusion maximum of
    approximate and other three details coefficients are merged together.

  45. Polar Fusion Technique Analysis for Evaluating the Performances of Image Fusion of Thermal and Visual Images for Human Face Recognition.

    Authors: Mita Nasipuri, Dipak Kumar Basu, Debotosh Bhattacharjee, Mrinal Kanti Bhowmik
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    This paper presents a comparative study of two different methods, which are
    based on fusion and polar transformation of visual and thermal images. Here,
    investigation is done to handle the challenges of face recognition, which
    include pose variations, changes in facial expression, partial occlusions,
    variations in illumination, rotation through different angles, change in scale
    etc. To overcome these obstacles we have implemented and thoroughly examined
    two different fusion techniques through rigorous experimentation.

  46. Comparing Haar-Hilbert and Log-Gabor Based Iris Encoders on Bath Iris Image Database.

    Authors: Nicolaie Popescu-Bodorin, Valentina E. Balas
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    This papers introduces a new family of iris encoders which use 2-dimensional
    Haar Wavelet Transform for noise attenuation, and Hilbert Transform to encode
    the iris texture. In order to prove the usefulness of the newly proposed iris
    encoding approach, the recognition results obtained by using these new encoders
    are compared to those obtained using the classical Log- Gabor iris encoder.
    Twelve tests involving single/multienrollment and conducted on Bath Iris Image
    Database are presented here.

  47. Cubical Cohomology Ring of 3D Photographs.

    Authors: Rocio Gonzalez-Diaz, Maria Jose Jimenez, Belen Medrano
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Cohomology and cohomology ring of three-dimensional (3D) objects are
    topological invariants that characterize holes and their relations. Cohomology
    ring has been traditionally computed on simplicial complexes. Nevertheless,
    cubical complexes deal directly with the voxels in 3D images, no additional
    triangulation is necessary, facilitating efficient algorithms for the
    computation of topological invariants in the image context. In this paper, we
    present formulas to directly compute the cohomology ring of 3D cubical
    complexes without making use of any additional triangulation.

  48. Fast O(1) bilateral filtering using trigonometric range kernels.

    Authors: Kunal Narayan Chaudhury, Michael Unser, Daniel Sage
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    It is well-known that spatial averaging can be realized (in space or
    frequency domain) using algorithms whose complexity does not depend on the size
    or shape of the filter. These fast algorithms are generally referred to as
    constant-time or O(1) algorithms in the image processing literature. Along with
    the spatial filter, the edge-preserving bilateral filter [bilateralFilter]
    involves an additional range kernel. This is used to restrict the averaging to
    those neighborhood pixels whose intensity are similar or close to that of the
    pixel of interest.

  49. Human Identity Verification based on Heart Sounds: Recent Advances and Future Directions.

    Authors: Francesco Beritelli, Andrea Spadaccini
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Identity verification is an increasingly important process in our daily
    lives, and biometric recognition is a natural solution to the authentication
    problem.

    One of the most important research directions in the field of biometrics is
    the characterization of novel biometric traits that can be used in conjunction
    with other traits, to limit their shortcomings or to enhance their performance.

  50. Invariant Representative Cocycles of Cohomology Generators using Irregular Graph Pyramids.

    Authors: Rocio Gonzalez-Diaz, Adrian Ion, Mabel Iglesias-Ham, Walter G. Kropatsch
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Structural pattern recognition describes and classifies data based on the
    relationships of features and parts. Topological invariants, like the Euler
    number, characterize the structure of objects of any dimension. Cohomology can
    provide more refined algebraic invariants to a topological space than does
    homology. It assigns `quantities' to the chains used in homology to
    characterize holes of any dimension. Graph pyramids can be used to describe
    subdivisions of the same object at multiple levels of detail.

  51. In Search of Autocorrelation Based Vocal Cord Cues for Speaker Identification.

    Authors: Md. Sahidullah, Goutam Saha
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    In this paper we investigate a technique to find out vocal source based
    features from the LP residual of speech signal for automatic speaker
    identification. Autocorrelation with some specific lag is computed for the
    residual signal to derive these features. Compared to traditional features like
    MFCC, PLPCC which represent vocal tract information, these features represent
    complementary vocal cord information. Our experiment in fusing these two
    sources of information in representing speaker characteristics yield better
    speaker identification accuracy.

  52. Convex Approaches to Model Wavelet Sparsity Patterns.

    Authors: Robert D. Nowak, Nikhil S Rao, Stephen J. Wright, Nick G. Kingsbury
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Statistical dependencies among wavelet coefficients are commonly represented
    by graphical models such as hidden Markov trees(HMTs). However, in linear
    inverse problems such as deconvolution, tomography, and compressed sensing, the
    presence of a sensing or observation matrix produces a linear mixing of the
    simple Markovian dependency structure. This leads to reconstruction problems
    that are non-convex optimizations. Past work has dealt with this issue by
    resorting to greedy or suboptimal iterative reconstruction methods.

  53. GEOMIR2K9 - A Similar Scene Finder.

    Authors: Alwin de Rooij
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    The main goal of the GEOMIR2K9 project is to create a software program that
    is able to find similar scenic images clustered by geographical location and
    sorted by similarity based only on their visual content. The user should be
    able to input a query image, based on this given query image the program should
    find relevant visual content and present this to the user in a meaningful way.
    Technically the goal for the GEOMIR2K9 project is twofold.

  54. Template-based matching using weight maps.

    Authors: Kwie Min Wong
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Template matching is one of the most prevalent pattern recognition methods
    worldwide. It has found uses in most visual concept detection fields. In this
    work, we investigate methods for improving template matching by adjusting the
    weights of different regions of the template. We compare several weight maps
    and test the methods using the FERET face test set in the context of human eye
    detection.

  55. Fuzzy Rules and Evidence Theory for Satellite Image Analysis.

    Authors: Arijit Laha, J. Das
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Design of a fuzzy rule based classifier is proposed. The performance of the
    classifier for multispectral satellite image classification is improved using
    Dempster- Shafer theory of evidence that exploits information of the
    neighboring pixels. The classifiers are tested rigorously with two known images
    and their performance are found to be better than the results available in the
    literature. We also demonstrate the improvement of performance while using D-S
    theory along with fuzzy rule based classifiers over the basic fuzzy rule based
    classifiers for all the test cases.

  56. Gaussian Affine Feature Detector.

    Authors: Xiaopeng Xu, Xiaochun Zhang
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    A new method is proposed to get image features' geometric information. Using
    Gaussian as an input signal, a theoretical optimal solution to calculate
    feature's affine shape is proposed. Based on analytic result of a feature
    model, the method is different from conventional iterative approaches. From the
    model, feature's parameters such as position, orientation, background
    luminance, contrast, area and aspect ratio can be extracted. Tested with
    synthesized and benchmark data, the method achieves or outperforms existing
    approaches in term of accuracy, speed and stability.

  57. Internal Constraints of the Trifocal Tensor.

    Authors: Stuart B. Heinrich, Wesley E. Snyder
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    The fundamental matrix and trifocal tensor are convenient algebraic
    representations of the epipolar geometry of two and three view configurations,
    respectively. The estimation of these entities is central to most
    reconstruction algorithms, and a solid understanding of their properties and
    constraints is therefore very important. The fundamental matrix has 1 internal
    constraint which is well understood, whereas the trifocal tensor has 8
    independent algebraic constraints.

  58. Improved Edge Awareness in Discontinuity Preserving Smoothing.

    Authors: Stuart B. Heinrich, Wesley E. Snyder
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Discontinuity preserving smoothing is a fundamentally important procedure
    that is useful in a wide variety of image processing contexts. It is directly
    useful for noise reduction, and frequently used as an intermediate step in
    higher level algorithms. For example, it can be particularly useful in edge
    detection and segmentation. Three well known algorithms for discontinuity
    preserving smoothing are nonlinear anisotropic diffusion, bilateral filtering,
    and mean shift filtering.

  59. A Parametric Level Set Approach to Simultaneous Object Identification and Background Reconstruction for Dual Energy Computed Tomography.

    Authors: Oguz Semerci, Eric L. Miller
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Dual energy computerized tomography has gained great interest because of its
    ability to characterize the chemical composition of a material rather than
    simply providing relative attenuation images as in conventional tomography.

  60. A Comparison of Two Human Brain Tumor Segmentation Methods for MRI Data.

    Authors: Miriam H. A. Bauer, Jan Egger, Daniela Kuhnt, Bernd Freisleben, Christopher Nimsky, Dženan Zukić, Barbara Carl, Andreas Kolb
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    The most common primary brain tumors are gliomas, evolving from the cerebral
    supportive cells. For clinical follow-up, the evaluation of the preoperative
    tumor volume is essential. Volumetric assessment of tumor volume with manual
    segmentation of its outlines is a time-consuming process that can be overcome
    with the help of computerized segmentation methods. In this contribution, two
    methods for World Health Organization (WHO) grade IV glioma segmentation in the
    human brain are compared using magnetic resonance imaging (MRI) patient data
    from the clinical routine.

  61. All Roads Lead To Rome.

    Authors: Xin Li
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    This short article presents a class of projection-based solution algorithms
    to the problem considered in the pioneering work on compressed sensing -
    perfect reconstruction of a phantom image from 22 radial lines in the frequency
    domain. Under the framework of projection-based image reconstruction, we will
    show experimentally that several old and new tools of nonlinear filtering
    (including Perona-Malik diffusion, nonlinear diffusion, Translation-Invariant
    thresholding and SA-DCT thresholding) all lead to perfect reconstruction of the
    phantom image.

  62. Ray-Based and Graph-Based Methods for Fiber Bundle Boundary Estimation.

    Authors: Miriam H. A. Bauer, Jan Egger, Daniela Kuhnt, Sebastiano Barbieri, Jan Klein, Horst K. Hahn, Bernd Freisleben, Christopher Nimsky
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Diffusion Tensor Imaging (DTI) provides the possibility of estimating the
    location and course of eloquent structures in the human brain. Knowledge about
    this is of high importance for preoperative planning of neurosurgical
    interventions and for intraoperative guidance by neuronavigation in order to
    minimize postoperative neurological deficits. Therefore, the segmentation of
    these structures as closed, three-dimensional object is necessary.

  63. Geometric Models with Co-occurrence Groups.

    Authors: Stéphane Mallat, Joan Bruna
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    A geometric model of sparse signal representations is introduced for classes
    of signals. It is computed by optimizing co-occurrence groups with a maximum
    likelihood estimate calculated with a Bernoulli mixture model. Applications to
    face image compression and MNIST digit classification illustrate the
    applicability of this model.

  64. Statistical Compressed Sensing of Gaussian Mixture Models.

    Authors: Guillermo Sapiro, Guoshen Yu
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    A novel framework of compressed sensing, namely statistical compressed
    sensing (SCS), that aims at efficiently sampling a collection of signals that
    follow a statistical distribution, and achieving accurate reconstruction on
    average, is introduced.

  65. A correspondence-less approach to matching of deformable shapes.

    Authors: Alexander M. Bronstein, Michael M. Bronstein, Jonathan Pokrass
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Finding a match between partially available deformable shapes is a
    challenging problem with numerous applications. The problem is usually
    approached by computing local descriptors on a pair of shapes and then
    establishing a point-wise correspondence between the two. In this paper, we
    introduce an alternative correspondence-less approach to matching fragments to
    an entire shape undergoing a non-rigid deformation. We use diffusion geometric
    descriptors and optimize over the integration domains on which the integral
    descriptors of the two parts match.

  66. Diffusion framework for geometric and photometric data fusion in non-rigid shape analysis.

    Authors: Alexander M. Bronstein, Michael M. Bronstein, Ron Kimmel, Artiom Kovnatsky
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    In this paper, we explore the use of the diffusion geometry framework for the
    fusion of geometric and photometric information in local and global shape
    descriptors. Our construction is based on the definition of a diffusion process
    on the shape manifold embedded into a high-dimensional space where the
    embedding coordinates represent the photometric information. Experimental
    results show that such data fusion is useful in coping with different
    challenges of shape analysis where pure geometric and pure photometric methods
    fail.

  67. Automated Image Processing for the Analysis of DNA Repair Dynamics.

    Authors: Thorsten Riess, Christian Dietz, Martin Tomas, Elisa Ferrando-May, Dorit Merhof
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    The efficient repair of cellular DNA is essential for the maintenance and
    inheritance of genomic information. In order to cope with the high frequency of
    spontaneous and induced DNA damage, a multitude of repair mechanisms have
    evolved. These are enabled by a wide range of protein factors specifically
    recognizing different types of lesions and finally restoring the normal DNA
    sequence. This work focuses on the repair factor XPC (xeroderma pigmentosum
    complementation group C), which identifies bulky DNA lesions and initiates
    their removal via the nucleotide excision repair pathway.

  68. Introduction to the Bag of Features Paradigm for Image Classification and Retrieval.

    Authors: Stephen O'Hara, Bruce A. Draper
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    The past decade has seen the growing popularity of Bag of Features (BoF)
    approaches to many computer vision tasks, including image classification, video
    search, robot localization, and texture recognition. Part of the appeal is
    simplicity. BoF methods are based on orderless collections of quantized local
    image descriptors; they discard spatial information and are therefore
    conceptually and computationally simpler than many alternative methods.

  69. Support vector machines/relevance vector machine for remote sensing classification: A review.

    Authors: Mahesh Pal
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Kernel-based machine learning algorithms are based on mapping data from the
    original input feature space to a kernel feature space of higher dimensionality
    to solve a linear problem in that space. Over the last decade, kernel based
    classification and regression approaches such as support vector machines have
    widely been used in remote sensing as well as in various civil engineering
    applications.

  70. A Review of Research on Devnagari Character Recognition.

    Authors: V J Dongre, V H Mankar
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    English Character Recognition (CR) has been extensively studied in the last
    half century and progressed to a level, sufficient to produce technology driven
    applications. But same is not the case for Indian languages which are
    complicated in terms of structure and computations. Rapidly growing
    computational power may enable the implementation of Indic CR methodologies.
    Digital document processing is gaining popularity for application to office and
    library automation, bank and postal services, publishing houses and
    communication technology.

  71. Application of Freeman Chain Codes: An Alternative Recognition Technique for Malaysian Car Plates.

    Authors: Jasni Mohamad Zain, Nor Amizam Jusoh
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Various applications of car plate recognition systems have been developed
    using various kinds of methods and techniques by researchers all over the
    world. The applications developed were only suitable for specific country due
    to its standard specification endorsed by the transport department of
    particular countries. The Road Transport Department of Malaysia also has
    endorsed a specification for car plates that includes the font and size of
    characters that must be followed by car owners. However, there are cases where
    this specification is not followed.

  72. Detecting Image Forgeries using Geometric Cues.

    Authors: Yang Wang, Lin Wu
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    This chapter presents a framework for detecting fake regions by using various
    methods including watermarking technique and blind approaches. In particular,
    we describe current categories on blind approaches which can be divided into
    five: pixel-based techniques, format-based techniques, camera-based techniques,
    physically-based techniques and geometric-based techniques. Then we take a
    second look on the geometric-based techniques and further categorize them in
    detail. In the following section, the state-of-the-art methods involved in the
    geometric technique are elaborated.

  73. Sparse motion segmentation using multiple six-point consistencies.

    Authors: Vasileios Zografos, Klas Nordberg, Liam Ellis
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    We present a method for segmenting an arbitrary number of moving objects in
    image sequences using the geometry of 6 points in 2D to infer motion
    consistency. The method has been evaluated on the Hopkins 155 database and
    surpasses current state-of-the-art methods such as SSC, both in terms of
    overall performance on two and three motions but also in terms of maximum
    errors. The method works by ?nding initial clusters in the spatial domain, and
    then classifying each remaining point as belonging to the cluster that
    minimizes a motion consistency score.

  74. Sparsity tracking for low rank matrix recovery from noise.

    Authors: Yue Deng, Qionghai Dai, Zengke Zhang
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Rank-based analysis is a basic approach for many real world applications.
    Recently, with the developments of compressive sensing, an interesting problem
    was proposed to recover a lowrank matrix from sparse noise. In this paper, we
    will address this problem and propose a low rank matrix recovery algorithm
    based on sparsity tacking. The core of the proposed Sparsity Tracking
    Recovery(STR) is a heuristic kernel, which is introduced to penalize the noise
    distribution. With the heuristic method, the sparse entries in the noise matrix
    can be accurately tracked and discouraged to be zero.

  75. Learning sparse representations of depth.

    Authors: Bruno A. Olshausen, Ivana Tosic, Benjamin J. Culpepper
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    We propose a method for learning sparse representations of depth (disparity)
    maps, which is able to cope with noise and unreliable depth measurements. The
    proposed algorithm relaxes the usual assumption of the stationary noise model
    in sparse coding and enables learning from data corrupted with spatially
    varying noise or uncertainty. Different noise statistics at each pixel location
    are inferred from the data, and the learning rule is adapted with respect to
    the noise level.

  76. Edge Preserving Image Denoising in Reproducing Kernel Hilbert Spaces.

    Authors: Sergios Theodoridis, Pantelis Bouboulis
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    The goal of this paper is the development of a novel approach for the problem
    of Noise Removal, based on the theory of Reproducing Kernels Hilbert Spaces
    (RKHS). The problem is cast as an optimization task in a RKHS, by taking
    advantage of the celebrated semiparametric Representer Theorem. Examples verify
    that in the presence of gaussian noise the proposed method performs relatively
    well compared to wavelet based technics and outperforms them significantly in
    the presence of impulse or mixed noise.

  77. Generalized Tree-Based Wavelet Transform.

    Authors: Michael Elad, Idan Ram, Israel Cohen
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    In this paper we propose a new wavelet transform applicable to functions
    defined on graphs, high dimensional data and networks. The proposed method
    generalizes the Haar-like transform proposed in \cite{gavish2010mwot}, and it
    is similarly defined via a hierarchical tree, which is assumed to capture the
    geometry and structure of the input data. It is applied to the data using a
    multiscale filtering and decimation scheme, which can employ different wavelet
    filters. We propose a tree construction method which results in efficient
    representation of the input function in the transform domain.

  78. A Fuzzy Clustering Model for Fuzzy Data with Outliers.

    Authors: M.H.Fazel Zarandi, Zahra S. Razaee
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    In this paper a fuzzy clustering model for fuzzy data with outliers is
    proposed. The model is based on Wasserstein distance between interval valued
    data which is generalized to fuzzy data. In addition, Keller's approach is used
    to identify outliers and reduce their influences. We have also defined a
    transformation to change our distance to the Euclidean distance. With the help
    of this approach, the problem of fuzzy clustering of fuzzy data is reduced to
    fuzzy clustering of crisp data.

  79. Warping Peirce Quincuncial Panoramas.

    Authors: Chamberlain Fong
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    The Peirce quincuncial projection is a mapping of the surface of a sphere to
    a square. It is a conformal mapping except for four points on the equator.
    These points of non-conformality cause significant artifacts in photographic
    applications. In this paper, we propose an algorithm and a user-interface to
    mitigate these artifacts. We then promote the Peirce quincuncial projection as
    a viable alternative to the stereographic projection in photographic
    applications.

  80. The Data Replication Method for the Classification with Reject Option.

    Authors: Ricardo Sousa, Jaime S. Cardoso
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Classification is one of the most important tasks of machine learning.
    Although the most well studied model is the two-class problem, in many
    scenarios there is the opportunity to label critical items for manual revision,
    instead of trying to automatically classify every item. In this paper we adapt
    a paradigm initially proposed for the classification of ordinal data to address
    the classification problem with reject option.

  81. Tensor-SIFT based Earth Mover's Distance for Contour Tracking.

    Authors: Peihua Li
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Contour tracking in adverse environments is a challenging problem due to
    cluttered background, illumination variation, occlusion, and noise, among
    others. This paper presents a robust contour tracking method by contributing to
    some of the key issues involved, including (a) a region functional formulation
    and its optimization; (b) design of a robust and effective feature; and (c)
    development of an integrated tracking algorithm.

  82. Featureless 2D-3D Pose Estimation by Minimising an Illumination-Invariant Loss.

    Authors: Marcus Hutter, Nathan Brewer, Srimal Jayawardena
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    The problem of identifying the 3D pose of a known object from a given 2D
    image has important applications in Computer Vision ranging from robotic vision
    to image analysis. Our proposed method of registering a 3D model of a known
    object on a given 2D photo of the object has numerous advantages over existing
    methods: It does neither require prior training nor learning, nor knowledge of
    the camera parameters, nor explicit point correspondences or matching features
    between image and model.

  83. Multiple View Reconstruction of Calibrated Images using Singular Value Decomposition.

    Authors: Ayan Chaudhury, Abhishek Gupta, Sumita Manna, Subhadeep Mukherjee, Amlan Chakrabarti
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Calibration in a multi camera network has widely been studied for over
    several years starting from the earlier days of photogrammetry. Many authors
    have presented several calibration algorithms with their relative advantages
    and disadvantages. In a stereovision system, multiple view reconstruction is a
    challenging task. However, the total computational procedure in detail has not
    been presented before.

  84. Fast Color Quantization Using Weighted Sort-Means Clustering.

    Authors: M. Emre Celebi
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Color quantization is an important operation with numerous applications in
    graphics and image processing. Most quantization methods are essentially based
    on data clustering algorithms. However, despite its popularity as a general
    purpose clustering algorithm, k-means has not received much respect in the
    color quantization literature because of its high computational requirements
    and sensitivity to initialization. In this paper, a fast color quantization
    method based on k-means is presented.

  85. Statistical Compressive Sensing of Gaussian Mixture Models.

    Authors: Guillermo Sapiro, Guoshen Yu
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    A new framework of compressive sensing (CS), namely statistical compressive
    sensing (SCS), that aims at efficiently sampling a collection of signals that
    follow a statistical distribution and achieving accurate reconstruction on
    average, is introduced.

  86. Fast Inference in Sparse Coding Algorithms with Applications to Object Recognition.

    Authors: Yann LeCun, Koray Kavukcuoglu, Marc'Aurelio Ranzato
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Adaptive sparse coding methods learn a possibly overcomplete set of basis
    functions, such that natural image patches can be reconstructed by linearly
    combining a small subset of these bases. The applicability of these methods to
    visual object recognition tasks has been limited because of the prohibitive
    cost of the optimization algorithms required to compute the sparse
    representation. In this work we propose a simple and efficient algorithm to
    learn basis functions.

  87. Hybrid Linear Modeling via Local Best-fit Flats.

    Authors: Teng Zhang, Arthur Szlam, Gilad Lerman, Yi Wang
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    In this paper we present a simple and fast geometric method for modeling data
    by a union of affine sets. The method begins by forming a collection of local
    best fit affine subspaces. The correct sizes of the local neighborhoods are
    determined automatically by the Jones' $\beta_2$ numbers; we prove under
    certain geometric conditions that good local neighborhoods exist and are found
    by our method. The collection is further processed by a greedy selection
    procedure or a spectral method to generate the final model.

  88. A Microwave Imaging and Enhancement Technique from Noisy Synthetic Data.

    Authors: Sugata Sanyal, Anjan Kumar Kundu, Bijoy Bandopadhyay
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    An inverse iterative algorithm for microwave imaging based on moment method
    solution is presented here. The iterative scheme has been developed on
    constrained optimization technique and is certain to converge. Different mesh
    size for the model has been used here to overcome the Inverse Crime. The
    synthetic data at the receivers is contaminated with different percentage of
    noise. The ill-posedness of the problem is solved by Levenberg-Marquardt
    method. The algorithm is applied to synthetic data and the reconstructed image
    is then further enhanced through the Image enhancement technique

  89. An Embarrassingly Simple Speed-Up of Belief Propagation with Robust Potentials.

    Authors: James M. Coughlan, Huiying Shen
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    We present an exact method of greatly speeding up belief propagation (BP) for
    a wide variety of potential functions in pairwise MRFs and other graphical
    models. Specifically, our technique applies whenever the pairwise potentials
    have been {\em truncated} to a constant value for most pairs of states, as is
    commonly done in MRF models with robust potentials (such as stereo) that impose
    an upper bound on the penalty assigned to discontinuities; for each of the $M$
    possible states in one node, only a smaller number $m$ of compatible states in
    a neighboring node are assigned milder penalties.

  90. Face Detection with Effective Feature Extraction.

    Authors: Chunhua Shen, Jian Zhang, Sakrapee Paisitkriangkrai
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    There is an abundant literature on face detection due to its important role
    in many vision applications. Since Viola and Jones proposed the first real-time
    AdaBoost based face detector, Haar-like features have been adopted as the
    method of choice for frontal face detection. In this work, we show that simple
    features other than Haar-like features can also be applied for training an
    effective face detector.

  91. Rotation Invariant Face Detection Using Wavelet, PCA and Radial Basis Function Networks.

    Authors: Md. Saiful Islam, S. M. Kamruzzaman, Md. Emdadul Haque, Firoz Ahmed Siddiqi, Mohammad Shamsul Alam
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    This paper introduces a novel method for human face detection with its
    orientation by using wavelet, principle component analysis (PCA) and redial
    basis networks. The input image is analyzed by two-dimensional wavelet and a
    two-dimensional stationary wavelet. The common goals concern are the image
    clearance and simplification, which are parts of de-noising or compression. We
    applied an effective procedure to reduce the dimension of the input vectors
    using PCA.

  92. Modeling Instantaneous Changes In Natural Scenes.

    Authors: Vikram Dhillon
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    This paper proposes a framework for modeling instantaneous changes natural
    scenes in real time using Lagrangian Particle Framework and a fluid-particle
    grid approach. This research can be divided into 3 distinct sections: the first
    one discusses a multi-camera rig that can measure ego-motion accurately up to
    88%, how this device becomes the backbone of our framework, and some
    improvements devised to optimize a know framework for depth maps and 3d
    structure estimation from a single still image called make3d.

  93. Balancing clusters to reduce response time variability in large scale image search.

    Authors: Romain Tavenard, Laurent Amsaleg, Hervé Jégou
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Many algorithms for approximate nearest neighbor search in high-dimensional
    spaces partition the data into clusters. At query time, in order to avoid
    exhaustive search, an index selects the few (or a single) clusters nearest to
    the query point. Clusters are often produced by the well-known $k$-means
    approach since it has several desirable properties. On the downside, it tends
    to produce clusters having quite different cardinalities. Imbalanced clusters
    negatively impact both the variance and the expectation of query response
    times.

  94. 3D-Mesh denoising using an improved vertex based anisotropic diffusion.

    Authors: Mohammed EL Hassouni, Driss Aboutajdine
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    This paper deals with an improvement of vertex based nonlinear diffusion for
    mesh denoising. This method directly filters the position of the vertices using
    Laplace, reduced centered Gaussian and Rayleigh probability density functions
    as diffusivities. The use of these PDFs improves the performance of a
    vertex-based diffusion method which are adapted to the underlying mesh
    structure. We also compare the proposed method to other mesh denoising methods
    such as Laplacian flow, mean, median, min and the adaptive MMSE filtering. To
    evaluate these methods of filtering, we use two error metrics.

  95. Asymmetric Totally-corrective Boosting for Real-time Object Detection.

    Authors: Chunhua Shen, Peng Wang, Nick Barnes, Hong Zheng, Zhang Ren
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Real-time object detection is one of the core problems in computer vision.
    The cascade boosting framework proposed by Viola and Jones has become the
    standard for this problem. In this framework, the learning goal for each node
    is asymmetric, which is required to achieve a high detection rate and a
    moderate false positive rate. We develop new boosting algorithms to address
    this asymmetric learning problem. We show that our methods explicitly optimize
    asymmetric loss objectives in a totally corrective fashion.

  96. Invariant Spectral Hashing of Image Saliency Graph.

    Authors: Laurent Jacques, Maxime Taquet, Christophe De Vleeschouwer, Benoit Macq
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Image hashing is the process of associating a short vector of bits to an
    image. The resulting summaries are useful in many applications including image
    indexing, image authentication and pattern recognition. These hashes need to be
    invariant under transformations of the image that result in similar visual
    content, but should drastically differ for conceptually distinct contents. This
    paper proposes an image hashing method that is invariant under rotation,
    scaling and translation of the image.

  97. Estimation of Infants' Cry Fundamental Frequency using a Modified SIFT algorithm.

    Authors: Dror Lederman
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    This paper addresses the problem of infants' cry fundamental frequency
    estimation. The fundamental frequency is estimated using a modified simple
    inverse filtering tracking (SIFT) algorithm. The performance of the modified
    SIFT is studied using a real database of infants' cry.

  98. Evolutionary Computational Method of Facial Expression Analysis for Content-based Video Retrieval using 2-Dimensional Cellular Automata.

    Authors: P. Geetha, Vasumathi Narayanan
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    In this paper, Deterministic Cellular Automata (DCA) based video shot
    classification and retrieval is proposed. The deterministic 2D Cellular
    automata model captures the human facial expressions, both spontaneous and
    posed. The determinism stems from the fact that the facial muscle actions are
    standardized by the encodings of Facial Action Coding System (FACS) and Action
    Units (AUs). Based on these encodings, we generate the set of evolutionary
    update rules of the DCA for each facial expression.

  99. Approximate Lesion Localization in Dermoscopy Images.

    Authors: M. Emre Celebi, Hitoshi Iyatomi, Gerald Schaefer, William V. Stoecker
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Background: Dermoscopy is one of the major imaging modalities used in the
    diagnosis of melanoma and other pigmented skin lesions. Due to the difficulty
    and subjectivity of human interpretation, automated analysis of dermoscopy
    images has become an important research area. Border detection is often the
    first step in this analysis. Methods: In this article, we present an
    approximate lesion localization method that serves as a preprocessing step for
    detecting borders in dermoscopy images. In this method, first the black frame
    around the image is removed using an iterative algorithm.

  100. Proliferating cell nuclear antigen (PCNA) allows the automatic identification of follicles in microscopic images of human ovarian tissue.

    Authors: Thomas W Kelsey, Benedicta Caserta, Luis Castillo, W Hamish B Wallace, Francisco Cóppola Gonzálvez
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Human ovarian reserve is defined by the population of nongrowing follicles
    (NGFs) in the ovary. Direct estimation of ovarian reserve involves the
    identification of NGFs in prepared ovarian tissue. Previous studies involving
    human tissue have used hematoxylin and eosin (HE) stain, with NGF populations
    estimated by human examination either of tissue under a microscope, or of
    images taken of this tissue. In this study we replaced HE with proliferating
    cell nuclear antigen (PCNA), and automated the identification and enumeration
    of NGFs that appear in the resulting microscopic images.

  101. Optimally Training a Cascade Classifier.

    Authors: Chunhua Shen, Anton van den Hengel, Peng Wang
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Cascade classifiers are widely used in real-time object detection. Different
    from conventional classifiers that are designed for a low overall
    classification error rate, a classifier in each node of the cascade is required
    to achieve an extremely high detection rate and moderate false positive rate.
    Although there are a few reported methods addressing this requirement in the
    context of object detection, there is no a principled feature selection method
    that explicitly takes into account this asymmetric node learning objective. We
    provide such an algorithm here.

  102. The Differential of the Exponential Map, Jacobi Fields and Exact Principal Geodesic Analysis.

    Authors: Stefan Sommer, François Lauze, Mads Nielsen
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    The importance of manifolds and Riemannian geometry in mathematics is
    spreading to applied fields in which the need to model non-linear structure has
    spurred wide-spread interest in geometry. The transfer of interest has created
    demand for methods for computing classical constructs of geometry on manifolds
    occurring in practical applications. This paper develops initial value problems
    for the computation of the differential of the exponential map and Jacobi
    fields on parametrically and implicitly represented manifolds.

  103. Biometric Authentication using Nonparametric Methods.

    Authors: S. V. Sheela, K. R. Radhika
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    The physiological and behavioral trait is employed to develop biometric
    authentication systems. The proposed work deals with the authentication of iris
    and signature based on minimum variance criteria. The iris patterns are
    preprocessed based on area of the connected components. The segmented image
    used for authentication consists of the region with large variations in the
    gray level values. The image region is split into quadtree components. The
    components with minimum variance are determined from the training samples. Hu
    moments are applied on the components.

  104. An Efficient Automatic Mass Classification Method In Digitized Mammograms Using Artificial Neural Network.

    Authors: Mohammed J. Islam, Majid Ahmadi, Maher A. Sid-Ahmed
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    In this paper we present an efficient computer aided mass classification
    method in digitized mammograms using Artificial Neural Network (ANN), which
    performs benign-malignant classification on region of interest (ROI) that
    contains mass. One of the major mammographic characteristics for mass
    classification is texture. ANN exploits this important factor to classify the
    mass into benign or malignant. The statistical textural features used in
    characterizing the masses are mean, standard deviation, entropy, skewness,
    kurtosis and uniformity.

  105. Orthogonal multifilters image processing of astronomical images from scanned photographic plates.

    Authors: Vasil Kolev
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    In this paper orthogonal multifilters for astronomical image processing are
    presented. We obtained new orthogonal multifilters based on the orthogonal
    wavelet of Haar and Daubechies. Recently, multiwavelets have been introduced as
    a more powerful multiscale analysis tool. It adds several degrees of freedom in
    multifilter design and makes it possible to have several useful properties such
    as symmetry, orthogonality, short support, and a higher number of vanishing
    moments simultaneously. Multifilter decomposition of scanned photographic
    plates with astronomical images is made.

  106. Video Event Recognition for Surveillance Applications (VERSA).

    Authors: Stephen O'Hara
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    VERSA provides a general-purpose framework for defining and recognizing
    events in live or recorded surveillance video streams. The approach for event
    recognition in VERSA is using a declarative logic language to define the
    spatial and temporal relationships that characterize a given event or activity.
    Doing so requires the definition of certain fundamental spatial and temporal
    relationships and a high-level syntax for specifying frame templates and query
    parameters.

  107. A Review of Fast l1-Minimization Algorithms for Robust Face Recognition.

    Authors: Arvind Ganesh, Yi Ma, Zihan Zhou, S. Shankar Sastry, Allen Y. Yang
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    l1-minimization refers to finding the minimum l1-norm solution to an
    underdetermined linear system b=Ax. It has recently received much attention,
    mainly motivated by the new compressive sensing theory that shows that under
    quite general conditions the minimum l1-norm solution is also the sparsest
    solution to the system of linear equations. Although the underlying problem is
    a linear program, conventional algorithms such as interior-point methods suffer
    from poor scalability for large-scale real world problems.

  108. Neural Network Based Reconstruction of a 3D Object from a 2D Wireframe.

    Authors: Kyle Johnson, Clayton Chang, Hod Lipson
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    We propose a new approach for constructing a 3D representation from a 2D
    wireframe drawing. A drawing is simply a parallel projection of a 3D object
    onto a 2D surface; humans are able to recreate mental 3D models from 2D
    representations very easily, yet the process is very difficult to emulate
    computationally. We hypothesize that our ability to perform this construction
    relies on the angles in the 2D scene, among other geometric properties. Being
    able to reproduce this reconstruction process automatically would allow for
    efficient and robust 3D sketch interfaces.

  109. Registration of Brain Images using Fast Walsh Hadamard Transform.

    Authors: D.Sasikala, R.Neelaveni
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    A lot of image registration techniques have been developed with great
    significance for data analysis in medicine, astrophotography, satellite imaging
    and few other areas. This work proposes a method for medical image registration
    using Fast Walsh Hadamard transform. This algorithm registers images of the
    same or different modalities. Each image bit is lengthened in terms of Fast
    Walsh Hadamard basis functions. Each basis function is a notion of determining
    various aspects of local structure, e.g., horizontal edge, corner, etc.

  110. Bilateral filters: what they can and cannot do.

    Authors: Oleg S. Pianykh
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Nonlinear bilateral filters (BF) deliver a fine blend of computational
    simplicity and blur-free denoising. However, little is known about their
    nature, noise-suppressing properties, and optimal choices of filter parameters.
    Our study is meant to fill this gap-explaining the underlying mechanism of
    bilateral filtering and providing the methodology for optimal filter selection.
    Practical application to CT image denoising is discussed to illustrate our
    results.

  111. Human Face Recognition using Line Features.

    Authors: Mahantapas Kundu, Mita Nasipuri, Dipak Kumar Basu, Debotosh Bhattacharjee, Mrinal Kanti Bhowmik
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    In this work we investigate a novel approach to handle the challenges of face
    recognition, which includes rotation, scale, occlusion, illumination etc. Here,
    we have used thermal face images as those are capable to minimize the affect of
    illumination changes and occlusion due to moustache, beards, adornments etc.
    The proposed approach registers the training and testing thermal face images in
    polar coordinate, which is capable to handle complicacies introduced by scaling
    and rotation. Line features are extracted from thermal polar images and feature
    vectors are constructed using these line.

  112. Classification of Log-Polar-Visual Eigenfaces using Multilayer Perceptron.

    Authors: Mahantapas Kundu, Mita Nasipuri, Dipak Kumar Basu, Debotosh Bhattacharjee, Mrinal Kanti Bhowmik
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    In this paper we present a simple novel approach to tackle the challenges of
    scaling and rotation of face images in face recognition. The proposed approach
    registers the training and testing visual face images by log-polar
    transformation, which is capable to handle complicacies introduced by scaling
    and rotation. Log-polar images are projected into eigenspace and finally
    classified using an improved multi-layer perceptron. In the experiments we have
    used ORL face database and Object Tracking and Classification Beyond Visible
    Spectrum (OTCBVS) database for visual face images.

  113. Classification of fused face images using multilayer perceptron neural network.

    Authors: Mahantapas Kundu, Mita Nasipuri, Dipak Kumar Basu, Debotosh Bhattacharjee, Mrinal Kanti Bhowmik
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    This paper presents a concept of image pixel fusion of visual and thermal
    faces, which can significantly improve the overall performance of a face
    recognition system. Several factors affect face recognition performance
    including pose variations, facial expression changes, occlusions, and most
    importantly illumination changes. So, image pixel fusion of thermal and visual
    images is a solution to overcome the drawbacks present in the individual
    thermal and visual face images. Fused images are projected into eigenspace and
    finally classified using a multi-layer perceptron.

  114. Classification of Fused Images using Radial Basis Function Neural Network for Human Face Recognition.

    Authors: Debotosh Bhattacharjee, M. Kundu, D. K. Basu, M. Nasipuri, M.K. Bhowmik
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Here an efficient fusion technique for automatic face recognition has been
    presented. Fusion of visual and thermal images has been done to take the
    advantages of thermal images as well as visual images. By employing fusion a
    new image can be obtained, which provides the most detailed, reliable, and
    discriminating information. In this method fused images are generated using
    visual and thermal face images in the first step. In the second step, fused
    images are projected into eigenspace and finally classified using a radial
    basis function neural network.

  115. Image Pixel Fusion for Human Face Recognition.

    Authors: Mahantapas Kundu, Mita Nasipuri, Dipak Kumar Basu, Debotosh Bhattacharjee, Mrinal Kanti Bhowmik
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    In this paper we present a technique for fusion of optical and thermal face
    images based on image pixel fusion approach. Out of several factors, which
    affect face recognition performance in case of visual images, illumination
    changes are a significant factor that needs to be addressed. Thermal images are
    better in handling illumination conditions but not very consistent in capturing
    texture details of the faces.

  116. A Parallel Framework for Multilayer Perceptron for Human Face Recognition.

    Authors: Debotosh Bhattacharjee, M. Kundu, D. K. Basu, M. Nasipuri, M.K. Bhowmik
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Artificial neural networks have already shown their success in face
    recognition and similar complex pattern recognition tasks. However, a major
    disadvantage of the technique is that it is extremely slow during training for
    larger classes and hence not suitable for real-time complex problems such as
    pattern recognition. This is an attempt to develop a parallel framework for the
    training algorithm of a perceptron. In this paper, two general architectures
    for a Multilayer Perceptron (MLP) have been demonstrated.

  117. Fusion of Wavelet Coefficients from Visual and Thermal Face Images for Human Face Recognition - A Comparative Study.

    Authors: Debotosh Bhattacharjee, M. Kundu, D. K. Basu, M. Nasipuri, M. K. Bhowmik
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    In this paper we present a comparative study on fusion of visual and thermal
    images using different wavelet transformations. Here, coefficients of discrete
    wavelet transforms from both visual and thermal images are computed separately
    and combined. Next, inverse discrete wavelet transformation is taken in order
    to obtain fused face image. Both Haar and Daubechies (db2) wavelet transforms
    have been used to compare recognition results. For experiments IRIS
    Thermal/Visual Face Database was used.

  118. Fusion of Daubechies Wavelet Coefficients for Human Face Recognition.

    Authors: Mahantapas Kundu, Mita Nasipuri, Dipak Kumar Basu, Debotosh Bhattacharjee, Mrinal Kanti Bhowmik
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    In this paper fusion of visual and thermal images in wavelet transformed
    domain has been presented. Here, Daubechies wavelet transform, called as D2,
    coefficients from visual and corresponding coefficients computed in the same
    manner from thermal images are combined to get fused coefficients. After
    decomposition up to fifth level (Level 5) fusion of coefficients is done.
    Inverse Daubechies wavelet transform of those coefficients gives us fused face
    images.

  119. Quotient Based Multiresolution Image Fusion of Thermal and Visual Images Using Daubechies Wavelet Transform for Human Face Recognition.

    Authors: Mahantapas Kundu, Mita Nasipuri, Dipak Kumar Basu, Debotosh Bhattacharjee, Mrinal Kanti Bhowmik
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    This paper investigates the multiresolution level-1 and level-2 Quotient
    based Fusion of thermal and visual images. In the proposed system, the method-1
    namely "Decompose then Quotient Fuse Level-1" and the method-2 namely
    "Decompose-Reconstruct then Quotient Fuse Level-2" both work on wavelet
    transformations of the visual and thermal face images. The wavelet transform is
    well-suited to manage different image resolution and allows the image
    decomposition in different kinds of coefficients, while preserving the image
    information without any loss.

  120. Face Synthesis (FASY) System for Determining the Characteristics of a Face Image.

    Authors: Mahantapas Kundu, Mita Nasipuri, Dipak Kumar Basu, Debotosh Bhattacharjee, Santanu Halder
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    This paper aims at determining the characteristics of a face image by
    extracting its components. The FASY (FAce SYnthesis) System is a Face Database
    Retrieval and new Face generation System that is under development. One of its
    main features is the generation of the requested face when it is not found in
    the existing database, which allows a continuous growing of the database also.
    To generate the new face image, we need to store the face components in the
    database. So we have designed a new technique to extract the face components by
    a sophisticated method.

  121. A Fast Decision Technique for Hierarchical Hough Transform for Line Detection.

    Authors: Chandan Singh, Nitin Bhatia
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Many techniques have been proposed to speedup the performance of classic
    Hough Transform. These techniques are primarily based on converting the voting
    procedure to a hierarchy based voting method. These methods use approximate
    decision-making process. In this paper, we propose a fast decision making
    process that enhances the speed and reduces the space requirements.
    Experimental results demonstrate that the proposed algorithm is much faster
    than a similar Fast Hough Transform.

  122. Fuzzy Classification of Facial Component Parameters.

    Authors: M. Kundu, D. K. Basu, M. Nasipuri, S. Halder, Debotosh Bhattacherjee
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    This paper presents a novel type-2 Fuzzy logic System to define the Shape of
    a facial component with the crisp output. This work is the part of our main
    research effort to design a system (called FASY) which offers a novel face
    construction approach based on the textual description and also extracts and
    analyzes the facial components from a face image by an efficient technique. The
    Fuzzy model, designed in this paper, takes crisp value of width and height of a
    facial component and produces the crisp value of Shape for different facial
    components.

  123. FPGA Based Assembling of Facial Components for Human Face Construction.

    Authors: Mahantapas Kundu, Mita Nasipuri, Dipak Kumar Basu, Debotosh Bhattacharjee, Santanu Halder
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    This paper aims at VLSI realization for generation of a new face from textual
    description. The FASY (FAce SYnthesis) System is a Face Database Retrieval and
    new Face generation System that is under development. One of its main features
    is the generation of the requested face when it is not found in the existing
    database. The new face generation system works in three steps - searching
    phase, assembling phase and tuning phase. In this paper the tuning phase using
    hardware description language and its implementation in a Field Programmable
    Gate Array (FPGA) device is presented.

  124. Classification Of Gradient Change Features Using MLP For Handwritten Character Recognition.

    Authors: Mita Nasipuri, Sandhya Arora, Debotosh Bhattacharjee, Latesh Malik
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    A novel, generic scheme for off-line handwritten English alphabets character
    images is proposed. The advantage of the technique is that it can be applied in
    a generic manner to different applications and is expected to perform better in
    uncertain and noisy environments. The recognition scheme is using a multilayer
    perceptron(MLP) neural networks. The system was trained and tested on a
    database of 300 samples of handwritten characters. For improved generalization
    and to avoid overtraining, the whole available dataset has been divided into
    two subsets: training set and test set.

  125. A novel approach for handwritten Devnagari character recognition.

    Authors: Mita Nasipuri, Sandhya Arora, Debotosh Bhattacharjee, Latesh Malik
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    In this paper a method for recognition of handwritten devanagari characters
    is described. Here, feature vector is constituted by accumulated directional
    gradient changes in different segments, number of intersections points for the
    character, type of spine present and type of shirorekha present in the
    character. One Multi-layer Perceptron with conjugate-gradient training is used
    to classify these feature vectors. This method is applied to a database with
    1000 sample characters and the recognition rate obtained is 88.12%

  126. A Two Stage Classification Approach for Handwritten Devanagari Characters.

    Authors: Mita Nasipuri, Sandhya Arora, Debotosh Bhattacharjee, Latesh Malik
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    The paper presents a two stage classification approach for handwritten
    devanagari characters The first stage is using structural properties like
    shirorekha, spine in character and second stage exploits some intersection
    features of characters which are fed to a feedforward neural network. Simple
    histogram based method does not work for finding shirorekha, vertical bar
    (Spine) in handwritten devnagari characters.

  127. Multiple Classifier Combination for Off-line Handwritten Devnagari Character Recognition.

    Authors: Mahantapas Kundu, Mita Nasipuri, Dipak Kumar Basu, Sandhya Arora, Debotosh Bhattacharjee
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    This work presents the application of weighted majority voting technique for
    combination of classification decision obtained from three Multi_Layer
    Perceptron(MLP) based classifiers for Recognition of Handwritten Devnagari
    characters using three different feature sets. The features used are
    intersection, shadow feature and chain code histogram features. Shadow features
    are computed globally for character image while intersection features and chain
    code histogram features are computed by dividing the character image into
    different segments.

  128. Application of Statistical Features in Handwritten Devnagari Character Recognition.

    Authors: Debotosh Bhattacharjee, S. Arora, M. Nasipuri, D.K. Basu, M.Kundu
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    In this paper a scheme for offline Handwritten Devnagari Character
    Recognition is proposed, which uses different feature extraction methodologies
    and recognition algorithms. The proposed system assumes no constraints in
    writing style or size. First the character is preprocessed and features namely
    : Chain code histogram and moment invariant features are extracted and fed to
    Multilayer Perceptrons as a preliminary recognition step. Finally the results
    of both MLP's are combined using weighted majority scheme.

  129. Recognition of Non-Compound Handwritten Devnagari Characters using a Combination of MLP and Minimum Edit Distance.

    Authors: Mita Nasipuri, Sandhya Arora, Debotosh Bhattacharjee, M. Kundu, D. K. Basu
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    This paper deals with a new method for recognition of offline Handwritten
    non-compound Devnagari Characters in two stages. It uses two well known and
    established pattern recognition techniques: one using neural networks and the
    other one using minimum edit distance. Each of these techniques is applied on
    different sets of characters for recognition. In the first stage, two sets of
    features are computed and two classifiers are applied to get higher recognition
    accuracy. Two MLP's are used separately to recognize the characters.

  130. Performance Comparison of SVM and ANN for Handwritten Devnagari Character Recognition.

    Authors: Mita Nasipuri, Sandhya Arora, Debotosh Bhattacharjee, L. Malik, M. Kundu, D. K. Basu
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Classification methods based on learning from examples have been widely
    applied to character recognition from the 1990s and have brought forth
    significant improvements of recognition accuracies. This class of methods
    includes statistical methods, artificial neural networks, support vector
    machines (SVM), multiple classifier combination, etc.

  131. 3D Visual Tracking with Particle and Kalman Filters.

    Authors: Burak Bayramli
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    One of the most visually demonstrable and straightforward uses of filtering
    is in the field of Computer Vision. In this document we will try to outline the
    issues encountered while designing and implementing a particle and kalman
    filter based tracking system.

  132. Optimization of Weighted Curvature for Image Segmentation.

    Authors: Noha El-Zehiry, Leo Grady
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Minimization of boundary curvature is a classic regularization technique for
    image segmentation in the presence of noisy image data. Techniques for
    minimizing curvature have historically been derived from descent methods which
    could be trapped in a local minimum and therefore required a good
    initialization. Recently, combinatorial optimization techniques have been
    applied to the optimization of curvature which provide a solution that achieves
    nearly a global optimum. However, when applied to image segmentation these
    methods required a meaningful data term.

  133. Action Recognition in Videos: from Motion Capture Labs to the Web.

    Authors: Ana Paula Brandão Lopes, Eduardo Alves do Valle Jr., Jussara Marques de Almeida, Arnaldo Albuquerque de Araújo
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    This paper presents a survey of human action recognition approaches based on
    visual data recorded from a single video camera. We propose an organizing
    framework which puts in evidence the evolution of the area, with techniques
    moving from heavily constrained motion capture scenarios towards more
    challenging, realistic, "in the wild" videos. The proposed organization is
    based on the representation used as input for the recognition task, emphasizing
    the hypothesis assumed and thus, the constraints imposed on the type of video
    that each technique is able to address.

  134. Solving Inverse Problems with Piecewise Linear Estimators: From Gaussian Mixture Models to Structured Sparsity.

    Authors: Stéphane Mallat, Guillermo Sapiro, Guoshen Yu
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    A general framework for solving image inverse problems is introduced in this
    paper. The approach is based on Gaussian mixture models, estimated via a
    computationally efficient MAP-EM algorithm. A dual mathematical interpretation
    of the proposed framework with structured sparse estimation is described, which
    shows that the resulting piecewise linear estimate stabilizes the estimation
    when compared to traditional sparse inverse problem techniques. This
    interpretation also suggests an effective dictionary motivated initialization
    for the MAP-EM algorithm.

  135. Image Segmentation Using Weak Shape Priors.

    Authors: Oleg Michailovich, Robert Sheng Xu, Magdy Salama
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    The problem of image segmentation is known to become particularly challenging
    in the case of partial occlusion of the object(s) of interest, background
    clutter, and the presence of strong noise. To overcome this problem, the
    present paper introduces a novel approach segmentation through the use of
    "weak" shape priors.

  136. Biometric Authentication using Nonparametric Methods.

    Authors: S. V. Sheela, K. R. Radhika
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    The physiological and behavioral trait is employed to develop biometric
    authentication systems. The proposed work deals with the authentication of iris
    and signature based on minimum variance criteria. The iris patterns are
    preprocessed based on area of the connected components. The segmented image
    used for authentication consists of the region with large variations in the
    gray level values. The image region is split into quadtree components. The
    components with minimum variance are determined from the training samples. Hu
    moments are applied on the components.

  137. Detection of Bleeding in Wireless Capsule Endoscopy Images Using Range Ratio Color.

    Authors: Amer A. Al-Rahayfeh, Abdelshakour A. Abuzneid
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Wireless Capsule Endoscopy (WCE) is device to detect abnormalities in
    colon,esophagus,small intestinal and stomach, to distinguish bleeding in WCE
    images from non bleeding is a hard job by human reviewing and very time
    consuming. Consequently, automation for classifying bleeding frames not only
    will expedite the process but will reduce the burden on the doctors. Using the
    purity of the red color we can detect the Bleeding areas in WCE images.

  138. Classification of LULC Change Detection using Remotely Sensed Data for Coimbatore City, Tamilnadu, India.

    Authors: Y.Babykalpana, K.ThanushKodi
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Maps are used to describe far-off places . It is an aid for navigation and
    military strategies. Mapping of the lands are important and the mapping work is
    based on (i). Natural resource management & development (ii). Information
    technology ,(iii). Environmental development ,(iv). Facility management and
    (v). e-governance.

  139. Classification via Incoherent Subspaces.

    Authors: Pierre Vandergheynst, Karin Schnass
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    This article presents a new classification framework that can extract
    individual features per class. The scheme is based on a model of incoherent
    subspaces, each one associated to one class, and a model on how the elements in
    a class are represented in this subspace. After the theoretical analysis an
    alternate projection algorithm to find such a collection is developed. The
    classification performance and speed of the proposed method is tested on the AR
    and YaleB databases and compared to that of Fisher's LDA and a recent approach
    based on on $\ell_1$ minimisation.

  140. Multistage Hybrid Arabic/Indian Numeral OCR System.

    Authors: Yasser M. Alginaih, Abdul Ahad Siddiqi
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    The use of OCR in postal services is not yet universal and there are still
    many countries that process mail sorting manually. Automated Arabic/Indian
    numeral Optical Character Recognition (OCR) systems for Postal services are
    being used in some countries, but still there are errors during the mail
    sorting process, thus causing a reduction in efficiency. The need to
    investigate fast and efficient recognition algorithms/systems is important so
    as to correctly read the postal codes from mail addresses and to eliminate any
    errors during the mail sorting stage.

  141. Randomized hybrid linear modeling by local best-fit flats.

    Authors: Teng Zhang, Arthur Szlam, Gilad Lerman, Yi Wang
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    The hybrid linear modeling problem is to identify a set of d-dimensional
    affine sets in a D-dimensional Euclidean space. It arises, for example, in
    object tracking and structure from motion. The hybrid linear model can be
    considered as the second simplest (behind linear) manifold model of data. In
    this paper we will present a very simple geometric method for hybrid linear
    modeling based on selecting a set of local best fit flats that minimize a
    global l1 error measure.

  142. An Efficient Vein Pattern-based Recognition System.

    Authors: Mohit Soni, Phalguni Gupta, Sandesh Gupta, M.S. Rao
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    This paper presents an efficient human recognition system based on vein
    pattern from the palma dorsa. A new absorption based technique has been
    proposed to collect good quality images with the help of a low cost camera and
    light source. The system automatically detects the region of interest from the
    image and does the necessary preprocessing to extract features. A Euclidean
    Distance based matching technique has been used for making the decision.

  143. Isometric Embeddings in Imaging and Vision: Facts and Fiction.

    Authors: Emil Saucan
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    We explore the practicability of Nash's Embedding Theorem in vision and
    imaging sciences. In particular, we investigate the relevance of a result of
    Burago and Zalgaller regarding the existence of isometric embeddings of
    polyhedral surfaces in $\mathbb{R}^3$ and we show that their proof does not
    extended directly to higher dimensions.

  144. Logical methods of object recognition on satellite images using spatial constraints.

    Authors: R.K. Fedorov
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    A logical approach to object recognition on image is proposed. The main idea
    of the approach is to perform the object recognition as a logical inference on
    a set of rules describing an object shape.

  145. Deblured Gaussian Blurred Images.

    Authors: Salem Saleh Al-amri, N.V. Kalyankar, Khamitkar S.D
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    This paper attempts to undertake the study of Restored Gaussian Blurred
    Images. by using four types of techniques of deblurring image as Wiener filter,
    Regularized filter, Lucy Richardson deconvlutin algorithm and Blind
    deconvlution algorithm with an information of the Point Spread Function (PSF)
    corrupted blurred image with Different values of Size and Alfa and then
    corrupted by Gaussian noise.

  146. An Efficient Watermarking Algorithm to Improve Payload and Robustness without Affecting Image Perceptual Quality.

    Authors: Er. Anantdeep, Er. Sandeep kaur, Er. Deepak Aggarwal
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Capacity, Robustness, & Perceptual quality of watermark data are very
    important issues to be considered. A lot of research is going on to increase
    these parameters for watermarking of the digital images, as there is always a
    tradeoff among them. . In this paper an efficient watermarking algorithm to
    improve payload and robustness without affecting perceptual quality of image
    data based on DWT is discussed.

  147. Spatially-Adaptive Reconstruction in Computed Tomography Based on Statistical Learning.

    Authors: Michael Elad, Joseph Shtok, Michael Zibulevsky
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    We propose a direct reconstruction algorithm for Computed Tomography, based
    on a local fusion of a few preliminary image estimates by means of a non-linear
    fusion rule. One such rule is based on a signal denoising technique which is
    spatially adaptive to the unknown local smoothness. Another, more powerful
    fusion rule, is based on a neural network trained off-line with a high-quality
    training set of images. Two types of linear reconstruction algorithms for the
    preliminary images are employed for two different reconstruction tasks.

  148. Hashing Image Patches for Zooming.

    Authors: Mithun Das Gupta
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    In this paper we present a Bayesian image zooming/super-resolution algorithm
    based on a patch based representation. We work on a patch based model with
    overlap and employ a Locally Linear Embedding (LLE) based approach as our data
    fidelity term in the Bayesian inference. The image prior imposes continuity
    constraints across the overlapping patches. We apply an error back-projection
    technique, with an approximate cross bilateral filter. The problem of nearest
    neighbor search is handled by a variant of the locality sensitive hashing (LSH)
    scheme.

  149. Simultaneous Bayesian inference of motion velocity fields and probabilistic models in successive video-frames described by spatio-temporal MRFs.

    Authors: Jun-ichi Inoue, Yuya Inagaki
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    We numerically investigate a mean-field Bayesian approach with the assistance
    of the Markov chain Monte Carlo method to estimate motion velocity fields and
    probabilistic models simultaneously in consecutive digital images described by
    spatio-temporal Markov random fields. Preliminary to construction of our
    procedure, we find that mean-field variables in the iteration diverge due to
    improper normalization factor of regularization terms appearing in the
    posterior.

  150. Facial Expression Representation Using Heteroscedastic Linear Discriminant Analysis and Gabor Wavelets.

    Authors: Mahmoud Khademi, Mehran Safayani, Mohammad H. Kiapour, Mohammad T. Manzuri
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    In this paper, a novel representation for facial expressions in
    two-dimensional image sequences is presented. We apply a variation of
    two-dimensional heteroscedastic linear discriminant analysis (2DHLDA)
    algorithm, as an efficient dimensionality reduction technique, to Gabor
    representation of the input sequence. 2DHLDA is an extension of the
    two-dimensional linear discriminant analysis (2DLDA) approach and removes the
    equal within-class covariance. By applying 2DHLDA in two directions, we
    eliminate the correlations between both image columns and image rows.

  151. New Clustering Algorithm for Vector Quantization using Rotation of Error Vector.

    Authors: H. B. Kekre, Tanuja K. Sarode
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    The paper presents new clustering algorithm. The proposed algorithm gives
    less distortion as compared to well known Linde Buzo Gray (LBG) algorithm and
    Kekre's Proportionate Error (KPE) Algorithm. Constant error is added every time
    to split the clusters in LBG, resulting in formation of cluster in one
    direction which is 1350 in 2-dimensional case. Because of this reason
    clustering is inefficient resulting in high MSE in LBG. To overcome this
    drawback of LBG proportionate error is added to change the cluster orientation
    in KPE.

  152. A Robust Fuzzy Clustering Technique with Spatial Neighborhood Information for Effective Medical Image Segmentation.

    Authors: S. Zulaikha Beevi, M. Mohammed Sathik, K. Senthamaraikannan
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Medical image segmentation demands an efficient and robust segmentation
    algorithm against noise. The conventional fuzzy c-means algorithm is an
    efficient clustering algorithm that is used in medical image segmentation. But
    FCM is highly vulnerable to noise since it uses only intensity values for
    clustering the images. This paper aims to develop a novel and efficient fuzzy
    spatial c-means clustering algorithm which is robust to noise. The proposed
    clustering algorithm uses fuzzy spatial information to calculate membership
    value. The input image is clustered using proposed ISFCM algorithm.

  153. Maximized Posteriori Attributes Selection from Facial Salient Landmarks for Face Recognition.

    Authors: Dakshina Ranjan Kisku, Jamuna Kanta Sing, Phalguni Gupta, Massimo Tistarelli
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    This paper presents a robust and dynamic face recognition technique based on
    the extraction and matching of devised probabilistic graphs drawn on SIFT
    features related to independent face areas. The face matching strategy is based
    on matching individual salient facial graph characterized by SIFT features as
    connected to facial landmarks such as the eyes and the mouth. In order to
    reduce the face matching errors, the Dempster-Shafer decision theory is applied
    to fuse the individual matching scores obtained from each pair of salient
    facial features.

  154. Feature Level Fusion of Face and Palmprint Biometrics by Isomorphic Graph-based Improved K-Medoids Partitioning.

    Authors: Dakshina Ranjan Kisku, Jamuna Kanta Sing, Phalguni Gupta
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    This paper presents a feature level fusion approach which uses the improved
    K-medoids clustering algorithm and isomorphic graph for face and palmprint
    biometrics. Partitioning around medoids (PAM) algorithm is used to partition
    the set of n invariant feature points of the face and palmprint images into k
    clusters. By partitioning the face and palmprint images with scale invariant
    features SIFT points, a number of clusters is formed on both the images. Then
    on each cluster, an isomorphic graph is drawn.

  155. A New Approach to Lung Image Segmentation using Fuzzy Possibilistic C-Means Algorithm.

    Authors: M. Gomathi, P.Thangaraj
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Image segmentation is a vital part of image processing. Segmentation has its
    application widespread in the field of medical images in order to diagnose
    curious diseases. The same medical images can be segmented manually. But the
    accuracy of image segmentation using the segmentation algorithms is more when
    compared with the manual segmentation.

  156. Regularized Richardson-Lucy Algorithm for Sparse Reconstruction of Poissonian Images.

    Authors: Oleg Michailovich, Elad Shaked
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Restoration of digital images from their degraded measurements has always
    been a problem of great theoretical and practical importance in numerous
    applications of imaging sciences. A specific solution to the problem of image
    restoration is generally determined by the nature of degradation phenomenon as
    well as by the statistical properties of measurement noises. The present study
    is concerned with the case in which the images of interest are corrupted by
    convolutional blurs and Poisson noises.

  157. Signature Recognition using Multi Scale Fourier Descriptor And Wavelet Transform.

    Authors: Ismail A. Ismail, Mohammed A. Ramadan, Talaat S. El danaf, Ahmed H. Samak
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    This paper present a novel off-line signature recognition method based on
    multi scale Fourier Descriptor and wavelet transform . The main steps of
    constructing a signature recognition system are discussed and experiments on
    real data sets show that the average error rate can reach 1%. Finally we
    compare 8 distance measures between feature vectors with respect to the
    recognition performance.

    Key words: signature recognition; Fourier Descriptor; Wavelet transform;
    personal verification

  158. Extended Two-Dimensional PCA for Efficient Face Representation and Recognition.

    Authors: Mahmoud Khademi, Mehran Safayani, Mohammad T. Manzuri-Shalmani
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    In this paper a novel method called Extended Two-Dimensional PCA (E2DPCA) is
    proposed which is an extension to the original 2DPCA. We state that the
    covariance matrix of 2DPCA is equivalent to the average of the main diagonal of
    the covariance matrix of PCA. This implies that 2DPCA eliminates some
    covariance information that can be useful for recognition. E2DPCA instead of
    just using the main diagonal considers a radius of r diagonals around it and
    expands the averaging so as to include the covariance information within those
    diagonals. The parameter r unifies PCA and 2DPCA.

  159. Analysis, Interpretation, and Recognition of Facial Action Units and Expressions Using Neuro-Fuzzy Modeling.

    Authors: Mahmoud Khademi, Mohammad T. Manzuri-Shalmani, Ali A. Kiaei, Mohammad Hadi Kiapour
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    In this paper an accurate real-time sequence-based system for representation,
    recognition, interpretation, and analysis of the facial action units (AUs) and
    expressions is presented.

  160. Recognizing Combinations of Facial Action Units with Different Intensity Using a Mixture of Hidden Markov Models and Neural Network.

    Authors: Mahmoud Khademi, Mohammad T. Manzuri-Shalmani, Mohammad H. Kiapour, Ali A. Kiaei
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Facial Action Coding System consists of 44 action units (AUs) and more than
    7000 combinations. Hidden Markov models (HMMs) classifier has been used
    successfully to recognize facial action units (AUs) and expressions due to its
    ability to deal with AU dynamics. However, a separate HMM is necessary for each
    single AU and each AU combination. Since combinations of AU numbering in
    thousands, a more efficient method will be needed. In this paper an accurate
    real-time sequence-based system for representation and recognition of facial
    AUs is presented.

  161. Multilinear Biased Discriminant Analysis: A Novel Method for Facial Action Unit Representation.

    Authors: Mahmoud Khademi, Mehran Safayani, Mohammad T. Manzuri-Shalmani
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    In this paper a novel efficient method for representation of facial action
    units by encoding an image sequence as a fourth-order tensor is presented. The
    multilinear tensor-based extension of the biased discriminant analysis (BDA)
    algorithm, called multilinear biased discriminant analysis (MBDA), is first
    proposed. Then, we apply the MBDA and two-dimensional BDA (2DBDA) algorithms,
    as the dimensionality reduction techniques, to Gabor representations and the
    geometric features of the input image sequence respectively.

  162. A stochastic model of human visual attention with a dynamic Bayesian network.

    Authors: Akisato kimura, Derek Pang, Tatsuto Takeuchi, Kouji Miyazato, Junji Yamato, Kunio Kashino
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Recent studies in the field of human vision science suggest that the human
    responses to the stimuli on a visual display are non-deterministic. People may
    attend to different locations on the same visual input at the same time. Based
    on this knowledge, we propose a new stochastic model of visual attention by
    introducing a dynamic Bayesian network to predict the likelihood of where
    humans typically focus on a video scene. The proposed model is composed of a
    dynamic Bayesian network with 4 layers.

  163. A novel scheme for binarization of vehicle images using hierarchical histogram equalization technique.

    Authors: Subhadip Basu, Mita Nasipuri, Satadal Saha, Dipak Kumar Basu
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Automatic License Plate Recognition system is a challenging area of research
    now-a-days and binarization is an integral and most important part of it. In
    case of a real life scenario, most of existing methods fail to properly
    binarize the image of a vehicle in a congested road, captured through a CCD
    camera. In the current work we have applied histogram equalization technique
    over the complete image and also over different hierarchy of image
    partitioning. A novel scheme is formulated for giving the membership value to
    each pixel for each hierarchy of histogram equalization.

  164. Development of an automated Red Light Violation Detection System (RLVDS) for Indian vehicles.

    Authors: Subhadip Basu, Mita Nasipuri, Satadal Saha, Dipak Kumar Basu
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Integrated Traffic Management Systems (ITMS) are now implemented in different
    cities in India to primarily address the concerns of road-safety and security.
    An automated Red Light Violation Detection System (RLVDS) is an integral part
    of the ITMS. In our present work we have designed and developed a complete
    system for generating the list of all stop-line violating vehicle images
    automatically from video snapshots of road-side surveillance cameras.

  165. Offline Signature Identification by Fusion of Multiple Classifiers using Statistical Learning Theory.

    Authors: Dakshina Ranjan Kisku, Jamuna Kanta Sing, Phalguni Gupta
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    This paper uses Support Vector Machines (SVM) to fuse multiple classifiers
    for an offline signature system. From the signature images, global and local
    features are extracted and the signatures are verified with the help of
    Gaussian empirical rule, Euclidean and Mahalanobis distance based classifiers.
    SVM is used to fuse matching scores of these matchers.

  166. Recognition of Handwritten Roman Script Using Tesseract Open source OCR Engine.

    Authors: Subhadip Basu, Sandip Rakshit
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    In the present work, we have used Tesseract 2.01 open source Optical
    Character Recognition (OCR) Engine under Apache License 2.0 for recognition of
    handwriting samples of lower case Roman script. Handwritten isolated and
    free-flow text samples were collected from multiple users. Tesseract is trained
    to recognize user-specific handwriting samples of both the categories of
    document pages. On a single user model, the system is trained with 1844
    isolated handwritten characters and the performance is tested on 1133
    characters, taken form the test set.

  167. Development of a multi-user handwriting recognition system using Tesseract open source OCR engine.

    Authors: Subhadip Basu, Sandip Rakshit
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    The objective of the paper is to recognize handwritten samples of lower case
    Roman script using Tesseract open source Optical Character Recognition (OCR)
    engine under Apache License 2.0. Handwritten data samples containing isolated
    and free-flow text were collected from different users. Tesseract is trained
    with user-specific data samples of both the categories of document pages to
    generate separate user-models representing a unique language-set. Each such
    language-set recognizes isolated and free-flow handwritten test samples
    collected from the designated user.

  168. Recognition of Handwritten Textual Annotations using Tesseract Open Source OCR Engine for information Just In Time (iJIT).

    Authors: Subhadip Basu, Sandip Rakshit, Hisashi Ikeda
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Objective of the current work is to develop an Optical Character Recognition
    (OCR) engine for information Just In Time (iJIT) system that can be used for
    recognition of handwritten textual annotations of lower case Roman script.
    Tesseract open source OCR engine under Apache License 2.0 is used to develop
    user-specific handwriting recognition models, viz., the language sets, for the
    said system, where each user is identified by a unique identification tag
    associated with the digital pen.

  169. Development of a Multi-User Recognition Engine for Handwritten Bangla Basic Characters and Digits.

    Authors: Subhadip Basu, Sandip Rakshit, Debkumar Ghosal, Tanmoy Das, Subhrajit Dutta
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    The objective of the paper is to recognize handwritten samples of basic
    Bangla characters using Tesseract open source Optical Character Recognition
    (OCR) engine under Apache License 2.0. Handwritten data samples containing
    isolated Bangla basic characters and digits were collected from different
    users. Tesseract is trained with user-specific data samples of document pages
    to generate separate user-models representing a unique language-set. Each such
    language-set recognizes isolated basic Bangla handwritten test samples
    collected from the designated users.

  170. Recognition of handwritten Roman Numerals using Tesseract open source OCR engine.

    Authors: Subhadip Basu, Sandip Rakshit, Amitava Kundu, Mrinmoy Maity, Subhajit Mandal, Satwika Sarkar
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    The objective of the paper is to recognize handwritten samples of Roman
    numerals using Tesseract open source Optical Character Recognition (OCR)
    engine. Tesseract is trained with data samples of different persons to generate
    one user-independent language model, representing the handwritten Roman
    digit-set. The system is trained with 1226 digit samples collected form the
    different users.

  171. Robust multi-camera view face recognition.

    Authors: Dakshina Ranjan Kisku, Hunny Mehrotra, Jamuna Kanta Sing, Phalguni Gupta
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    This paper presents multi-appearance fusion of Principal Component Analysis
    (PCA) and generalization of Linear Discriminant Analysis (LDA) for multi-camera
    view offline face recognition (verification) system. The generalization of LDA
    has been extended to establish correlations between the face classes in the
    transformed representation and this is called canonical covariate.

  172. Tuning CLD Maps.

    Authors: Amelia Carolina Sparavigna, Roberto Marazzato
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    The Coherence Length Diagram and the related maps have been shown to
    represent a useful tool for image analysis. Setting threshold parameters is one
    of the most important issues when dealing with such applications, as they
    affect both the computability, which is outlined by the support map, and the
    appearance of the coherence length diagram itself and of defect maps.

  173. Active Testing for Face Detection and Localization.

    Authors: Raphael Sznitman, Bruno Jedynak
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    We provide a novel search technique, which uses a hierarchical model and a
    mutual information gain heuristic to efficiently prune the search space when
    localizing faces in images. We show exponential gains in computation over
    traditional sliding window approaches, while keeping similar performance
    levels.

  174. The Video Genome.

    Authors: Alexander M. Bronstein, Michael M. Bronstein, Ron Kimmel
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Fast evolution of Internet technologies has led to an explosive growth of
    video data available in the public domain and created unprecedented challenges
    in the analysis, organization, management, and control of such content. The
    problems encountered in video analysis such as identifying a video in a large
    database (e.g. detecting pirated content in YouTube), putting together video
    fragments, finding similarities and common ancestry between different versions
    of a video, have analogous counterpart problems in genetic research and
    analysis of DNA and protein sequences.

  175. Image Compression and Watermarking scheme using Scalar Quantization.

    Authors: Kilari Veera Swamy, B.Chandra Mohan, Y.V.Bhaskar Reddy, S.Srinivas Kumar
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    This paper presents a new compression technique and image watermarking
    algorithm based on Contourlet Transform (CT). For image compression, an energy
    based quantization is used. Scalar quantization is explored for image
    watermarking. Double filter bank structure is used in CT. The Laplacian Pyramid
    (LP) is used to capture the point discontinuities, and then followed by a
    Directional Filter Bank (DFB) to link point discontinuities.

  176. Towards automated high-throughput screening of C. elegans on agar.

    Authors: Yoav Freund, Mayank Kabra, Annie L. Conery, Eyleen J. O'Rourke, Xin Xie, Vebjorn Ljosa, Thouis R. Jones, Frederick M. Ausubel, Gary Ruvkun, Anne E. Carpenter
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    High-throughput screening (HTS) using model organisms is a promising method
    to identify a small number of genes or drugs potentially relevant to human
    biology or disease. In HTS experiments, robots and computers do a significant
    portion of the experimental work. However, one remaining major bottleneck is
    the manual analysis of experimental results, which is commonly in the form of
    microscopy images. This manual inspection is labor intensive, slow and
    subjective.

  177. Land-cover Classification and Mapping for Eastern Himalayan State Sikkim.

    Authors: M. K. Ghose, Ratika Pradhan, Mohan P. Pradhan, Ashish Bhusan, Ronak K. Pradhan
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Area of classifying satellite imagery has become a challenging task in
    current era where there is tremendous growth in settlement i.e. construction of
    buildings, roads, bridges, dam etc. This paper suggests an improvised k-means
    and Artificial Neural Network (ANN) classifier for land-cover mapping of
    Eastern Himalayan state Sikkim. The improvised k-means algorithm shows
    satisfactory results compared to existing methods that includes k-Nearest
    Neighbor and maximum likelihood classifier.

  178. A Comprehensive Review of Image Enhancement Techniques.

    Authors: Raman Maini, Himanshu Aggarwal
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Principle objective of Image enhancement is to process an image so that
    result is more suitable than original image for specific application. Digital
    image enhancement techniques provide a multitude of choices for improving the
    visual quality of images. Appropriate choice of such techniques is greatly
    influenced by the imaging modality, task at hand and viewing conditions. This
    paper will provide an overview of underlying concepts, along with algorithms
    commonly used for image enhancement.

  179. System-theoretic approach to image interest point detection.

    Authors: Vitaly Pimenov
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Interest point detection is a common task in various computer vision
    applications. Although a big variety of detector are developed so far
    computational efficiency of interest point based image analysis remains to be
    the problem. Current paper proposes a system-theoretic approach to interest
    point detection. Starting from the analysis of interdependency between detector
    and descriptor it is shown that given a descriptor it is possible to introduce
    to notion of detector redundancy. Furthermore for each detector it is possible
    to construct its irredundant and equivalent modification.

  180. On MMSE and MAP Denoising Under Sparse Representation Modeling Over a Unitary Dictionary.

    Authors: Michael Elad, Javier Turek, Irad Yavneh, Matan Protter
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Among the many ways to model signals, a recent approach that draws
    considerable attention is sparse representation modeling. In this model, the
    signal is assumed to be generated as a random linear combination of a few atoms
    from a pre-specified dictionary. In this work we analyze two Bayesian denoising
    algorithms -- the Maximum-Aposteriori Probability (MAP) and the
    Minimum-Mean-Squared-Error (MMSE) estimators, under the assumption that the
    dictionary is unitary.

  181. The Projected GSURE for Automatic Parameter Tuning in Iterative Shrinkage Methods.

    Authors: Michael Elad, Raja Giryes, Yonina C Eldar
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Linear inverse problems are very common in signal and image processing. Many
    algorithms that aim at solving such problems include unknown parameters that
    need tuning. In this work we focus on optimally selecting such parameters in
    iterative shrinkage methods for image deblurring and image zooming. Our work
    uses the projected Generalized Stein Unbiased Risk Estimator (GSURE) for
    determining the threshold value lambda and the iterations number K in these
    algorithms. The proposed parameter selection is shown to handle any degradation
    operator, including ill-posed and even rectangular ones.

  182. Sliding window approach based Text Binarisation from Complex Textual images.

    Authors: Chitrakala Gopalan, D.Manjula
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Text binarisation process classifies individual pixels as text or background
    in the textual images. Binarization is necessary to bridge the gap between
    localization and recognition by OCR. This paper presents Sliding window method
    to binarise text from textual images with textured background. Suitable
    preprocessing techniques are applied first to increase the contrast of the
    image and blur the background noises due to textured background. Then Edges are
    detected by iterative thresholding.

  183. Pattern recognition using inverse resonance filtration.

    Authors: Olga Sofina, Yuriy Bunyak, Roman Kvetnyy
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    An approach to textures pattern recognition based on inverse resonance
    filtration (IRF) is considered. A set of principal resonance harmonics of
    textured image signal fluctuations eigen harmonic decomposition (EHD) is used
    for the IRF design. It was shown that EHD is invariant to textured image linear
    shift. The recognition of texture is made by transfer of its signal into
    unstructured signal which simple statistical parameters can be used for texture
    pattern recognition. Anomalous variations of this signal point on foreign
    objects.

  184. Fast space-variant elliptical filtering using box splines.

    Authors: Kunal Narayan Chaudhury, Michael Unser, Arrate Munoz-Barrutia
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    The efficient realization of linear space-variant (non-convolution) filters
    is a challenging computational problem in image processing. In this paper, we
    demonstrate that it is possible to filter an image with a Gaussian-like
    elliptic window of varying size, elongation and orientation using a fixed
    number of computations per pixel. The associated algorithm, which is based on a
    family of smooth compactly supported piecewise polynomials, the
    radially-uniform box splines, is realized using pre-integration and local
    finite-differences.

  185. Facial Gesture Recognition Using Correlation And Mahalanobis Distance.

    Authors: Supriya Kapoor, Shruti Khanna, Rahul Bhatia
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Augmenting human computer interaction with automated analysis and synthesis
    of facial expressions is a goal towards which much research effort has been
    devoted recently. Facial gesture recognition is one of the important component
    of natural human-machine interfaces; it may also be used in behavioural
    science, security systems and in clinical practice. Although humans recognise
    facial expressions virtually without effort or delay, reliable expression
    recognition by machine is still a challenge.

  186. Properties of the Discrete Pulse Transform for Multi-Dimensional Arrays.

    Authors: Roumen Anguelov, Inger Fabris-Rotelli
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    This report presents properties of the Discrete Pulse Transform on
    multi-dimensional arrays introduced by the authors two or so years ago. The
    main result given here in Lemma 2.1 is also formulated in a paper to appear in
    IEEE Transactions on Image Processing. However, the proof, being too technical,
    was omitted there and hence it appears in full in this publication.

  187. Scalable Large-Margin Mahalanobis Distance Metric Learning.

    Authors: Chunhua Shen, Junae Kim, Lei Wang
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    For many machine learning algorithms such as $k$-Nearest Neighbor ($k$-NN)
    classifiers and $ k $-means clustering, often their success heavily depends on
    the metric used to calculate distances between different data points.

  188. Binarizing Business Card Images for Mobile Devices.

    Authors: Nibaran Das, Ram Sarkar, Subhadip Basu, Mahantapas Kundu, Mita Nasipuri, Ayatullah Faruk Mollah
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Business card images are of multiple natures as these often contain graphics,
    pictures and texts of various fonts and sizes both in background and
    foreground. So, the conventional binarization techniques designed for document
    images can not be directly applied on mobile devices. In this paper, we have
    presented a fast binarization technique for camera captured business card
    images. A card image is split into small blocks. Some of these blocks are
    classified as part of the background based on intensity variance.

  189. Text Region Extraction from Business Card Images for Mobile Devices.

    Authors: Nibaran Das, Ram Sarkar, Subhadip Basu, Mahantapas Kundu, Mita Nasipuri, Ayatullah Faruk Mollah
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Designing a Business Card Reader (BCR) for mobile devices is a challenge to
    the researchers because of huge deformation in acquired images, multiplicity in
    nature of the business cards and most importantly the computational constraints
    of the mobile devices. This paper presents a text extraction method designed in
    our work towards developing a BCR for mobile devices. At first, the background
    of a camera captured image is eliminated at a coarse level. Then, various rule
    based techniques are applied on the Connected Components (CC) to filter out the
    noises and picture regions.

  190. CLD-shaped Brushstrokes in Non-Photorealistic Rendering.

    Authors: Amelia Carolina Sparavigna, Roberto Marazzato
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Rendering techniques based on a random grid can be improved by adapting
    brushstrokes to the shape of different areas of the original picture. In this
    paper, the concept of Coherence Length Diagram is applied to determine the
    adaptive brushstrokes, in order to simulate an impressionist painting. Some
    examples are provided to instance the proposed algorithm.

  191. Supervised Learning of Digital image restoration based on Quantization Nearest Neighbor algorithm.

    Authors: Md. Imran Hossain, Syed Golam Rajib
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    In this paper, an algorithm is proposed for Image Restoration. Such algorithm
    is different from the traditional approaches in this area, by utilizing priors
    that are learned from similar images. Original images and their degraded
    versions by the known degradation operators are utilized for designing the
    Quantization. The code vectors are designed using the blurred images. For each
    such vector, the high frequency information obtained from the original images
    is also available.

  192. Handwritten Bangla Basic and Compound character recognition using MLP and SVM classifier.

    Authors: Nibaran Das, Bindaban Das, Ram Sarkar, Subhadip Basu, Mahantapas Kundu, Mita Nasipuri
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    A novel approach for recognition of handwritten compound Bangla characters,
    along with the Basic characters of Bangla alphabet, is presented here. Compared
    to English like Roman script, one of the major stumbling blocks in Optical
    Character Recognition (OCR) of handwritten Bangla script is the large number of
    complex shaped character classes of Bangla alphabet. In addition to 50 basic
    character classes, there are nearly 160 complex shaped compound character
    classes in Bangla alphabet.

  193. Automatic diagnosis of retinal diseases from color retinal images.

    Authors: D. Jayanthi, N. Devi, S. SwarnaParvathi
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Teleophthalmology holds a great potential to improve the quality, access, and
    affordability in health care. For patients, it can reduce the need for travel
    and provide the access to a superspecialist. Ophthalmology lends itself easily
    to telemedicine as it is a largely image based diagnosis. The main goal of the
    proposed system is to diagnose the type of disease in the retina and to
    automatically detect and segment retinal diseases without human supervision or
    interaction.

  194. Intrinsic dimension estimation of data by principal component analysis.

    Authors: Bo Zhang, Hong Qiao, Mingyu Fan, Nannan Gu
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Estimating intrinsic dimensionality of data is a classic problem in pattern
    recognition and statistics. Principal Component Analysis (PCA) is a powerful
    tool in discovering dimensionality of data sets with a linear structure; it,
    however, becomes ineffective when data have a nonlinear structure. In this
    paper, we propose a new PCA-based method to estimate intrinsic dimension of
    data with nonlinear structures.

  195. Detection of Microcalcification in Mammograms Using Wavelet Transform and Fuzzy Shell Clustering.

    Authors: T. Balakumaran, I.L.A. Vennila, C. Gowri Shankar
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Microcalcifications in mammogram have been mainly targeted as a reliable
    earliest sign of breast cancer and their early detection is vital to improve
    its prognosis. Since their size is very small and may be easily overlooked by
    the examining radiologist, computer-based detection output can assist the
    radiologist to improve the diagnostic accuracy. In this paper, we have proposed
    an algorithm for detecting microcalcification in mammogram.

  196. A Comparative Study of Removal Noise from Remote Sensing Image.

    Authors: Salem Saleh Al-amri, N. V. Kalyankar, S.D. Khamitkar
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    This paper attempts to undertake the study of three types of noise such as
    Salt and Pepper (SPN), Random variation Impulse Noise (RVIN), Speckle (SPKN).
    Different noise densities have been removed between 10% to 60% by using five
    types of filters as Mean Filter (MF), Adaptive Wiener Filter (AWF), Gaussian
    Filter (GF), Standard Median Filter (SMF) and Adaptive Median Filter (AMF). The
    same is applied to the Saturn remote sensing image and they are compared with
    one another. The comparative study is conducted with the help of Mean Square
    Errors (MSE) and Peak-Signal to Noise Ratio (PSNR).

  197. Ball-Scale Based Hierarchical Multi-Object Recognition in 3D Medical Images.

    Authors: Ulas Bagci, Jayaram K. Udupa, Xinjian Chen
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    This paper investigates, using prior shape models and the concept of ball
    scale (b-scale), ways of automatically recognizing objects in 3D images without
    performing elaborate searches or optimization. That is, the goal is to place
    the model in a single shot close to the right pose (position, orientation, and
    scale) in a given image so that the model boundaries fall in the close vicinity
    of object boundaries in the image. This is achieved via the following set of
    key ideas: (a) A semi-automatic way of constructing a multi-object shape model
    assembly.

  198. The Influence of Intensity Standardization on Medical Image Registration.

    Authors: Ulas Bagci, Jayaram K. Udupa, Li Bai
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Acquisition-to-acquisition signal intensity variations (non-standardness) are
    inherent in MR images. Standardization is a post processing method for
    correcting inter-subject intensity variations through transforming all images
    from the given image gray scale into a standard gray scale wherein similar
    intensities achieve similar tissue meanings. The lack of a standard image
    intensity scale in MRI leads to many difficulties in tissue characterizability,
    image display, and analysis, including image segmentation.

  199. Face Recognition by Fusion of Local and Global Matching Scores using DS Theory: An Evaluation with Uni-classifier and Multi-classifier Paradigm.

    Authors: Dakshina Ranjan Kisku, Jamuna Kanta Sing, Phalguni Gupta, Massimo Tistarelli
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Faces are highly deformable objects which may easily change their appearance
    over time. Not all face areas are subject to the same variability. Therefore
    decoupling the information from independent areas of the face is of paramount
    importance to improve the robustness of any face recognition technique. This
    paper presents a robust face recognition technique based on the extraction and
    matching of SIFT features related to independent face areas. Both a global and
    local (as recognition from parts) matching strategy is proposed.

  200. Feature Level Clustering of Large Biometric Database.

    Authors: Dakshina Ranjan Kisku, Hunny Mehrotra, Phalguni Gupta, V. Bhawani Radhika, Banshidhar Majhi
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    This paper proposes an efficient technique for partitioning large biometric
    database during identification. In this technique feature vector which
    comprises of global and local descriptors extracted from offline signature are
    used by fuzzy clustering technique to partition the database. As biometric
    features posses no natural order of sorting, thus it is difficult to index them
    alphabetically or numerically. Hence, some supervised criteria is required to
    partition the search space.

  201. Face Identification by SIFT-based Complete Graph Topology.

    Authors: Dakshina Ranjan Kisku, Ajita Rattani, Enrico Grosso, Massimo Tistarelli
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    This paper presents a new face identification system based on Graph Matching
    Technique on SIFT features extracted from face images. Although SIFT features
    have been successfully used for general object detection and recognition, only
    recently they were applied to face recognition. This paper further investigates
    the performance of identification techniques based on Graph matching topology
    drawn on SIFT features which are invariant to rotation, scaling and
    translation.

  202. SIFT-based Ear Recognition by Fusion of Detected Keypoints from Color Similarity Slice Regions.

    Authors: Dakshina Ranjan Kisku, Hunny Mehrotra, Jamuna Kanta Sing, Phalguni Gupta
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Ear biometric is considered as one of the most reliable and invariant
    biometrics characteristics in line with iris and fingerprint characteristics.
    In many cases, ear biometrics can be compared with face biometrics regarding
    many physiological and texture characteristics. In this paper, a robust and
    efficient ear recognition system is presented, which uses Scale Invariant
    Feature Transform (SIFT) as feature descriptor for structural representation of
    ear images.

  203. Feature Level Fusion of Biometrics Cues: Human Identification with Doddingtons Caricature.

    Authors: Dakshina Ranjan Kisku, Jamuna Kanta Sing, Phalguni Gupta
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    This paper presents a multimodal biometric system of fingerprint and ear
    biometrics. Scale Invariant Feature Transform (SIFT) descriptor based feature
    sets extracted from fingerprint and ear are fused. The fused set is encoded by
    K-medoids partitioning approach with less number of feature points in the set.
    K-medoids partition the whole dataset into clusters to minimize the error
    between data points belonging to the clusters and its center. Reduced feature
    set is used to match between two biometric sets.

  204. Fusion of Multiple Matchers using SVM for Offline Signature Identification.

    Authors: Dakshina Ranjan Kisku, Jamuna Kanta Sing, Phalguni Gupta
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    This paper uses Support Vector Machines (SVM) to fuse multiple classifiers
    for an offline signature system. From the signature images, global and local
    features are extracted and the signatures are verified with the help of
    Gaussian empirical rule, Euclidean and Mahalanobis distance based classifiers.
    SVM is used to fuse matching scores of these matchers.

  205. Some considerations on how the human brain must be arranged in order to make its replication in a thinking machine possible.

    Authors: Emanuel Diamant
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    For the most of my life, I have earned my living as a computer vision
    professional busy with image processing tasks and problems. In the computer
    vision community there is a widespread belief that artificial vision systems
    faithfully replicate human vision abilities or at least very closely mimic
    them. It was a great surprise to me when one day I have realized that computer
    and human vision have next to nothing in common.

  206. Kannada Character Recognition System A Review.

    Authors: K. Indira, S. Sethu Selvi
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Intensive research has been done on optical character recognition ocr and a
    large number of articles have been published on this topic during the last few
    decades. Many commercial OCR systems are now available in the market, but most
    of these systems work for Roman, Chinese, Japanese and Arabic characters. There
    are no sufficient number of works on Indian language character recognition
    especially Kannada script among 12 major scripts in India. This paper presents
    a review of existing work on printed Kannada script and their results.

  207. Threshold Based Indexing of Commercial Shoe Print to Create Reference and Recovery Images.

    Authors: S. Rathinavel, S. Arumugam
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    One of the important evidence in a crime scene that is normally overlooked
    but very important evidence is shoe print as the criminal is normally unaware
    of the mask for this. In this paper we use image processing technique to
    process reference shoe images to make it index-able for a search from the
    database the shoe print impressions available in the commercial market.

  208. Detection and Demarcation of Tumor using Vector Quantization in MRI images.

    Authors: H. B. Kekre, Tanuja K. Sarode, Saylee M. Gharge
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Segmenting a MRI images into homogeneous texture regions representing
    disparate tissue types is often a useful preprocessing step in the
    computer-assisted detection of breast cancer. That is why we proposed new
    algorithm to detect cancer in mammogram breast cancer images. In this paper we
    proposed segmentation using vector quantization technique. Here we used Linde
    Buzo-Gray algorithm (LBG) for segmentation of MRI images. Initially a codebook
    of size 128 was generated for MRI images. These code vectors were further
    clustered in 8 clusters using same LBG algorithm.

  209. SVM-based Multiview Face Recognition by Generalization of Discriminant Analysis.

    Authors: Dakshina Ranjan Kisku, Hunny Mehrotra, Jamuna Kanta Sing, Phalguni Gupta
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Identity verification of authentic persons by their multiview faces is a real
    valued problem in machine vision. Multiview faces are having difficulties due
    to non-linear representation in the feature space. This paper illustrates the
    usability of the generalization of LDA in the form of canonical covariate for
    face recognition to multiview faces. In the proposed work, the Gabor filter
    bank is used to extract facial features that characterized by spatial
    frequency, spatial locality and orientation.

  210. Multi-camera Realtime 3D Tracking of Multiple Flying Animals.

    Authors: Andrew D. Straw, Kristin Branson, Titus R. Neumann, Michael H. Dickinson
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Automated tracking of animal movement allows analyses that would not
    otherwise be possible by providing great quantities of data. The additional
    capability of tracking in realtime - with minimal latency - opens up the
    experimental possibility of manipulating sensory feedback, thus allowing
    detailed explorations of the neural basis for control of behavior. Here we
    describe a new system capable of tracking the position and body orientation of
    animals such as flies and birds. The system operates with less than 40 msec
    latency and can track multiple animals simultaneously.

  211. Gradient Based Seeded Region Grow method for CT Angiographic Image Segmentation.

    Authors: T.R. Gopalakrishnan Nair, G.N. Harikrishna Rai
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Segmentation of medical images using seeded region growing technique is
    increasingly becoming a popular method because of its ability to involve
    high-level knowledge of anatomical structures in seed selection process. Region
    based segmentation of medical images are widely used in varied clinical
    applications like visualization, bone detection, tumor detection and
    unsupervised image retrieval in clinical databases. As medical images are
    mostly fuzzy in nature, segmenting regions based intensity is the most
    challenging task.

  212. Hybrid Medical Image Classification Using Association Rule Mining with Decision Tree Algorithm.

    Authors: P. Rajendran, M.Madheswaran
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    The main focus of image mining in the proposed method is concerned with the
    classification of brain tumor in the CT scan brain images. The major steps
    involved in the system are: pre-processing, feature extraction, association
    rule mining and hybrid classifier. The pre-processing step has been done using
    the median filtering process and edge features have been extracted using canny
    edge detection technique. The two image mining approaches with a hybrid manner
    have been proposed in this paper.

  213. 3D Skull Recognition Using 3D Matching Technique.

    Authors: A.A Zaidan, B.B Zaidan, Hamdan.O.Alanazi
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Biometrics has become a "hot" area. Governments are funding research programs
    focused on biometrics. In this paper the problem of person recognition and
    verification based on a different biometric application has been addressed. The
    system is based on the 3DSkull recognition using 3D matching technique, in fact
    this paper present several bio-metric approaches in order of assign the weak
    point in term of used the biometric from the authorize person and insure the
    person who access the data is the real person.

  214. Features Based Text Similarity Detection.

    Authors: Chow Kok Kent, Naomie Salim
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    As the Internet help us cross cultural border by providing different
    information, plagiarism issue is bound to arise. As a result, plagiarism
    detection becomes more demanding in overcoming this issue. Different plagiarism
    detection tools have been developed based on various detection techniques.
    Nowadays, fingerprint matching technique plays an important role in those
    detection tools. However, in handling some large content articles, there are
    some weaknesses in fingerprint matching technique especially in space and time
    consumption issue.

  215. An Explicit Nonlinear Mapping for Manifold Learning.

    Authors: Di Wang, Bo Zhang, Hong Qiao, Peng Zhang
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Manifold learning is a hot research topic in the field of computer science
    and has many applications in the real world. A main drawback of manifold
    learning methods is, however, that there is no explicit mappings from the input
    data manifold to the output embedding. This prohibits the application of
    manifold learning methods in many practical problems such as classification and
    target detection.

  216. Analytical shape determination of fiber-like objects with Virtual Image Correlation.

    Authors: Benoit Semin, Marc Louis Maurice François, Harold Auradou
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    This paper reports a method allowing for the determination of the shape of
    deformed fiber-like objects. Compared to existing methods, it provides
    analytical results including the local slope and curvature which are of first
    importance, for instance, in beam mechanics. The presented VIC (Virtual Image
    Correlation) method consists in looking for the best correlation between the
    image of the fiber-like object and a virtual beam image, using an algorithm
    close to the Digital Image Correlation method developed in experimental solid
    mechanics.

  217. An Improved Image Mining Technique For Brain Tumour Classification Using Efficient Classifier.

    Authors: P. Rajendran, M. Madheswaran
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    An improved image mining technique for brain tumor classification using
    pruned association rule with MARI algorithm is presented in this paper. The
    method proposed makes use of association rule mining technique to classify the
    CT scan brain images into three categories namely normal, benign and malign. It
    combines the low level features extracted from images and high level knowledge
    from specialists. The developed algorithm can assist the physicians for
    efficient classification with multiple keywords per image to improve the
    accuracy.

  218. A Topological derivative based image segmentation for sign language recognition system using isotropic filter.

    Authors: M. Krishnaveni, Dr. V. Radha
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    The need of sign language is increasing radically especially to hearing
    impaired community. Only few research groups try to automatically recognize
    sign language from video, colored gloves and etc. Their approach requires a
    valid segmentation of the data that is used for training and of the data that
    is used to be recognized. Recognition of a sign language image sequence is
    challenging because of the variety of hand shapes and hand motions.

  219. A New Method to Extract Dorsal Hand Vein Pattern using Quadratic Inference Function.

    Authors: Maleika Heenaye Mamode Khan, Naushad Ali Mamode Khan
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Among all biometric, dorsal hand vein pattern is attracting the attention of
    researchers, of late. Extensive research is being carried out on various
    techniques in the hope of finding an efficient one which can be applied on
    dorsal hand vein pattern to improve its accuracy and matching time. One of the
    crucial step in biometric is the extraction of features. In this paper, we
    propose a method based on quadratic inference function to the dorsal hand vein
    features to extract its features. The biometric system developed was tested on
    a database of 100 images.

  220. Recognition of Regular Shapes in Satelite Images.

    Authors: Ali Pourmohammad, Ahmad Reza Eskandari
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    This paper has been withdrawn by the author ali pourmohammad.

  221. Using SLP Neural Network to Persian Handwritten Digits Recognition.

    Authors: Ali Pourmohammad, Seyed Mohammad Ahadi
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    This paper has been withdrawn by the author ali pourmohammad.

  222. Boosting k-NN for categorization of natural scenes.

    Authors: Frank Nielsen, Paolo Piro, Richard Nock, Michel Barlaud
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    The k-nearest neighbors (k-NN) classification rule has proven extremely
    successful in countless many computer vision applications. For example, image
    categorization often relies on uniform voting among the nearest prototypes in
    the space of descriptors. In spite of its good properties, the classic k-NN
    rule suffers from high variance when dealing with sparse prototype datasets in
    high dimensions. A few techniques have been proposed to improve k-NN
    classification, which rely on either deforming the nearest neighborhood
    relationship or modifying the input space.

  223. An Unsupervised Algorithm For Learning Lie Group Transformations.

    Authors: Jascha Sohl-Dickstein, Jimmy C. Wang, Bruno A. Olshausen
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    We describe a method for learning Lie, or continuous transformation, group
    descriptions of the dynamics of natural scenes. Naively, doing so is made
    difficult by the O(N^6) computational complexity in the number of pixels N for
    learning of the Lie group operators, and an abundance of local minima while
    inferring transformations for specific image sequences.

  224. Accelerating Competitive Learning Graph Quantization.

    Authors: Klaus Obermayer, Brijnesh J. Jain
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Vector quantization(VQ) is a lossy data compression technique from signal
    processing for which simple competitive learning is one standard method to
    quantize patterns from the input space. Extending competitive learning VQ to
    the domain of graphs results in competitive learning for quantizing input
    graphs. In this contribution, we propose an accelerated version of competitive
    learning graph quantization (GQ) without trading computational time against
    solution quality. For this, we lift graphs locally to vectors in order to avoid
    unnecessary calculations of intractable graph distances.

  225. A Novel Feature Extraction for Robust EMG Pattern Recognition.

    Authors: Angkoon Phinyomark, Chusak Limsakul, Pornchai Phukpattaranont
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Varieties of noises are major problem in recognition of Electromyography
    (EMG) signal. Hence, methods to remove noise become most significant in EMG
    signal analysis. White Gaussian noise (WGN) is used to represent interference
    in this paper. Generally, WGN is difficult to be removed using typical
    filtering and solutions to remove WGN are limited. In addition, noise removal
    is an important step before performing feature extraction, which is used in
    EMG-based recognition. This research is aimed to present a novel feature that
    tolerate with WGN.

  226. Matching 2-D Ellipses to 3-D Circles with Application to Vehicle Pose Estimation.

    Authors: Marcus Hutter, Nathan Brewer
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Finding the three-dimensional representation of all or a part of a scene from
    a single two dimensional image is a challenging task. In this paper we propose
    a method for identifying the pose and location of objects with circular
    protrusions in three dimensions from a single image and a 3d representation or
    model of the object of interest. To do this, we present a method for
    identifying ellipses and their properties quickly and reliably with a novel
    technique that exploits intensity differences between objects and a geometric
    technique for matching an ellipse in 2d to a circle in 3d.

  227. Synthesis of supervised classification algorithm using intelligent and statistical tools.

    Authors: Ali Douik, Mourad Moussa Jlassi
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    A fundamental task in detecting foreground objects in both static and dynamic
    scenes is to take the best choice of color system representation and the
    efficient technique for background modeling. We propose in this paper a
    non-parametric algorithm dedicated to segment and to detect objects in color
    images issued from a football sports meeting. Indeed segmentation by pixel
    concern many applications and revealed how the method is robust to detect
    objects, even in presence of strong shadows and highlights.

  228. Heart Rate Variability Analysis Using Threshold of Wavelet Package Coefficients.

    Authors: G. Kheder, A. Kachouri, M. Ben Massoued, M. Samet
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    In this paper, a new efficient feature extraction method based on the
    adaptive threshold of wavelet package coefficients is presented. This paper
    especially deals with the assessment of autonomic nervous system using the
    background variation of the signal Heart Rate Variability HRV extracted from
    the wavelet package coefficients. The application of a wavelet package
    transform allows us to obtain a time-frequency representation of the signal,
    which provides better insight in the frequency distribution of the signal with
    time.

  229. Maximin affinity learning of image segmentation.

    Authors: Srinivas C. Turaga, Kevin L. Briggman, Moritz Helmstaedter, Winfried Denk, H. Sebastian Seung
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Images can be segmented by first using a classifier to predict an affinity
    graph that reflects the degree to which image pixels must be grouped together
    and then partitioning the graph to yield a segmentation. Machine learning has
    been applied to the affinity classifier to produce affinity graphs that are
    good in the sense of minimizing edge misclassification rates. However, this
    error measure is only indirectly related to the quality of segmentations
    produced by ultimately partitioning the affinity graph.

  230. Pigment Melanin: Pattern for Iris Recognition.

    Authors: Mahdi S. Hosseini, Babak N. Araabi, Hamid Soltanian-Zadeh
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Recognition of iris based on Visible Light (VL) imaging is a difficult
    problem because of the light reflection from the cornea. Nonetheless, pigment
    melanin provides a rich feature source in VL, unavailable in Near-Infrared
    (NIR) imaging. This is due to biological spectroscopy of eumelanin, a chemical
    not stimulated in NIR. In this case, a plausible solution to observe such
    patterns may be provided by an adaptive procedure using a variational technique
    on the image histogram. To describe the patterns, a shape analysis method is
    used to derive feature-code for each subject.

  231. Non-photorealistic image processing: an Impressionist rendering.

    Authors: Amelia Carolina Sparavigna, Roberto Marazzato
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    The paper describes an image processing for a non-photorealistic rendering.
    The algorithm is based on a random choice of a set of pixels from those ot the
    original image and substitution of them with colour spots. An iterative
    procedure is applied to cover, at a desired level, the canvas. The resulting
    effect mimics the impressionist painting and Pointillism.

  232. CanICA: Model-based extraction of reproducible group-level ICA patterns from fMRI time series.

    Authors: Gaël Varoquaux, Sepideh Sadaghiani, Jean Baptiste Poline, Bertrand Thirion
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Spatial Independent Component Analysis (ICA) is an increasingly used
    data-driven method to analyze functional Magnetic Resonance Imaging (fMRI)
    data. To date, it has been used to extract meaningful patterns without prior
    information. However, ICA is not robust to mild data variation and remains a
    parameter-sensitive algorithm. The validity of the extracted patterns is hard
    to establish, as well as the significance of differences between patterns
    extracted from different groups of subjects.

  233. Breast Cancer Detection Using Multilevel Thresholding.

    Authors: Y. Ireaneus Anna Rejani, S.Thamarai Selvi
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    This paper presents an algorithm which aims to assist the radiologist in
    identifying breast cancer at its earlier stages. It combines several image
    processing techniques like image negative, thresholding and segmentation
    techniques for detection of tumor in mammograms. The algorithm is verified by
    using mammograms from Mammographic Image Analysis Society. The results obtained
    by applying these techniques are described.

  234. An Innovative Scheme For Effectual Fingerprint Data Compression Using Bezier Curve Representations.

    Authors: Vani Perumal, Jagannathan Ramaswamy
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Naturally, with the mounting application of biometric systems, there arises a
    difficulty in storing and handling those acquired biometric data. Fingerprint
    recognition has been recognized as one of the most mature and established
    technique among all the biometrics systems. In recent times, with fingerprint
    recognition receiving increasingly more attention the amount of fingerprints
    collected has been constantly creating enormous problems in storage and
    transmission.

  235. The Cyborg Astrobiologist: Testing a Novelty-Detection Algorithm on Two Mobile Exploration Systems at Rivas Vaciamadrid in Spain and at the Mars Desert Research Station in Utah.

    Authors: P.C. McGuire, C. Gross, L. Wendt, A. Bonnici, V. Souza-Egipsy, J. Ormo, E. Diaz-Martinez, B.H. Foing, R. Bose, S. Walter, M. Oesker, J. Ontrup, R. Haschke, H. Ritter
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    (ABRIDGED)In previous work, two platforms have been developed for testing
    computer-vision algorithms for robotic planetary exploration (McGuire et al.
    2004b,2005; Bartolo et al. 2007). The wearable-computer platform has been
    tested at geological and astrobiological field sites in Spain (Rivas
    Vaciamadrid and Riba de Santiuste), and the phone-camera has been tested at a
    geological field site in Malta.

  236. An Iterative Shrinkage Approach to Total-Variation Image Restoration.

    Authors: Oleg Michailovich
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    The problem of restoration of digital images from their degraded measurements
    plays a central role in a multitude of practically important applications. A
    particularly challenging instance of this problem occurs in the case when the
    degradation phenomenon is modeled by an ill-conditioned operator. In such a
    case, the presence of noise makes it impossible to recover a valuable
    approximation of the image of interest without using some a priori information
    about its properties. Such a priori information is essential for image
    restoration, rendering it stable and robust to noise.

  237. Algorithms for Image Analysis and Combination of Pattern Classifiers with Application to Medical Diagnosis.

    Authors: Harris Georgiou
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Medical Informatics and the application of modern signal processing in the
    assistance of the diagnostic process in medical imaging is one of the more
    recent and active research areas today. This thesis addresses a variety of
    issues related to the general problem of medical image analysis, specifically
    in mammography, and presents a series of algorithms and design approaches for
    all the intermediate levels of a modern system for computer-aided diagnosis
    (CAD).

  238. Behavior Subtraction.

    Authors: V. Saligrama, P. M. Jodoin, J. Konrad
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Background subtraction has been a driving engine for many computer vision and
    video analytics tasks. Although its many variants exist, they all share the
    underlying assumption that photometric scene properties are either static or
    exhibit temporal stationarity. While this works in some applications, the model
    fails when one is interested in discovering {\it changes in scene dynamics}
    rather than those in a static background; detection of unusual pedestrian and
    motor traffic patterns is but one example.

  239. Fractional differentiation based image processing.

    Authors: Amelia Carolina Sparavigna
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    There are many resources useful for processing images, most of them freely
    available and quite friendly to use. In spite of this abundance of tools, a
    study of the processing methods is still worthy of efforts. Here, we want to
    discuss the new possibilities arising from the use of fractional differential
    calculus. This calculus evolved in the research field of pure mathematics until
    1920, when applied science started to use it. Only recently, fractional
    calculus was involved in image processing methods.

  240. Positive Semidefinite Metric Learning with Boosting.

    Authors: Chunhua Shen, Junae Kim, Lei Wang, Anton van den Hengel
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    The learning of appropriate distance metrics is a critical problem in image
    classification and retrieval. In this work, we propose a boosting-based
    technique, termed \BoostMetric, for learning a Mahalanobis distance metric. One
    of the primary difficulties in learning such a metric is to ensure that the
    Mahalanobis matrix remains positive semidefinite.

  241. A dyadic solution of relative pose problems.

    Authors: Patrick Erik Bradley
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    A hierarchical interval subdivision is shown to lead to a $p$-adic encoding
    of image data. This allows in the case of the relative pose problem in computer
    vision and photogrammetry to derive equations having 2-adic numbers as
    coefficients, and to use Hensel's lifting method to their solution. This method
    is applied to the linear and non-linear equations coming from eight, seven or
    five point correspondences. An inherent property of the method is its
    robustness.

  242. A Matlab Implementation of a Flat Norm Motivated Polygonal Edge Matching Method using a Decomposition of Boundary into Four 1-Dimensional Currents.

    Authors: Wotao Yin, Simon P Morgan, Kevin R Vixie
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    We describe and provide code and examples for a polygonal edge matching
    method.

  243. Improvements of the 3D images captured with Time-of-Flight cameras.

    Authors: D. Falie
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    3D Time-of-Flight camera's images are affected by errors due to the diffuse
    (indirect) light and to the flare light. The presented method improves the 3D
    image reducing the distance's errors to dark surface objects. This is achieved
    by placing one or two contrast tags in the scene at different distances from
    the ToF camera. The white and black parts of the tags are situated at the same
    distance to the camera but the distances measured by the camera are different.
    This difference is used to compute a correction vector.

  244. A Method for Extraction and Recognition of Isolated License Plate Characters.

    Authors: Yon Ping Chen, Tien Der Yeh
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    A method to extract and recognize isolated characters in license plates is
    proposed. In extraction stage, the proposed method detects isolated characters
    by using Difference-of-Gaussian (DOG) function, The DOG function, similar to
    Laplacian of Gaussian function, was proven to produce the most stable image
    features compared to a range of other possible image functions. The candidate
    characters are extracted by doing connected component analysis on different
    scale DOG images.

  245. A possible low-level explanation of "temporal dynamics of brightness induction and White's illusion".

    Authors: Subhajit Karmakar, Sandip Sarkar
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Based upon physiological observation on time dependent orientation
    selectivity in the cells of macaque's primary visual cortex together with the
    psychophysical studies on the tuning of orientation detectors in human vision
    we suggest that time dependence in brightness perception can be accommodated
    through the time evolution of cortical contribution to the orientation tuning
    of the ODoG filter responses. A set of Difference of Gaussians functions has
    been used to mimic the time dependence of orientation tuning.

  246. Median K-flats for hybrid linear modeling with many outliers.

    Authors: Teng Zhang, Arthur Szlam, Gilad Lerman
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    We describe the Median K-Flats (MKF) algorithm, a simple online method for
    hybrid linear modeling, i.e., for approximating data by a mixture of flats.
    This algorithm simultaneously partitions the data into clusters while finding
    their corresponding best approximating l1 d-flats, so that the cumulative l1
    error is minimized. The current implementation restricts d-flats to be
    d-dimensional linear subspaces.

  247. An Efficient Secure Multimodal Biometric Fusion Using Palmprint and Face Image.

    Authors: M. Nageshkumar, P.K. Mahesh, M.N.S. Swamy
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Biometrics based personal identification is regarded as an effective method
    for automatically recognizing, with a high confidence a person's identity. A
    multimodal biometric systems consolidate the evidence presented by multiple
    biometric sources and typically better recognition performance compare to
    system based on a single biometric modality. This paper proposes an
    authentication method for a multimodal biometric system identification using
    two traits i.e. face and palmprint. The proposed system is designed for
    application where the training data contains a face and palmprint.

  248. Sparsity and `something else'.

    Authors: James Bowley, Laura Rebollo-Neira
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    The property of sparse representations concerning capability for information
    storage is discussed. It is shown that this feature can be used, for instance,
    for an application that we term Image Folding. The proposed procedure is
    applicable by means of any suitable transformation. However, it is also the aim
    of this paper to draw attention in regard to the gain in the sparsity of an
    image representation achieved by combination of Discrete Cosine a Dirac
    dictionaries.

  249. Sparsity and `something else'.

    Authors: James Bowley, Laura Rebollo-Neira
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    The property of sparse representations concerning capability for information
    storage is discussed. It is shown that this feature can be used, for instance,
    for an application that we term Image Folding. The proposed procedure is
    applicable by means of any suitable transformation. However, it is also the aim
    of this paper to draw attention in regard to the gain in the sparsity of an
    image representation achieved by combination of Discrete Cosine a Dirac
    dictionaries.

  250. Motion Segmentation by SCC on the Hopkins 155 Database.

    Authors: G. Chen, G. Lerman
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    We apply the Spectral Curvature Clustering (SCC) algorithm to a benchmark
    database of 155 motion sequences, and show that it outperforms all other
    state-of-the-art methods. The average misclassification rate by SCC is 1.41%
    for sequences having two motions and 4.85% for three motions.

  251. Kernel Spectral Curvature Clustering (KSCC).

    Authors: G. Chen, S. Atev, G. Lerman
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Multi-manifold modeling is increasingly used in segmentation and data
    representation tasks in computer vision and related fields. While the general
    problem, modeling data by mixtures of manifolds, is very challenging, several
    approaches exist for modeling data by mixtures of affine subspaces (which is
    often referred to as hybrid linear modeling). We translate some important
    instances of multi-manifold modeling to hybrid linear modeling in embedded
    spaces, without explicitly performing the embedding but applying the kernel
    trick.

  252. Scale-Based Gaussian Coverings: Combining Intra and Inter Mixture Models in Image Segmentation.

    Authors: Fionn Murtagh, Pedro Contreras, Jean-Luc Starck
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    By a "covering" we mean a Gaussian mixture model fit to observed data.
    Approximations of the Bayes factor can be availed of to judge model fit to the
    data within a given Gaussian mixture model. Between families of Gaussian
    mixture models, we propose the R\'enyi quadratic entropy as an excellent and
    tractable model comparison framework.

  253. Handwritten Farsi Character Recognition using Artificial Neural Network.

    Authors: Reza Gharoie Ahangar, Mohammad Farajpoor Ahangar
    Subjects: Computer Vision and Pattern Recognition
    Abstract

    Neural Networks are being used for character recognition from last many years
    but most of the work was confined to English character recognition. Till date,
    a very little work has been reported for Handwritten Farsi Character
    recognition. In this paper, we have made an attempt to recognize handwritten
    Farsi characters by using a multilayer perceptron with one hidden layer. The
    error backpropagation algorithm has been used to train the MLP network.

Syndicate content