Computation

  1. Simulation of stochastic systems via polynomial chaos expansions and convex optimization.

    Authors: Lorenzo Fagiano, Mustafa Khammash
    Subjects: Computation
    Abstract

    Polynomial Chaos Expansions represent a powerful tool to simulate stochastic
    models of dynamical systems. Yet, deriving the expansion's coefficients for
    complex systems might require a significant and non-trivial manipulation of the
    model, or the computation of large numbers of simulation runs, rendering the
    approach too time consuming and impracticable for applications with more than a
    handful of random variables. We introduce a novel computationally tractable
    technique for computing the coefficients of polynomial chaos expansions.

  2. Mcmc Methods for Functions: Modifying Old Algorithms to make them Faster.

    Authors: Gareth Roberts, Andrew Stuart, Simon Cotter, David White
    Subjects: Computation
    Abstract

    Many problems arising in applications result in the need to probe a
    probability distribution for functions. Examples include Bayesian nonparametric
    statistics and conditioned diffusion processes. Standard MCMC algorithms
    typically become arbitrarily slow under the mesh refinement dictated by
    nonparametric description of the unknown function. We describe an approach to
    modifying a whole range of MCMC methods which ensures that their speed of
    convergence is robust under mesh refinement.

  3. Split HMC for Gaussian Process Models.

    Authors: Babak Shahbaba, Shiwei Lan
    Subjects: Computation
    Abstract

    In this paper, we discuss an extension of the Split Hamiltonian Monte Carlo
    (Split HMC) method for Gaussian process model (GPM). This method is based on
    splitting the Hamiltonian in a way that allows much of the movement around the
    state space to be done at low computational cost. To this end, we approximate
    the negative log density (i.e., the energy function) of the distribution of
    interest by a quadratic function U0 for which Hamiltonian dynamics can be
    solved analytically. The overall energy function U is then written as U0 + U1,
    where U1 is the approximation error.

  4. Bayesian Parameter Inference for Partially Observed Stopped Processes.

    Authors: Ajay Jasra, Nikolas Kantas
    Subjects: Computation
    Abstract

    In this article we consider Bayesian parameter inference associated to
    partially-observed stochastic processes that start from a set B0 and are
    stopped or killed at the first hitting time of a known set A. Such processes
    occur naturally within the context of a wide variety of applications. The
    associated posterior distributions are highly complex and posterior parameter
    inference requires the use of advanced Markov chain Monte Carlo (MCMC)
    techniques. Our approach uses a recently introduced simulation methodology,
    particle Markov chain Monte Carlo (PMCMC) (Andrieu et. al.

  5. A Generic Path Algorithm for Regularized Statistical Estimation.

    Authors: Yichao Wu, Hua Zhou
    Subjects: Computation
    Abstract

    Regularization is widely used in statistics and machine learning to prevent
    overfitting and gear solution towards prior information. In general, a
    regularized estimation problem minimizes the sum of a loss function and a
    penalty term. The penalty term is usually weighted by a tuning parameter and
    encourages certain constraints on the parameters to be estimated. Particular
    choices of constraints lead to the popular lasso, fused-lasso, and other
    generalized $l_1$ penalized regression methods.

  6. Path Following and Empirical Bayes Model Selection for Sparse Regression.

    Authors: Hua Zhou, Artin Armagan, David B. Dunson
    Subjects: Computation
    Abstract

    In recent years, a rich variety of regularization procedures have been
    proposed for high dimensional regression problems. However, tuning parameter
    choice and computational efficiency in ultra-high dimensional problems remain
    vexing issues. The routine use of $\ell_1$ regularization is largely
    attributable to the computational efficiency of the LARS algorithm, but similar
    efficiency for better behaved penalties has remained elusive. In this article,
    we propose a highly efficient path following procedure for combination of any
    convex loss function and a broad class of penalties.

  7. Bergm: Bayesian Exponential Random Graphs in R.

    Authors: N. Friel, A. Caimo
    Subjects: Computation
    Abstract

    In this paper we describe the basic features of the Bergm package for the
    open-source R software which provides a comprehensive framework for Bayesian
    analysis for exponential random graph models: tools for parameter estimation,
    model selection and goodness-of-fit diagnostics. We illustrate the capabilities
    of this package describing the algorithms that drive the package through a
    tutorial analysis of two well-known network datasets.

  8. Mixed Beta Regression: A Bayesian Perspective.

    Authors: Silvia L.P. Ferrari, Jorge I. Figueroa-Zuñiga, Reinaldo B. Arellano-Valle
    Subjects: Computation
    Abstract

    This paper builds on recent research that focuses on regression modeling of
    continuous bounded data, such as proportions measured on a continuous scale.
    Specifically, it deals with beta regression models with mixed effects from a
    Bayesian approach.

  9. Bayesian model selection for exponential random graph models.

    Authors: Nial Friel, Alberto Caimo
    Subjects: Computation
    Abstract

    Exponential random graph models are a class of widely used exponential family
    models for social networks. The topological structure of an observed network is
    modeled by the relative prevalence of a set of local sub-graph configurations
    termed network statistics. One of the key tasks in the application of these
    models is which network statistics to include in the model. This can be thought
    of as statistical model selection problem.

  10. Simply Explicitly Invertible Approximations to 4 Decimals of Error Function and Normal Cumulative Distribution Function.

    Authors: A. Soranzo, E. Epure
    Subjects: Computation
    Abstract

    We improve the Modified Winitzki's Approximation of the error function
    $erf(x)\cong \sqrt{1-e^{-x^2\frac{\frac{4}{\pi}+0.147x^2}{1+0.147x^2}}}$ which
    has error $|\varepsilon (x)| < 1.25 \cdot 10^{-4}$ $\forall x \ge 0$ till
    reaching 4 decimals of precision with $|\varepsilon (x)| < 2.27 \cdot 10^{-5}$;
    also reducing slightly the relative error. Old formula and ours are both
    explicitly invertible, essentially solving a biquadratic equation, after
    obvious substitutions.

  11. Summarizing posterior distributions in signal decomposition problems when the number of components is unknown.

    Authors: Julien Bect, Alireza Roodaki, Gilles Fleury
    Subjects: Computation
    Abstract

    This paper addresses the problem of summarizing the posterior distributions
    that typically arise, in a Bayesian framework, when dealing with signal
    decomposition problems with unknown number of components. Such posterior
    distributions are defined over union of subspaces of differing dimensionality
    and can be sampled from using modern Monte Carlo techniques, for instance the
    increasingly popular RJ-MCMC method. No generic approach is available, however,
    to summarize the resulting variable-dimensional samples and extract from them
    component-specific parameters.

  12. Note on the computation of the Metropolis-Hastings ratio for Birth-or-Death moves in trans-dimensional MCMC algorithms for signal decomposition problems.

    Authors: Julien Bect, Alireza Roodaki, Gilles Fleury
    Subjects: Computation
    Abstract

    Reversible jump MCMC (RJ-MCMC) sampling techniques, which allow to jointly
    tackle model selection and parameter estimation problems in a coherent Bayesian
    framework, have become increasingly popular in the signal processing literature
    since the seminal paper of Andrieu and Doucet (IEEE Trans. Signal Process.,
    47(10), 1999). Crucial to the implementation of any RJ-MCMC sampler is the
    computation of the so-called Metropolis-Hastings-Green (MHG) ratio, which
    determines the acceptance probability for the proposed moves.

  13. Particle Approximation of the Filtering Density for State-Space Markov Models in Discrete Time.

    Authors: Dan Crisan, Joaquin Miguez
    Subjects: Computation
    Abstract

    Sequential Monte Carlo (SMC) methods, also known as particle filters, are
    simulation-based recursive algorithms for the approximation of the a posteriori
    probability measures generated by state-space dynamical models. At any given
    (discrete) time t, a particle filter produces a set of samples over the state
    space of the system of interest. These samples are referred to as "particles"
    and can be used to build a discrete approximation of the a posteriori
    probability distribution of the state, conditional on a sequence of available
    observations.

  14. Self-Avoiding Random Dynamics on Integer Complex Systems.

    Authors: Firas Hamze, Nando de Freitas, Ziyu Wang
    Subjects: Computation
    Abstract

    This paper introduces a new specialized algorithm for equilibrium Monte Carlo
    sampling of binary-valued systems, which allows for large moves in the state
    space. This is achieved by constructing self-avoiding walks (SAWs) in the state
    space. As a consequence, many bits are flipped in a single MCMC step. We name
    the algorithm SARDONICS, an acronym for Self-Avoiding Random Dynamics on
    Integer Complex Systems. The algorithm has several free parameters, but we show
    that Bayesian optimization can be used to automatically tune them.

  15. The No-U-Turn Sampler: Adaptively Setting Path Lengths in Hamiltonian Monte Carlo.

    Authors: Andrew Gelman, Matthew D. Hoffman
    Subjects: Computation
    Abstract

    Hamiltonian Monte Carlo (HMC) is a Markov chain Monte Carlo (MCMC) algorithm
    that avoids the random walk behavior and sensitivity to correlated parameters
    that plague many MCMC methods by taking a series of steps informed by
    first-order gradient information. These features allow it to converge to
    high-dimensional target distributions much more quickly than simpler methods
    such as random walk Metropolis or Gibbs sampling. However, HMC's performance is
    highly sensitive to two user-specified parameters: a step size {\epsilon} and a
    desired number of steps L.

  16. Asymptotic Approximation of Marginal Likelihood Integrals.

    Authors: Shaowei Lin
    Subjects: Computation
    Abstract

    The accurate asymptotic evaluation of marginal likelihood integrals is a
    fundamental problem in Bayesian statistics. Following the approach introduced
    by Watanabe, we translate this into a problem of computational algebraic
    geometry, namely, to determine the real log canonical threshold of a polynomial
    ideal, and we present effective methods for solving this problem. Our results
    are based on resolution of singularities, and they apply to all statistical
    models for discrete data that admit a parametrization by real analytic
    functions.

  17. Student's T Robust Bundle Adjustment Algorithm.

    Authors: Aleksandr Y. Aravkin, Michael Styer, Zachary Moratto, Ara Nefian, Michael Broxton
    Subjects: Computation
    Abstract

    Bundle adjustment (BA) is the problem of refining a visual reconstruction to
    produce better structure and viewing parameter estimates. This problem is often
    formulated as a nonlinear least squares problem, where data arises from
    interest point matching. Mismatched interest points cause serious problems in
    this approach, as a single mismatch will affect the entire reconstruction. In
    this paper, we propose a novel robust Student's t BA algorithm (RST-BA).

  18. Going off grid: Computationally efficient inference for log-Gaussian Cox processes.

    Authors: H&#xe5;vard Rue, Finn Lindgren, Daniel Simpson, Janine Illian, Sigrunn S&#xf8;rbye
    Subjects: Computation
    Abstract

    In this paper we introduce a new method for performing computational
    inference on log-Gaussian Cox processes (LGCP). Contrary to current practice,
    we do not approximate by a counting process on a partition of the domain, but
    rather attack the point process likelihood directly. In order to do this, we
    use the continuously specified Markovian random fields introduced by
    \citet{Lindgren2011}. The inference is performed using the \texttt{R-INLA}
    package of \citet{art451}, which allows us to perform fast approximate
    inference on quite complicated models.

  19. Sparse recovery conditions for Orthogonal Least Squares.

    Authors: R&#xe9;mi Gribonval, Charles Soussen, J&#xe9;r&#xf4;me Idier, C&#xe9;dric Herzet
    Subjects: Computation
    Abstract

    We extend Tropp's analysis of Orthogonal Matching Pursuit (OMP) using the
    Exact Recovery Condition (ERC) to a first exact recovery analysis of Orthogonal
    Least Squares (OLS). We show that when ERC is met, OLS is guaranteed to exactly
    recover the unknown support. Moreover, we provide a closer look at the analysis
    of both OMP and OLS when ERC is not fulfilled. We show that there exist
    dictionaries for which some subsets are never recovered with OMP. This
    phenomenon, which also appears with $\ell_1$ minimization, does not occur for
    OLS.

  20. Functional Uniform Priors for Nonlinear Modelling.

    Authors: Bj&#xf6;rn Bornkamp
    Subjects: Computation
    Abstract

    This paper considers the topic of finding prior distributions when a major
    component of the statistical model depends on a nonlinear function. Using
    results on how to construct uniform distributions in general metric spaces, we
    propose a prior distribution that is uniform in the space of functional shapes
    of the underlying nonlinear function and then back-transform to obtain a prior
    distribution for the original model parameters. The primary application
    considered in this article is nonlinear regression, but the idea might be of
    interest beyond this case.

  21. Efficient Bayesian Multivariate Surface Regression.

    Authors: Feng Li, Mattias Villani
    Subjects: Computation
    Abstract

    Methods for choosing a fixed set of knot locations in additive spline models
    are fairly well established in the statistical literature. While most of these
    methods are in principle directly extendable to non-additive surface models,
    they are likely to be less successful in that setting because of the curse of
    dimensionality, especially when there are more than a couple of covariates.

  22. Bayesian Post-Processor and other Enhancements of Subset Simulation for Estimating Failure Probabilities in High Dimensions.

    Authors: Konstantin M. Zuev, James L. Beck, Siu-Kui Au, Lambros S. Katafygiotis
    Subjects: Computation
    Abstract

    Estimation of small failure probabilities is one of the most important and
    challenging computational problems in reliability engineering. The failure
    probability is usually given by an integral over a high-dimensional uncertain
    parameter space that is difficult to evaluate numerically. This paper focuses
    on enhancements to Subset Simulation (SS), proposed by Au and Beck, which
    provides an efficient algorithm based on MCMC (Markov chain Monte Carlo)
    simulation for computing small failure probabilities for general
    high-dimensional reliability problems.

  23. Two algorithms for fitting constrained marginal models.

    Authors: Robin J. Evans, Antonio Forcina
    Subjects: Computation
    Abstract

    There are two main algorithms for fitting constrained marginal models to
    discrete data available in the literature. We show that they are equivalent,
    each being advantageous in different circumstances, and give some results on
    their convergence properties. Extensions of the method are provided to allow
    the inclusion of individual covariates, and for maximization under
    $L_1$-penalties.

  24. Efficient Particle Markov Chain Monte Carlo - Bayesian inference with a couple of particles.

    Authors: Fredrik Lindsten, Thomas B. Sch&#xf6;n
    Subjects: Computation
    Abstract

    Recently, Andrieu, Doucet and Holenstein (2010) introduced a general
    framework for using particle filters (PFs) to construct proposal kernels for
    Markov chain Monte Carlo (MCMC) methods. This framework, termed Particle Markov
    chain Monte Carlo (PMCMC), was shown to provide powerful methods for joint
    Bayesian state and parameter inference in nonlinear/non-Gaussian state-space
    models. However, the mixing of the resulting MCMC kernels can be quite
    sensitive, both to the number of particles used in the underlying PF and to the
    number of observations in the data.

  25. Abstract tubes associated with perturbed polyhedra with applications to multidimensional normal probability computations.

    Authors: Satoshi Kuriki, Tetsuhisa Miwa, Anthony J. Hayter
    Subjects: Computation
    Abstract

    Let $K$ be a closed convex polyhedron defined by a finite number of linear
    inequalities. In this paper we refine the theory of abstract tubes (Naiman and
    Wynn, 1997) associated with $K$ when $K$ is perturbed. In particular, we focus
    on the perturbation that is lexicographic and in an outer direction. An
    algorithm for constructing the abstract tube by means of linear programming and
    its implementation are discussed.

  26. An algorithm to compute the power of Monte Carlo tests with guaranteed precision.

    Authors: Axel Gandy, Patrick Rubin-Delanchy
    Subjects: Computation
    Abstract

    This article presents an algorithm that generates an exact (conservative)
    confidence interval of a specified length and coverage probability for the
    power of a Monte Carlo test (such as a bootstrap or permutation test). It is
    the first method that achieves this aim for almost any Monte Carlo test.

  27. On the rank-one approximation of symmetric tensors.

    Authors: Michael James O&#x27;Hara
    Subjects: Computation
    Abstract

    The problem of rank-one approximation of symmetric tensors is important in
    independent components analysis, also known as blind source separation, as well
    as polynomial optimization. The rank-one approximation problem reduces to
    finding the principal Z eigenvector, and an effective algorithm was recently
    discovered for computing symmetric tensor eigenvectors.

  28. Bayesian Quantile Regression for Single-Index Models.

    Authors: Heng Lian, Robert B. Gramacy, Yuao Hua
    Subjects: Computation
    Abstract

    Using an asymmetric Laplace distribution, which provides a mechanism for
    Bayesian inference of quantile regression models, we develop a fully Bayesian
    approach to fitting single-index models in conditional quantile regression. In
    this work, we use a Gaussian process prior for the unknown nonparametric link
    function and a Laplace distribution on the index vector, with the latter
    motivated by the recent popularity of the Bayesian lasso idea. We design a
    Markov chain Monte Carlo algorithm for posterior inference.

  29. $\epsilon$-Strong Simulation of the Brownian Path.

    Authors: Gareth Roberts, Alexandros Beskos, Stefano Peluchetti
    Subjects: Computation
    Abstract

    We present an iterative sampling method which delivers upper and lower
    bounding processes for the Brownian path. We develop such processes with
    particular emphasis on being able to unbiasedly simulate them on a personal
    computer. The dominating processes converge almost surely in the supremum and
    $L_1$ norms.

  30. Stability properties of some particle filters.

    Authors: Nick Whiteley
    Subjects: Computation
    Abstract

    Under multiplicative drift and other regularity conditions, it is established
    that the asymptotic variance associated with a particle filter approximation of
    the prediction filter is bounded uniformly in time, and the non-asymptotic,
    relative variance associated with the particle approximation of the normalizing
    constant is bounded linearly in time. The conditions are demonstrated to hold
    for some hidden Markov models on non-compact state spaces.

  31. Approximate Bayesian Computing for Spatial Extremes.

    Authors: Richard L. Smith, Robert J. Erhardt
    Subjects: Computation
    Abstract

    Statistical analysis of max-stable processes used to model spatial extremes
    has been limited by the difficulty in calculating the joint likelihood
    function. This precludes all standard likelihood-based approaches, including
    Bayesian approaches. In this paper we present a Bayesian approach through the
    use of approximate Bayesian computing. This circumvents the need for a joint
    likelihood function by instead relying on simulations from the (unavailable)
    likelihood. This method is compared with an alternative approach based on the
    composite likelihood.

  32. GLMMLasso: An Algorithm for High-Dimensional Generalized Linear Mixed Models Using L1-Penalization.

    Authors: Peter B&#xfc;hlmann, J&#xfc;rg Schelldorfer
    Subjects: Computation
    Abstract

    We propose an L1-penalized algorithm for fitting high-dimensional generalized
    linear mixed models. Generalized linear mixed models (GLMMs) can be viewed as
    an extension of generalized linear models for clustered observations. This
    Lasso-type approach for GLMMs should be mainly used as variable screening
    method to reduce the number of variables below the sample size. We then suggest
    a refitting by maximum likelihood based on the selected variables only. This is
    an effective correction to overcome problems stemming from the variable
    screening procedure which are more severe with GLMMs.

  33. An Adaptive Interacting Wang-Landau Algorithm for Automatic Density Exploration.

    Authors: Pierre Del Moral, Arnaud Doucet, Pierre Jacob, Luke Bornn
    Subjects: Computation
    Abstract

    While statisticians are well-accustomed to performing exploratory analysis in
    the modeling stage of an analysis, the notion of conducting preliminary
    general-purpose exploratory analysis in the Monte Carlo stage (or more
    generally, the model-fitting stage) of an analysis is an area which we feel
    deserves much further attention. Towards this aim, this paper proposes a
    general-purpose algorithm for automatic density exploration.

  34. Rigorous Computing of Rectangle Scan Probabilities for Markov Increments.

    Authors: Jannis Dimitriadis
    Subjects: Computation
    Abstract

    Extending recent work of Corrado, we derive an algorithm that computes
    rigorous upper and lower bounds for rectangle scan probabilities for Markov
    increments. We experimentally examine the closeness of the bounds computed by
    the algorithm and we examine the range of tractable input variables.

  35. Bayesian Inference with Optimal Maps.

    Authors: Youssef M. Marzouk, Tarek A. El Moselhy
    Subjects: Computation
    Abstract

    We present a new approach to Bayesian inference that entirely avoids Markov
    chain simulation, by constructing a map that pushes forward the prior measure
    to the posterior measure. Existence and uniqueness of a suitable
    measure-preserving map is established by formulating the problem in the context
    of optimal transport theory. We discuss various means of explicitly
    parameterizing the map and computing it efficiently through solution of an
    optimization problem, exploiting gradient information from the forward model
    when possible.

  36. Solving the 100 Swiss Francs Problem.

    Authors: Mingfu Zhu, Guangran Jiang, Shuhong Gao
    Subjects: Computation
    Abstract

    Sturmfels offered 100 Swiss Francs in 2005 to a conjecture, which deals with
    a special case of the maximum likelihood estimation for a latent class model.
    This paper confirms the conjecture positively.

  37. Precise Computation of Position Accuracy in GNSS Systems.

    Authors: Juan Pablo Boyero Garrido
    Subjects: Computation
    Abstract

    Accuracy and Availability computations for a GNSS System - or combination of
    Systems - through Service Volume Simulations take considerable time. Therefore,
    the computation of the accuracy in 2D and 3D are often simplified by an
    approximate solution. The drawback is that such simplifications can lead to
    accuracy results that are too conservative (up to 25% in the 2D case and up to
    43% in the 3D case, for a 95% confidence level), which in turn translates into
    pessimistic System Availability.

  38. Linear Variance Bounds for Particle Approximations of Time-Homogeneous Feynman-Kac Formulae.

    Authors: Ajay Jasra, Nick Whiteley, Nikolas Kantas
    Subjects: Computation
    Abstract

    This article establishes sufficient conditions for a linear-in-time bound on
    the non-asymptotic variance of particle approximations of time-homogeneous
    Feynman-Kac formulae. These formulae appear in a wide variety of applications
    including option pricing in finance and risk sensitive control in engineering.
    In direct Monte Carlo approximation of these formulae, the non-asymptotic
    variance typically increases at an exponential rate in the time parameter.

  39. Generalized Direct Sampling for Hierarchical Bayesian Models.

    Authors: Michael Braun, Paul Damien
    Subjects: Computation
    Abstract

    In this paper, we develop a new method to sample from posterior distributions
    in hierarchical models without using Markov chain Monte Carlo. This method is
    generally applicable to high-dimensional models involving large data sets.
    Illustrative analysis exemplifies the ease with which one could implement our
    method, which results in independent samples from the posterior distributions
    of interest.

  40. Online Variational Bayes Inference for High-Dimensional Correlated Data.

    Authors: David B. Dunson, Sylvie Tchumtchoua, Jeffrey S. Morris
    Subjects: Computation
    Abstract

    High-dimensional data with hundreds of thousands of observations are becoming
    commonplace in many disciplines. The analysis of such data poses many
    computational challenges, especially when the observations are correlated over
    time and/or across space. In this paper we propose flexible hierarchical
    regression models for analyzing such data that accommodate serial and/or
    spatial correlation. We address the computational challenges involved in
    fitting these models by adopting an approximate inference framework.

  41. Adaptive Gaussian Predictive Process Approximation.

    Authors: Surya T Tokdar
    Subjects: Computation
    Abstract

    We address the issue of knots selection for Gaussian predictive process
    methodology. Predictive process approximation provides an effective solution to
    the cubic order computational complexity of Gaussian process models. This
    approximation crucially depends on a set of points, called knots, at which the
    original process is retained, while the rest is approximated via a
    deterministic extrapolation. Knots should be few in number to keep the
    computational complexity low, but provide a good coverage of the process domain
    to limit approximation error.

  42. Expectation-Propagation for Summary-Less, Likelihood-Free Inference.

    Authors: Nicolas Chopin, Simon Barthelm&#xe9;
    Subjects: Computation
    Abstract

    Many models of interest in the natural and social sciences have no
    closed-form likelihood function, which means that they cannot be treated using
    the usual techniques of statistical inference. In the case where such models
    can be efficiently simulated, Bayesian inference is still possible thanks to
    the Approximate Bayesian Computation (ABC) algorithm. Although many refinements
    have since been suggested, the technique suffers from three major shortcomings.
    First, it requires introducing a vector of "summary statistics", the choice of
    which is arbitrary and may lead to strong biases.

  43. On the Weakenesses of Correlation Measures used for Search Engines' Results (Unsupervised Comparison of Search Engine Rankings).

    Authors: Paolo D&#x27;Alberto, Ali Dasdan
    Subjects: Computation
    Abstract

    The correlation of the result lists provided by search engines is fundamental
    and it has deep and multidisciplinary ramifications. Here, we present automatic
    and unsupervised methods to assess whether or not search engines provide
    results that are comparable or correlated. We have two main contributions:
    First, we provide evidence that for more than 80% of the input queries -
    independently of their frequency - the two major search engines share only
    three or fewer URLs in their search results, leading to an increasing
    divergence.

  44. Decision Based Uncertainty Propagation Using Adaptive Gaussian Mixtures.

    Authors: Gabriel Terejanu, Tarunraj Singh, Peter D. Scott, Puneet Singla
    Subjects: Computation
    Abstract

    Given a decision process based on the approximate probability density
    function returned by a data assimilation algorithm, an interaction level
    between the decision making level and the data assimilation level is designed
    to incorporate the information held by the decision maker into the data
    assimilation process. Here the information held by the decision maker is a loss
    function at a decision time which maps the state space onto real numbers which
    represent the threat associated with different possible outcomes or states.

  45. Approximate Interval Method for Epistemic Uncertainty Propagation using Polynomial Chaos and Evidence Theory.

    Authors: Gabriel Terejanu, Tarunraj Singh, Peter D. Scott, Puneet Singla
    Subjects: Computation
    Abstract

    The paper builds upon a recent approach to find the approximate bounds of a
    real function using Polynomial Chaos expansions. Given a function of random
    variables with compact support probability distributions, the intuition is to
    quantify the uncertainty in the response using Polynomial Chaos expansion and
    discard all the information provided about the randomness of the output and
    extract only the bounds of its compact support.

  46. Considerate Approaches to Achieving Sufficiency for ABC model selection.

    Authors: Sarah Filippi, Chris Barnes, Michael P.H. Stumpf, Thomas Thorne
    Subjects: Computation
    Abstract

    For nearly any challenging scientific problem evaluation of the likelihood is
    problematic if not impossible. Approximate Bayesian computation (ABC) allows us
    to employ the whole Bayesian formalism to problems where we can use simulations
    from a model, but cannot evaluate the likelihood directly.

  47. On optimal kernel in ABC SMC.

    Authors: Sarah Filippi, Chris Barnes, Michael P.H. Stumpf
    Subjects: Computation
    Abstract

    computation (ABC). Here we discuss how to construct the perturbation kernels
    that are required in ABC-SMC approaches, in order to construct a set of
    distributions that start out from a suitably defined prior and converge towards
    the unknown posterior. We derive optimality criteria for different kernels,
    which are based on the Kullback-Leibler divergence between intermediate
    distributions and the posterior distribution. We will show that for many
    complicated posterior distributions locally adapted kernels tend to show the
    best performance.

  48. Split Hamiltonian Monte Carlo.

    Authors: Radford M. Neal, Babak Shahbaba, Shiwei Lan, Wesley O. Johnson
    Subjects: Computation
    Abstract

    We show how the Hamiltonian Monte Carlo algorithm can sometimes be speeded up
    by "splitting" the Hamiltonian in a way that allows much of the movement around
    the state space to be done at low computational cost. One context where this is
    possible is when the log density of the distribution of interest (the potential
    energy function) can be written as the log of a Gaussian density, which is a
    quadratic function, plus a slowly varying function. Hamiltonian dynamics for
    quadratic energy functions can be analytically solved.

  49. Markov Chain Monte Carlo Based on Deterministic Transformations.

    Authors: Somak Dutta, Sourabh Bhattacharya
    Subjects: Computation
    Abstract

    In this article we propose a novel MCMC method based on deterministic
    transformations T : X x D --> X where X is the state-space and D is some set
    which may or may not be a subset of X. We refer to our new methodology as
    Transformation-based Markov chain Monte Carlo (TMCMC).

  50. Spatial wavelet Markov models are more efficient than covariance tapering and process convolutions.

    Authors: David Bolin, Finn Lindgren
    Subjects: Computation
    Abstract

    The Mat\'ern covariance function is a popular choice for modeling dependence
    in spatial environmental data. Standard Mat\'ern covariance models are,
    however, often computationally infeasible for large data sets. In this work,
    recent results for Markov approximations of Gaussian Mat\'ern fields based on
    Hilbert space approximations are extended using wavelet basis functions. These
    Markov approximations are compared with two of the most popular methods for
    efficient covariance approximations; covariance tapering and the process
    convolution method.

  51. Improved estimator of the entropy and goodness of fit tests in ranked set sampling.

    Authors: Morteza Amini, M. Mehdizadeh, N. R. Arghami
    Subjects: Computation
    Abstract

    The entropy is one of the most applicable uncertainty measures in many
    statistical and en- gineering problems. In statistical literature, the entropy
    is used in calculation of the Kullback- Leibler (KL) information which is a
    powerful mean for performing goodness of fit tests. Ranked Set Sampling (RSS)
    seems to provide improved estimators of many parameters of the popu- lation in
    the huge studied problems in the literature. It is developed for situations
    where the variable of interest is difficult or expensive to measure, but where
    ranking in small sub-samples is easy.

  52. Robust adaptive Metropolis algorithm with coerced acceptance rate.

    Authors: Matti Vihola
    Subjects: Computation
    Abstract

    The adaptive Metropolis (AM) algorithm of Haario, Saksman and Tamminen
    [Bernoulli 7 (2001) 223-242] uses the estimated covariance of the target
    distribution in the proposal distribution. This paper introduces a new robust
    adaptive Metropolis algorithm estimating the shape of the target distribution
    and simultaneously coercing the acceptance rate. The adaptation rule is
    computationally simple adding no extra cost compared with the AM algorithm.

  53. Posterior model probabilities computed from model-specific Gibbs output.

    Authors: Richard J. Barker, William A. Link
    Subjects: Computation
    Abstract

    Reversible jump Markov chain Monte Carlo (RJMCMC) extends ordinary MCMC
    methods for use in Bayesian multimodel inference. We show that RJMCMC can be
    implemented as Gibbs sampling with alternating updates of a model indicator and
    a vector-valued "palette" of parameters denoted $\bm \psi$. Like an artist uses
    the palette to mix dabs of color for specific needs, we create model-specific
    parameters from the set available in $\bm \psi$.

  54. Parameter estimation in high dimensional Gaussian distributions.

    Authors: Erlend Aune, Daniel P. Simpson
    Subjects: Computation
    Abstract

    In order to compute the log-likelihood for high dimensional spatial Gaussian
    models, it is necessary to compute the determinant of the large, sparse,
    symmetric positive definite precision matrix, Q. Traditional methods for
    evaluating the log-likelihood for very large models may fail due to the massive
    memory requirements. We present a novel approach for evaluating such
    likelihoods when the matrix-vector product, Qv, is fast to compute. In this
    approach we utilise matrix functions, Krylov subspaces, and probing vectors to
    construct an iterative method for computing the log-likelihood.

  55. An empirical Bayes procedure for the selection of Gaussian graphical models.

    Authors: Jean-Michel Marin, Sophie Donnet
    Subjects: Computation
    Abstract

    A new methodology for model determination in decomposable graphical Gaussian
    models is developed. The Bayesian paradigm is used and, for each given graph, a
    hyper inverse Wishart prior distribution on the covariance matrix is
    considered. This prior distribution depends on hyper-parameters. It is
    well-known that the models's posterior distribution is sensitive to the
    specification of these hyper-parameters and no completely satisfactory method
    is registered.

  56. Iterative bias reduction multivariate smoothing in R: The ibr package.

    Authors: P. A. Cornillon, N. Hengartner, E. Matzner-L&#xf8;ber
    Subjects: Computation
    Abstract

    In multivariate nonparametric analysis, sparseness of the covariates also
    called curse of dimensionality, forces one to use large smoothing parameters.
    This leads to a biased smoother. Instead of focusing on optimally selecting the
    smoothing parameter, we fix it to some reasonably large value to ensure an
    over-smoothing of the data. The resulting base smoother has a small variance
    but a substantial bias. In this paper, we propose an R package named ibr to
    iteratively correct the initial bias of the (base) estimator by an estimate of
    the bias obtained by smoothing the residuals.

  57. Approximate inference via variational sampling.

    Authors: Alexis Roche
    Subjects: Computation
    Abstract

    We recast importance sampling in terms of minimizing a Monte Carlo estimate
    of the Kullback-Leibler divergence over a well-chosen exponential family. We
    further examine two alternative Kullback-Leibler approximations that do not
    require the target distribution to be normalized. One of these makes the
    variational problem equivalent to normalized importance sampling, while the
    other leads to a new moment estimation technique that works remarkably well if
    the target distribution is reasonably close to the approximating family even
    though the proposal distribution is mismatched.

  58. EM algorithm and variants: an informal tutorial.

    Authors: Alexis Roche
    Subjects: Computation
    Abstract

    The expectation-maximization (EM) algorithm introduced by Dempster et al in
    1977 is a very general method to solve maximum likelihood estimation problems.
    In this informal report, we review the theory behind EM as well as a number of
    EM variants, suggesting that beyond the current state of the art is an even
    much wider territory still to be discovered.

  59. Deviance Information Criteria for Model Selection in Approximate Bayesian Computation.

    Authors: Olivier Francois, Guillaume Laval
    Subjects: Computation
    Abstract

    Approximate Bayesian computation (ABC) is a class of algorithmic methods in
    Bayesian inference using statistical summaries and computer simulations. ABC
    has become popular in evolutionary genetics and in other branches of biology.
    However model selection under ABC algorithms has been a subject of intense
    debate during the recent years. Here we propose novel approaches to model
    selection based on posterior predictive distributions and approximations of the
    deviance.

  60. The Multivariate Watson Distribution: Maximum-Likelihood Estimation and other Aspects.

    Authors: Suvrit Sra, Dmitrii Karp
    Subjects: Computation
    Abstract

    This paper studies fundamental aspects of modelling data using multivariate
    Watson distributions. Although these distributions are natural for modelling
    axially symmetric data (i.e., unit vectors where $\pm \x$ are equivalent), for
    high-dimensions using them can be difficult. Why so? Largely because for Watson
    distributions even basic tasks such as maximum-likelihood are numerically
    challenging. To tackle the numerical difficulties some approximations have been
    derived---but these are either grossly inaccurate in high-dimensions
    (\emph{Directional Statistics}, Mardia & Jupp.

  61. Conservative Hypothesis Tests and Confidence Intervals using Importance Sampling.

    Authors: Matthew T. Harrison
    Subjects: Computation
    Abstract

    Importance sampling is a common technique for Monte Carlo approximation,
    including Monte Carlo approximation of p-values. Here it is shown that a simple
    correction of the usual importance sampling p-values creates valid p-values,
    meaning that a hypothesis test created by rejecting the null when the p-value
    is <= alpha will also have a type I error rate <= alpha. This correction uses
    the importance weight of the original observation, which gives valuable
    diagnostic information under the null hypothesis.

  62. Exact Enumeration and Sampling of Matrices with Specified Margins.

    Authors: Jeffrey W. Miller, Matthew T. Harrison
    Subjects: Computation
    Abstract

    We describe a dynamic programming algorithm for exact counting and exact
    uniform sampling of matrices with specified row and column sums. The algorithm
    runs in polynomial time when the column sums are bounded. Binary or
    non-negative integer matrices are handled. The method is distinguished by
    applicability to non-regular margins, tractability on large matrices, and the
    capacity for exact sampling.

  63. Dynamic Markov Bases.

    Authors: Adrian Dobra
    Subjects: Computation
    Abstract

    We present a computational approach for generating Markov bases for multi-way
    contin- gency tables whose cells counts might be constrained by fixed marginals
    and by lower and upper bounds. Our framework includes tables with structural
    zeros as a particular case. In- stead of computing the entire Markov basis in
    an initial step, our framework finds sets of local moves that connect each
    table in the reference set with a set of neighbor tables. We construct a Markov
    chain on the reference set of tables that requires only a set of local moves at
    each iteration.

  64. An algorithm for the principal component analysis of large data sets.

    Authors: Nathan Halko, Per-Gunnar Martinsson, Mark Tygert, Yoel Shkolnisky
    Subjects: Computation
    Abstract

    Recently popularized randomized methods for principal component analysis
    (PCA) efficiently and reliably produce nearly optimal accuracy --- even on
    parallel processors --- unlike the classical (deterministic) alternatives. We
    adapt one of these randomized methods for use with data sets that are too large
    to be stored in random-access memory (RAM). (The traditional terminology is
    that our procedure works efficiently "out-of-core.") We illustrate the
    performance of the algorithm via several numerical examples.

  65. Sequential Monte Carlo samplers: error bounds and insensitivity to initial conditions.

    Authors: Nick Whiteley
    Subjects: Computation
    Abstract

    This paper addresses finite sample stability properties of sequential Monte
    Carlo methods for approximating sequences of probability distributions. The
    results presented herein are applicable in the scenario where the start and end
    distributions in the sequence are fixed and the number of intermediate steps is
    a parameter of the algorithm. Under assumptions which hold on non-compact
    spaces, it is shown that the effect of the initial distribution decays
    exponentially fast in the number of intermediate steps and the corresponding
    stochastic error is stable in \mathbb{L}_{p} norm.

  66. On the Stability of Sequential Monte Carlo Methods in High Dimensions.

    Authors: Alexandros Beskos, Ajay Jasra, Dan Crisan
    Subjects: Computation
    Abstract

    We investigate the stability of a Sequential Monte Carlo (SMC) method applied
    to the problem of sampling from a target distribution on R^d for large d. It is
    well known that, using a single importance sampling step, one produces an
    approximation for the target distribution that deteriorates as the dimension
    $d$ increases, unless the number of Monte Carlo samples $N$ increases at an
    exponential rate in d.

  67. A Path Algorithm for Constrained Estimation.

    Authors: Hua Zhou, Kenneth Lange
    Subjects: Computation
    Abstract

    Many least squares problems involve affine equality and inequality
    constraints. Although there are variety of methods for solving such problems,
    most statisticians find constrained estimation challenging. The current paper
    proposes a new path following algorithm for quadratic programming based on
    exact penalization. Similar penalties arise in $l_1$ regularization in model
    selection. Classical penalty methods solve a sequence of unconstrained problems
    that put greater and greater stress on meeting the constraints.

  68. Approximating Probability Densities by Iterated Laplace Approximations.

    Authors: Bj&#xf6;rn Bornkamp
    Subjects: Computation
    Abstract

    The Laplace approximation is an old, but frequently used method to
    approximate integrals for Bayesian calculations. In this paper we develop an
    extension of the Laplace approximation, by applying it iteratively to the
    residual, i.e., the difference between the current approximation and the true
    function. The final approximation is thus a linear combination of multivariate
    normal densities, where the coefficients are chosen to achieve a good fit to
    the target distribution.

  69. Diagonal Based Feature Extraction for Handwritten Alphabets Recognition System using Neural Network.

    Authors: J. Pradeep, E. Srinivasan, S. Himavathi
    Subjects: Computation
    Abstract

    An off-line handwritten alphabetical character recognition system using
    multilayer feed forward neural network is described in the paper. A new method,
    called, diagonal based feature extraction is introduced for extracting the
    features of the handwritten alphabets. Fifty data sets, each containing 26
    alphabets written by various people, are used for training the neural network
    and 570 different handwritten alphabetical characters are used for testing.

  70. Computationally efficient algorithms for statistical image processing. Implementation in R.

    Authors: Mikhail A. Langovoy, Olaf Wittich
    Subjects: Computation
    Abstract

    In the series of our earlier papers on the subject, we proposed a novel
    statistical hypothesis testing method for detection of objects in noisy images.
    The method uses results from percolation theory and random graph theory. We
    developed algorithms that allowed to detect objects of unknown shapes in the
    presence of nonparametric noise of unknown level and of unknown distribution.
    No boundary shape constraints were imposed on the objects, only a weak bulk
    condition for the object's interior was required. Our algorithms have linear
    complexity and exponential accuracy.

  71. Confidence intervals for sensitivity indices using reduced-basis metamodels.

    Authors: Alexandre Janon, Ma&#xeb;lle Nodet, Cl&#xe9;mentine Prieur
    Subjects: Computation
    Abstract

    Global sensitivity analysis is often impracticable for complex and time
    demanding numerical models, as it requires a large number of runs. The
    reduced-basis approach provides a way to replace the original model by a much
    faster to run code. In this paper, we are interested in the information loss
    induced by the approximation on the estimation of sensitivity indices. We
    present a method to provide a robust error assessment, hence enabling
    significant time savings without sacrifice on precision and rigourousness.

  72. Perfect Simulation for Mixtures with Known and Unknown Number of components.

    Authors: Sabyasachi Mukhopadhyay, Sourabh Bhattacharya
    Subjects: Computation
    Abstract

    We propose and develop a novel and effective perfect sampling methodology for
    simulating from posteriors corresponding to mixtures with either known (fixed)
    or unknown number of components. For the latter we consider the Dirichlet
    process-based mixture model developed by these authors, and show that our ideas
    are applicable to conjugate, and importantly, to non-conjugate cases.

  73. Online EM Algorithm for Hidden Markov Models.

    Authors: Olivier Capp&#xe9;
    Subjects: Computation
    Abstract

    Online (also called "recursive" or "adaptive") estimation of fixed model
    parameters in hidden Markov models is a topic of much interest in times series
    modelling. In this work, we propose an online parameter estimation algorithm
    that combines two key ideas. The first one, which is deeply rooted in the
    Expectation-Maximization (EM) methodology consists in reparameterizing the
    problem using complete-data sufficient statistics. The second ingredient
    consists in exploiting a purely recursive form of smoothing in HMMs based on an
    auxiliary recursion.

  74. Dual-Tree Fast Gauss Transforms.

    Authors: Alexander G. Gray, Dongryeol Lee, Andrew W. Moore
    Subjects: Computation
    Abstract

    Kernel density estimation (KDE) is a popular statistical technique for
    estimating the underlying density distribution with minimal assumptions.
    Although they can be shown to achieve asymptotic estimation optimality for any
    input distribution, cross-validating for an optimal parameter requires
    significant computation dominated by kernel summations. In this paper we
    present an improvement to the dual-tree algorithm, the first practical kernel
    summation algorithm for general dimension. Our extension is based on the
    series-expansion for the Gaussian kernel used by fast Gauss transform.

  75. Adaptive Gibbs samplers and related MCMC methods.

    Authors: Krzysztof Latuszynski, Gareth O. Roberts, Jeffrey S. Rosenthal
    Subjects: Computation
    Abstract

    We consider various versions of adaptive Gibbs and Metropolis- within-Gibbs
    samplers, which update their selection probabilities (and per- haps also their
    proposal distributions) on the y during a run, by learning as they go in an
    attempt to optimise the algorithm.We present a cautionary example of how even a
    simple-seeming adaptive Gibbs sampler may fail to converge.We then present
    various positive results guaranteeing convergence of adaptive Gibbs samplers
    under certain conditions.

  76. Nonasymptotic bounds on the mean square error for MCMC estimates via renewal techniques.

    Authors: Krzysztof Latuszynski, Blazej Miasojedow, Wojciech Niemiro
    Subjects: Computation
    Abstract

    The Nummellin's split chain construction allows to decompose a Markov chain
    Monte Carlo (MCMC) trajectory into i.i.d. "excursions". RegenerativeMCMC
    algorithms based on this technique use a random number of samples. They have
    been proposed as a promising alternative to usual fixed length simulation [25,
    33, 14]. In this note we derive nonasymptotic bounds on the mean square error
    (MSE) of regenerative MCMC estimates via techniques of renewal theory and
    sequential statistics. These results are applied to costruct confidence
    intervals.

  77. Why approximate Bayesian computational (ABC) methods cannot handle model choice problems.

    Authors: Christian Robert, Natesh S. Pillai, Jean-Michel Marin
    Subjects: Computation
    Abstract

    Approximate Bayesian computation (ABC), also known as likelihood-free
    methods, have become a favourite tool for the analysis of complex stochastic
    models, primarily in population genetics but also in financial analyses. We
    advocated in Grelaud et al.

  78. Efficient Bayesian inference in stochastic chemical kinetic models using graphical processing units.

    Authors: Jarad Niemi, Matthew Wheeler
    Subjects: Computation
    Abstract

    A goal of systems biology is to understand the dynamics of intracellular
    systems. Stochastic chemical kinetic models are often utilized to accurately
    capture the stochastic nature of these systems due to low numbers of molecules.
    Collecting system data allows for estimation of stochastic chemical kinetic
    rate parameters. We describe a well-known, but typically impractical data
    augmentation Markov chain Monte Carlo algorithm for estimating these
    parameters.

  79. Fast clustering of large datasets with sequential $k$-medians : a stochastic gradient approach.

    Authors: Herv&#xe9; Cardot, Peggy C&#xe9;nac, Jean-Marie Monnez
    Subjects: Computation
    Abstract

    Clustering with fast algorithms large samples of high dimensional data is an
    important challenge in computational statistics. Borrowing ideas from MacQueen
    (1967) who introduced a sequential version of the $k$-means algorithm, we
    propose in this paper a new class of recursive stochastic gradient algorithms
    designed for the $k$-medians loss criterion. By their recursive nature, these
    algorithms are very fast and are well adapted to deal with large samples of
    data that are allowed to arrive sequentially.

  80. SMC^2: A sequential Monte Carlo algorithm with particle Markov chain Monte Carlo updates.

    Authors: Nicolas Chopin, Omiros Papaspiliopoulos, Pierre E. Jacob
    Subjects: Computation
    Abstract

    We consider the generic problem of performing sequential Bayesian inference
    in a state-space model with observation process $(y_{t})$, state process
    $(x_{t})$ and fixed parameter $\theta$. An idealized approach would be to apply
    the \emph{iterated batch importance sampling} (IBIS) algorithm of
    \citet{Chopin:IBIS}. This is a sequential Monte Carlo algorithm \emph{in the

  81. Marginal Likelihood Computation via Arrogance Sampling.

    Authors: Benedict Escoto
    Subjects: Computation
    Abstract

    This paper describes a method for estimating the marginal likelihood or Bayes
    factors of Bayesian models using non-parametric importance sampling ("arrogance
    sampling"). This method can also be used to compute the normalizing constant of
    probability distributions. Because the required inputs are samples from the
    distribution to be normalized and the scaled density at those samples, this
    method may be a convenient replacement for the harmonic mean estimator. The
    method has been implemented in the open source R package margLikArrogance.

  82. Approximate Bayesian Computational methods.

    Authors: Christian P. Robert, Jean-Michel Marin, Pierre Pudlo, Robin Ryder
    Subjects: Computation
    Abstract

    Also known as likelihood-free methods, approximate Bayesian computational
    (ABC) methods have appeared in the past ten years as the most satisfactory
    approach to untractable likelihood problems, first in genetics then in a
    broader spectrum of applications. However, these methods suffer to some degree
    from calibration difficulties that make them rather volatile in their
    implementation and thus render them suspicious to the users of more traditional
    Monte Carlo methods.

  83. Adaptive Multiple Importance Sampling.

    Authors: Christian P. Robert, Jean-Michel Marin, Antonietta Mira, Jean-Marie Cornuet
    Subjects: Computation
    Abstract

    The Adaptive Multiple Importance Sampling (AMIS) algorithm is aimed at an
    optimal recycling of past simulations in an iterated importance sampling
    scheme. The difference with earlier adaptive importance sampling
    implementations like Population Monte Carlo is that the importance weights of
    all simulated values, past as well as present, are recomputed at each
    iteration, following the technique of the deterministic multiple mixture
    estimator of Owen and Zhou (2000).

  84. Exact sampling for intractable probability distributions via a Bernoulli factory.

    Authors: James M. Flegal, Radu Herbei
    Subjects: Computation
    Abstract

    Many applications in the field of statistics require Markov chain Monte Carlo
    methods. Determining appropriate starting values and run lengths can be both
    analytically and empirically challenging. A desire to overcome these problems
    has led to the development of exact, or perfect, sampling algorithms which
    convert a Markov chain into an algorithm that produces i.i.d.\ samples from the
    stationary distribution. Unfortunately, very few of these algorithms have been
    developed for the intractable distributions that arise in statistical
    applications, which typically have uncountable support.

  85. Zero Variance Markov Chain Monte Carlo for Bayesian Estimators.

    Authors: Antonietta Mira, Reza Solgi, Daniele Imparato
    Subjects: Computation
    Abstract

    A general purpose variance reduction technique for Markov chain Monte Carlo
    estimators based on the zero-variance principle introduced in the physics
    literature by Assaraf and Caffarel (1999, 2003), is proposed. Conditions for
    unbiasedness of the zero-variance estimator are derived. A central limit
    theorem is also proved under regularity conditions. The potential of the new
    idea is illustrated with real applications to Bayesian inference for probit,
    logit and GARCH models.

  86. Two Proposals for Robust PCA using Semidefinite Programming.

    Authors: Michael McCoy, Joel Tropp
    Subjects: Computation
    Abstract

    The performance of principal component analysis (PCA) suffers badly in the
    presence of outliers. This paper proposes two novel approaches for robust PCA
    based on semidefinite programming. The first method, maximum mean absolute
    deviation rounding (MDR), seeks directions of large spread in the data while
    damping the effect of outliers. The second method produces a low-leverage
    decomposition (LLD) of the data that attempts to form a low-rank model for the
    data by separating out corrupted observations. This paper also presents
    efficient computational methods for solving these SDPs.

  87. A coordinate-wise optimization algorithm for the Fused Lasso.

    Authors: Holger H&#xf6;fling, Harald Binder, Martin Schumacher
    Subjects: Computation
    Abstract

    L1 -penalized regression methods such as the Lasso (Tibshirani 1996) that
    achieve both variable selection and shrinkage have been very popular. An
    extension of this method is the Fused Lasso (Tibshirani and Wang 2007), which
    allows for the incorporation of external information into the model. In this
    article, we develop new and fast algorithms for solving the Fused Lasso which
    are based on coordinate-wise optimization. This class of algorithms has
    recently been applied very successfully to solve L1 -penalized problems very
    quickly (Friedman et al. 2007).

  88. Network inference using asynchronously updated kinetic Ising Model.

    Authors: Erik Aurell, Hong-Li Zeng, Mikko Alava, Hamed Mahmoudi
    Subjects: Computation
    Abstract

    Network structures are reconstructed from dynamical data by respectively
    naive mean field (nMF) and Thouless-Anderson-Palmer (TAP) approximations. For
    TAP approximation, we use two methods to reconstruct the network: a) iteration
    method; b) casting the inference formula to a set of cubic equations and
    solving it directly. We investigate inference of the asymmetric Sherrington-
    Kirkpatrick (S-K) model using asynchronous update. The solutions of the sets
    cubic equation depend of temperature T in the S-K model, and a critical
    temperature Tc is found around 2.1.

  89. Approximate simulation free multiple changepoint analysis with Gaussian Markov random field segment models.

    Authors: Nial Friel, Jason Wyse, H&#xe5;vard Rue
    Subjects: Computation
    Abstract

    This paper proposes approaches for the analysis of multiple changepoint
    models when dependency in the data is modelled through a hierarchical Gaussian
    Markov random field model. Integrated nested Laplace approximations are used to
    approximate data quantities, and an approximate filtering recursions approach
    is proposed for savings in compuational cost when detecting changepoints. All
    of these methods are simulation free. Analysis of real data demonstrates the
    usefulness of the approach in general.

  90. Slice Sampling with Adaptive Multivariate Steps: The Shrinking-Rank Method.

    Authors: Radford M. Neal, Madeleine B. Thompson
    Subjects: Computation
    Abstract

    The shrinking rank method is a variation of slice sampling that is efficient
    at sampling from multivariate distributions with highly correlated parameters.
    It requires that the gradient of the log-density be computable. At each
    individual step, it approximates the current slice with a Gaussian occupying a
    shrinking-dimension subspace. The dimension of the approximation is shrunk
    orthogonally to the gradient at rejected proposals, since the gradients at
    points outside the current slice tend to point towards the slice.

  91. An Alternating Direction Method for Finding Dantzig Selectors.

    Authors: Zhaosong Lu, Yong Zhang, Ting Kei Pong
    Subjects: Computation
    Abstract

    In this paper, we study the alternating direction method for finding the
    Dantzig selectors, which are first introduced in [8]. In particular, at each
    iteration we apply the nonmonotone gradient method proposed in [17] to
    approximately solve one subproblem of this method. We compare our approach with
    a first-order method proposed in [3]. The computational results show that our
    approach usually outperforms that method in terms of CPU time while producing
    solutions of comparable quality.

  92. Block clustering with collapsed latent block models.

    Authors: Nial Friel, Jason Wyse
    Subjects: Computation
    Abstract

    We introduce a Bayesian extension of the latent block model for model-based
    block clustering of data matrices. Our approach considers a block model where
    block parameters may be integrated out. The result is a posterior defined over
    the number of clusters in rows and columns and cluster memberships. The number
    of row and column clusters need not be known in advance as these are sampled
    along with cluster memberhips using Markov chain Monte Carlo.

  93. Simulation-based Bayesian analysis for multiple changepoints.

    Authors: Nial Friel, Jason Wyse
    Subjects: Computation
    Abstract

    This paper presents a Markov chain Monte Carlo method to generate approximate
    posterior samples in retrospective multiple changepoint problems where the
    number of changes is not known in advance. The method uses conjugate models
    whereby the marginal likelihood for the data between consecutive changepoints
    is tractable. Inclusion of hyperpriors gives a near automatic algorithm
    providing a robust alternative to popular filtering recursions approaches in
    cases which may be sensitive to prior information. Three real examples are used
    to demonstrate the proposed approach.

  94. Efficient Bayesian Inference for Switching State-Space Models using Discrete Particle Markov Chain Monte Carlo Methods.

    Authors: Arnaud Doucet, Christophe Andrieu, Nick Whiteley
    Subjects: Computation
    Abstract

    Switching state-space models (SSSM) are a very popular class of time series
    models that have found many applications in statistics, econometrics and
    advanced signal processing. Bayesian inference for these models typically
    relies on Markov chain Monte Carlo (MCMC) techniques. However, even
    sophisticated MCMC methods dedicated to SSSM can prove quite inefficient as
    they update potentially strongly correlated discrete-valued latent variables
    one-at-a-time (Carter and Kohn, 1996; Gerlach et al., 2000; Giordani and Kohn,
    2008).

  95. Space Alternating Penalized Kullback Proximal Point Algorithms for Maximing Likelihood with Nondifferentiable Penalty.

    Authors: St&#xe9;phane Chr&#xe9;tien, Alfred Hero, Herv&#xe9; Perdry
    Subjects: Computation
    Abstract

    The EM algorithm is a widely used methodology for penalized likelihood
    estimation. Provable monotonicity and convergence are the hallmarks of the EM
    algorithm and these properties are well established for smooth likelihood and
    smooth penalty functions. However, many relaxed versions of variable selection
    penalties are not smooth. The goal of this paper is to introduce a new class of
    Space Alternating Penalized Kullback Proximal extensions of the EM algorithm
    for nonsmooth likelihood inference.

  96. Discussions on "Riemann manifold Langevin and Hamiltonian Monte Carlo methods".

    Authors: Arnaud Doucet, Christian P. Robert, Nicolas Chopin, Jean-Michel Marin, Pierre Jacob, Simon Barthelme, Magali Beffy, Adam M. Johansen
    Subjects: Computation
    Abstract

    This is a collection of discussions of `Riemann manifold Langevin and
    Hamiltonian Monte Carlo methods" by Girolami and Calderhead, to appear in the
    Journal of the Royal Statistical Society, Series B.

  97. A Very Fast Algorithm for Matrix Factorization.

    Authors: Vladimir Nikulin, Tian-Hsiang Huang, Shu-Kay Ng, Suren I Rathnayake, Geoffrey J McLachlan
    Subjects: Computation
    Abstract

    We present a very fast algorithm for general matrix factorization of a data
    matrix for use in the statistical analysis of high-dimensional data via latent
    factors. Such data are prevalent across many application areas and generate an
    ever-increasing demand for methods of dimension reduction in order to undertake
    the statistical analysis of interest. Our algorithm uses a gradient-based
    approach which can be used with an arbitrary loss function provided the latter
    is differentiable.

  98. A Comparison of Methods for Computing Autocorrelation Time.

    Authors: Madeleine B. Thompson
    Subjects: Computation
    Abstract

    This paper describes four methods for estimating autocorrelation time and
    evaluates these methods with a test set of seven series. Fitting an
    autoregressive process appears to be the most accurate method of the four. An R
    package is provided for extending the comparison to more methods and test
    series.

  99. Discussion of "Riemann manifold Langevin and Hamiltonian Monte Carlo methods'' by M. Girolami and B. Calderhead.

    Authors: Julien Cornebise, Gareth W. Peters, Luke Bornn
    Subjects: Computation
    Abstract

    This technical report is the union of two contributions to the discussion of
    the Read Paper "Riemann manifold Langevin and Hamiltonian Monte Carlo methods"
    by B. Calderhead and M. Girolami, presented in front of the Royal Statistical
    Society on October 13th 2010 and to appear in the Journal of the Royal
    Statistical Society Series B. The first comment establishes a parallel and
    possible interactions with Adaptive Monte Carlo methods.

  100. Exact Bayesian Analysis of Mixtures.

    Authors: Christian P. Robert, Kerrie L. Mengersen
    Subjects: Computation
    Abstract

    In this paper, we show how a complete and exact Bayesian analysis of a
    parametric mixture model is possible in some cases when components of the
    mixture are taken from exponential families and when conjugate priors are used.
    This restricted set-up allows us to show the relevance of the Bayesian approach
    as well as to exhibit the limitations of a complete analysis, namely that it is
    impossible to conduct this analysis when the sample size is too large, when the
    data are not from an exponential family, or when priors that are more complex
    than conjugate priors are used.

  101. Maximum Likelihood Estimation of Nonnegative Trigonometric Sum Models Using a Newton-like Algorithm on Manifolds.

    Authors: Fern&#xe1;ndez-Dur&#xe1;n, J.J., Gregorio-Dom&#xed;nguez
    Subjects: Computation
    Abstract

    In Fern\'andez-Dur\'an (2004), a new family of circular distributions based
    on nonnegative trigonometric sums (NNTS models) is developed. Because the
    parameter space of this family is the surface of the hypersphere, an efficient
    Newton-like algorithm on manifolds is generated in order to obtain the maximum
    likelihood estimates of the parameters.

  102. Parameter expansion in local-shrinkage models.

    Authors: James G. Scott
    Subjects: Computation
    Abstract

    This paper considers the problem of using MCMC to fit sparse Bayesian models
    based on normal scale-mixture priors. Examples of this framework include the
    Bayesian LASSO and the horseshoe prior. We study the usefulness of parameter
    expansion (PX) for improving convergence in such models, which is notoriously
    slow when the global variance component is near zero. Our conclusion is that
    parameter expansion does improve matters in LASSO-type models, but only
    modestly.

  103. A Bregman Extension of quasi-Newton updates I: An Information Geometrical framework.

    Authors: Takafumi Kanamori, Atsumi Ohara
    Subjects: Computation
    Abstract

    We study quasi-Newton methods from the viewpoint of information geometry
    induced associated with Bregman divergences. Fletcher has studied a variational
    problem which derives the approximate Hessian update formula of the
    quasi-Newton methods. We point out that the variational problem is identical to
    optimization of the Kullback-Leibler divergence, which is a discrepancy measure
    between two probability distributions. The Kullback-Leibler divergence for the
    multinomial normal distribution corresponds to the objective function Fletcher
    has considered.

  104. A Bregman Extension of quasi-Newton updates II: Convergence and Robustness Properties.

    Authors: Takafumi Kanamori, Atsumi Ohara
    Subjects: Computation
    Abstract

    We propose an extension of quasi-Newton methods, and investigate the
    convergence and the robustness properties of the proposed update formulae for
    the approximate Hessian matrix. Fletcher has studied a variational problem
    which derives the approximate Hessian update formula of the quasi-Newton
    methods. We point out that the variational problem is identical to optimization
    of the Kullback-Leibler divergence, which is a discrepancy measure between two
    probability distributions.

  105. Using parallel computation to improve Independent Metropolis--Hastings based estimation.

    Authors: Christian P. Robert, Pierre Jacob, Murray H. Smith
    Subjects: Computation
    Abstract

    In this paper, we consider the implications of the fact that parallel
    raw-power can be exploited by a generic Metropolis--Hastings algorithm if the
    proposed values are independent. In particular, we present improvements to the
    independent Metropolis--Hastings algorithm that significantly decrease the
    variance of any estimator derived from the MCMC output, for a null computing
    cost since those improvements are based on a fixed number of target density
    evaluations.

  106. Tuning Tempered Transitions.

    Authors: Nial Friel, Gundula Behrens, Merrilee Hurn
    Subjects: Computation
    Abstract

    The method of tempered transitions was proposed by Neal (1996) for tackling
    the difficulties arising when using Markov chain Monte Carlo to sample from
    multimodal distributions. In common with methods such as simulated tempering
    and Metropolis-coupled MCMC, the key idea is to utilise a series of
    successively easier to sample distributions to improve movement around the
    state space. Tempered transitions does this by incorporating moves through
    these less modal distributions into the MCMC proposals.

  107. Practical Estimation of High Dimensional Stochastic Differential Mixed-Effects Models.

    Authors: Umberto Picchini, Susanne Ditlevsen
    Subjects: Computation
    Abstract

    Stochastic differential equations (SDEs) are established tools to model
    physical phenomena whose dynamics are affected by random noise. By estimating
    parameters of an SDE intrinsic randomness of a system around its drift can be
    identified and separated from the drift itself. When it is of interest to model
    dynamics within a given population, i.e. to model simultaneously the
    performance of several experiments or subjects, mixed-effects modelling allows
    for the distinction of between and within experiment variability.

  108. Sequential design of computer experiments for the estimation of a probability of failure.

    Authors: Emmanuel Vazquez, Julien Bect, David Ginsbourger, Ling Li, Victor Picheny
    Subjects: Computation
    Abstract

    This paper deals with the problem of estimating the volume of the excursion
    set of a function $f:\mathbb{R}^d \to \mathbb{R}$ above a given threshold,
    under a probability measure on $\mathbb{R}^d$ that is assumed to be known. In
    the industrial world, this corresponds to the problem of estimating a
    probability of failure of a system. When only an expensive-to-simulate model of
    the system is available, the budget for simulations is usually severely limited
    and therefore classical Monte Carlo methods ought to be avoided.

  109. The Time Machine: A Simulation Approach for Stochastic Trees.

    Authors: Ajay Jasra, Maria De Iorio, Marc Chadeau-Hyam
    Subjects: Computation
    Abstract

    In the following paper we consider a simulation technique for stochastic
    trees. One of the most important areas in computational genetics is the
    calculation and subsequent maximization of the likelihood function associated
    to such models. This typically consists of using importance sampling (IS) and
    sequential Monte Carlo (SMC) techniques. The approach proceeds by simulating
    the tree, backward in time from observed data, to a most recent common ancestor
    (MRCA). However, in many cases, the computational time and variance of
    estimators are often too high to make standard approaches useful.

  110. Bayesian Tracking of Emerging Epidemics Using Ensemble Optimal Statistical Interpolation (EnOSI).

    Authors: Jan Mandel, Loren Cobb, Ashok Krishnamurthy, Jonathan Beezley
    Subjects: Computation
    Abstract

    We explore the use of the optimal statistical interpolation (OSI) data
    assimilation method for the statistical tracking of emerging epidemics and to
    study the spatial dynamics of a disease. The epidemic models that we used for
    this study are spatial variants of the common susceptible-infectious-removed
    (S-I-R) compartmental model of epidemiology. The spatial S-I-R epidemic model
    is illustrated by application to simulated spatial dynamic epidemic data from
    the historic "Black Death" plague of 14th century Europe.

  111. Bayesian matching of unlabelled point sets using Procrustes and configuration models.

    Authors: Ian L. Dryden, Kim Kenobi
    Subjects: Computation
    Abstract

    The problem of matching unlabelled point sets using Bayesian inference is
    considered. Two recently proposed models for the likelihood are compared, based
    on the Procrustes size-and-shape and the full configuration. Bayesian inference
    is carried out for matching point sets using Markov chain Monte Carlo
    simulation. An improvement to the existing Procrustes algorithm is proposed
    which improves convergence rates, using occasional large jumps in the burn-in
    period.

  112. Approximate variances for tapered spectral estimates.

    Authors: Michael Amrein, Hans R. K&#xfc;nsch
    Subjects: Computation
    Abstract

    We propose an approximation of the asymptotic variance that removes a certain
    discontinuity in the formula for the raw and the smoothed periodogram in case a
    data taper is used. It is based on an approximation of the covariance of the
    (tapered) periodogram at two arbitrary frequencies.

  113. Interior Point Methods for Computing Optimal Designs.

    Authors: Zhaosong Lu, Ting Kei Pong
    Subjects: Computation
    Abstract

    In this paper we study interior point (IP) methods for solving optimal design
    problems. In particular, we propose a primal IP method for solving the problems
    with general convex optimality criteria and establish its global convergence.
    In addition, we reformulate the problems with A-, D- and E-criterion into
    linear or log-determinant semidefinite programs (SDPs) and apply standard
    primal-dual IP solvers such as SDPT3 [21,25] to solve the resulting SDPs. We
    also compare the IP methods with the widely used multiplicative algorithm
    introduced by Silvey et al. [18].

  114. Optimal designs for rational function regression.

    Authors: D&#xe1;vid Papp
    Subjects: Computation
    Abstract

    We consider optimal non-sequential designs for a large class of (linear and
    nonlinear) regression models involving polynomials and rational functions with
    heteroscedastic noise also given by a polynomial or rational weight function.
    The proposed method generates a polynomial whose zeros are the support points
    of the optimal approximate design, and generalizes a number of previously known
    results of the same flavor.

  115. Estimating Discrete Markov Models From Various Incomplete Data Schemes.

    Authors: Nicolas Bousquet, Alberto Pasanisi, Shuai Fu
    Subjects: Computation
    Abstract

    The parameters of a discrete stationary Markov model are transition
    probabilities between states. Traditionally, data consists in sequences of
    observed states for a given number of individuals over the whole observation
    period. In such a case, the estimation of transition probabilities is
    straightforwardly made by counting one-step moves from a given state to
    another. In many real-life problems, however, the inference is much more
    difficult as state sequences are not fully observed, namely the state of each
    individual is known only for some given values of the time variable $t$.

  116. Great Expectations: EM Algorithms for Discretely Observed Linear Birth-Death-Immigration Processes.

    Authors: Vladimir N. Minin, Marc A. Suchard, Charles R. Doss, Ian Holmes, Midori Kato-Maeda
    Subjects: Computation
    Abstract

    Estimating parameters of continuous-time linear birth-death-immigration
    processes, observed discretely at unevenly spaced time points, is a recurring
    theme in statistical analyses of population dynamics. Viewing this task as a
    missing data problem, we develop two novel expectation-maximization (EM)
    algorithms. When the rate of immigration is either zero or proportional to the
    birth rate, we use Kendall's generating function method to reduce the E-step of
    the EM algorithm, as well as calculation of the Fisher information, to one
    dimensional integration.

  117. Rate estimation in partially observed Markov jump processes with measurement errors.

    Authors: Michael Amrein, Hans R. Kuensch
    Subjects: Computation
    Abstract

    We present a simulation methodology for Bayesian estimation of rate
    parameters in Markov jump processes arising for example in stochastic kinetic
    models. To handle the problem of missing components and measurement errors in
    observed data, we embed the Markov jump process into the framework of a general
    state space model. We do not use diffusion approximations. Markov chain Monte
    Carlo and particle filter type algorithms are introduced, which allow sampling
    from the posterior distribution of the rate parameters and the Markov jump
    process also in data-poor scenarios.

  118. Multiplicative random walk Metropolis-Hastings on the real line.

    Authors: Somak Dutta
    Subjects: Computation
    Abstract

    In this article we propose multiplication based random walk Metropolis
    Hastings (MH) algorithm on the real line. We call it the random dive MH (RDMH)
    algorithm. This algorithm, even if simple to apply, was not studied earlier in
    Markov chain Monte Carlo literature. The associated kernel is shown to have
    standard properties like irreducibility, aperiodicity and Harris recurrence
    under some mild assumptions. These ensure basic convergence (ergodicity) of the
    kernel. Further the kernel is shown to be geometric ergodic for a large class
    of target densities on $\mathbb{R}$.

  119. Bayes Factors via MCMC and Path Sampling - The case of Factor Models.

    Authors: Ritabrata Dutta, Jayanta K Ghosh
    Subjects: Computation
    Abstract

    An overview is provided for the problem of calculating Bayes factors (BF) for
    model selection using methods based on variants of importance sampling and
    variants of Laplace Approximation, specifically path sampling (PS). These
    methods are compared on several simulated and two real life data set. It is
    noted that there are unexpected instabilities in PS and we provide both a
    theoretical explanation and suggestions for improvement. Estimates of the same
    BF may differ widely across methods.

  120. Control Variates for Reversible MCMC Samplers.

    Authors: Ioannis Kontoyiannis, Petros Dellaportas
    Subjects: Computation
    Abstract

    A general methodology is introduced for the construction and effective
    application of control variates to estimation problems involving data from
    reversible MCMC samplers. We propose the use of a specific class of functions
    as control variates, and we introduce a new, consistent estimator for the
    values of the coefficients of the optimal linear combination of these
    functions. The form and proposed construction of the control variates is
    derived from our solution of the Poisson equation associated with a specific
    MCMC scenario.

  121. Cases for the nugget in modeling computer experiments.

    Authors: Robert B. Gramacy, Herbert K.H. Lee
    Subjects: Computation
    Abstract

    Most surrogate models for computer experiments are interpolators, and the
    most common interpolator is a Gaussian process (GP) that deliberately omits a
    small-scale (measurement) error term called the nugget. The explanation is that
    computer experiments are, by definition, "deterministic", and so there is no
    measurement error. We think this is too narrow a focus for a computer
    experiment and a statistically inefficient way to model them.

  122. An Algorithm for Learning the Essential Graph.

    Authors: John M. Noble
    Subjects: Computation
    Abstract

    This article presents an algorithm for learning the essential graph of a
    Bayesian network. The basis of the algorithm is the Maximum Minimum Parents and
    Children algorithm developed by previous authors, with three substantial
    modifications. The MMPC algorithm is the first stage of the Maximum Minimum
    Hill Climbing algorithm for learning the directed acyclic graph of a Bayesian
    network, introduced by previous authors. The MMHC algorithm runs in two phases;
    firstly, the MMPC algorithm to locate the skeleton and secondly an edge
    orientation phase.

  123. Split Bregman method for large scale fused Lasso.

    Authors: Gui-Bo Ye, Xiaohui Xie
    Subjects: Computation
    Abstract

    rdering of regression or classification coefficients occurs in many
    real-world applications. Fused Lasso exploits this ordering by explicitly
    regularizing the differences between neighboring coefficients through an
    $\ell_1$ norm regularizer. However, due to nonseparability and nonsmoothness of
    the regularization term, solving the fused Lasso problem is computationally
    demanding. Existing solvers can only deal with problems of small or medium
    size, or a special case of the fused Lasso problem in which the predictor
    matrix is identity matrix.

  124. Adjusted Bayesian inference for selected parameters.

    Authors: Daniel Yekutieli
    Subjects: Computation
    Abstract

    We address the problem of providing inference for parameters selected after
    viewing the data from a Bayesian perspective. A frequentist solution to this
    problem is constructing False Coverage-statement Rate adjusted confidence
    intervals for the subset of selected parameters. We illustrate the limitations
    of the frequentist solution. We argue that if the parameter is elicited a
    non-informative prior, or if it is a ``fixed'' effect that is generated before
    selection is applied, then it is necessary to adjust the Bayesian inference for
    selection.

  125. Free energy Sequential Monte Carlo, application to mixture modelling.

    Authors: Nicolas Chopin, Pierre Jacob
    Subjects: Computation
    Abstract

    We introduce a new class of Sequential Monte Carlo (SMC) methods, which we
    call free energy SMC. This class is inspired by free energy methods, which
    originate from Physics, and where one samples from a biased distribution such
    that a given function $\xi(\theta)$ of the state $\theta$ is forced to be
    uniformly distributed over a given interval.

  126. General Purpose Convolution Algorithm in S4-Classes by means of FFT.

    Authors: Peter Ruckdeschel, Matthias Kohl
    Subjects: Computation
    Abstract

    Object orientation provides a flexible framework for the implementation of
    the convolution of arbitrary distributions of real-valued random variables.

    We discuss an algorithm which is based on the Discrete Fourier Transformation
    and its fast computability via the Fast Fourier Transformation. It directly
    applies to lattice-supported distributions. In the case of continuous
    distributions an additional discretization to a linear lattice is necessary and
    the resulting lattice-supported distributions are suitably smoothed after
    convolution.

  127. Slice sampling covariance hyperparameters of latent Gaussian models.

    Authors: Ryan Prescott Adams, Iain Murray
    Subjects: Computation
    Abstract

    The Gaussian process (GP) is a popular way to specify dependencies between
    random variables in a probabilistic model. In the Bayesian framework the
    covariance structure can be specified using unknown hyperparameters.
    Integrating over these hyperparameters considers different possible
    explanations for the data when making predictions. This integration is often
    performed using Markov chain Monte Carlo (MCMC) sampling. However, with
    non-Gaussian observations standard hyperparameter sampling approaches require
    careful tuning and may converge slowly.

  128. Classification using distance nearest neighbours.

    Authors: Nial Friel, Anthony N. Pettitt
    Subjects: Computation
    Abstract

    This paper proposes a new probabilistic classification algorithm using a
    Markov random field approach. The joint distribution of class labels is
    explicitly modelled using the distances between feature vectors. Intuitively, a
    class label should depend more on class labels which are closer in the feature
    space, than those which are further away. Our approach builds on previous work
    by Holmes and Adams (2002, 2003) and Cucala et al. (2008). Our work shares many
    of the advantages of these approaches in providing a probabilistic basis for
    the statistical inference.

  129. Computing the confidence levels for a root-mean-square test of goodness-of-fit.

    Authors: Mark Tygert, Rachel Ward, William Perkins
    Subjects: Computation
    Abstract

    The classic chi-squared statistic for testing goodness-of-fit has long been a
    cornerstone of modern statistical practice. The statistic consists of a sum in
    which each summand involves multiplying by the inverse of (i.e., dividing by)
    the probability associated with the corresponding bin in the distribution being
    tested for goodness-of-fit. This inversion typically precipitates rebinning to
    uniformize the probabilities associated with the bins, in order to make the
    test reasonably powerful. With the now widespread availability of computers,
    there is no longer any need for this.

  130. A note on target distribution ambiguity of likelihood-free samplers.

    Authors: Y. Fan, S. A. Sisson, G. W. Peters, M. Briers
    Subjects: Computation
    Abstract

    Methods for Bayesian simulation in the presence of computationally
    intractable likelihood functions are of growing interest. Termed
    likelihood-free samplers, standard simulation algorithms such as Markov chain
    Monte Carlo have been adapted for this setting. In this article, by presenting
    generalisations of existing algorithms, we demonstrate that likelihood-free
    samplers can be ambiguous over the form of the target distribution. We also
    consider the theoretical justification of these samplers.

  131. A simple and efficient algorithm for fused lasso signal approximator with convex loss function.

    Authors: Heng Lian
    Subjects: Computation
    Abstract

    We consider the augmented Lagrangian method (ALM) as a solver for the fused
    lasso signal approximator (FLSA) problem. The ALM is a dual method in which
    squares of the constraint functions are added as penalties to the Lagrangian.
    In order to apply this method to FLSA, two types of auxiliary variables are
    introduced to transform the original unconstrained minimization problem into a
    linearly constrained minimization problem. Each updating in this iterative
    algorithm consists of just a simple one-dimensional convex programming problem,
    with closed form solution in many cases.

  132. Sequential Monte Carlo Methods for Option Pricing.

    Authors: Pierre Del Moral, Ajay Jasra
    Subjects: Computation
    Abstract

    In the following paper we provide a review and development of sequential
    Monte Carlo (SMC) methods for option pricing. SMC are a class of Monte
    Carlo-based algorithms, that are designed to approximate expectations w.r.t a
    sequence of related probability measures. These approaches have been used,
    successfully, for a wide class of applications in engineering, statistics,
    physics and operations research. SMC methods are highly suited to many option
    pricing problems and sensitivity/Greek calculations due to the nature of the
    sequential simulation.

  133. A statistical analysis of probabilistic counting algorithms.

    Authors: Peter Clifford, Ioana A. Cosma
    Subjects: Computation
    Abstract

    We consider the problem of cardinality estimation in data stream
    applications, focusing on two techniques that use pseudo-random variates to
    form low-dimensional data sketches. We apply conventional statistical methods
    to compare algorithms based on storing either selected order statistics or
    random projections. We derive estimators of the cardinality in both cases and
    show that the maximal-term estimator is recursively computable and has
    exponentially decreasing error bounds.

  134. On a Multiplicative Algorithm for Computing Bayesian D-optimal Designs.

    Authors: Yaming Yu
    Subjects: Computation
    Abstract

    We use the minorization-maximization principle (Lange, Hunter and Yang 2000)
    to establish the monotonicity of a multiplicative algorithm for computing
    Bayesian D-optimal designs. This proves a conjecture of Dette, Pepelyshev and
    Zhigljavsky (2008).

  135. Efficient computation of the cdf of the maximal difference between Brownian bridge and its concave majorant.

    Authors: Fadoua Balabdaoui, Karim Filali
    Subjects: Computation
    Abstract

    In this paper, we describe two computational methods for calculating the
    cumulative distribution function and the upper quantiles of the maximal
    difference between a Brownian bridge and its concave majorant. The first method
    has two different variants that are both based on a Monte Carlo approach,
    whereas the second uses the Gaver-Stehfest (GS) algorithm for numerical
    inversion of Laplace transform.

  136. An Adaptive Sequential Monte Carlo Sampler.

    Authors: Paul Fearnhead, Benjamin M. Taylor
    Subjects: Computation
    Abstract

    Sequential Monte Carlo (SMC) methods are not only a popular tool in the
    analysis of state{space models, but o?er an alternative to MCMC in situations
    where Bayesian inference must proceed via simulation. This paper introduces a
    new SMC method that uses adaptive MCMC kernels for particle dynamics. The
    proposed algorithm features an online stochastic optimization procedure to
    select the best MCMC kernel and simultaneously learn optimal tuning parameters.
    Theoretical results are presented that justify the approach and give guidance
    on how it should be implemented.

  137. Notes on Using Control Variates for Estimation with Reversible MCMC Samplers.

    Authors: Ioannis Kontoyiannis, Petros Dellaportas
    Subjects: Computation
    Abstract

    A general methodology is presented for the construction and effective use of
    control variates for reversible MCMC samplers. The values of the coefficients
    of the optimal linear combination of the control variates are computed, and
    adaptive, consistent MCMC estimators are derived for these optimal
    coefficients. All methodological and asymptotic arguments are rigorously
    justified. Numerous MCMC simulation examples from Bayesian inference
    applications demonstrate that the resulting variance reduction can be quite
    dramatic.

  138. Conditional Sampling for Max-Stable Random Fields.

    Authors: Yizao Wang, Stilian A. Stoev
    Subjects: Computation
    Abstract

    Max-stable random fields play a central role in modeling extreme value
    phenomena. Their conditional distributions, however, are notoriously difficult
    to handle. This makes the prediction problem for such random fields a
    formidable challenge. Here, we obtain an explicit formula for the conditional
    probability in general max-linear models, which include a large class of
    max-stable random fields. As a consequence, we develop an algorithm for
    efficient and exact sampling from the conditional distributions.

  139. Bayesian estimation of regularization and PSF parameters for Wiener-Hunt deconvolution.

    Authors: Francois Orieux, Jean-Francois Giovannelli, Thomas Rodet
    Subjects: Computation
    Abstract

    This paper tackles the problem of image deconvolution with joint estimation
    of PSF parameters and hyperparameters. Within a Bayesian framework, the
    solution is inferred via a global a posteriori law for unknown parameters and
    object. The estimate is chosen as the posterior mean, numerically calculated by
    means of a Monte-Carlo Markov chain algorithm. The estimates are efficiently
    computed in the Fourier domain and the effectiveness of the method is shown on
    simulated examples.

  140. Exact posterior distributions over the segmentation space and model selection for multiple change-point detection problems.

    Authors: Guillem Rigaill, Emilie Lebarbier, St&#xe9;phane Robin
    Subjects: Computation
    Abstract

    In segmentation problems, inference on change-point position and model
    selection are two difficult issues due to the discrete nature of change-points.
    In a Bayesian context, we derive exact, non-asymptotic, explicit and tractable
    formulae for the posterior distribution of variables such as the number of
    change-points or their positions. We also derive a new selection criterion that
    accounts for the reliability of the results. All these results are based on an
    efficient strategy to explore the whole segmentation space, which is very
    large.

  141. Pooling Design and Bias Correction in DNA Library Screening.

    Authors: Takafumi Kanamori, Hiroaki Uehara, Masakazu Jimbo
    Subjects: Computation
    Abstract

    We study the group test for DNA library screening based on probabilistic
    approach. Group test is a method of detecting a few positive items from among a
    large number of items, and has wide range of applications. In DNA library
    screening, positive item corresponds to the clone having a specified DNA
    segment, and it is necessary to identify and isolate the positive clones for
    compiling the libraries. In the group test, a group of items, called pool, is
    assayed in a lump in order to save the cost of testing, and positive items are
    detected based on the observation from each pool.

  142. Pruned dynamic programming for optimal multiple change-point detection.

    Authors: Guillem Rigaill
    Subjects: Computation
    Abstract

    Multiple change-point detection models assume that the observed data is a
    realization of an independent random process affected by K-1 abrupt changes,
    called change-points, at some unknown positions. For off-line detection a
    dynamic programming (DP) algorithm retrieves the K-1 change-points minimizing
    the quadratic loss and reduces the complexity from \Theta(n^K) to \Theta(Kn^2)
    where n is the number of observations. The quadratic complexity in n still
    restricts the use of such an algorithm to small or intermediate values of n.

  143. Maximin design on non hypercube domain and kernel interpolation.

    Authors: Jean-Michel Marin, Yves Auffray, Pierre Barbillon
    Subjects: Computation
    Abstract

    In the paradigm of computer experiments, the choice of an experimental design
    is an important issue. When no information is available about the black-box
    function to be approximated, an exploratory design have to be used. In this
    context, two dispersion criteria are usually considered: the minimax and the
    maximin ones. In the case of a hypercube domain, a standard strategy consists
    of taking the maximin design within the class of latin hypercube designs.
    However, in a non hypercube context, it does not make sense to use the latin
    hypercube strategy.

  144. The Dynamic ECME Algorithm.

    Authors: Chuanhai Liu, Yunxiao He
    Subjects: Computation
    Abstract

    The ECME algorithm has proven to be an effective way of accelerating the EM
    algorithm for many problems. Recognising the limitation of using prefixed
    acceleration subspace in ECME, we propose the new Dynamic ECME (DECME)
    algorithm which allows the acceleration subspace to be chosen dynamically. Our
    investigation of an inefficient special case of DECME, the classical Successive
    Overrelaxation (SOR) method, leads to an efficient, simple, and widely
    applicable DECME implementation, called DECME_v1.

  145. Computational Methods in Bayesian Statistics.

    Authors: Alan Tua, Kristian Zarb Adami
    Subjects: Computation
    Abstract

    This paper focuses on utilizing two different Bayesian methods to deal with a
    variety of toy problems which occur in data analysis. In particular we
    implement the Variational Bayesian and Nested Sampling methods to tackle the
    problems of polynomial selection and Gaussian Mixture Models, comparing the
    algorithms in terms of processing speed and accuracy. In the problems tackled
    here it is the Variational Bayesian algorithms which are the faster though both
    results give similar results.

  146. Graphics Processing Units and High-Dimensional Optimization.

    Authors: Hua Zhou, Kenneth Lange, Marc A. Suchard
    Subjects: Computation
    Abstract

    This paper discusses the potential of graphics processing units (GPUs) in
    high-dimensional optimization problems. A single GPU card with hundreds of
    arithmetic cores can be inserted in a personal computer and dramatically
    accelerates many statistical algorithms. To exploit these devices fully,
    optimization algorithms should reduce to multiple parallel tasks, each
    accessing a limited amount of data. These criteria favor EM and MM algorithms
    that separate parameters and data. To a lesser extent block relaxation and
    coordinate descent and ascent also qualify.

  147. Covariance-Adaptive Slice Sampling.

    Authors: Madeleine Thompson, Radford M. Neal
    Subjects: Computation
    Abstract

    We describe two slice sampling methods for taking multivariate steps using
    the crumb framework. These methods use the gradients at rejected proposals to
    adapt to the local curvature of the log-density surface, a technique that can
    produce much better proposals when parameters are highly correlated. We
    evaluate our methods on four distributions and compare their performance to
    that of a non-adaptive slice sampling method and a Metropolis method. The
    adaptive methods perform favorably on low-dimensional target distributions with
    highly-correlated parameters.

  148. Data Driven Computing by the Morphing Fast Fourier Transform Ensemble Kalman Filter in Epidemic Spread Simulations.

    Authors: Jan Mandel, Jonathan D. Beezley, Loren Cobb, Ashok Krishnamurthy
    Subjects: Computation
    Abstract

    The FFT EnKF data assimilation method is proposed and applied to a stochastic
    cell simulation of an epidemic, based on the S-I-R spread model. The FFT EnKF
    combines spatial statistics and ensemble filtering methodologies into a
    localized and computationally inexpensive version of EnKF with a very small
    ensemble, and it is further combined with the morphing EnKF to assimilate
    changes in the position of the epidemic.

  149. Free energy methods for efficient exploration of mixture posterior densities.

    Authors: Nicolas Chopin, Gabriel Stoltz, Tony Lelievre
    Subjects: Computation
    Abstract

    Because of their multimodality, mixture posterior densities are difficult to
    sample with standard Markov chain Monte Carlo (MCMC) methods. We propose a
    strategy to enhance the sampling of MCMC in this context, using a biasing
    procedure which originates from computational statistical physics. The
    principle is first to choose a "reaction coordinate", that is, a direction in
    which the target density is multimodal. In a second step, the marginal
    log-density of the reaction coordinate is estimated; this quantity is called
    "free energy" in the computational statistical physics literature.

  150. Improved EM for Mixture Proportions with Applications to Nonparametric ML Estimation for Censored Data.

    Authors: Yaming Yu
    Subjects: Computation
    Abstract

    Improved EM strategies, based on the idea of efficient data augmentation
    (Meng and van Dyk 1997, 1998), are presented for ML estimation of mixture
    proportions. The resulting algorithms inherit the simplicity, ease of
    implementation, and monotonic convergence properties of EM, but have
    considerably improved speed. Because conventional EM tends to be slow when
    there exists a large overlap between the mixture components, we can improve the
    speed without sacrificing the simplicity or stability, if we can reformulate
    the problem so as to reduce the amount of overlap.

  151. Evolutionary Stochastic Search for Bayesian model exploration.

    Authors: Sylvia Richardson, Leonardo Bottolo
    Subjects: Computation
    Abstract

    Implementing Bayesian variable selection for linear Gaussian regression
    models for analysing high dimensional data sets is of current interest in many
    fields. In order to make such analysis operational, we propose a new sampling
    algorithm based upon Evolutionary Monte Carlo and designed to work under the
    "large p, small n" paradigm, thus making fully Bayesian multivariate analysis
    feasible, for example, in genetics/genomics experiments. Two real data examples
    in genomics are presented, demonstrating the performance of the algorithm in a
    space of up to 10,000 covariates.

  152. Bayesian computational methods.

    Authors: Christian P. Robert
    Subjects: Computation
    Abstract

    In this chapter, we will first present the most standard computational
    challenges met in Bayesian Statistics, focussing primarily on mixture
    estimation and on model choice issues, and then relate these problems with
    computational solutions. Of course, this chapter is only a terse introduction
    to the problems and solutions related to Bayesian computations. For more
    complete references, see Robert and Casella (2004, 2009), or Marin and Robert
    (2007), among others.

  153. On computational tools for Bayesian data analysis.

    Authors: Christian P. Robert, Jean-Michel Marin
    Subjects: Computation
    Abstract

    While Robert and Rousseau (2010) addressed the foundational aspects of
    Bayesian analysis, the current chapter details its practical aspects through a
    review of the computational methods available for approximating Bayesian
    procedures. Recent innovations like Monte Carlo Markov chain, sequential Monte
    Carlo methods and more recently Approximate Bayesian Computation techniques
    have considerably increased the potential for Bayesian applications and they
    have also opened new avenues for Bayesian inference, first and foremost
    Bayesian model choice.

  154. Computationally Efficient Estimation of Factor Multivariate Stochastic Volatility Models.

    Authors: Robert Kohn, Weijun Xu, Li Yang
    Subjects: Computation
    Abstract

    An MCMC simulation method based on a two stage delayed rejection
    Metropolis-Hastings algorithm is proposed to estimate a factor multivariate
    stochastic volatility model. The first stage uses kstep iteration towards the
    mode, with k small, and the second stage uses an adaptive random walk proposal
    density. The marginal likelihood approach of Chib (1995) is used to choose the
    number of factors, with the posterior density ordinates approximated by
    Gaussian copula.

  155. Measures of Analysis of Time Series (MATS): A MATLAB Toolkit for Computation of Multiple Measures on Time Series Data Bases.

    Authors: Dimitris Kugiumtzis, Alkiviadis Tsimpiris
    Subjects: Computation
    Abstract

    In many applications, such as physiology and finance, large time series data
    bases are to be analyzed requiring the computation of linear, nonlinear and
    other measures. Such measures have been developed and implemented in commercial
    and freeware softwares rather selectively and independently. The Measures of
    Analysis of Time Series ({\tt MATS}) {\tt MATLAB} toolkit is designed to handle
    an arbitrary large set of scalar time series and compute a large variety of
    measures on them, allowing for the specification of varying measure parameters
    as well.

  156. A New Approximation to the Normal Distribution Quantile Function.

    Authors: Paul M Voutier
    Subjects: Computation
    Abstract

    We present a new approximation to the normal distribution quantile function.
    It has a similar form to the approximation of Beasley and Springer [3],
    providing a maximum absolute error of less than $2.5 \cdot 10^{-5}$. This is
    less accurate than [3], but still sufficient for many applications. However it
    is faster than [3]. This is its primary benefit, which can be crucial to many
    applications, including in financial markets.

  157. MM Algorithms for Minimizing Nonsmoothly Penalized Objective Functions.

    Authors: Martin T. Wells, Elizabeth D. Schifano, Robert L. Strawderman
    Subjects: Computation
    Abstract

    In this paper, we propose a general class of algorithms for optimizing an
    extensive variety of nonsmoothly penalized objective functions that satisfy
    certain regularity conditions. The proposed framework utilizes the
    majorization-minimization (MM) algorithm as its core optimization engine. The
    resulting algorithms rely on iterated soft-thresholding, implemented
    componentwise, allowing for fast, stable updating that avoids the need for any
    high-dimensional matrix inversion.

  158. Student's t Kalman Smoother.

    Authors: Aleksandr Aravkin
    Subjects: Computation
    Abstract

    We propose a new nonlinear outlier-robust Kalman smoother, which we call
    T-Robust. The smoother finds the maximum {\it a posteriori} likelihood (MAP)
    solutions for statistical models with heavy tailed observation noise.
    Specifically, the T-Robust Kalman smoother assumes a Gaussian process model and
    Student's t observation model. It compares favorably to the recently developed
    $\ell_1$-Robust smoother - it performs as well as the $\ell_1$-Robust on
    several simulated examples and even better when the level of contamination is
    high or contaminating errors come from the uniform distribution.

  159. Adaptive Gibbs samplers.

    Authors: Krzysztof Latuszynski, Jeffrey S. Rosenthal
    Subjects: Computation
    Abstract

    We consider various versions of adaptive Gibbs and Metropolis within-Gibbs
    samplers, which update their selection probabilities (and perhaps also their
    proposal distributions) on the fly during a run, by learning as they go in an
    attempt to optimise the algorithm. We present a cautionary example of how even
    a simple-seeming adaptive Gibbs sampler may fail to converge. We then present
    various positive results guaranteeing convergence of adaptive Gibbs samplers
    under certain conditions.

  160. Monotonic Convergence of a General Algorithm for Computing Optimal Designs.

    Authors: Yaming Yu
    Subjects: Computation
    Abstract

    Monotonic convergence is established for a general class of multiplicative
    algorithms introduced by Silvey et al. (1978) for computing optimal designs. A
    conjecture of Titterington (1978) is confirmed as a consequence. Optimal
    designs for logistic regression are used as an illustration.

  161. An alternative marginal likelihood estimator for phylogenetic models.

    Authors: Serena Arima, Luca Tardella
    Subjects: Computation
    Abstract

    Bayesian phylogenetic methods are generating noticeable enthusiasm in the
    field of molecular systematics. Several phylogenetic models are often at stake
    and different approaches are used to compare them within a Bayesian framework.
    The Bayes factor, defined as the ratio of the marginal likelihoods of two
    competing models, plays a key role in Bayesian model selection. However, its
    computation is still a challenging problem.

  162. BSA - exact algorithm computing LTS estimate.

    Authors: Karel Klouda
    Subjects: Computation
    Abstract

    The main result of this paper is a new exact algorithm computing the estimate
    given by the Least Trimmed Squares (LTS). The algorithm works under very weak
    assumptions. To prove that, we study the respective objective function using
    basic techniques of analysis and linear algebra.

  163. Elliptical Slice Sampling.

    Authors: Ryan Prescott Adams, Iain Murray, David J.C. MacKay
    Subjects: Computation
    Abstract

    Many probabilistic models introduce strong dependencies between variables
    using a latent multivariate Gaussian distribution or a Gaussian process. We
    present a new Markov chain Monte Carlo algorithm for performing inference in
    models with multivariate Gaussian priors. Its key properties are: 1) it has
    simple, generic code applicable to many models, 2) it has no free parameters,
    3) it works well for a variety of Gaussian process based models.

  164. Nonparametric Bayesian Density Modeling with Gaussian Processes.

    Authors: Ryan Prescott Adams, Iain Murray, David J.C. MacKay
    Subjects: Computation
    Abstract

    We present the Gaussian process density sampler (GPDS), an exchangeable
    generative model for use in nonparametric Bayesian density estimation. Samples
    drawn from the GPDS are consistent with exact, independent samples from a
    distribution defined by a density that is a transformation of a function drawn
    from a Gaussian process prior. Our formulation allows us to infer an unknown
    density from data using Markov chain Monte Carlo, which gives samples from the
    posterior distribution over density functions and from the predictive
    distribution on data space. We describe two such MCMC methods.

  165. Likelihood-free Bayesian inference for alpha-stable models.

    Authors: Y. Fan, S. A. Sisson, G. W. Peters
    Subjects: Computation
    Abstract

    $\alpha$-stable distributions are utilised as models for heavy-tailed noise
    in many areas of statistics, finance and signal processing engineering.

    However, in general, neither univariate nor multivariate $\alpha$-stable
    models admit closed form densities which can be evaluated pointwise. This
    complicates the inferential procedure.

    As a result, $\alpha$-stable models are practically limited to the univariate
    setting under the Bayesian paradigm, and to bivariate models under the
    classical framework.

  166. Diffusive Nested Sampling.

    Authors: Brendon J. Brewer, Livia B. P&#xe1;rtay, G&#xe1;bor Cs&#xe1;nyi
    Subjects: Computation
    Abstract

    We introduce a general Monte Carlo method based on Nested Sampling (NS), for
    sampling complex probability distributions and estimating the normalising
    constant. The method uses a single particle, which explores a mixture of nested
    probability distributions, each successive distribution occupying ~ e^{-1}
    times the enclosed prior mass of the previous distribution. While classic NS
    technically requires independent generation of particles, imperfect Markov
    Chain Monte Carlo (MCMC) exploration fits naturally into this technique.

  167. A Numerical Approach to Performance Analysis of Quickest Change-Point Detection Procedures.

    Authors: Aleksey S. Polunchenko, Alexander G. Tartakovsky, George V. Moustakides
    Subjects: Computation
    Abstract

    For the most popular sequential change detection rules such as CUSUM, EWMA,
    and the Shiryaev-Roberts test, we develop integral equations and a concise
    numerical method to compute a number of performance metrics, including average
    detection delay and average time to false alarm. We pay special attention to
    the Shiryaev-Roberts procedure and evaluate its performance for various
    initialization strategies.

  168. Convergence properties of the expected improvement algorithm.

    Authors: Emmanuel Vazquez, Julien Bect
    Subjects: Computation
    Abstract

    This paper deals with the convergence of the expected improvement algorithm,
    a popular global optimization algorithm based on a Gaussian process model of
    the function to be optimized. The first result is that under some mild
    hypotheses on the covariance function $k$ of the Gaussian process, the expected
    improvement algorithm produces a dense sequence of evaluation points in the
    search domain, when the function to be optimized is in the reproducing kernel
    Hilbert space (RKHS) generated by $k$.

  169. Computation- and Space-Efficient Implementation of SSA.

    Authors: Anton Korobeynikov
    Subjects: Computation
    Abstract

    The computational complexity of different steps of the basic SSA is
    discussed. It is shown that the use of the general-purpose "blackbox" routines
    (e.g. found in packages like LAPACK) leads to huge waste of time resources
    since the special Hankel structure of the trajectory matrix is not taken into
    account. We outline several state-of-the-art algorithms (for example,
    Lanczos-based truncated SVD) which can be modified to exploit the structure of
    the trajectory matrix. The key components here are hankel matrix-vector
    multiplication and hankelization operator.

  170. Simulating Events of Unknown Probabilities via Reverse Time Martingales.

    Authors: Krzysztof Latuszynski, Ioannis Kosmidis, Omiros Papaspiliopoulos, Gareth O. Roberts
    Subjects: Computation
    Abstract

    Assume that one aims to simulate an event of unknown probability $s\in (0,1)$
    which is uniquely determined, however only its approximations can be obtained
    using a finite computational effort. Such settings are often encountered in
    statistical simulations. We consider two specific examples. First, the exact
    simulation of non-linear diffusions, second, the celebrated Bernoulli factory
    problem of generating an $f(p)-$coin given a sequence $X_1,X_2,...$ of
    independent tosses of a $p-$coin (with known $f$ and unknown $p$).

  171. On sequential Monte Carlo, partial rejection control and approximate Bayesian computation.

    Authors: Y. Fan, S. A. Sisson, G. W. Peters
    Subjects: Computation
    Abstract

    We present a sequential Monte Carlo sampler variant of the partial rejection
    control algorithm, and show that this variant can be considered as a sequential
    Monte Carlo sampler with a modified mutation kernel. We prove that the new
    sampler can reduce the variance of the incremental importance weights when
    compared with standard sequential Monte Carlo samplers. We provide a study of
    theoretical properties of the new algorithm, and make connections with some
    existing algorithms.

  172. Spatial global sensitivity analysis.

    Authors: Bertrand Iooss, Amandine Marrel, Michel Jullien, Beatrice Laurent, Elena Volkova
    Subjects: Computation
    Abstract

    The global sensitivity analysis of a complex numerical model often requires
    the estimation of variance-based importance measures, called Sobol indices.
    Metamodel-based techniques have been developed in order to replace the cpu time
    expensive computer code with an inexpensive mathematical function, predicting
    the computer code output.

  173. Comments on "Particle Markov chain Monte Carlo" by C. Andrieu, A. Doucet, and R. Hollenstein.

    Authors: Christian P. Robert, Nicolas Chopin, Pierre Jacob, Havard Rue
    Subjects: Computation
    Abstract

    This is the compilation of our comments submitted to the Journal of the Royal
    Statistical Society, Series B, to be published within the discussion of the
    Read Paper of Andrieu, Doucet and Hollenstein.

  174. Probability matrices, non-negative rank, and parameterizations of mixture models.

    Authors: Enrico Carlini, Fabio Rapallo
    Subjects: Computation
    Abstract

    In this paper we parameterize non-negative matrices of sum one and rank at
    most two. More precisely, we give a family of parameterizations using the least
    possible number of parameters. We also show how these parameterizations relate
    to a class of statistical models, known in Probability and Statistics as
    mixture models for contingency tables.

  175. A cautionary tale on the efficiency of some adaptive Monte Carlo Schemes.

    Authors: Yves F. Atchade
    Subjects: Computation
    Abstract

    There is a growing interest in the literature for adaptive Markov Chain Monte
    Carlo methods based on sequences of random transition kernels $\{P_n\}$ where
    the kernel $P_n$ is allowed to have an invariant distribution $\pi_n$ not
    necessarily equal to the distribution of interest $\pi$ (target distribution).
    These algorithms are designed such that as $n\to\infty$, $P_n$ converges to
    $P$, a kernel that has the correct invariant distribution $\pi$. Typically, $P$
    is a kernel with good convergence properties, but one that cannot be directly
    implemented.

  176. Particle filtering within adaptive Metropolis Hastings sampling.

    Authors: Ralph Silva, Paolo Giordani, Robert Kohn, Mike Pitt
    Subjects: Computation
    Abstract

    We show that it is feasible to carry out exact Bayesian inference for
    non-Gaussian state space models using an adaptive Metropolis Hastings sampling
    scheme with the likelihood approximated by the particle filter. Furthermore, an
    adapyive independent Metropolis Hastings sampler based on a mixture of normals
    proposal is computationally much more efficient than an adaptive random walk
    proposal because the cost of constructing a good adaptive proposal is
    negligible compared to the cost of approximating the likelihood.

  177. D-optimal designs via a cocktail algorithm.

    Authors: Yaming Yu
    Subjects: Computation
    Abstract

    A "cocktail algorithm" is proposed for numerical computation of (approximate)
    D-optimal designs. This new algorithm combines and extends the multiplicative
    algorithm of Silvey et al. (1978) and the vertex exchange method (VEM) of
    Bohning (1986), and shares their simplicity and monotonic convergence
    properties. Numerical examples show that the cocktail algorithm can lead to
    dramatically improved speed, sometimes by orders of magnitude, relative to
    either VEM or the multiplicative algorithm.

  178. A vanilla Rao--Blackwellisation of Metropolis-Hastings algorithms.

    Authors: Christian P. Robert, Randal Douc
    Subjects: Computation
    Abstract

    Casella and Robert (1996) presented a general Rao--Blackwellisation principle
    for accept-reject and Metropolis-Hastings schemes that leads to significant
    decreases in the variance of the resulting estimators, but at a high cost in
    computing and storage. Adopting a completely different perspective, we
    introduce instead a universal scheme that guarantees variance reductions in all
    Metropolis-Hastings based estimators while keeping the computing cost under
    control.

  179. Tutorial on ABC rejection and ABC SMC for parameter estimation and model selection.

    Authors: Tina Toni, Michael P. H. Stumpf
    Subjects: Computation
    Abstract

    In this tutorial we schematically illustrate four algorithms:

    (1) ABC rejection for parameter estimation

    (2) ABC SMC for parameter estimation

    (3) ABC rejection for model selection on the joint space

    (4) ABC SMC for model selection on the joint space.

  180. Gibbs Sampling for a Bayesian Hierarchical General Linear Model.

    Authors: Galin L. Jones, Alicia A. Johnson
    Subjects: Computation
    Abstract

    We consider a Bayesian hierarchical version of the normal theory general
    linear model which is practically relevant in the sense that it is general
    enough to have many applications and it is not straightforward to sample
    directly from the corresponding posterior distribution. Thus we study a block
    Gibbs sampler that has the posterior as its invariant distribution. We
    establish that the Gibbs sampler converges at a geometric rate. This allows us
    to establish conditions for a central limit theorem for the ergodic averages
    used to estimate features of the posterior.

  181. Efficient Bayesian analysis of multiple changepoint models with dependence across segments.

    Authors: Paul Fearnhead, Zhen Liu
    Subjects: Computation
    Abstract

    We consider Bayesian analysis of a class of multiple changepoint models.
    While there are a variety of efficient ways to analyse these models if the
    parameters associated with each segment are independent, there are few general
    approaches for models where the parameters are dependent. Under the assumption
    that the dependence is Markov, we propose an efficient online algorithm for
    sampling from an approximation to the posterior distribution of the number and
    position of the changepoints. In a simulation study, we show that the
    approximation introduced is negligible.

  182. Importance sampling methods for Bayesian discrimination between embedded models.

    Authors: Christian P. Robert, Jean-Michel Marin
    Subjects: Computation
    Abstract

    This paper surveys some well-established approaches on the approximation of
    Bayes factors used in Bayesian model choice, mostly as covered in Chen et al.
    (2000). Our focus here is on methods that are based on importance sampling
    strategies rather than variable dimension techniques like reversible jump MCMC,
    including: crude Monte Carlo, maximum likelihood based importance sampling,
    bridge and harmonic mean sampling, as well as Chib's method based on the
    exploitation of a functional equality.

  183. A path algorithm for the Fused Lasso Signal Approximator.

    Authors: Holger Hoefling
    Subjects: Computation
    Abstract

    The Lasso is a very well known penalized regression model, which adds an
    $L_{1}$ penalty with parameter $\lambda_{1}$ on the coefficients to the squared
    error loss function. The Fused Lasso extends this model by also putting an
    $L_{1}$ penalty with parameter $\lambda_{2}$ on the difference of neighboring
    coefficients, assuming there is a natural ordering. In this paper, we develop a
    fast path algorithm for solving the Fused Lasso Signal Approximator that
    computes the solutions for all values of $\lambda_1$ and $\lambda_2$.

  184. Particle learning of Gaussian process models for sequential design and optimization.

    Authors: Robert B. Gramacy, Nicholas G. Polson
    Subjects: Computation
    Abstract

    We develop a simulation-based method for the online updating of Gaussian
    process (GP) models by particle learning (PL) for regression and classification
    problems. Our method exploits the sequential nature of PL inference to produce
    a thrifty sequential design algorithm, in terms of computational speed,
    compared to batch and other MCMC-based methods which must be re-started and
    iterated to convergence with the inclusion of each new design point.

  185. Efficient Simulation of a Bivariate Exponential Conditionals Distribution.

    Authors: Yaming Yu
    Subjects: Computation
    Abstract

    The bivariate distribution with exponential conditionals (BEC) is introduced
    by Arnold and Strauss [Bivariate distributions with exponential conditionals,
    J. Amer. Statist. Assoc. 83 (1988) 522--527]. This work presents a simple and
    fast algorithm for simulating random variates from this density.

  186. Approximate Bayesian Computation: a non-parametric perspective.

    Authors: Michael Blum
    Subjects: Computation
    Abstract

    Approximate Bayesian Computation is a family of likelihood-free inference
    techniques that are well-suited to models defined in terms of a stochastic
    generating mechanism. In a nutshell, Approximate Bayesian Computation proceeds
    by computing summary statistics ${\bf s}_{obs}$ from the data and simulating
    synthetic summary statistics for different values of the parameter $\Theta$.
    The posterior distribution is then approximated with an estimator of the
    conditional density $g(\Theta|{\bf s}_{obs})$.

  187. Optimal scaling of the random walk Metropolis on elliptically symmetric unimodal targets.

    Authors: Chris Sherlock, Gareth Roberts
    Subjects: Computation
    Abstract

    Scaling of proposals for Metropolis algorithms is an important practical
    problem in MCMC implementation. Criteria for scaling based on empirical
    acceptance rates of algorithms have been found to work consistently well across
    a broad range of problems. Essentially, proposal jump sizes are increased when
    acceptance rates are high and decreased when rates are low. In recent years,
    considerable theoretical support has been given for rules of this type which
    work on the basis that acceptance rates around 0.234 should be preferred.

  188. Optimal scaling of the random walk Metropolis on elliptically symmetric unimodal targets.

    Authors: Chris Sherlock, Gareth Roberts
    Subjects: Computation
    Abstract

    Scaling of proposals for Metropolis algorithms is an important practical
    problem in MCMC implementation. Criteria for scaling based on empirical
    acceptance rates of algorithms have been found to work consistently well across
    a broad range of problems. Essentially, proposal jump sizes are increased when
    acceptance rates are high and decreased when rates are low. In recent years,
    considerable theoretical support has been given for rules of this type which
    work on the basis that acceptance rates around 0.234 should be preferred.

  189. Grapham: Graphical Models with Adaptive Random Walk Metropolis Algorithms.

    Authors: Matti Vihola
    Subjects: Computation
    Abstract

    Recently developed adaptive Markov chain Monte Carlo (MCMC) methods have been
    applied successfully to many problems in Bayesian statistics. Grapham is a new
    open source implementation covering several such methods, with emphasis on
    graphical models for directed acyclic graphs.

  190. Monte Carlo Methods in Statistics.

    Authors: Christian P. Robert
    Subjects: Computation
    Abstract

    Monte Carlo methods are now an essential part of the statistician's toolbox,
    to the point of being more familiar to graduate students than the measure
    theoretic notions upon which they are based! We recall in this note some of the
    advances made in the design of Monte Carlo techniques towards their use in
    Statistics, referring to Robert and Casella (2004,2010) for an in-depth
    coverage.

  191. Latin hypercube sampling with inequality constraints.

    Authors: Matthieu Petelet, Bertrand Iooss, Olivier Asserin, Alexandre Loredo
    Subjects: Computation
    Abstract

    In some studies requiring predictive and CPU-time consuming numerical models,
    the sampling design of the model input variables has to be chosen with caution.
    For this purpose, Latin hypercube sampling has a long history and has shown its
    robustness capabilities. In this paper we propose and discuss a new algorithm
    to build a Latin hypercube sample (LHS) taking into account inequality
    constraints between the sampled variables. This technique, called constrained
    Latin hypercube sampling (cLHS), consists in doing permutations on an initial
    LHS to honor the desired monotonic constraints.

  192. A History of Markov Chain Monte Carlo--Subjective Recollections from Incomplete Data--.

    Authors: Christian Robert, George Casella
    Subjects: Computation
    Abstract

    In this note we attempt to trace the history and development of Markov chain
    Monte Carlo (MCMC) from its early inception in the late 1940's through its use
    today. We see how the earlier stages of the Monte Carlo (MC, not MCMC) research
    have led to the algorithms currently in use. More importantly, we see how the
    development of this methodology has not only changed our solutions to problems,
    but has changed the way we think about problems.

  193. Numerical Comparison of Cusum and Shiryaev-Roberts Procedures for Detecting Changes in Distributions.

    Authors: Aleksey S. Polunchenko, Alexander G. Tartakovsky, George V. Moustakides
    Subjects: Computation
    Abstract

    The CUSUM procedure is known to be optimal for detecting a change in
    distribution under a minimax scenario, whereas the Shiryaev-Roberts procedure
    is optimal for detecting a change that occurs at a distant time horizon. As a
    simpler alternative to the conventional Monte Carlo approach, we propose a
    numerical method for the systematic comparison of the two detection schemes in
    both settings, i.e., minimax and for detecting changes that occur in the
    distant future.

  194. A simple sketching algorithm for entropy estimation.

    Authors: Peter Clifford, Ioana Ada Cosma
    Subjects: Computation
    Abstract

    We consider the problem of approximating the empirical Shannon entropy of a
    high-frequency data stream when space limitations make exact computation
    infeasible. It is known that \alpha-dependent quantities such as the Renyi and
    Tsallis entropies can be estimated efficiently and unbiasedly from
    low-dimensional \alpha-stable data sketches. An approximation to the Shannon
    entropy can be obtained from either of these quantities by taking \alpha
    sufficiently close to 1. However, practical guidelines for the choice of
    $\alpha$ are lacking. We avoid this problem by going directly to the limit.

Syndicate content