Bernhard Schölkopf

  1. Learning from Distributions via Support Measure Machines.

    Authors: Kenji Fukumizu, Bernhard Schölkopf, Francesco Dinuzzo, Krikamol Muandet
    Subjects: Machine Learning
    Abstract

    This paper presents a kernel-based discriminative learning framework on
    probability measures. Rather than relying on large collections of vectorial
    training examples, our framework learns using a collection of probability
    distributions that have been constructed to meaningfully represent training
    data. By representing these probability distributions as mean embeddings in the
    reproducing kernel Hilbert space (RKHS), we are able to apply many standard
    kernel-based learning techniques in straightforward fashion.

  2. Robust Learning via Cause-Effect Models.

    Authors: Bernhard Schölkopf, Dominik Janzing, Jonas Peters, Kun Zhang
    Subjects: Machine Learning
    Abstract

    We consider the problem of function estimation in the case where the data
    distribution may shift between training and test time, and additional
    information about it may be available at test time. This relates to popular
    scenarios such as covariate shift, concept drift, transfer learning and
    semi-supervised learning. This working paper discusses how these tasks could be
    tackled depending on the kind of changes of the distributions. It argues that
    knowledge of an underlying causal direction can facilitate several of these
    tasks.

  3. Causal Inference on Discrete Data using Additive Noise Models.

    Authors: Bernhard Schölkopf, Dominik Janzing, Jonas Peters
    Subjects: Machine Learning
    Abstract

    Inferring the causal structure of a set of random variables from a finite
    sample of the joint distribution is an important problem in science. Recently,
    methods using additive noise models have been suggested to approach the case of
    continuous variables. In many situations, however, the variables of interest
    are discrete or even have only finitely many states. In this work we extend the
    notion of additive noise models to these cases.

  4. On integral probability metrics, \phi-divergences and binary classification.

    Authors: Kenji Fukumizu, Bharath K. Sriperumbudur, Arthur Gretton, Bernhard Schölkopf, Gert R. G. Lanckriet
    Subjects: Information Theory
    Abstract

    A class of distance measures on probabilities -- the integral probability
    metrics (IPMs) -- is addressed: these include the Wasserstein distance, Dudley
    metric, and Maximum Mean Discrepancy. IPMs have thus far mostly been used in
    more abstract settings, for instance as theoretical tools in mass
    transportation problems, and in metrizing the weak topology on the set of all
    Borel probability measures defined on a metric space.

Syndicate content