Topic models can be seen as a generalization of the clustering problem, in
that they posit that observations are generated due to multiple latent factors
(e.g. the words in each document are generated as a mixture of several active
topics, as opposed to just one). This increased representational power comes at
the cost of a more challenging unsupervised learning problem of estimating the
topic probability vectors (the distributions over words for each topic), when
only the words are observed and the corresponding topics are hidden.
We study the tracking problem, namely, estimating the hidden state of an
object over time, from unreliable and noisy measurements. The standard
framework for the tracking problem is the generative framework, which is the
basis of solutions such as the Bayesian algorithm and its approximation, the
particle filters. However, these solutions can be very sensitive to model
mismatches. In this paper, motivated by online learning, we introduce a new
framework for tracking. We provide an efficient tracking algorithm for this
framework.
Mixture models are a fundamental tool in applied statistics and machine
learning for treating data taken from multiple subpopulations. The current
practice for estimating the parameters of such models relies on local search
heuristics (e.g., the EM algorithm) which are prone to failure, and existing
consistent methods are unfavorable due to their high computational and sample
complexity which typically scale exponentially with the number of mixture
components.
We prove an exponential probability tail inequality for positive semidefinite
quadratic forms in a subgaussian random vector. The bound is analogous to one
that holds when the vector has independent Gaussian entries.
This paper addresses the problem of minimizing a convex, Lipschitz function
$f$ over a convex, compact set $\xset$ under a stochastic bandit feedback
model. In this model, the algorithm is allowed to observed noisy realizations
of the function value $f(x)$ at any query point $x \in \xset$. The quantity of
interest is regret of the algorithm, which is the sum of the function values at
algorithm's query points minus the optimal function value. We demonstrate a
generalization of the ellipsoid algorithm that incurs $\otil(\poly(d)\sqrt{T})$
regret.
This work considers the problem of learning the structure of a broad class of
multivariate latent variable tree models, which include a variety of continuous
and discrete models (including the widely used linear-Gaussian models, hidden
Markov models, and Markov evolutionary trees). The setting is one where we only
have samples from certain observed variables in the tree and our goal is to
estimate the tree structure (i.e., the graph of how the underlying hidden
variables are connected to the observed variables).
We address the problem of learning in an online setting where the learner
repeatedly observes features, selects among a set of actions, and receives
reward for the action taken. We provide the first efficient algorithm with an
optimal regret. Our algorithm uses a cost sensitive classification learner as
an oracle and has a running time $\mathrm{polylog}(N)$, where $N$ is the number
of classification rules among which the oracle might choose. This is
exponentially faster than all previous algorithms that achieve optimal regret
in this setting.
The random design setting for linear regression concerns estimators based on
a random sample of covariate/response pairs. This work gives explicit bounds on
the prediction error for the ordinary least squares estimator and the ridge
regression estimator under mild assumptions on the covariate/response
distributions. In particular, this work provides sharp results on the
"out-of-sample" prediction error, as opposed to the "in-sample" (fixed design)
error. Our analysis also explicitly reveals the effect of noise vs. modeling
errors.
In this work we study parallelization of online learning, a core primitive in
machine learning. In a parallel environment all known approaches for parallel
online learning lead to delayed updates, where the model is updated using
out-of-date information.
Suppose a given observation matrix can be decomposed as the sum of a low-rank
matrix and a sparse matrix (outliers), and the goal is to recover these
individual components from the observed sum. Such additive decompositions have
applications in a variety of numerical problems including system
identification, latent variable graphical modeling, and principal components
analysis. We study conditions under which recovering such a decomposition is
possible via a combination of $\ell_1$ norm and trace norm minimization.
We study the problem of decision-theoretic online learning (DTOL). Motivated
by practical applications, we focus on DTOL when the number of actions is very
large. Previous algorithms for learning in this framework have a tunable
learning rate parameter, and a barrier to using online-learning in practical
applications is that it is not understood how to set this parameter optimally,
particularly when the number of actions is large.
We study the tracking problem, namely, estimating the hidden state of an
object over time, from unreliable and noisy measurements. The standard
framework for the tracking problem is the generative framework, which is the
basis of solutions such as the Bayesian algorithm and its approximation, the
particle filters. However, the problem with these solutions is that they are
very sensitive to model mismatches. In this paper, motivated by online
learning, we introduce a new framework -- an {\em explanatory} framework -- for
tracking.