We find lower and upper bounds for the risk of estimating a manifold in
Hausdorff distance under several models. We also show that there are close
connections between manifold estimation and the problem of deconvolving a
singular measure.
Discussion on "Brownian distance covariance" by G\'{a}bor J. Sz\'{e}kely and
Maria L. Rizzo [arXiv:1010.0297]
We develop nonparametric methods for estimating filamentary structure from
planar point process data and find the minimax lower bound for this problem. We
show that, under weak conditions, the filaments have a simple geometric
representation as the medial axis of the data distribution's support. Our
methods convert an estimator of the support's boundary into an estimator of the
filaments. We find the rates of convergence of our estimators and show that
when using an optimal boundary estimator, they achieve the minimax rate.
The next generation of telescopes will acquire terabytes of image data on a
nightly basis. Collectively, these large images will contain billions of
interesting objects, which astronomers call sources. The astronomers' task is
to construct a catalog detailing the coordinates and other properties of the
sources. The source catalog is the primary data product for most telescopes and
is an important input for testing new astrophysical theories, but to construct
the catalog one must first detect the sources.
We consider the problem of reliably finding filaments in point clouds.
Realistic data sets often have numerous filaments of various sizes and shapes.
Statistical techniques exist for finding one (or a few) filaments but these
methods do not handle noisy data sets with many filaments. Other methods can be
found in the astronomy literature but they do not have rigorous statistical
guarantees. We propose the following method. Starting at each data point we
construct the steepest ascent path along a kernel density estimator.
We consider the problem of reliably finding filaments in point clouds.
Realistic data sets often have numerous filaments of various sizes and shapes.
Statistical techniques exist for finding one (or a few) filaments but these
methods do not handle noisy data sets with many filaments. Other methods can be
found in the astronomy literature but they do not have rigorous statistical
guarantees. We propose the following method. Starting at each data point we
construct the steepest ascent path along a kernel density estimator.