Data Structures and Algorithms

  1. FastSIR Algorithm: A Fast Algorithm for simulation of epidemic spread in large networks by using SIR compartment model.

    Authors: Nino Antulov-Fantulin, Alen Lancic, Hrvoje Stefancic, Mile Sikic
    Subjects: Data Structures and Algorithms
    Abstract

    The epidemic spreading on arbitrary complex networks is studied in SIR
    (Susceptible Infected Recovered) compartment model. We propose our
    implementation of a Naive SIR algorithm for epidemic simulation spreading on
    networks that uses data structures efficiently to reduce running time. The
    Naive SIR algorithm models full epidemic dynamics and can be easily upgraded to
    parallel version. We also propose novel algorithm for epidemic simulation
    spreading on networks called the FastSIR algorithm that has better average case
    running time than the Naive SIR algorithm.

  2. Real-Time Monitoring of Undirected Networks: Articulation Points, Bridges, and Connected and Biconnected Components.

    Authors: Giorgio Ausiello, Donatella Firmani, Luigi Laura
    Subjects: Data Structures and Algorithms
    Abstract

    In this paper we present the first algorithm in the streaming model to
    characterize completely the biconnectivity properties of undirected networks:
    articulation points, bridges, and connected and biconnected components. The
    motivation of our work was the development of a real-time algorithm to monitor
    the connectivity of the Autonomous Systems (AS) Network, but the solution
    provided is general enough to be applied to any network.

  3. Multiple-Source Shortest Paths in Embedded Graphs.

    Authors: Sergio Cabello, Erin Wolf Chambers, Jeff Erickson
    Subjects: Data Structures and Algorithms
    Abstract

    Let G be a directed graph with n vertices and non-negative weights in its
    directed edges, embedded on a surface of genus g, and let F be an arbitrary
    face of G. We describe an algorithm to preprocess the graph in O(gn log n)
    time, so that the shortest-path distance from any vertex on the boundary of F
    to any other vertex in G can be retrieved in O(log n) time. Our result directly
    generalizes the O(n log n)-time algorithm of Klein [SODA 2005] for
    multiple-source shortest paths in planar graphs.

  4. Advanced Coarsening Schemes for Graph Partitioning.

    Authors: Ilya Safro, Peter Sanders, Christian Schulz
    Subjects: Data Structures and Algorithms
    Abstract

    The graph partitioning problem is widely used and studied in many practical
    and theoretical applications. The multilevel strategies represent today one of
    the most effective and efficient generic frameworks for solving this problem on
    large-scale graphs. Most of the attention in designing the multilevel
    partitioning frameworks has been on the refinement phase. In this work we focus
    on the coarsening phase, which is responsible for creating structurally similar
    to the original but smaller graphs.

  5. The black-and-white coloring problem on permutation graphs.

    Authors: Ton Kloks
    Subjects: Data Structures and Algorithms
    Abstract

    Given a graph G and integers b and w. The black-and-white coloring problem
    asks if there exist disjoint sets of vertices B and W with |B|=b and |W|=w such
    that no vertex in B is adjacent to any vertex in W. In this paper we show that
    the problem is polynomial when restricted to permutation graphs.

  6. Deterministic Polynomial-Time Algorithms for Designing Short DNA Words.

    Authors: Yong Zhang, He Sun, Ming-Yang Kao, Henry C. M. Leung
    Subjects: Data Structures and Algorithms
    Abstract

    Designing short DNA words is a problem of constructing a set (i.e., code) of
    n DNA strings (i.e., words) with the minimum length such that the Hamming
    distance between each pair of words is at least k and the n words satisfy a set
    of additional constraints. This problem has applications in, e.g., DNA
    self-assembly and DNA arrays. Previous works include those that extended
    results from coding theory to obtain bounds on code and word sizes for
    biologically motivated constraints and those that applied heuristic local
    searches, genetic algorithms, and randomized algorithms.

  7. Faster and Simpler Width-Independent Parallel Algorithms for Positive Semidefinite Programming.

    Authors: Richard Peng, Kanat Tangwongsan
    Subjects: Data Structures and Algorithms
    Abstract

    This paper studies the problem of finding a (1+eps)-approximation to positive
    semidefinite programs. These are semidefinite programs in which all matrices in
    the constraints and objective are positive semidefinite and all scalars are
    non-negative. Previous work (Jain and Yao, FOCS'11) gave an NC algorithm that
    requires at least Omega((1/eps)^13\log^{13}n) iterations. The algorithm
    performs at least Omega(n^\omega) work per iteration, where n is the dimension
    of the matrices involved, since each iteration involves computing spectral
    decomposition.

  8. I Like Her more than You: Self-determined Communities.

    Authors: Shang-Hua Teng, Christian Borgs, Jennifer Chayes, Maria-Florina Balcan, Mark Braverman
    Subjects: Data Structures and Algorithms
    Abstract

    In this paper we define what we call an affinity system, which is a set of
    individuals, each with a vector characterizing its preference for all other
    individuals in the set. The preference of a member can be given either by a
    ranking of all members or by a weighted vector that defines the degrees of its
    affinity to others. Affinity systems are useful for modeling social systems as
    well as general data sets, as social interactions are often determined by
    affinities among the members.

  9. An efficient parallel algorithm for the longest path problem in meshes.

    Authors: Fatemeh Keshavarz-Kohjerdi, Alireza Bagheri
    Subjects: Data Structures and Algorithms
    Abstract

    In this paper, first we give a sequential linear-time algorithm for the
    longest path problem in meshes. This algorithm can be considered as an
    improvement of [13]. Then based on this sequential algorithm, we present a
    constant-time parallel algorithm for the problem which can be run on every
    parallel machine.

  10. A simple D^2-sampling based PTAS for k-means and other Clustering Problems.

    Authors: Amit Kumar, Ragesh Jaiswal, Sandeep Sen
    Subjects: Data Structures and Algorithms
    Abstract

    Given a set of points $P \subset \mathbb{R}^d$, the $k$-means clustering
    problem is to find a set of $k$ {\em centers} $C = \{c_1,...,c_k\}, c_i \in
    \mathbb{R}^d,$ such that the objective function $\sum_{x \in P} d(x,C)^2$,
    where $d(x,C)$ denotes the distance between $x$ and the closest center in $C$,
    is minimized. This is one of the most prominent objective functions that have
    been studied with respect to clustering.

  11. Nearly Optimal Sparse Fourier Transform.

    Authors: Piotr Indyk, Eric Price, Haitham Hassanieh, Dina Katabi
    Subjects: Data Structures and Algorithms
    Abstract

    We consider the problem of computing the k-sparse approximation to the
    discrete Fourier transform of an n-dimensional signal. We show:

    * An O(k log n)-time algorithm for the case where the input signal has at
    most k non-zero Fourier coefficients, and

    * An O(k log n log(n/k))-time algorithm for general input signals.

  12. Relationships in Large-Scale Graph Computing.

    Authors: Dan Petrovic
    Subjects: Data Structures and Algorithms
    Abstract

    In 2009 Grzegorz Czajkowski from Google's system infrastructure team has
    published an article which didn't get much attention in the SEO community at
    the time. It was titled "Large-scale graph computing at Google" and gave an
    excellent insight into the future of Google's search. This article highlights
    some of the little known facts which lead to transformation of Google's
    algorithm in the last two years.

  13. There is no 16-Clue Sudoku: Solving the Sudoku Minimum Number of Clues Problem.

    Authors: Gary Mcguire, Bastian Tugemann, Gilles Civario
    Subjects: Data Structures and Algorithms
    Abstract

    We apply our new hitting set enumeration algorithm to solve the sudoku
    minimum number of clues problem, which is the following question: What is the
    smallest number of clues (givens) that a sudoku puzzle may have? It was
    conjectured that the answer is 17. We have performed an exhaustive search for a
    16-clue sudoku puzzle, and we did not find one, thereby proving that the answer
    is indeed 17. This article describes our method and the actual search.

  14. Cache-Oblivious Implicit Predecessor Dictionaries with the Working Set Property.

    Authors: Gerth Stølting Brodal, Casper Kejlberg-Rasmussen
    Subjects: Data Structures and Algorithms
    Abstract

    In this paper we present an implicit dynamic dictionary with the working-set
    property, supporting \op{insert}($e$) and \op{delete}($e$) in $O(\log n)$ time,
    \op{predecessor}($e$) in $O(\log \ws{\pred(e)})$ time, \op{successor}($e$) in
    $O(\log \ws{\succ(e)})$ time and \op{search}($e$) in $O(\log
    \min(\ws{\pred(e)},\ws{e}, \ws{\succ(e)}))$ time, where $n$ is the number of
    elements stored in the dictionary, $\ws{e}$ is the number of distinct elements
    searched for since element $e$ was last searched for and $\pred(e)$ and
    $\succ(e)$ are the predecessor and successor of $e$, respectively.

  15. Strongly connected components of directed hypergraphs.

    Authors: Xavier Allamigeon
    Subjects: Data Structures and Algorithms
    Abstract

    We study the problem of determining strongly connected components (SCCs) of
    directed hypergraphs. The main contribution is an algorithm computing the
    terminal strongly connected components (ie SCCs which do not reach any other
    components than themselves). The time complexity of the algorithm is almost
    linear, which is a significant improvement over the known methods which are
    quadratic time. This also proves that the problems of testing strong
    connectivity, and determining the existence of a sink, can be both solved in
    almost linear time in directed hypergraphs.

  16. Correlation Decay up to Uniqueness in Spin Systems.

    Authors: Pinyan Lu, Liang Li, Yitong Yin
    Subjects: Data Structures and Algorithms
    Abstract

    We give a complete characterization of the two-state anti-ferromagnetic spin
    systems which are of exponential correlation decay. We show that a system is of
    correlation decay on all graphs with maximum degree \Delta\ if and only if the
    system has a unique Gibbs measure on all infinite regular trees up to degree
    \Delta, where \Delta\ can either be bounded or unbounded.

  17. Compressed Matrix Multiplication.

    Authors: Rasmus Pagh
    Subjects: Data Structures and Algorithms
    Abstract

    Motivated by the problems of computing sample covariance matrices, and of
    transforming a collection of vectors to a basis where they are sparse, we
    present a simple algorithm that computes an approximation of the product of two
    n-by-n real matrices A and B. Let ||AB||_F denote the Frobenius norm of AB, and
    b be a parameter determining the time/accuracy trade-off.

  18. On-line construction of position heaps.

    Authors: Gregory Kucherov
    Subjects: Data Structures and Algorithms
    Abstract

    We propose a simple linear-time on-line algorithm for constructing a position
    heap for a string [Ehrenfeucht et al, 2011]. Our definition of position heap
    differs slightly from the one proposed in [Ehrenfeucht et al, 2011] in that it
    considers the suffixes ordered from left to right. Our construction is based on
    classic suffix pointers and resembles the Ukkonen's algorithm for suffix trees
    [Ukkonen, 1995]. Using suffix pointers, the position heap can be extended into
    the augmented position heap that allows for a linear-time string matching
    algorithm [Ehrenfeucht et al, 2011].

  19. Fast Compressed Tries through Path Decompositions.

    Authors: Roberto Grossi, Giuseppe Ottaviano
    Subjects: Data Structures and Algorithms
    Abstract

    Tries are popular data structures for storing a set of strings, where common
    prefixes are represented by common root-to-node paths. Over fifty years of
    usage have produced many variants and implementations to overcome some of their
    limitations. We explore new succinct representations of path-decomposed tries
    and experimentally evaluate the corresponding reduction in space usage and
    memory latency, comparing with the state of the art.

  20. Algorithms for Weighted Capacity and Admission Control in Wireless Networks.

    Authors: Magnus M. Halldorsson, Pradipta Mitra
    Subjects: Data Structures and Algorithms
    Abstract

    We give algorithms with constant-factor performance guarantees for several
    capacity and throughput problems in the SINR model. The algorithms are all
    based on a novel LP formulation for capacity problems. First, we give a new
    constant-factor approximation algorithm for selecting the maximum subset of
    links that can be scheduled simultaneously, under any non-decreasing and
    sublinear power assignment. For the case of uniform power, we extend this to
    the case of variable QoS requirements and link-dependent noise terms.

  21. Exact Distance Oracles for Planar Graphs.

    Authors: Shay Mozes, Christian Sommer
    Subjects: Data Structures and Algorithms
    Abstract

    We present new and improved data structures that answer exact node-to-node
    distance queries in planar graphs. Such data structures are also known as
    distance oracles. For any directed planar graph on n nodes with non-negative
    lengths we obtain the following:

    * Given a desired space allocation $S\in[n\lg\lg n,n^2]$, we show how to
    construct in $\tilde O(S)$ time a data structure of size $O(S)$ that answers
    distance queries in $\tilde O(n/\sqrt S)$ time per query.

  22. Testing the Connectivity of Networks.

    Authors: Taha Sochi
    Subjects: Data Structures and Algorithms
    Abstract

    In this article we discuss general strategies and computer algorithms to test
    the connectivity of unstructured networks which consist of a number of segments
    connected through randomly distributed nodes.

  23. Near Linear-Work Parallel SDD Solvers, Low-Diameter Decomposition, and Low-Stretch Subgraphs.

    Authors: Anupam Gupta, Ioannis Koutis, Gary L. Miller, Richard Peng, Guy E. Blelloch, Kanat Tangwongsan
    Subjects: Data Structures and Algorithms
    Abstract

    We present the design and analysis of a near linear-work parallel algorithm
    for solving symmetric diagonally dominant (SDD) linear systems. On input of a
    SDD $n$-by-$n$ matrix $A$ with $m$ non-zero entries and a vector $b$, our
    algorithm computes a vector $\tilde{x}$ such that $\norm[A]{\tilde{x} - A^+b}
    \leq \vareps \cdot \norm[A]{A^+b}$ in $O(m\log^{O(1)}{n}\log{\frac1\epsilon})$
    work and $O(m^{1/3+\theta}\log \frac1\epsilon)$ depth for any fixed $\theta >
    0$.

  24. Tight Approximation of Image Matching.

    Authors: Simon Korman, Daniel Reichman, Gilad Tsur
    Subjects: Data Structures and Algorithms
    Abstract

    In this work we consider the {\em image matching} problem for two grayscale
    $n \times n$ images, $M_1$ and $M_2$ (where pixel values range from 0 to 1).
    Our goal is to find an affine transformation $T$ that maps pixels from $M_1$ to
    pixels in $M_2$ so that the differences over pixels $p$ between $M_1(p)$ and
    $M_2(T(p))$ is minimized.

  25. A Compressed Self-Index for Genomic Databases.

    Authors: Travis Gagie, Yakov Nekrich, Simon J. Puglisi, Juha Kärkkäinen
    Subjects: Data Structures and Algorithms
    Abstract

    Advances in DNA sequencing technology will soon result in databases of
    thousands of genomes. Within a species, individuals' genomes are almost exact
    copies of each other; e.g., any two human genomes are 99.9% the same. Relative
    Lempel-Ziv (RLZ) compression takes advantage of this property: it stores the
    first genome uncompressed or as an FM-index, then compresses the other genomes
    with a variant of LZ77 that copies phrases only from the first genome.

  26. Active Testing.

    Authors: Maria-Florina Balcan, Avrim Blum, Eric Blais, Liu Yang
    Subjects: Data Structures and Algorithms
    Abstract

    One of the motivations for property testing of boolean functions is the idea
    that testing can serve as a preprocessing step before learning. However, in
    most machine learning applications, the ability to query functions at arbitrary
    points in the input space is considered highly unrealistic. Instead, the
    dominant query paradigm in applied machine learning, called active learning, is
    one where the algorithm may ask for examples to be labeled, but only from among
    those that exist in nature.

  27. Perfectly Balanced Allocation With Estimated Average Using Approximately Constant Retries.

    Authors: Sourav Dutta, Souvik Bhattacherjee, Ankur Narang
    Subjects: Data Structures and Algorithms
    Abstract

    Balanced allocation of online balls-and-bins has long been an active area of
    research for efficient load balancing and hashing applications. There exists a
    large number of results in this domain with different settings, such as
    parallel allocations [1], multi-dimensional allocations [5], weighted balls [4]
    etc. For sequential multi-choice allocation where m balls are thrown into n
    bins with each ball choosing d (constant) bins independently uniformly at
    random, the maximum load of a bin is O(log log n)+m/n with high probability
    [3]. This offers the current best known allocation scheme.

  28. On the Value of Job Migration in Online Makespan Minimization.

    Authors: Matthias Hellwig, Susanne Albers
    Subjects: Data Structures and Algorithms
    Abstract

    Makespan minimization on identical parallel machines is a classical
    scheduling problem. We consider the online scenario where a sequence of $n$
    jobs has to be scheduled non-preemptively on $m$ machines so as to minimize the
    maximum completion time of any job. The best competitive ratio that can be
    achieved by deterministic online algorithms is in the range $[1.88,1.9201]$.
    Currently no randomized online algorithm with a smaller competitiveness is
    known, for general $m$.

  29. Multidimensional Balanced Allocation for Multiple Choice & (1 + Beta) Processes.

    Authors: Sourav Dutta, Souvik Bhattacherjee, Ankur Narang
    Subjects: Data Structures and Algorithms
    Abstract

    Allocation of balls into bins is a well studied abstraction for load
    balancing problems.The literature hosts numerous results for sequential(single
    dimensional) allocation case when m balls are thrown into n bins. In this paper
    we study the symmetric multiple choice process for both unweighted and weighted
    balls as well as for both multidimensional and scalar models.Additionally,we
    present the results on bounds on gap for (1+beta) choice process with
    multidimensional balls and bins.

  30. Improved Upper Bounds for Pairing Heaps.

    Authors: John Iacono
    Subjects: Data Structures and Algorithms
    Abstract

    Pairing heaps are shown to have constant amortized time Insert and Meld, thus
    showing that pairing heaps have the same amortized runtimes as Fibonacci heaps
    for all operations but Decrease-key.

  31. (1+eps)-approximate Sparse Recovery.

    Authors: Eric Price, David P. Woodruff
    Subjects: Data Structures and Algorithms
    Abstract

    The problem central to sparse recovery and compressive sensing is that of
    stable sparse recovery: we want a distribution of matrices A in R^{m\times n}
    such that, for any x \in R^n and with probability at least 2/3 over A, there is
    an algorithm to recover x* from Ax with

  32. Telling Two Distributions Apart: a Tight Characterization.

    Authors: Eyal Even Dar, Mark Sandler
    Subjects: Data Structures and Algorithms
    Abstract

    We consider the problem of distinguishing between two arbitrary black-box
    distributions defined over the domain [n], given access to $s$ samples from
    both. It is known that in the worst case O(n^{2/3}) samples is both necessary
    and sufficient, provided that the distributions have L1 difference of at least
    {\epsilon}. However, it is also known that in many cases fewer samples suffice.
    We identify a new parameter, that provides an upper bound on how many samples
    needed, and present an efficient algorithm that requires the number of samples
    independent of the domain size.

  33. Regularized Laplacian Estimation and Fast Eigenvector Approximation.

    Authors: Patrick O. Perry, Michael W. Mahoney
    Subjects: Data Structures and Algorithms
    Abstract

    Recently, Mahoney and Orecchia demonstrated that popular diffusion-based
    procedures to compute a quick \emph{approximation} to the first nontrivial
    eigenvector of a data graph Laplacian \emph{exactly} solve certain regularized
    Semi-Definite Programs (SDPs). In this paper, we extend that result by
    providing a statistical interpretation of their approximation procedure.

  34. On the strong chromatic index and maximum induced matching of tree-cographs and permutation graphs.

    Authors: Ton Kloks, Chin-Ting Ung, Yue-Li Wang
    Subjects: Data Structures and Algorithms
    Abstract

    We show that there exist linear-time algorithms that compute the strong
    chromatic index and a maximum induced matching of tree-cographs when the
    decomposition tree is a part of the input. We also show that there exists an
    efficient algorithm for the strong chromatic index of permutation graphs.

  35. Concave Generalized Flows with Applications to Market Equilibria.

    Authors: Laszlo A. Vegh
    Subjects: Data Structures and Algorithms
    Abstract

    We consider a nonlinear extension of the generalized network flow model, with
    the flow leaving an arc being an increasing concave function of the flow
    entering it, as proposed by Truemper and Shigeno. We give the first polynomial
    time combinatorial algorithm for solving corresponding optimization problems,
    finding a epsilon-approximate solution in O(m(m+\log n)\log(MUm/epsilon))
    arithmetic operations and value oracle queries, where M and U are upper bounds
    on simple parameters.

  36. Approximation Algorithms for Variable-Sized and Generalized Bin Covering.

    Authors: Alexander Souza, Matthias Hellwig
    Subjects: Data Structures and Algorithms
    Abstract

    In this paper, we consider the \alg{Generalized Bin Covering} problem: We are
    given $m$ bin types, where each bin of type $i$ has profit $p_i$ and demand
    $d_i$. Furthermore, there are $n$ items, where item $j$ has size $s_j$. A bin
    of type $i$ is covered if the set of items assigned to it has total size at
    least the demand $d_i$. In that case, the profit of $p_i$ is earned and the
    objective is to maximize the total profit. To the best of our knowledge, only
    the cases $p_i = d_i = 1$ (\alg{Bin Covering}) and $p_i = d_i$
    (\alg{Variable-Sized Bin Covering}) have been treated before.

  37. Faster Approximate Pattern Matching in Compressed Repetitive Texts.

    Authors: Travis Gagie, Pawel Gawrychowski, Simon J. Puglisi
    Subjects: Data Structures and Algorithms
    Abstract

    Motivated by the imminent growth of massive, highly redundant genomic
    databases, we study the problem of compressing a string database while
    simultaneously supporting fast random access, substring extraction and pattern
    matching to the underlying string(s). Bille et al. (2011) recently showed how,
    given a straight-line program with $r$ rules for a string $s$ of length $n$, we
    can build an $\Oh{r}$-word data structure that allows us to extract any
    substring (s [i..j]) in $\Oh{\log n + j - i}$ time.

  38. Lossless data compression on GPGPU architectures.

    Authors: Axel Eirola
    Subjects: Data Structures and Algorithms
    Abstract

    Modern graphics processors provide exceptional computa- tional power, but
    only for certain computational models. While they have revolutionized
    computation in many fields, compression has been largely unnaffected. This
    paper aims to explain the current issues and possibili- ties in GPGPU
    compression. This is done by a high level overview of the GPGPU computational
    model in the context of compression algorithms; along with a more in-depth
    analysis of how one would implement bzip2 on a GPGPU architecture.

  39. Efficient Minimization of Higher Order Submodular Functions using Monotonic Boolean Functions.

    Authors: Srikumar Ramalingam, Chris Russell, Lubor Ladicky, Philip H.S. Torr
    Subjects: Data Structures and Algorithms
    Abstract

    Submodular function minimization is a key problem in a wide variety of
    applications in machine learning, economics, game theory, computer vision and
    many others. The general solver has a complexity of $O(n^6+n^5L)$ where $L$ is
    the time required to evaluate the function and $n$ is the number of variables
    \cite{orlin09}.

  40. A New Proposed Cost Model for List Accessing Problem using Buffering.

    Authors: Rakesh Mohanty, Sasmita Tripathy, Seetaya Bhoi
    Subjects: Data Structures and Algorithms
    Abstract

    There are many existing well known cost models for the list accessing
    problem. The standard cost model developed by Sleator and Tarjan is most widely
    used. In this paper, we have made a comprehensive study of the existing cost
    models and proposed a new cost model for the list accessing problem. In our
    proposed cost model, for calculating the processing cost of request sequence
    using a singly linked list, we consider the access cost, matching cost and
    replacement cost. The cost of processing a request sequence is the sum of
    access cost, matching cost and replacement cost.

  41. A Learning Theory Approach to Non-Interactive Database Privacy.

    Authors: Aaron Roth, Katrina Ligett, Avrim Blum
    Subjects: Data Structures and Algorithms
    Abstract

    In this paper we demonstrate that, ignoring computational constraints, it is
    possible to privately release synthetic databases that are useful for large
    classes of queries -- much larger in size than the database itself.
    Specifically, we give a mechanism that privately releases synthetic data for a
    class of queries over a discrete domain with error that grows as a function of
    the size of the smallest net approximately representing the answers to that
    class of queries.

  42. Characterization of Request Sequences for List Accessing Problem and New Theoretical Results for MTF Algorithm.

    Authors: Rakesh Mohanty, Burle Sharma, Sasmita Tripathy
    Subjects: Data Structures and Algorithms
    Abstract

    List Accessing Problem is a well studied research problem in the context of
    linear search. Input to the list accessing problem is an unsorted linear list
    of distinct elements along with a sequence of requests, where each request is
    an access operation on an element of the list. A list accessing algorithm
    reorganizes the list while processing a request sequence on the list in order
    to minimize the access cost.

  43. The Open Graph Archive: A Community-Driven Effort.

    Authors: Christian Bachmaier, Franz J. Brandenburg, Philip Effinger, Carsten Gutwenger, Jyrki Katajainen, Karsten Klein, Miro Spönemann, Matthias Stegmaier, Michael Wybrow
    Subjects: Data Structures and Algorithms
    Abstract

    In order to evaluate, compare, and tune graph algorithms, experiments on well
    designed benchmark sets have to be performed. Together with the goal of
    reproducibility of experimental results, this creates a demand for a public
    archive to gather and store graph instances. Such an archive would ideally
    allow annotation of instances or sets of graphs with additional information
    like graph properties and references to the respective experiments and results.
    Here we examine the requirements, and introduce a new community project with
    the aim of producing an easily accessible library of graphs.

  44. Linear Time Inference of Strings from Cover Arrays using a Binary Alphabet.

    Authors: M. Sohel Rahman, Tanaeem M. Moosa, Sumaiya Nazeen, Rezwana Reaz
    Subjects: Data Structures and Algorithms
    Abstract

    Covers being one of the most popular form of regularities in strings, have
    drawn much attention over time. In this paper, we focus on the problem of
    linear time inference of strings from cover arrays using the least sized
    alphabet possible. We present an algorithm that can reconstruct a string $x$
    over a two-letter alphabet whenever a valid cover array $C$ is given as an
    input. This algorithm uses several interesting combinatorial properties of
    cover arrays and an interesting relation between border array and cover array
    to achieve this. Our algorithm runs in linear time.

  45. Substring Range Reporting.

    Authors: Philip Bille, Inge Li Goertz
    Subjects: Data Structures and Algorithms
    Abstract

    We revisit various string indexing problems with range reporting features,
    namely, position-restricted substring searching, indexing substrings with gaps,
    and indexing substrings with intervals. We obtain the following main results.
    {itemize} We give efficient reductions for each of the above problems to a new
    problem, which we call \emph{substring range reporting}. Hence, we unify the
    previous work by showing that we may restrict our attention to a single problem
    rather than studying each of the above problems individually.

  46. Upward Point Set Embeddability for Convex Point Sets is in $P$.

    Authors: Tamara Mchedlidze, Antonios Symvonis, Michael Kaufmann
    Subjects: Data Structures and Algorithms
    Abstract

    In this paper, we present a polynomial dynamic programming algorithm that
    tests whether a $n$-vertex directed tree $T$ has an upward planar embedding
    into a convex point-set $S$ of size $n$. Further, we extend our approach to the
    class of outerplanar digraphs. This nontrivial and surprising result implies
    that any given digraph can be efficiently tested for an upward planar embedding
    into a given convex point set.

  47. Weak Dominance Drawings and Linear Extension Diameter.

    Authors: Evgenios M. Kornaropoulos, Ioannis G. Tollis
    Subjects: Data Structures and Algorithms
    Abstract

    We introduce the problem of Weak Dominance Drawing for general directed
    acyclic graphs and we show the connection with the linear extension diameter of
    a partial order P. We present complexity results and bounds.

  48. Conauto-2.0: Fast Isomorphism Testing and Automorphism Group Computation.

    Authors: Antonio Fernández Anta, José Luis López-Presa, Luis Núñez Chiroque
    Subjects: Data Structures and Algorithms
    Abstract

    In this paper we present an algorithm, called conauto-2.0, that can
    efficiently compute a set of generators of the automorphism group of a graph,
    and test whether two graphs are isomorphic, finding an isomorphism if they are.
    This algorithm uses the basic individualization/refinement technique, and is an
    improved version of the algorithm conauto, which has been shown to be very fast
    for random graphs and several families of hard graphs.

  49. Wireless Capacity With Arbitrary Gain Matrix.

    Authors: Magnus M. Halldorsson, Pradipta Mitra
    Subjects: Data Structures and Algorithms
    Abstract

    Given a set of wireless links, a fundamental problem is to find the largest
    subset that can transmit simultaneously, within the SINR model of interference.
    Significant progress on this problem has been made in recent years. In this
    note, we study the problem in the setting where we are given a fixed set of
    arbitrary powers each sender must use, and an arbitrary gain matrix defining
    how signals fade. This variation of the problem appears immune to most
    algorithmic approaches studied in the literature.

  50. Matroidal Degree-Bounded Minimum Spanning Trees.

    Authors: Rico Zenklusen
    Subjects: Data Structures and Algorithms
    Abstract

    We consider the minimum spanning tree (MST) problem under the restriction
    that for every vertex v, the edges of the tree that are adjacent to v satisfy a
    given family of constraints. A famous example thereof is the classical
    degree-constrained MST problem, where for every vertex v, a simple upper bound
    on the degree is imposed. Iterative rounding/relaxation algorithms became the
    tool of choice for degree-bounded network design problems.

  51. Fully De-Amortized Cuckoo Hashing for Cache-Oblivious Dictionaries and Multimaps.

    Authors: Michael T. Goodrich, Michael Mitzenmacher, Justin Thaler, Daniel S. Hirschberg
    Subjects: Data Structures and Algorithms
    Abstract

    A dictionary (or map) is a key-value store that requires all keys be unique,
    and a multimap is a key-value store that allows for multiple values to be
    associated with the same key. We design hashing-based indexing schemes for
    dictionaries and multimaps that achieve worst-case optimal performance for
    lookups and updates, with a small or negligible probability the data structure
    will require a rehash operation, depending on whether we are working in the the
    external-memory (I/O) model or one of the well-known versions of the Random
    Access Machine (RAM) model.

  52. Lower Bounds for the Average and Smoothed Number of Pareto Optima.

    Authors: Luis Rademacher, Navin Goyal
    Subjects: Data Structures and Algorithms
    Abstract

    Smoothed analysis of multiobjective 0-1 linear optimization has drawn
    considerable attention recently. The number of Pareto-optimal solutions (i.e.,
    solutions with the property that no other solution is at least as good in all
    the coordinates and better in at least one) for multiobjective optimization
    problems is the central object of study. In this paper, we prove several lower
    bounds for the expected number of Pareto optima.

  53. Co-nondeterminism in compositions: A kernelization lower bound for a Ramsey-type problem.

    Authors: Stefan Kratsch
    Subjects: Data Structures and Algorithms
    Abstract

    Until recently, techniques for obtaining lower bounds for kernelization were
    one of the most sought after tools in the field of parameterized complexity.
    Now, after a strong influx of techniques, we are in the fortunate situation of
    having tools available that are even stronger than what has been required in
    their applications so far. Based on a result of Fortnow and Santhanam (JCSS
    2011), Bodlaender et al.

  54. On Polynomial Kernels for Structural Parameterizations of Odd Cycle Transversal.

    Authors: Stefan Kratsch, Bart M. P. Jansen
    Subjects: Data Structures and Algorithms
    Abstract

    The Odd Cycle Transversal problem (OCT) asks whether a given graph can be
    made bipartite (i.e., 2-colorable) by deleting at most l vertices. We study
    structural parameterizations of OCT with respect to their polynomial
    kernelizability, i.e., whether instances can be efficiently reduced to a size
    polynomial in the chosen parameter. It is a major open problem in parameterized
    complexity whether Odd Cycle Transversal admits a polynomial kernel when
    parameterized by l.

  55. K-sort: A new sorting algorithm that beats Heap sort for n <= 70 lakhs!.

    Authors: Kiran Kumar Sundararajan, Mita Pal, Soubhik Chakraborty, N.C. Mahanti
    Subjects: Data Structures and Algorithms
    Abstract

    Sundararajan and Chakraborty (2007) introduced a new version of Quick sort
    removing the interchanges. Khreisat (2007) found this algorithm to be competing
    well with some other versions of Quick sort. However, it uses an auxiliary
    array thereby increasing the space complexity. Here, we provide a second
    version of our new sort where we have removed the auxiliary array. This second
    improved version of the algorithm, which we call K-sort, is found to sort
    elements faster than Heap sort for an appreciably large array size (n <=
    70,00,000) for uniform U[0, 1] inputs.

  56. A Higher-Order Cheeger's Inequality.

    Authors: Shayan Oveis Gharan, Luca Trevisan
    Subjects: Data Structures and Algorithms
    Abstract

    A basic fact in algebraic graph theory is that the number of connected
    components in an undirected graph is equal to the multiplicity of the
    eigenvalue 1 in the normalized adjacency matrix of the graph. In particular,
    the graph is disconnected if and only if there are at least two eigenvalues
    equal to 1.

    Cheeger's inequality provides an "approximate" version of the latter fact,
    and it states that a graph has a sparse cut (it is "almost disconnected") if
    and only if there are at least two eigenvalues that are close to one.

  57. Learning Poisson Binomial Distributions.

    Authors: Ilias Diakonikolas, Constantinos Daskalakis, R. Servedio
    Subjects: Data Structures and Algorithms
    Abstract

    We consider a basic problem in unsupervised learning: learning an unknown
    \emph{Poisson Binomial Distribution} over $\{0,1,...,n\}$. A Poisson Binomial
    Distribution (PBD) is a sum $X = X_1 + ... + X_n$ of $n$ independent Bernoulli
    random variables which may have arbitrary expectations. We work in a framework
    where the learner is given access to independent draws from the distribution
    and must (with high probability) output a hypothesis distribution which has
    total variation distance at most $\eps$ from the unknown target PBD.

  58. Learning $k$-Modal Distributions via Testing.

    Authors: Ilias Diakonikolas, Constantinos Daskalakis, R. Servedio
    Subjects: Data Structures and Algorithms
    Abstract

    A $k$-modal probability distribution over the domain $\{1,...,n\}$ is one
    whose histogram has at most $k$ "peaks" and "valleys." Such distributions are
    natural generalizations of monotone ($k=0$) and unimodal ($k=1$) probability
    distributions, which have been intensively studied in probability theory and
    statistics.

  59. On the Approximability and Hardness of Minimum Topic Connected Overlay and Its Special Instances.

    Authors: Jun Hosoda, Juraj Hromkovic, Taisuke Izumi, Horotaka Ono, Monika Steinova, Koichi Wada
    Subjects: Data Structures and Algorithms
    Abstract

    In the context of designing a scalable overlay network to support
    decentralized topic-based pub/sub communication, the Minimum Topic-Connected
    Overlay problem (Min-TCO in short) has been investigated: Given a set of t
    topics and a collection of n users together with the lists of topics they are
    interested in, the aim is to connect these users to a network by a minimum
    number of edges such that every graph induced by users interested in a common
    topic is connected. It is known that Min-TCO is NP-hard and approximable within
    O(log t) in polynomial time.

  60. A Linear Time Algorithm for Seeds Computation.

    Authors: Marcin Kubica, Jakub Radoszewski, Wojciech Rytter, Tomasz Walen, Tomasz Kociumaka
    Subjects: Data Structures and Algorithms
    Abstract

    Periodicity in words is one of the most fundamental areas of text algorithms
    and combinatorics. Two classical and natural variations of periodicity are
    seeds and covers (also called quasiperiods). Linear-time algorithms are known
    for finding all the covers of a word, however in case of seeds, for the past 15
    years only an $O(n\log{n})$ time algorithm was known (Iliopoulos, Moore and
    Park, 1996). Finding an $o(n\log{n})$ time algorithm for the all-seeds problem
    was mentioned as one of the most important open problems related to repetitions
    in words in a survey by Smyth (2000).

  61. On Multiway Cut parameterized above lower bounds.

    Authors: Marek Cygan, Marcin Pilipczuk, Jakub Onufry Wojtaszczyk, Micha&#x142; Pilipczuk
    Subjects: Data Structures and Algorithms
    Abstract

    In this paper we consider two above lower bound parameterizations of the Node
    Multiway Cut problem - above the maximum separating cut and above a natural
    LP-relaxation - and prove them to be fixed-parameter tractable. Our results
    imply O*(4^k) algorithms for Vertex Cover above Maximum Matching and Almost
    2-SAT as well as an O*(2^k) algorithm for Node Multiway Cut with a standard
    parameterization by the solution size, improving previous bounds for these
    problems.

  62. Genome Halving by Block Interchange.

    Authors: Antoine Thomas, A&#xef;da Ouangraoua, Jean-St&#xe9;phane Varr&#xe9;
    Subjects: Data Structures and Algorithms
    Abstract

    We address the problem of finding the minimal number of block interchanges
    (exchange of two intervals) required to transform a duplicated linear genome
    into a tandem duplicated linear genome. We provide a formula for the distance
    as well as a polynomial time algorithm for the sorting problem.

  63. The traveling salesman problem on cubic and subcubic graphs.

    Authors: Ren&#xe9; Sitters, Sylvia Boyd, Suzanne van der Ster, Leen Stougie
    Subjects: Data Structures and Algorithms
    Abstract

    We study the Travelling Salesman Problem (TSP) on the metric completion of
    cubic and subcubic graphs, which is known to be NP-hard. The problem is of
    interest because of its relation to the famous 4/3 conjecture for metric TSP,
    which says that the integrality gap, i.e., the worst case ratio between the
    optimal values of the TSP and its linear programming relaxation (the subtour
    elimination relaxation), is 4/3. We present the first algorithm for cubic
    graphs with approximation ratio 4/3. The proof uses polyhedral techniques in a
    surprising way, which is of independent interest.

  64. Generalized Maneuvers in Route Planning.

    Authors: Petr Hlineny, Ondrej Moris
    Subjects: Data Structures and Algorithms
    Abstract

    We study an important practical aspect of the route planning problem in
    real-world road networks -- maneuvers. Informally, maneuvers are walks that
    might either decrease or increase the weight of a route by additional costs.
    One can model a wide range of route restrictions or traffic regulations by
    maneuvers. For instance turn-prohibitions, traffic light delays, round-abouts,
    forbidden passages and so on. A new approach to tackle maneuvers during route
    planning queries without prior adjustments of the underlying road network graph
    is presented.

  65. An Efficient Partitioning Oracle for Bounded-Treewidth Graphs.

    Authors: Krzysztof Onak, Avinatan Hassidim, Alan Edelman, Huy N. Nguyen
    Subjects: Data Structures and Algorithms
    Abstract

    Partitioning oracles were introduced by Hassidim et al. (FOCS 2009) as a
    generic tool for constant-time algorithms. For any epsilon > 0, a partitioning
    oracle provides query access to a fixed partition of the input bounded-degree
    minor-free graph, in which every component has size poly(1/epsilon), and the
    number of edges removed is at most epsilon*n, where n is the number of vertices
    in the graph.

  66. On vertex covers and matching number of trapezoid graphs.

    Authors: Aleksandar Ilic, Andreja Ilic
    Subjects: Data Structures and Algorithms
    Abstract

    The intersection graph of a collection of trapezoids with corner points lying
    on two parallel lines is called a trapezoid graph. Using binary indexed tree
    data structure, we improve algorithms for calculating the size and the number
    of minimum vertex covers (or independent sets), as well as the total number of
    vertex covers, and reduce the time complexity from $O (n^2)$ to $O (n \log n)$,
    where $n$ is the number of trapezoids.

  67. A simple algorithm for the evaluation of the hypergeometric series using quasi-linear time and linear space.

    Authors: Sergey V.Yakhontov
    Subjects: Data Structures and Algorithms
    Abstract

    A simple algorithm with quasi-linear time complexity and linear space
    complexity for the evaluation of the hypergeometric series with rational
    coefficients is constructed. It is shown that this algorithm is suitable in
    practical informatics for constructive analogues of often used constants of
    analysis.

  68. Unconstrained and Constrained Fault-Tolerant Resource Allocation.

    Authors: Kewen Liao, Hong Shen
    Subjects: Data Structures and Algorithms
    Abstract

    First, we study the Unconstrained Fault-Tolerant Resource Allocation (UFTRA)
    problem (a.k.a. FTFA problem in \cite{shihongftfa}). In the problem, we are
    given a set of sites equipped with an unconstrained number of facilities as
    resources, and a set of clients with set $\mathcal{R}$ as corresponding
    connection requirements, where every facility belonging to the same site has an
    identical opening (operating) cost and every client-facility pair has a
    connection cost. The objective is to allocate facilities from sites to satisfy
    $\mathcal{R}$ at a minimum total cost.

  69. A Library for Implementing the Multiple Hypothesis Tracking Algorithm.

    Authors: David Miguel Antunes, David Martins de Matos, Jos&#xe9; Gaspar
    Subjects: Data Structures and Algorithms
    Abstract

    The Multiple Hypothesis Tracking (MHT) algorithm is known to produce good
    results in difficult multi-target tracking situations. However, its
    implementation is not trivial, and is associated with a significant programming
    effort, code size and long implementation time. We propose a library which
    addresses these problems by providing a domain independent implementation of
    the most complex MHT operations. We also address the problem of applying
    clustering in domain independent manner.

  70. Approximating subset $k$-connectivity problems.

    Authors: Zeev Nutov
    Subjects: Data Structures and Algorithms
    Abstract

    A subset $T \subseteq V$ of terminals is $k$-connected to a root $s$ in a
    directed/undirected graph $J$ if $J$ has $k$ internally-disjoint $vs$-paths for
    every $v \in T$; $T$ is $k$-connected in $J$ if $T$ is $k$-connected to every
    $s \in T$. We consider the {\sf Subset $k$-Connectivity Augmentation} problem:
    given a graph $G=(V,E)$ with edge/node-costs, node subset $T \subseteq V$, and
    a subgraph $J=(V,E_J)$ of $G$ such that $T$ is $k$-connected in $J$, find a
    minimum-cost augmenting edge-set $F \subseteq E \setminus E_J$ such that $T$ is
    $(k+1)$-connected in $J \cup F$.

  71. Invitation to Algorithmic Uses of Inclusion-Exclusion.

    Authors: Thore Husfeldt
    Subjects: Data Structures and Algorithms
    Abstract

    I give an introduction to algorithmic uses of the principle of
    inclusion-exclusion. The presentation is intended to be be concrete and
    accessible, at the expense of generality and comprehensiveness.

  72. Incremental Cycle Detection, Topological Ordering, and Strong Component Maintenance.

    Authors: Bernhard Haeupler, Telikepalli Kavitha, Rogers Mathew, Siddhartha Sen, Robert Endre Tarjan
    Subjects: Data Structures and Algorithms
    Abstract

    We present two on-line algorithms for maintaining a topological order of a
    directed $n$-vertex acyclic graph as arcs are added, and detecting a cycle when
    one is created. Our first algorithm handles $m$ arc additions in $O(m^{3/2})$
    time. For sparse graphs ($m/n = O(1)$), this bound improves the best previous
    bound by a logarithmic factor, and is tight to within a constant factor among
    algorithms satisfying a natural {\em locality} property. Our second algorithm
    handles an arbitrary sequence of arc additions in $O(n^{5/2})$ time.

  73. LP-Based Approximation Algorithms for Traveling Salesman Path Problems.

    Authors: Hyung-Chan An, David B. Shmoys
    Subjects: Data Structures and Algorithms
    Abstract

    We present a (5/3 - epsilon)-approximation algorithm for some constant
    epsilon>0 for the traveling salesman path problem under the unit-weight
    graphical metric, and prove an upper bound on the integrality gap of the
    path-variant Held-Karp relaxation both under this metric and the general
    metric. Given a complete graph with the metric cost and two designated
    endpoints in the graph, the traveling salesman path problem is to find a
    minimum Hamiltonian path between these two endpoints. The best previously known
    performance guarantee for this problem was 5/3 and was due to Hoogeveen.

  74. Approximation Algorithms for Submodular Multiway Partition.

    Authors: Chandra Chekuri, Alina Ene
    Subjects: Data Structures and Algorithms
    Abstract

    We study algorithms for the Submodular Multiway Partition problem (SubMP). An
    instance of SubMP consists of a finite ground set $V$, a subset of $k$ elements
    $S = \{s_1,s_2,...,s_k\}$ called terminals, and a non-negative submodular set
    function $f:2^V\rightarrow \mathbb{R}_+$ on $V$ provided as a value oracle. The
    goal is to partition $V$ into $k$ sets $A_1,...,A_k$ such that for $1 \le i \le
    k$, $s_i \in A_i$ and $\sum_{i=1}^k f(A_i)$ is minimized.

  75. Submodular Cost Allocation Problem and Applications.

    Authors: Chandra Chekuri, Alina Ene
    Subjects: Data Structures and Algorithms
    Abstract

    We study the Minimum Submodular-Cost Allocation problem (MSCA). In this
    problem we are given a finite ground set $V$ and $k$ non-negative submodular
    set functions $f_1 ,..., f_k$ on $V$. The objective is to partition $V$ into
    $k$ (possibly empty) sets $A_1 ,..., A_k$ such that the sum $\sum_{i=1}^k
    f_i(A_i)$ is minimized. Several well-studied problems such as the non-metric
    facility location problem, multiway-cut in graphs and hypergraphs, and uniform
    metric labeling and its generalizations can be shown to be special cases of
    MSCA.

  76. Property Testing for Cyclic Groups and Beyond.

    Authors: Yuichi Yoshida, Francois Le Gall
    Subjects: Data Structures and Algorithms
    Abstract

    This paper studies the problem of testing if an input (Gamma,*), where Gamma
    is a finite set of unknown size and * is a binary operation over Gamma given as
    an oracle, is close to a specified class of groups. Friedl et al. [Efficient
    testing of groups, STOC'05] have constructed an efficient tester using
    poly(log|Gamma|) queries for the class of abelian groups.

  77. Improved Low-rank Matrix Decompositions via the Subsampled Randomized Hadamard Transform.

    Authors: Christos Boutsidis
    Subjects: Data Structures and Algorithms
    Abstract

    We comment on two randomized algorithms for constructing low-rank matrix
    decompositions. Both algorithms employ the Subsampled Randomized Hadamard
    Transform [14]. The first algorithm appeared recently in [9]; here, we provide
    a novel analysis that significantly improves the approximation bound obtained
    in [9]. A preliminary version of the second algorithm appeared in [7]; here, we
    present a mild modification of this algorithm that achieves the same
    approximation bound but significantly improves the corresponding running time.

  78. The maximum disjoint paths problem on multi-relations social networks.

    Authors: Bang Ye Wu
    Subjects: Data Structures and Algorithms
    Abstract

    Motivated by applications to social network analysis (SNA), we study the
    problem of finding the maximum number of disjoint uni-color paths in an
    edge-colored graph. We show the NP-hardness and the approximability of the
    problem, and both approximation and exact algorithms are proposed. Since short
    paths are much more significant in SNA, we also study the length-bounded
    version of the problem, in which the lengths of paths are required to be upper
    bounded by a fixed integer $l$. It is shown that the problem can be solved in
    polynomial time for $l=3$ and is NP-hard for $l\geq 4$.

  79. Random input helps searching predecessors.

    Authors: D. Belazzougui, A. C. Kaporis, P. G. Spirakis
    Subjects: Data Structures and Algorithms
    Abstract

    We solve the dynamic Predecessor Problem with high probability (whp) in
    constant time, using only $n^{1+\delta}$ bits of memory, for any constant
    $\delta > 0$. The input keys are random wrt a wider class of the well studied
    and practically important class of $(f_1, f_2)$-smooth distributions introduced
    in \cite{and:mat}. It achieves O(1) whp amortized time. Its worst-case time is
    $O(\sqrt{\frac{\log n}{\log \log n}})$. Also, we prove whp $O(\log \log \log
    n)$ time using only $n^{1+ \frac{1}{\log \log n}}= n^{1+o(1)}$ bits. Finally,
    we show whp $O(\log \log n)$ time using O(n) space.

  80. An improved approximation algorithm for the minimum-cost subset k-connected subgraph problem.

    Authors: Bundit Laekhanukit
    Subjects: Data Structures and Algorithms
    Abstract

    The minimum-cost subset $k$-connected subgraph problem is a cornerstone
    problem in the area of network design with vertex connectivity requirements. In
    this problem, we are given a graph $G=(V,E)$ with costs on edges and a set of
    terminals $T$. The goal is to find a minimum cost subgraph such that every pair
    of terminals are connected by $k$ openly (vertex) disjoint paths. In this
    paper, we present an approximation algorithm for the subset $k$-connected
    subgraph problem which improves on the previous best approximation guarantee of
    $O(k^2\log{k})$ by Nutov (FOCS 2009).

  81. Courcelle's Theorem - A Game-Theoretic Approach.

    Authors: Joachim Kneis, Peter Rossmanith, Alexander Langer
    Subjects: Data Structures and Algorithms
    Abstract

    Courcelle's Theorem states that every problem definable in Monadic
    Second-Order logic can be solved in linear time on structures of bounded
    treewidth, for example, by constructing a tree automaton that recognizes or
    rejects a tree decomposition of the structure. Existing, optimized software
    like the MONA tool can be used to build the corresponding tree automata, which
    for bounded treewidth are of constant size. Unfortunately, the constants
    involved can become extremely large - every quantifier alternation requires a
    power set construction for the automaton.

  82. Efficient Seeds Computation Revisited.

    Authors: Maxime Crochemore, Marcin Kubica, Jakub Radoszewski, Wojciech Rytter, Tomasz Walen, Michalis Christou, Costas S. Iliopoulos, Solon P. Pissis, Bartosz Szreder
    Subjects: Data Structures and Algorithms
    Abstract

    The notion of the cover is a generalization of a period of a string, and
    there are linear time algorithms for finding the shortest cover. The seed is a
    more complicated generalization of periodicity, it is a cover of a superstring
    of a given string, and the shortest seed problem is of much higher algorithmic
    difficulty. The problem is not well understood, no linear time algorithm is
    known. In the paper we give linear time algorithms for some of its versions ---
    computing shortest left-seed array, longest left-seed array and checking for
    seeds of a given length.

  83. Problems parameterized by treewidth tractable in single exponential time: a logical approach.

    Authors: Micha&#x142; Pilipczuk
    Subjects: Data Structures and Algorithms
    Abstract

    We introduce a variant of modal logic, dubbed EXISTENTIAL COUNTING MODAL
    LOGIC (ECML), which captures a vast majority of problems known to be tractable
    in single exponential time when parameterized by treewidth. It appears that all
    these results can be subsumed by the theorem that model checking of ECML admits
    an algorithm with such complexity. We extend ECML by adding connectivity
    requirements and, using the Cut&Count technique introduced by Cygan et al.

  84. Confluent Persistence Revisited.

    Authors: Stefan Langerman, John Iacono, Sebastien Collette
    Subjects: Data Structures and Algorithms
    Abstract

    It is shown how to enhance any data structure in the pointer model to make it
    confluently persistent, with efficient query and update times and limited space
    overhead. Updates are performed in $O(\log n)$ amortized time, and following a
    pointer takes $O(\log c \log n)$ time where $c$ is the in-degree of a node in
    the data structure. In particular, this proves that confluent persistence can
    be achieved at a logarithmic cost in the bounded in-degree model used widely in
    previous work.

  85. A Note On Estimating the Spectral Norm of A Matrix Efficiently.

    Authors: Malik Magdon-Ismail
    Subjects: Data Structures and Algorithms
    Abstract

    We give an efficient algorithm which can obtain a relative error
    approximation to the spectral norm of a matrix, combining the power iteration
    method with some techniques from matrix reconstruction which use random
    sampling.

  86. Potent Tree Codes and their applications: Coding for Interactive Communication, revisited.

    Authors: Ran Gelles, Amit Sahai
    Subjects: Data Structures and Algorithms
    Abstract

    We study the fundamental problem of reliable interactive communication over a
    noisy channel. In a breakthrough sequence of papers published in 1992 and 1993,
    Schulman gave non-constructive proofs of the existence of general methods to
    emulate any two-party interactive protocol such that: (1) the emulation
    protocol takes a constant-factor longer than the original protocol, and (2) if
    the emulation protocol is executed over a noisy channel, then the probability
    that the emulation protocol fails is exponentially small in the total length of
    the protocol.

  87. A Note on: `Algorithms for Connected Set Cover Problem and Fault-Tolerant Connected Set Cover Problem'.

    Authors: Qing Zhao, Wei Ren
    Subjects: Data Structures and Algorithms
    Abstract

    A flaw in the greedy approximation algorithm proposed by Zhang et al. for
    minimum connected set cover problem is corrected, and a stronger result on the
    approximation ratio of the modified greedy algorithm is established. The
    results are now consistent with the existing results on connected dominating
    set problem which is a special case of the minimum connected set cover problem.

  88. Competitive Buffer Management with Class Segregation.

    Authors: Kamal Al-Bawani, Alexander Souza
    Subjects: Data Structures and Algorithms
    Abstract

    We introduce a new model for buffer management of network switches with
    Quality of Service (QoS) requirements and give a tight analysis. Specifically,
    each network packet is associated with a numerical value, called Class of
    Service (CoS), which represents its priority. We are furthermore given a
    network switch with $m$ queues that have individual capacities. Each queue
    stores packets of a certain CoS, only. A stream of packets arrives over time at
    the switch and an online algorithm has to decide on the admission and
    transmission of packets.

  89. Acyclic and Star Colorings of Cographs.

    Authors: Andrew Lyons
    Subjects: Data Structures and Algorithms
    Abstract

    An \emph{acyclic coloring} of a graph is a proper vertex coloring such that
    the union of any two color classes induces a disjoint collection of trees. The
    more restricted notion of \emph{star coloring} requires that the union of any
    two color classes induces a disjoint collection of stars. We prove that every
    acyclic coloring of a cograph is also a star coloring and give a linear-time
    algorithm for finding an optimal acyclic and star coloring of a cograph. If the
    graph is given in the form of a cotree, the algorithm runs in O(n) time.

  90. Data-Oblivious External-Memory Algorithms for the Compaction, Selection, and Sorting of Outsourced Data.

    Authors: Michael T. Goodrich
    Subjects: Data Structures and Algorithms
    Abstract

    We present data-oblivious algorithms in the external-memory model for
    compaction, selection, and sorting. Motivation for such problems comes from
    clients who use outsourced data storage services and wish to mask their data
    access patterns. We show that compaction and selection can be done
    data-obliviously using $O(N/B)$ I/Os, and sorting can be done, with a high
    probability of success, using $O((N/B)\log_{M/B} (N/B))$ I/Os.

  91. Copy-on-write B-tree finally beaten.

    Authors: Andy Twigg, Andrew Byde, Grzegorz Milos, Tim Moreton, John Wilkes, Tom Wilkie
    Subjects: Data Structures and Algorithms
    Abstract

    A classic versioned data structure in storage and computer science is the
    copy-on-write (CoW) B-tree -- it underlies many of today's file systems and
    databases, including WAFL, ZFS, Btrfs and more. Unfortunately, it doesn't
    inherit the B-tree's optimality properties; it has poor space utilization,
    cannot offer fast updates, and relies on random IO to scale. Yet, nothing
    better has been developed since. We describe the `stratified B-tree', which
    beats the CoW B-tree in every way.

  92. Gossip PCA.

    Authors: Andrea Montanari, Sewoong Oh, Satish Babu Korada
    Subjects: Data Structures and Algorithms
    Abstract

    Eigenvectors of data matrices play an important role in many computational
    problems, ranging from signal processing to machine learning and control. For
    instance, algorithms that compute positions of the nodes of a wireless network
    on the basis of pairwise distance measurements require a few leading
    eigenvectors of the distances matrix.

  93. Incremental dimension reduction of tensors with random index.

    Authors: Fredrik Sandin, Blerim Emruli, Magnus Sahlgren
    Subjects: Data Structures and Algorithms
    Abstract

    We present an incremental, scalable and efficient dimension reduction
    technique for tensors that is based on sparse random linear coding. Data is
    stored in a compactified representation with fixed size, which makes memory
    requirements low and predictable. Component encoding and decoding are performed
    on-line without computationally expensive re-analysis of the data set. The
    range of tensor indices can be extended dynamically without modifying the
    component representation.

  94. Direction-Reversing Quasi-Random Rumor Spreading with Restarts.

    Authors: Carola Winzen
    Subjects: Data Structures and Algorithms
    Abstract

    In a recent work, Doerr and Fouz [\emph{Asymptotically Optimal Randomized
    Rumor Spreading}, in ArXiv] present a new quasi-random PUSH algorithm for the
    rumor spreading problem (also known as gossip spreading or message propagation
    problem). Their \emph{hybrid protocol} outperforms all known PUSH protocols.

  95. Improved space-time tradeoffs for approximate full-text indexing with one edit error.

    Authors: Djamal Belazzougui
    Subjects: Data Structures and Algorithms
    Abstract

    In this paper we are interested in indexing texts for susbstring matching
    with one edit error. That is given a text T of n characters over an alphabet of
    size sigma, we are asked to build a data structure that answers to the
    following query: find all the occ substrings of the text which are at edit
    distance at most 1 from a string p of length m. In this paper we show two new
    results for this problem. The first result suitable for arbitrary alphabet size
    sigma uses O(n(log^epsilon n+ log sigma)) words of space and answers to queries
    in time O(m+occ).

  96. A Randomized Algorithm Based on Threshold Accepting to Approximate the Star Discrepancy.

    Authors: Michael Gnewuch, Magnus Wahlstr&#xf6;m, Carola Winzen
    Subjects: Data Structures and Algorithms
    Abstract

    We present an improved heuristic for estimating the star discrepancy of a
    point set, building on the threshold accepting algorithm of Winker and Fang but
    with several improvements, including a non-uniform sampling strategy which is
    much more fitting for higher-dimensional inputs, and a rounding ("snapping")
    step which rounds points to critical boxes where the higher discrepancy values
    are guaranteed to be found.

  97. Hereditary biclique-Helly graphs: recognition and maximal biclique enumeration.

    Authors: Martiniano Egu&#xed;a, Francisco J. Soulignac
    Subjects: Data Structures and Algorithms
    Abstract

    A biclique is a set of vertices that induce a bipartite complete graph. A
    graph G is biclique-Helly when its family of maximal bicliques satisfies the
    Helly property. If every induced subgraph of G is also biclique-Helly, then G
    is hereditary biclique-Helly. A graph is C_4-dominated when every cycle of
    length 4 contains a vertex that is dominated by the vertex of the cycle that is
    not adjacent to it.

  98. Near-Optimal Column-Based Matrix Reconstruction.

    Authors: Petros Drineas, Malik Magdon-Ismail, Christos Boutsidis
    Subjects: Data Structures and Algorithms
    Abstract

    We consider low-rank reconstruction of a matrix using its columns and we
    present asymptotically optimal algorithms for both spectral norm and Frobenius
    norm reconstruction. The main tools we introduce to obtain our r esults are:
    (i) the use of fast approximate SVD-like decompositions for column
    reconstruction, and (ii) two deter ministic algorithms for selecting rows from
    matrices with orthonormal columns, building upon the sparse represen tation
    theorem for decompositions of the identity that appeared in \cite{BSS09}.

  99. Locating Depots for Capacitated Vehicle Routing.

    Authors: Viswanath Nagarajan, Inge Li Goertz
    Subjects: Data Structures and Algorithms
    Abstract

    We study a location-routing problem in the context of capacitated vehicle
    routing. The input is a set of demand locations in a metric space and a fleet
    of k vehicles each of capacity Q. The objective is to locate k depots, one for
    each vehicle, and compute routes for the vehicles so that all demands are
    satisfied and the total cost is minimized. Our main result is a constant-factor
    approximation algorithm for this problem. To achieve this result, we reduce to
    the k-median-forest problem, which generalizes both k-median and minimum
    spanning tree, and which might be of independent interest.

  100. Refinements of Miller's Algorithm over Weierstrass Curves Revisited.

    Authors: Duc-Phong Le, Chao-Liang Liu
    Subjects: Data Structures and Algorithms
    Abstract

    In 1986 Victor Miller described an algorithm for computing the Weil pairing
    in his unpublished manuscript. This algorithm has then become the core of all
    pairing-based cryptosystems. Many improvements of the algorithm have been
    presented. Most of them involve a choice of elliptic curves of a \emph{special}
    forms to exploit a possible twist during Tate pairing computation. Other
    improvements involve a reduction of the number of iterations in the Miller's
    algorithm. For the generic case, Blake, Murty and Xu proposed three refinements
    to Miller's algorithm over Weierstrass curves.

  101. Listing All Maximal Cliques in Large Sparse Real-World Graphs.

    Authors: David Eppstein, Darren Strash
    Subjects: Data Structures and Algorithms
    Abstract

    We implement a new algorithm for listing all maximal cliques in sparse graphs
    due to Eppstein, L\"offler, and Strash (ISAAC 2010) and analyze its performance
    on a large corpus of real-world graphs. Our analysis shows that this algorithm
    is the first to offer a practical solution to listing all maximal cliques in
    large sparse graphs. All other theoretically-fast algorithms for sparse graphs
    have been shown to be significantly slower than the algorithm of Tomita et al.
    (Theoretical Computer Science, 2006) in practice. However, the algorithm of
    Tomita et al.

  102. Approximation Algorithms for Union and Intersection Covering Problems.

    Authors: Marek Cygan, Marcin Pilipczuk, Fabrizio Grandoni, Piotr Sankowski, Stefano Leonardi, Marcin Mucha
    Subjects: Data Structures and Algorithms
    Abstract

    In a classical covering problem, we are given a set of requests that we need
    to satisfy (fully or partially), by buying a subset of items at minimum cost.
    For example, in the k-MST problem we want to find the cheapest tree spanning at
    least k nodes of an edge-weighted graph. Here nodes and edges represent
    requests and items, respectively.

  103. Improved RIP Analysis of Orthogonal Matching Pursuit.

    Authors: Ray Maleh
    Subjects: Data Structures and Algorithms
    Abstract

    Orthogonal Matching Pursuit (OMP) has long been considered a powerful
    heuristic for attacking compressive sensing problems; however, its theoretical
    development is, unfortunately, somewhat lacking. This paper presents an
    improved Restricted Isometry Property (RIP) based performance guarantee for
    T-sparse signal reconstruction that asymptotically approaches the conjectured
    lower bound given in Davenport et al. We also further extend the
    state-of-the-art by deriving reconstruction error bounds for the case of
    general non-sparse signals subjected to measurement noise.

  104. Approximation Algorithms for Correlated Knapsacks and Non-Martingale Bandits.

    Authors: Anupam Gupta, R. Ravi, Ravishankar Krishnaswamy, Marco Molinaro
    Subjects: Data Structures and Algorithms
    Abstract

    In the stochastic knapsack problem, we are given a knapsack of size B, and a
    set of jobs whose sizes and rewards are drawn from a known probability
    distribution. However, we know the actual size and reward only when the job
    completes. How should we schedule jobs to maximize the expected total reward?
    We know O(1)-approximations when we assume that (i) rewards and sizes are
    independent random variables, and (ii) we cannot prematurely cancel jobs. What
    can we say when either or both of these assumptions are changed?

  105. Analysis of multi-stage open shop processing systems.

    Authors: Gerhard J. Woeginger, Alexander Schrijver, Christian Eggermont
    Subjects: Data Structures and Algorithms
    Abstract

    We study algorithmic problems in multi-stage open shop processing systems
    that are centered around reachability and deadlock detection questions. We
    characterize safe and unsafe system states. We show that it is easy to
    recognize system states that can be reached from the initial state (where the
    system is empty), but that in general it is hard to decide whether one given
    system state is reachable from another given system state.

  106. Algorithms for Internal Validation Clustering Measures in the Post Genomic Era.

    Authors: Filippo Utro
    Subjects: Data Structures and Algorithms
    Abstract

    Inferring cluster structure in microarray datasets is a fundamental task for
    the -omic sciences. A fundamental question in Statistics, Data Analysis and
    Classification, is the prediction of the number of clusters in a dataset,
    usually established via internal validation measures. Despite the wealth of
    internal measures available in the literature, new ones have been recently
    proposed, some of them specifically for microarray data. In this dissertation,
    a study of internal validation measures is given, paying particular attention
    to the stability based ones.

  107. Multicriteria Steiner Tree Problem for Communication Network.

    Authors: Mark Sh. Levin, Rustem I. Nuriakhmetov
    Subjects: Data Structures and Algorithms
    Abstract

    This paper addresses combinatorial optimization scheme for solving the
    multicriteria Steiner tree problem for communication network topology design
    (e.g., wireless mesh network). The solving scheme is based on several models:
    multicriteria ranking, clustering, minimum spanning tree, and minimum Steiner
    tree problem. An illustrative numerical example corresponds to designing a
    covering long-distance Wi-Fi network (static Ad-Hoc network). The set of
    criteria (i.e., objective functions) involves the following: total cost, total
    edge length, overall throughput (capacity), and estimate of QoS.

  108. A Novel Sparsity-Promoted Approach to State Estimation from Dynamic Observation.

    Authors: Lianlin Li, B. Jafarpour
    Subjects: Data Structures and Algorithms
    Abstract

    The problem of time-varying signal estimation or state estimation from
    dynamic observation is encountered in many applications. Among well-developed
    approaches the Kalman filter-type approaches are the most popular and have
    played an important role of estimating dynamic signal from sequential
    observations. However, when the number of observations at each time step is
    highly limited, they usually take too many time steps to catch up with the true
    value with satisfied accuracy, even fail to do that in many cases.

  109. Graph Coalition Structure Generation.

    Authors: Thomas D. Voice, Maria Polukarov, Nicholas R. Jennings
    Subjects: Data Structures and Algorithms
    Abstract

    We give the first analysis of the computational complexity of {\it coalition
    structure generation over graphs}. Given an undirected graph $G=(N,E)$ and a
    valuation function $v:2^N\rightarrow\RR$ over the subsets of nodes, the problem
    is to find a partition of $N$ into connected subsets, that maximises the sum of
    the components' values.

  110. Algorithms for Jumbled Pattern Matching in Strings.

    Authors: Gabriele Fici, Ferdinando Cicalese, P&#xe9;ter Burcsi, Zsuzsanna Lipt&#xe1;k
    Subjects: Data Structures and Algorithms
    Abstract

    The Parikh vector p(s) of a string s is defined as the vector of
    multiplicities of the characters. Parikh vector q occurs in s if s has a
    substring t with p(t)=q. We present two novel algorithms for searching for a
    query q in a text s. One solves the decision problem over a binary text in
    constant time, using a linear size index of the text. The second algorithm, for
    a general finite alphabet, finds all occurrences of a given Parikh vector q and
    has sub-linear expected time complexity; we present two variants, which both
    use a linear size index of the text.

  111. Restructuring in Combinatorial Optimization.

    Authors: Mark Sh. Levin
    Subjects: Data Structures and Algorithms
    Abstract

    The paper addresses a new class of combinatorial problems which consist in
    restructuring of solutions (as structures) in combinatorial optimization. Two
    main features of the restructuring process are examined: (i) a cost of the
    restructuring, (ii) a closeness to a goal solution. This problem corresponds to
    redesign (improvement, upgrade) of modular systems or solutions. The
    restructuring approach is described and illustrated for the following
    combinatorial optimization problems: knapsack problem, multiple choice problem,
    assignment problem, spanning tree problems.

  112. Algorithms for Implicit Hitting Set Problems.

    Authors: Santosh Vempala, Karthekeyan Chandrasekaran, Richard Karp, Erick Moreno-Centeno
    Subjects: Data Structures and Algorithms
    Abstract

    A hitting set for a collection of sets is a set that has a non-empty
    intersection with each set in the collection; the hitting set problem is to
    find a hitting set of minimum cardinality. Motivated by instances of the
    hitting set problem where the number of sets to be hit is large, we introduce
    the notion of implicit hitting set problems. In an implicit hitting set problem
    the collection of sets to be hit is typically too large to list explicitly;
    instead, an oracle is provided which, given a set H, either determines that H
    is a hitting set or returns a set that H does not hit.

  113. Fault-Tolerant Spanners: Better and Simpler.

    Authors: Robert Krauthgamer, Michael Dinitz
    Subjects: Data Structures and Algorithms
    Abstract

    A natural requirement of many distributed structures is fault-tolerance:
    after some failures, whatever remains from the structure should still be
    effective for whatever remains from the network. In this paper we examine
    spanners of general graphs that are tolerant to vertex failures, and
    significantly improve their dependence on the number of faults $r$, for all
    stretch bounds.

  114. Connection errors in networks of linear features and the application of geometrical reduction in spatial data algorithms.

    Authors: Panteleimon Rodis
    Subjects: Data Structures and Algorithms
    Abstract

    The topic of this paper is to present a study of the connection errors in
    networks of linear features that are mapped in Geographical Information Systems
    (GIS), as well as algorithms that detect them and the notion of geometrical
    reduction and its use in spatial data algorithms. In datasets of networks
    usually there can be found errors, when a number of elements of the network are
    not connected according to the specifications of the network. This can occur
    due to errors in the digitization process of the network or because these
    elements are connected in reality in a not legal way.

  115. Maintaining Arrays of Contiguous Objects.

    Authors: Tom Kamphans, Nils Schweer, Michael A. Bender, S&#xe1;ndor P. Fekete
    Subjects: Data Structures and Algorithms
    Abstract

    In this paper we consider methods for dynamically storing a set of different
    objects ("modules") in a physical array. Each module requires one free
    contiguous subinterval in order to be placed. Items are inserted or removed,
    resulting in a fragmented layout that makes it harder to insert further
    modules. It is possible to relocate modules, one at a time, to another free
    subinterval that is contiguous and does not overlap with the current location
    of the module. These constraints clearly distinguish our problem from classical
    memory allocation.

  116. Linear-Space Data Structures for Range Mode Query in Arrays.

    Authors: Stephane Durocher, Jason Morrison
    Subjects: Data Structures and Algorithms
    Abstract

    A mode of a multiset $S$ is an element $a \in S$ of maximum multiplicity;
    that is, $a$ occurs at least as frequently as any other element in $S$. Given a
    list $A[1:n]$ of $n$ items, we consider the problem of constructing a data
    structure that efficiently answers range mode queries on $A$. Each query
    consists of an input pair of indices $(i, j)$ for which a mode of $A[i:j]$ must
    be returned. We present an $O(n^{2-2\epsilon})$-space static data structure
    that supports range mode queries in $O(n^\epsilon)$ time in the worst case, for
    any fixed $\epsilon \in [0,1/2]$.

  117. Self-Index Based on LZ77.

    Authors: Gonzalo Navarro, Sebastian Kreft
    Subjects: Data Structures and Algorithms
    Abstract

    We introduce the first self-index based on the Lempel-Ziv 1977 compression
    format (LZ77). It is particularly competitive for highly repetitive text
    collections such as sequence databases of genomes of related species, software
    repositories, versioned document collections, and temporal text databases. Such
    collections are extremely compressible but classical self-indexes fail to
    capture that source of compressibility.

  118. Estimating the Average of a Lipschitz-Continuous Function from One Sample.

    Authors: David Kempe, Abhimanyu Das
    Subjects: Data Structures and Algorithms
    Abstract

    We study the problem of estimating the average of a Lipschitz continuous
    function $f$ defined over a metric space, by querying $f$ at only a single
    point. More specifically, we explore the role of randomness in drawing this
    sample. Our goal is to find a distribution minimizing the expected estimation
    error against an adversarially chosen Lipschitz continuous function. Our work
    falls into the broad class of estimating aggregate statistics of a function
    from a small number of carefully chosen samples.

  119. Clustering Protein Sequences Given the Approximation Stability of the Min-Sum Objective Function.

    Authors: Shang-Hua Teng, Maria-Florina Balcan, Konstantin Voevodski, Heiko Roglin, Yu Xia
    Subjects: Data Structures and Algorithms
    Abstract

    We study the problem of efficiently clustering protein sequences in a limited
    information setting. We assume that we do not know the distances between the
    sequences in advance, and must query them during the execution of the
    algorithm. Our goal is to find an accurate clustering while keeping the number
    of queries small. We model the problem as a point set S with an unknown metric
    d on S, and assume that we have access to one versus all distance queries that
    given a point s in S return the distances between s and all other points.

  120. Maximizing Submodular Set Functions Subject to Different Constraints: Combined Algorithms.

    Authors: Salman Fadaei, Mohammad Ali Safari, Mohammad Amin Fazli
    Subjects: Data Structures and Algorithms
    Abstract

    We study the problem of maximizing constrained non-monotone submodular
    functions and provide approximation algorithms that improve existing algorithms
    in terms of either the approximation factor or simplicity. Our algorithms
    combine existing local search and greedy based algorithms. Different
    constraints that we study are (exact) cardinality and knapsack constraints. For
    the multiple-knapsack constraints we achieve a $(\frac{1}{4}-3\epsilon)$-factor
    algorithm.

  121. Approximations for Monotone and Non-monotone Submodular Maximization with Knapsack Constraints.

    Authors: Ariel Kulik, Hadas Shachnai, Tami Tamir
    Subjects: Data Structures and Algorithms
    Abstract

    Submodular maximization generalizes many fundamental problems in discrete
    optimization, including Max-Cut in directed/undirected graphs, maximum
    coverage, maximum facility location and marketing over social networks.

  122. Popular b-matchings.

    Authors: Katarzyna Paluch
    Subjects: Data Structures and Algorithms
    Abstract

    Suppose that each member of a set of agents has a preference list of a subset
    of houses, possibly involving ties and each agent and house has their capacity
    denoting the maximum number of correspondingly agents/houses that can be
    matched to him/her/it. We want to find a matching $M$, for which there is no
    other matching $M'$ such that more agents prefer $M'$ to $M$ than $M$ to $M'$.
    (What it means that an agent prefers one matching to the other is explained in
    the paper.) Popular matchings have been studied quite extensively, especially
    in the one-to-one setting.

  123. Quasirandom Rumor Spreading: An Experimental Analysis.

    Authors: Benjamin Doerr, Tobias Friedrich, Thomas Sauerwald, Marvin K&#xfc;nnemann
    Subjects: Data Structures and Algorithms
    Abstract

    We empirically analyze two versions of the well-known "randomized rumor
    spreading" protocol to disseminate a piece of information in networks. In the
    classical model, in each round each informed node informs a random neighbor. In
    the recently proposed quasirandom variant, each node has a (cyclic) list of its
    neighbors. Once informed, it starts at a random position of the list, but from
    then on informs its neighbors in the order of the list.

  124. Quasirandom Rumor Spreading.

    Authors: Benjamin Doerr, Tobias Friedrich, Thomas Sauerwald
    Subjects: Data Structures and Algorithms
    Abstract

    We propose and analyse a quasirandom analogue of the classical push model for
    disseminating information in networks ("randomized rumor spreading").

    In the classical model, in each round each informed vertex chooses a neighbor
    at random and informs it, if it was not before. It is known that this simple
    protocol succeeds in spreading a rumor from one vertex to all others within
    O(log n) rounds on complete graphs, hypercubes, random regular graphs,
    Erdos-Renyi random graph and Ramanujan graphs with high probability.

  125. Transforming a random graph drawing into a Lombardi drawing.

    Authors: Nicolaos Matsakis
    Subjects: Data Structures and Algorithms
    Abstract

    The visualization of any graph plays important role in various aspects, such
    as graph drawing software. Complex systems (like large databases or networks)
    that have a graph structure should be properly visualized in order to avoid
    obfuscation. One way to provide an aesthetic improvement to a graph
    visualization is to apply a force-directed drawing algorithm to it. This
    method, that emerged in the 60's views graphs as spring systems that exert
    forces (repulsive or attractive) to the nodes.

  126. Sublinear Time, Measurement-Optimal, Sparse Recovery For All.

    Authors: Ely Porat, Martin J. Strauss
    Subjects: Data Structures and Algorithms
    Abstract

    An approximate sparse recovery system in ell_1 norm formally consists of
    parameters N, k, epsilon an m-by-N measurement matrix, Phi, and a decoding
    algorithm, D. Given a vector, x, where x_k denotes the optimal k-term
    approximation to x, the system approximates x by hat_x = D(Phi.x), which must
    satisfy

    ||hat_x - x||_1 <= (1+epsilon)||x - x_k||_1.

    Among the goals in designing such systems are minimizing m and the runtime of
    D. We consider the "forall" model, in which a single matrix Phi is used for all
    signals x.

  127. On Tuning the Bad-Character Rule: the Worst-Character Rule.

    Authors: Domenico Cantone, Simone Faro
    Subjects: Data Structures and Algorithms
    Abstract

    In this note we present the worst-character rule, an efficient variation of
    the bad-character heuristic for the exact string matching problem, firstly
    introduced in the well-known Boyer-Moore algorithm. Our proposed rule selects a
    position relative to the current shift which yields the largest average
    advancement, according to the characters distribution in the text. Experimental
    results show that the worst-character rule achieves very good results
    especially in the case of long patterns or small alphabets in random texts and
    in the case of texts in natural languages.

  128. Almost Optimal Explicit Johnson-Lindenstrauss Transformations.

    Authors: Raghu Meka
    Subjects: Data Structures and Algorithms
    Abstract

    The Johnson-Lindenstrauss lemma is a fundamental result in probability with
    several applications in the design and analysis of algorithms in high
    dimensional geometry. Most known constructions of linear embeddings that
    satisfy the Johnson-Lindenstrauss property involve randomness.

  129. Contractions, Removals and How to Certify 3-Connectivity in Linear Time.

    Authors: Jens M. Schmidt
    Subjects: Data Structures and Algorithms
    Abstract

    It is well-known as an existence result that every 3-connected graph G=(V,E)
    on more than 4 vertices admits a sequence of contractions and a sequence of
    removal operations to K_4 such that every intermediate graph is 3-connected. We
    show that both sequences can be computed in optimal time, improving the
    previously best known running times of O(|V|^2) to O(|V|+|E|). This settles
    also the open question of finding a linear time 3-connectivity test that is
    certifying and extends to a certifying 3-edge-connectivity test in the same
    time.

  130. Computing the diameter polynomially faster than APSP.

    Authors: Raphael Yuster
    Subjects: Data Structures and Algorithms
    Abstract

    We present a new algorithm for computing the diameter of an unweighted
    directed graph. Our algorithm runs in $\Ot(n^{(\w^2+3)/(\w+1)})$ time, where
    $\w < 2.376$ is the exponent of fast matrix multiplication and $n$ is the
    number of vertices of the graph. In particular, our algorithm runs in
    $O(n^{2.561})$ time, and if $\w=2+o(1)$ it runs in $\Ot(n^{7/3})$ time. This is
    the first algorithm that computes the diameter of a graph polynomially faster
    than All-Pairs Shortest Paths.

  131. Steiner Transitive-Closure Spanners of d-Dimensional Posets.

    Authors: Elena Grigorescu, Sofya Raskhodnikova, Arnab Bhattacharyya, Piotr Berman, David Woodruff, Grigory Yaroslavtsev
    Subjects: Data Structures and Algorithms
    Abstract

    Given a directed graph G and an integer k >= 1, a
    k-transitive-closure-spanner (k-TCspanner) of G is a directed graph H that has
    (1) the same transitive-closure as G and (2) diameter at most k. In some
    applications, the shortcut paths added to the graph in order to obtain small
    diameter can use Steiner vertices, that is, vertices not in the original graph
    G. The resulting spanner is called a Steiner transitive-closure spanner
    (Steiner TC-spanner).

  132. New Algorithms on Wavelet Trees and Applications to Information Retrieval.

    Authors: Gonzalo Navarro, Travis Gagie, Simon J. Puglisi
    Subjects: Data Structures and Algorithms
    Abstract

    Wavelet trees are widely used in the representation of sequences,
    permutations, text collections, binary relations, discrete points, and other
    succinct data structures. We show, however, that this still falls short of
    exploiting all of the virtues of this versatile data structure. In particular
    we show how to use wavelet trees to solve fundamental algorithmic problems such
    as {\em range quantile} queries, {\em range next value} queries, and {\em range
    intersection} queries.

  133. Recursive Sketching For Frequency Moments.

    Authors: Vladimir Braverman, Rafail Ostrovsky
    Subjects: Data Structures and Algorithms
    Abstract

    In a ground-breaking paper, Indyk and Woodruff (STOC 05) showed how to
    compute $F_k$ (for $k>2$) in space complexity $O(\mbox{\em poly-log}(n,m)\cdot
    n^{1-\frac2k})$, which is optimal up to (large) poly-logarithmic factors in $n$
    and $m$, where $m$ is the length of the stream and $n$ is the upper bound on
    the number of distinct elements in a stream. The best known lower bound for
    large moments is $\Omega(\log(n)n^{1-\frac2k})$.

  134. Streaming Algorithms from Precision Sampling.

    Authors: Krzysztof Onak, Alexandr Andoni, Robert Krauthgamer
    Subjects: Data Structures and Algorithms
    Abstract

    In STOC'05, Indyk and Woodruff introduced a new technique that yielded a
    near-optimal space algorithm for $F_k$ moment estimation problem. Since then,
    the technique has inspired a number of advances in streaming algorithmics. We
    show that at least several of these results follow easily from the application
    of a single probabilistic technique, called Precision Sampling. Using it, we
    obtain simple streaming algorithms that maintain a randomized sketch of an
    input vector $x=(x_1,...,x_n)$, useful for the following applications:
    Estimating the $F_k$ moment of $x$, for $k>2$.

  135. On a game theoretic approach to capacity maximization in wireless networks.

    Authors: Eyj&#xf3;lfur Ingi &#xc1;sgeirsson, Pradipta Mitra
    Subjects: Data Structures and Algorithms
    Abstract

    We consider the capacity problem (or, the single slot scheduling problem) in
    wireless networks. Our goal is to maximize the number of successful connections
    in arbitrary wireless networks where a transmission is successful only if the
    signal-to-interference-plus-noise ratio at the receiver is greater than some
    threshold. We study a game theoretic approach towards capacity maximization
    introduced by Andrews and Dinitz (INFOCOM 2009) and Dinitz (INFOCOM 2010). We
    prove vastly improved bounds for the game theoretic algorithm.

  136. Subset feedback vertex set is fixed parameter tractable.

    Authors: Marek Cygan, Marcin Pilipczuk, Jakub Onufry Wojtaszczyk, Michal Pilipczuk
    Subjects: Data Structures and Algorithms
    Abstract

    The FEEDBACK VERTEX SET problem, given an undirected graph G and an integer
    k, asks whether there exists a set of at most k vertices that hits all cycles
    in the graph G. FEEDBACK VERTEX SET was intensively studied from the
    parameterized perspective. In this paper we consider the SUBSET FEEDBACK VERTEX
    SET problem, where an instance comes additionally with a subset of vertices S
    and we ask for a set of at most k vertices that hits all simple cycles passing
    through S. The parameterized complexity of SUBSET FEEDBACK VERTEX SET was
    posted as an open problem independently by K.

  137. A better tester for bipartiteness?.

    Authors: Andrej Bogdanov, Fan Li
    Subjects: Data Structures and Algorithms
    Abstract

    Alon and Krivelevich (SIAM J. Discrete Math. 15(2): 211-227 (2002)) show that
    if a graph is {\epsilon}-far from bipartite, then the subgraph induced by a
    random subset of O(1/{\epsilon}) vertices is bipartite with high probability.
    We conjecture that the induced subgraph is {\Omega}~({\epsilon})-far from
    bipartite with high probability. Gonen and Ron (RANDOM 2007) proved this
    conjecture in the case when the degrees of all vertices are at most
    O({\epsilon}n). We give a more general proof that works for any d-regular (or
    almost d-regular) graph for arbitrary degree d.

  138. Efficient Triangle Counting in Large Graphs via Degree-based Vertex Partitioning.

    Authors: Charalampos E. Tsourakakis, Gary L. Miller, Richard Peng, Mihail N. Kolountzakis
    Subjects: Data Structures and Algorithms
    Abstract

    The number of triangles is a computationally expensive graph statistic which
    is frequently used in complex network analysis (e.g., transitivity ratio), in
    various random graph models (e.g., exponential random graph model) and in
    important real world applications such as spam detection, uncovering of the
    hidden thematic structure of the Web and link recommendation. Counting
    triangles in graphs with millions and billions of edges requires algorithms
    which run fast, use small amount of space, provide accurate estimates of the
    number of triangles and preferably are parallelizable.

  139. Query Efficient PTAS for Minimum Feedback Arc-Set in Tournaments.

    Authors: Nir Ailon
    Subjects: Data Structures and Algorithms
    Abstract

    We study the problem of Minimum Feedback Arc-Set in Tournaments (MFAST) in a
    setting where each edge from the input graph is given for a cost, and
    computations are done for free. We show that a recent PTAS by Kenyon and Schudy
    from 2007 in a standard computation model (input for free, computation costs)
    can be converted into a algorithm that requires observing only a sample of the
    input, with a similar approximation guarantee.

  140. Approximating Subdense Instances of Covering Problems.

    Authors: Jean Cardinal, Marek Karpinski, Richard Schmied, Claus Viehmann
    Subjects: Data Structures and Algorithms
    Abstract

    We study approximability of subdense instances of various covering problems
    on graphs, defined as instances in which the minimum or average degree is
    Omega(n/psi(n)) for some function psi(n)=omega(1) of the instance size. We
    design new approximation algorithms as well as new polynomial time
    approximation schemes (PTASs) for those problems and establish first
    approximation hardness results for them. Interestingly, in some cases we were
    able to prove optimality of the underlying approximation ratios, under usual
    complexity-theoretic assumptions.

  141. CUR from a Sparse Optimization Viewpoint.

    Authors: Michael W. Mahoney, Jacob Bien, Ya Xu
    Subjects: Data Structures and Algorithms
    Abstract

    The CUR decomposition provides an approximation of a matrix $X$ that has low
    reconstruction error and that is sparse in the sense that the resulting
    approximation lies in the span of only a few columns of $X$. In this regard, it
    appears to be similar to many sparse PCA methods. However, CUR takes a
    randomized algorithmic approach, whereas most sparse PCA methods are framed as
    convex optimization problems. In this paper, we try to understand CUR from a
    sparse optimization viewpoint.

  142. On Finding Similar Items in a Stream of Transactions.

    Authors: Andrea Campagna, Rasmus Pagh
    Subjects: Data Structures and Algorithms
    Abstract

    While there has been a lot of work on finding frequent itemsets in
    transaction data streams, none of these solve the problem of finding similar
    pairs according to standard similarity measures. This paper is a first attempt
    at dealing with this, arguably more important, problem.

  143. On Finding Frequent Patterns in Event Sequences.

    Authors: Andrea Campagna, Rasmus Pagh
    Subjects: Data Structures and Algorithms
    Abstract

    Given a directed acyclic graph with labeled vertices, we consider the problem
    of finding the most common label sequences ("traces") among all paths in the
    graph (of some maximum length m). Since the number of paths can be huge, we
    propose novel algorithms whose time complexity depends only on the size of the
    graph, and on the frequency epsilon of the most frequent traces. In addition,
    we apply techniques from streaming algorithms to achieve space usage that
    depends only on epsilon, and not on the number of distinct traces.

  144. Algorithmic and Statistical Perspectives on Large-Scale Data Analysis.

    Authors: Michael W. Mahoney
    Subjects: Data Structures and Algorithms
    Abstract

    In recent years, ideas from statistics and scientific computing have begun to
    interact in increasingly sophisticated and fruitful ways with ideas from
    computer science and the theory of algorithms to aid in the development of
    improved worst-case algorithms that are useful for large-scale scientific and
    Internet data analysis problems.

  145. Superselectors: Efficient Constructions and Applications.

    Authors: Ferdinando Cicalese, Ugo Vaccaro
    Subjects: Data Structures and Algorithms
    Abstract

    We introduce a new combinatorial structure: the superselector. We show that
    superselectors subsume several important combinatorial structures used in the
    past few years to solve problems in group testing, compressed sensing,
    multi-channel conflict resolution and data security. We prove close upper and
    lower bounds on the size of superselectors and we provide efficient algorithms
    for their constructions.

  146. Implementing regularization implicitly via approximate eigenvector computation.

    Authors: Michael W. Mahoney, Lorenzo Orecchia
    Subjects: Data Structures and Algorithms
    Abstract

    Regularization is a powerful technique for extracting useful information from
    noisy data. Typically, it is implemented by adding some sort of norm constraint
    to an objective function and then exactly optimizing the modified objective
    function. This procedure typically leads to optimization problems that are
    computationally more expensive than the original problem, a fact that is
    clearly problematic if one is interested in large-scale applications.

  147. FAST: Kernelization based on Graph Modular Decomposition.

    Authors: Yixin Cao, Jianer Chen
    Subjects: Data Structures and Algorithms
    Abstract

    Kernelization algorithms, usually a preprocessing step before other more
    traditional algorithms, are very special in the sense that they return
    (reduced) instances, instead of final results. This characteristic excludes the
    freedom of applying a kernelization algorithm for the weighted version of a
    problem to its unweighted instances. Thus with only very few special cases,
    kernelization algorithms have to be studied separately for weigthed and
    unweighted versions of a single problem.

  148. Polynomial-time sortable stacks of burnt pancakes.

    Authors: Josef Cibulka, Anthony Labarre
    Subjects: Data Structures and Algorithms
    Abstract

    Pancake flipping, a famous open problem in computer science, can be
    formalised as the problem of sorting a permutation of positive integers using
    as few prefix reversals as possible. In that context, a prefix reversal of
    length k reverses the order of the first k elements of the permutation. The
    burnt variant of pancake flipping involves permutations of signed integers, and
    reversals in that case not only reverse the order of elements but also invert
    their signs.

  149. Comparative Performance of Tabu Search and Simulated Annealing Heuristics for the Quadratic Assignment Problem.

    Authors: Gerald Paul
    Subjects: Data Structures and Algorithms
    Abstract

    For almost two decades the question of whether tabu search (TS) or simulated
    annealing (SA) performs better for the quadratic assignment problem has been
    unresolved. To answer this question satisfactorily, we compare performance at
    various values of targeted solution quality, running each heuristic at its
    optimal number of iterations for each target. We find that for a number of
    varied problem instances, SA performs better for higher quality targets while
    TS performs better for lower quality targets.

  150. L1 Projections with Box Constraints.

    Authors: Mithun Das Gupta, Sanjeev Kumar, Jing Xiao
    Subjects: Data Structures and Algorithms
    Abstract

    We study the L1 minimization problem with additional box constraints. We
    motivate the problem with two different views of optimality considerations. We
    look into imposing such constraints in projected gradient techniques and
    propose a worst case linear time algorithm to perform such projections. We
    demonstrate the merits and effectiveness of our algorithms on synthetic as well
    as real experiments.

  151. Exact Analysis of Pattern Matching Algorithms with Probabilistic Arithmetic Automata.

    Authors: Tobias Marschall, Sven Rahmann
    Subjects: Data Structures and Algorithms
    Abstract

    We propose a framework for the exact probabilistic analysis of window-based
    pattern matching algorithms, such as Boyer-Moore, Horspool, Backward DAWG
    Matching, Backward Oracle Matching, and more. In particular, we show how to
    efficiently obtain the distribution of such an algorithm's running time cost
    for any given pattern in a random text model, which can be quite general, from
    simple uniform models to higher-order Markov models or hidden Markov models
    (HMMs).

  152. Fast Pseudo-Random Fingerprints.

    Authors: Ely Porat, Yoram Bachrach
    Subjects: Data Structures and Algorithms
    Abstract

    We propose a method to exponentially speed up computation of various
    fingerprints, such as the ones used to compute similarity and rarity in massive
    data sets. Rather then maintaining the full stream of $b$ items of a universe
    $[u]$, such methods only maintain a concise fingerprint of the stream, and
    perform computations using the fingerprints. The computations are done
    approximately, and the required fingerprint size $k$ depends on the desired
    accuracy $\epsilon$ and confidence $\delta$.

  153. List Factoring and Relative Worst Order Analysis.

    Authors: Martin R. Ehmsen, Jens S. Kohrt, Kim S. Larsen
    Subjects: Data Structures and Algorithms
    Abstract

    Relative worst order analysis is a supplement or alternative to competitive
    analysis which has been shown to give results more in accordance with observed
    behavior of online algorithms for a range of different online problems. The
    contribution of this paper is twofold. First, it adds the static list accessing
    problem to the collection of online problems where relative worst order
    analysis gives better results. Second, and maybe more interesting, it adds the
    non-trivial supplementary proof technique of list factoring to the theoretical
    toolbox for relative worst order analysis.

  154. Approximability of Capacitated Network Design.

    Authors: Sanjeev Khanna, Deeparnab Chakrabarty, Nitish Korula, Chandra Chekuri
    Subjects: Data Structures and Algorithms
    Abstract

    In the {\em capacitated} survivable network design problem (Cap-SNDP), we are
    given an undirected multi-graph where each edge has a capacity and a cost. The
    goal is to find a minimum cost subset of edges that satisfies a given set of
    pairwise minimum-cut requirements. Unlike its classical special case of SNDP
    when all capacities are unit, the approximability of Cap-SNDP is not well
    understood; even in very restricted settings no known algorithm achieves a
    $o(m)$ approximation, where $m$ is the number of edges in the graph.

  155. Testing Closeness of Discrete Distributions.

    Authors: Lance Fortnow, Tugkan Batu, Ronitt Rubinfeld, Warren D. Smith, Patrick White
    Subjects: Data Structures and Algorithms
    Abstract

    Given samples from two distributions over an $n$-element set, we wish to test
    whether these distributions are statistically close. We present an algorithm
    which uses sublinear in $n$, specifically, $O(n^{2/3}\epsilon^{-8/3}\log n)$,
    independent samples from each distribution, runs in time linear in the sample
    size, makes no assumptions about the structure of the distributions, and
    distinguishes the cases when the distance between the distributions is small
    (less than $\max\{\epsilon^{4/3}n^{-1/3}/32, \epsilon n^{-1/2}/4\}$) or large
    (more than $\epsilon$) in $\ell_1$ distance.

  156. A PTAS for Scheduling with Tree Assignment Restrictions.

    Authors: Ulrich M. Schwarz
    Subjects: Data Structures and Algorithms
    Abstract

    Scheduling with assignment restrictions is an important special case of
    scheduling unrelated machines which has attracted much attention in the recent
    past. While a lower bound on approximability of 3/2 is known for its most
    general setting, subclasses of the problem admit polynomial-time approximation
    schemes. This note provides a PTAS for tree-like hierarchical structures,
    improving on a recent 4/3-approximation by Huo and Leung.

  157. Testing Simultaneous Planarity when the Common Graph is 2-Connected.

    Authors: Bernhard Haeupler, Krishnam Raju Jampani, Anna Lubiw
    Subjects: Data Structures and Algorithms
    Abstract

    Two planar graphs G1 and G2 sharing some vertices and edges are
    `simultaneously planar' if they have planar drawings such that a shared vertex
    [edge] is represented by the same point [curve] in both drawings. It is an open
    problem whether simultaneous planarity can be tested efficiently. We give a
    linear-time algorithm to test simultaneous planarity when the two graphs share
    a 2-connected subgraph. Our algorithm extends to the case of k planar graphs
    where each vertex [edge] is either common to all graphs or belongs to exactly
    one of them.

  158. A Versatile Algorithm to Generate Various Combinatorial Structures.

    Authors: Abhiram Natarajan, Pramod Ganapathi, Rama B
    Subjects: Data Structures and Algorithms
    Abstract

    Algorithms to generate various combinatorial structures find tremendous
    importance in computer science. In this paper, we begin by reviewing an
    algorithm proposed by Rohl that generates all unique permutations of a list of
    elements which possibly contains repetitions, taking some or all of the
    elements at a time, in any imposed order. The algorithm uses an auxiliary array
    that maintains the number of occurrences of each unique element in the input
    list. We provide a proof of correctness of the algorithm.

  159. Popularity at Minimum Cost.

    Authors: Prajakta Nimbhorkar, Telikepalli Kavitha, Meghana Nasre
    Subjects: Data Structures and Algorithms
    Abstract

    We consider an extension of the {\em popular matching} problem in this paper.
    The input to the popular matching problem is a bipartite graph G = (A U B,E),
    where A is a set of people, B is a set of items, and each person a belonging to
    A ranks a subset of items in an order of preference, with ties allowed. The
    popular matching problem seeks to compute a matching M* between people and
    items such that there is no matching M where more people are happier with M
    than with M*. Such a matching M* is called a popular matching.

  160. Small Vertex Cover makes Petri Net Coverability and Boundedness Easier.

    Authors: M. Praveen
    Subjects: Data Structures and Algorithms
    Abstract

    The coverability and boundedness problems for Petri nets are known to be
    Expspace-complete. Given a Petri net, we associate a graph with it. With the
    vertex cover number k of this graph and the maximum arc weight W as parameters,
    we show that coverability and boundedness are in ParaPspace.

  161. Clustering, Encoding and Diameter Computation Algorithms for Multidimensional Data.

    Authors: Eliana-Dina Tirsa, Mugurel Ionut Andreica
    Subjects: Data Structures and Algorithms
    Abstract

    In this paper we present novel algorithms for several multidimensional data
    processing problems. We consider problems related to the computation of
    restricted clusters and of the diameter of a set of points using a new distance
    function. We also consider two string (1D data) processing problems, regarding
    an optimal encoding method and the computation of the number of occurrences of
    a substring within a string generated by a grammar. The algorithms have been
    thoroughly analyzed from a theoretical point of view and some of them have also
    been evaluated experimentally.

  162. Maximizing the Total Resolution of Graphs.

    Authors: Antonios Symvonis, Evmorfia N. Argyriou, Michael A. Bekos
    Subjects: Data Structures and Algorithms
    Abstract

    A major factor affecting the readability of a graph drawing is its
    resolution. In the graph drawing literature, the resolution of a drawing is
    either measured based on the angles formed by consecutive edges incident to a
    common node (angular resolution) or by the angles formed at edge crossings
    (crossing resolution). In this paper, we evaluate both by introducing the
    notion of "total resolution", that is, the minimum of the angular and crossing
    resolution. To the best of our knowledge, this is the first time where the
    problem of maximizing the total resolution of a drawing is studied.

  163. Fast Searching in Packed Strings.

    Authors: Philip Bille
    Subjects: Data Structures and Algorithms
    Abstract

    Given strings $P$ and $Q$ the (exact) string matching problem is to find all
    positions of substrings in $Q$ matching $P$. The classical Knuth-Morris-Pratt
    algorithm [SIAM J. Comput., 1977] solves the string matching problem in linear
    time which is optimal if we can only read one character at the time. However,
    most strings are stored in a computer in a packed representation with several
    characters in a single word, giving us the opportunity to read multiple
    characters simultaneously.

  164. A Branch-and-Reduce Algorithm for Finding a Minimum Independent Dominating Set.

    Authors: Serge Gaspers, Mathieu Liedloff
    Subjects: Data Structures and Algorithms
    Abstract

    An independent dominating set D of a graph G = (V,E) is a subset of vertices
    such that every vertex in V \ D has at least one neighbor in D and D is an
    independent set, i.e. no two vertices of D are adjacent in G. Finding a minimum
    independent dominating set in a graph is an NP-hard problem. Whereas it is hard
    to cope with this problem using parameterized and approximation algorithms,
    there is a simple exact O(1.4423^n)-time algorithm solving the problem by
    enumerating all maximal independent sets.

  165. A Quartic Kernel for Pathwidth-One Vertex Deletion.

    Authors: Geevarghese Philip, Venkatesh Raman, Yngve Villanger
    Subjects: Data Structures and Algorithms
    Abstract

    The pathwidth of a graph is a measure of how path-like the graph is. Given a
    graph G and an integer k, the problem of finding whether there exist at most k
    vertices in G whose deletion results in a graph of pathwidth at most one is NP-
    complete. We initiate the study of the parameterized complexity of this
    problem, parameterized by k.

  166. Extended h-Index Parameterized Data Structures for Computing Dynamic Subgraph Statistics.

    Authors: Michael T. Goodrich, David Eppstein, Darren Strash, Lowell Trott
    Subjects: Data Structures and Algorithms
    Abstract

    We present techniques for maintaining subgraph frequencies in a dynamic
    graph, using data structures that are parameterized in terms of h, the h-index
    of the graph. Our methods extend previous results of Eppstein and Spiro for
    maintaining statistics for undirected subgraphs of size three to directed
    subgraphs and to subgraphs of size four. For the directed case, we provide a
    data structure to maintain counts for all 3-vertex induced subgraphs in O(h)
    amortized time per update. For the undirected case, we maintain the counts of
    size-four subgraphs in O(h^2) amortized time per update.

  167. The Geometry of Scheduling.

    Authors: Nikhil Bansal, Kirk Pruhs
    Subjects: Data Structures and Algorithms
    Abstract

    We consider the following general scheduling problem: The input consists of n
    jobs, each with an arbitrary release time, size, and a monotone function
    specifying the cost incurred when the job is completed at a particular time.
    The objective is to find a preemptive schedule of minimum aggregate cost. This
    problem formulation is general enough to include many natural scheduling
    objectives, such as weighted flow, weighted tardiness, and sum of flow squared.
    Our main result is a randomized polynomial-time algorithm with an approximation
    ratio O(log log nP), where P is the maximum job size.

  168. Evacuation of rectilinear polygons.

    Authors: Sandor P. Fekete, Chris Gray, Alexander Kroeller
    Subjects: Data Structures and Algorithms
    Abstract

    We investigate the problem of creating fast evacuation plans for buildings
    that are modeled as grid polygons, possibly containing exponentially many
    cells. We study this problem in two contexts: the ``confluent'' context in
    which the routes to exits remain fixed over time, and the ``non-confluent''
    context in which routes may change. Confluent evacuation plans are simpler to
    carry out, as they allocate contiguous regions to exits; non-confluent
    allocation can possibly create faster evacuation plans.

  169. Consecutive ones property testing: cut or swap.

    Authors: Mathieu Raffinot
    Subjects: Data Structures and Algorithms
    Abstract

    Let C be a finite set of $N elements and R = {R_1,R_2, ..,R_m} a family of M
    subsets of C. The family R verifies the consecutive ones property if there
    exists a permutation P of C such that each R_i in R is an interval of P. There
    already exist several algorithms to test this property in sum_{i=1}^m |R_i|
    time, all being involved. We present a simpler algorithm, based on a new
    partitioning scheme.

  170. Fully Automatic Trunk Packing with Free Placements.

    Authors: Ernst Althaus, Peter Hachenberger
    Subjects: Data Structures and Algorithms
    Abstract

    We present a new algorithm to compute the volume of a trunk according to the
    SAE J1100 standard. Our new algorithm uses state-of-the-art methods from
    computational geometry and from combinatorial optimization. It finds better
    solutions than previous approaches for small trunks.

  171. Prediction strategies without loss.

    Authors: Michael Kapralov, Rina Panigrahy
    Subjects: Data Structures and Algorithms
    Abstract

    Consider a sequence of bits where we are trying to predict the next bit from
    the previous bits. Assume we are allowed to say `predict 0' or `predict 1' or
    'no prediction'. We will say that for a sequence of bits the loss of a strategy
    is the number of wrong predictions minus the number of right predictions. In
    this paper we are interested in strategies that never have a (expected) loss
    over any string -- for example the strategy that always makes no prediction can
    never have a loss however it also never has a gain.

  172. An approximation algorithm for the total cover problem.

    Authors: Pooya Hatami
    Subjects: Data Structures and Algorithms
    Abstract

    We introduce a $2$-approximation algorithm for the minimum total covering
    number problem.

  173. Polynomial-Time Approximation Schemes for Knapsack and Related Counting Problems using Branching Programs.

    Authors: Adam Klivans, Raghu Meka, Parikshit Gopalan
    Subjects: Data Structures and Algorithms
    Abstract

    We give a deterministic, polynomial-time algorithm for approximately counting
    the number of {0,1}-solutions to any instance of the knapsack problem. On an
    instance of length n with total weight W and accuracy parameter eps, our
    algorithm produces a (1 + eps)-multiplicative approximation in time poly(n,log
    W,1/eps).

  174. 2-FREE-FLOOD-IT is polynomial.

    Authors: Aurelie Lagoutte
    Subjects: Data Structures and Algorithms
    Abstract

    We study a discrete diffusion process introduced in some combinatorial games
    called FLOODIT and MADVIRUS that can be played online and whose computational
    complexity has been recently studied by Arthur et al (FUN'2010). The flooding
    dynamics used in those games can be defined for any colored graph.

  175. Minimum Entropy Combinatorial Optimization Problems.

    Authors: Jean Cardinal, Samuel Fiorini, Gwena&#xeb;l Joret
    Subjects: Data Structures and Algorithms
    Abstract

    We survey recent results on combinatorial optimization problems in which the
    objective function is the entropy of a discrete distribution. These include the
    minimum entropy set cover, minimum entropy orientation, and minimum entropy
    coloring problems.

  176. Runtime-Flexible Multi-dimensional Arrays and Views for C++98 and C++0x.

    Authors: Bjoern Andres, Ullrich Koethe, Thorben Kroeger, Fred A. Hamprecht
    Subjects: Data Structures and Algorithms
    Abstract

    Multi-dimensional arrays are among the most fundamental and most useful data
    structures of all. In C++, excellent template libraries exist for arrays whose
    dimension is fixed at runtime. Arrays whose dimension can change at runtime
    have been implemented in C. However, a generic object-oriented C++
    implementation of runtime-flexible arrays has so far been missing. In this
    article, we discuss our new implementation called Marray, a package of class
    templates that fills this gap. Marray is based on views as an underlying
    concept.

  177. Faster Radix Sort via Virtual Memory and Write-Combining.

    Authors: Peter Sanders, Jan Wassenberg
    Subjects: Data Structures and Algorithms
    Abstract

    Sorting algorithms are the deciding factor for the performance of common
    operations such as removal of duplicates or database sort-merge joins. This
    work focuses on 32-bit integer keys, optionally paired with a 32-bit value. We
    present a fast radix sorting algorithm that builds upon a
    microarchitecture-aware variant of counting sort. Taking advantage of virtual
    memory and making use of write-combining yields a per-pass throughput
    corresponding to at least 88 % of the system's peak memory bandwidth. Our
    implementation outperforms Intel's recently published radix sort by a factor of
    1.5.

  178. Fast Approximation Algorithms for Cut-based Problems in Undirected Graphs.

    Authors: Aleksander Madry
    Subjects: Data Structures and Algorithms
    Abstract

    We present a general method of designing fast approximation algorithms for
    cut-based minimization problems in undirected graphs. In particular, we develop
    a technique that given any such problem that can be approximated quickly on
    trees, allows approximating it almost as quickly on general graphs while only
    losing a poly-logarithmic factor in the approximation guarantee.

  179. A Deterministic Polynomial-time Approximation Scheme for Counting Knapsack Solutions.

    Authors: Eric Vigoda, Daniel Stefankovic, Santosh Vempala
    Subjects: Data Structures and Algorithms
    Abstract

    Given n elements with nonnegative integer weights w1,..., wn and an integer
    capacity C, we consider the counting version of the classic knapsack problem:
    find the number of distinct subsets whose weights add up to at most the given
    capacity. We give a deterministic algorithm that estimates the number of
    solutions to within relative error 1+-eps in time polynomial in n and 1/eps
    (fully polynomial approximation scheme). More precisely, our algorithm takes
    time O(n^3 (1/eps) log (n/eps)).

  180. Intrinsic Dimensionality.

    Authors: Vladimir Pestov
    Subjects: Data Structures and Algorithms
    Abstract

    This entry for the SIGSPATIL Special July 2010 issue on Similarity Searching
    in Metric Spaces discusses the notion of intrinsic dimensionality of data in
    the context of similarity search.

  181. Approximation Algorithms for Secondary Spectrum Auctions.

    Authors: Martin Hoefer, Thomas Kesselheim, Berthold V&#xf6;cking
    Subjects: Data Structures and Algorithms
    Abstract

    We study combinatorial auctions for the secondary spectrum market. In this
    market, short-term licenses shall be given to wireless nodes for communication
    in their local neighborhood. In contrast to the primary market, channels can be
    assigned to multiple bidders, provided that the corresponding devices are well
    separated such that the interference is sufficiently low.

  182. AntiJam: Efficient Medium Access despite Adaptive and Reactive Jamming.

    Authors: Andrea Richa, Christian Scheideler, Stefan Schmid, Jin Zhang
    Subjects: Data Structures and Algorithms
    Abstract

    Intentional interference constitutes a major threat for communication
    networks operating over a shared medium and where availability is imperative.
    Jamming attacks are often simple and cheap to implement. In particular, today's
    jammers can perform physical carrier sensing in order to disrupt communication
    more efficiently, specially in a network of simple wireless devices such as
    sensor nodes, which usually operate over a single frequency (or a limited
    frequency band) and which cannot benefit from the use of spread spectrum or
    other more advanced technologies.

  183. Finding Cycles and Trees in Sublinear Time.

    Authors: Asaf Shapira, Artur Czumaj, Oded Goldreich, Dana Ron, Christian Sohler
    Subjects: Data Structures and Algorithms
    Abstract

    We present sublinear-time (randomized) algorithms for finding simple cycles
    of length at least $k\geq 3$ and tree-minors in bounded-degree graphs. The
    complexity of these algorithms is related to the distance of the graph from
    being $C_k$-minor-free (resp., free from having the corresponding tree-minor).
    In particular, if the graph is far (i.e., $\Omega(1)$-far) {from} being
    cycle-free, i.e.

  184. On the (non-)existence of polynomial kernels for Pl-free edge modification problems.

    Authors: Christophe Paul, Anthony Perez, Sylvain Guillemot
    Subjects: Data Structures and Algorithms
    Abstract

    Given a graph G = (V,E) and an integer k, an edge modification problem for a
    graph property P consists in deciding whether there exists a set of edges F of
    size at most k such that the graph H = (V,E \vartriangle F) satisfies the
    property P. In the P edge-completion problem, the set F of edges is constrained
    to be disjoint from E; in the P edge-deletion problem, F is a subset of E; no
    constraint is imposed on F in the P edge-edition problem.

  185. Scheduling to Minimize Energy and Flow Time in Broadcast Scheduling.

    Authors: Benjamin Moseley
    Subjects: Data Structures and Algorithms
    Abstract

    In this paper we initiate the study of minimizing power consumption in the
    broadcast scheduling model. In this setting there is a wireless transmitter.
    Over time requests arrive at the transmitter for pages of information. Multiple
    requests may be for the same page. When a page is transmitted, all requests for
    that page receive the transmission simulteneously. The speed the transmitter
    sends data at can be dynamically scaled to conserve energy.

  186. Searching in Dynamic Catalogs on a Tree.

    Authors: Yakov Nekrich
    Subjects: Data Structures and Algorithms
    Abstract

    In this paper we consider the following modification of the iterative search
    problem. We are given a tree $T$, so that a dynamic catalog $C(v)$ is
    associated with every tree node $v$. For any $x$ and for any node-to-root path
    $\pi$ in $T$, we must find the predecessor of $x$ in $\cup_{v\in \pi} C(v)$. We
    present a linear space dynamic data structure that supports such queries in
    $O(t(n)+|\pi|)$ time, where $t(n)$ is the time needed to search in one catalog
    and $|\pi|$ denotes the number of nodes on path $\pi$.

  187. Lower Bounds on Query Complexity for Testing Bounded-Degree CSPs.

    Authors: Yuichi Yoshida
    Subjects: Data Structures and Algorithms
    Abstract

    In this paper, we consider lower bounds on the query complexity for testing
    CSPs in the bounded-degree model.

  188. XML Reconstruction View Selection in XML Databases: Complexity Analysis and Approximation Scheme.

    Authors: Bin Fu, Artem Chebotko
    Subjects: Data Structures and Algorithms
    Abstract

    Query evaluation in an XML database requires reconstructing XML subtrees
    rooted at nodes found by an XML query. Since XML subtree reconstruction can be
    expensive, one approach to improve query response time is to use reconstruction
    views - materialized XML subtrees of an XML document, whose nodes are
    frequently accessed by XML queries. For this approach to be efficient, the
    principal requirement is a framework for view selection. In this work, we are
    the first to formalize and study the problem of XML reconstruction view
    selection.

  189. Ranking with Submodular Valuations.

    Authors: Iftah Gamzu, Yossi Azar
    Subjects: Data Structures and Algorithms
    Abstract

    We study the problem of ranking with submodular valuations. An instance of
    this problem consists of a ground set $[m]$, and a collection of $n$ monotone
    submodular set functions $f^1, \ldots, f^n$, where each $f^i: 2^{[m]} \to R_+$.
    An additional ingredient of the input is a weight vector $w \in R_+^n$. The
    objective is to find a linear ordering of the ground set elements that
    minimizes the weighted cover time of the functions.

  190. Heapable Sequences and Subsequences.

    Authors: Brent Heeringa, Michael Mitzenmacher, Georgios Zervas, John Byers
    Subjects: Data Structures and Algorithms
    Abstract

    Let us call a sequence of numbers heapable if they can be sequentially
    inserted to form a binary tree with the heap property, where each insertion
    subsequent to the first occurs at a leaf of the tree, i.e. below a previously
    placed number. In this paper we consider a variety of problems related to
    heapable sequences and subsequences that do not appear to have been studied
    previously. Our motivation for introducing these concepts is two-fold. First,
    such problems correspond to natural extensions of the well-known secretary
    problem for hiring an organization with a hierarchical structure.

  191. Faster Replacement Paths.

    Authors: Virginia Vassilevska Williams
    Subjects: Data Structures and Algorithms
    Abstract

    The replacement paths problem for directed graphs is to find for given nodes
    s and t and every edge e on the shortest path between them, the shortest path
    between s and t which avoids e. For unweighted directed graphs on n vertices,
    the best known algorithm runtime was \tilde{O}(n^{2.5}) by Roditty and Zwick.
    For graphs with integer weights in {-M,...,M}, Weimann and Yuster recently
    showed that one can use fast matrix multiplication and solve the problem in
    O(Mn^{2.584}) time, a runtime which would be O(Mn^{2.33}) if the exponent
    \omega of matrix multiplication is 2.

  192. Complexity of Splits Reconstruction for Low-Degree Trees.

    Authors: Serge Gaspers, Mathieu Liedloff, Maya Stein, Karol Suchan
    Subjects: Data Structures and Algorithms
    Abstract

    Given a vertex-weighted tree T, the split of an edge xy in T is min(s_x, s_y)
    where s_x (respectively, s_y) is the sum of all weights of vertices that are
    closer to x than to y (respectively, closer to y than to x) in T. Given a set
    of weighted vertices V and a multiset of splits S, we consider the problem of
    constructing a tree on V whose splits correspond to S. The problem is known to
    be NP-complete even when all vertices have unit weight and the maximum vertex
    degree of T is required to be no more than 4.

  193. Matroid Secretary Problem in the Random Assignment Model.

    Authors: Jos&#xe9; A. Soto
    Subjects: Data Structures and Algorithms
    Abstract

    In the Matroid Secretary Problem, introduced by Babaioff et al. [SODA 2007],
    the elements of a given matroid are presented to an online algorithm in random
    order. When an element is revealed, the algorithm learns its weight and decides
    whether or not to select it under the restriction that the selected elements
    form an independent set in the matroid. The objective is to maximize the total
    weight of the chosen elements. In the most studied version of this problem, the
    algorithm has no information about the weights beforehand. We refer to this as
    the zero information model.

  194. Symmetric Submodular Function Minimization Under Hereditary Family Constraints.

    Authors: Michel X. Goemans, Jos&#xe9; A. Soto
    Subjects: Data Structures and Algorithms
    Abstract

    We present an efficient algorithm to find non-empty minimizers of a symmetric
    submodular function over any family of sets closed under inclusion. This for
    example includes families defined by a cardinality constraint, a knapsack
    constraint, a matroid independence constraint, or any combination of such
    constraints. Our algorithm make $O(n^3)$ oracle calls to the submodular
    function where $n$ is the cardinality of the ground set.

  195. Parameterizing by the Number of Numbers.

    Authors: Serge Gaspers, Michael R. Fellows, Frances A. Rosamond
    Subjects: Data Structures and Algorithms
    Abstract

    The usefulness of parameterized algorithmics has often depended on what
    Niedermeier has called, "the art of problem parameterization". In this paper we
    introduce and explore a novel but general form of parameterization: the number
    of numbers. Several classic numerical problems, such as Subset Sum, Partition,
    3-Partition, Numerical 3-Dimensional Matching, and Numerical Matching with
    Target Sums, have multisets of integers as input.

  196. Submodular Maximization by Simulated Annealing.

    Authors: Shayan Oveis Gharan, Jan Vondr&#xe1;k
    Subjects: Data Structures and Algorithms
    Abstract

    We consider the problem of maximizing a nonnegative (possibly non-monotone)
    submodular set function with or without constraints. Feige et al.

  197. A Fast Algorithm for Three-Dimensional Layers of Maxima Problem.

    Authors: Yakov Nekrich
    Subjects: Data Structures and Algorithms
    Abstract

    We show that the three-dimensional layers-of-maxima problem can be solved in
    $o(n\log n)$ time in the word RAM model. Our algorithm runs in $O(n(\log \log
    n)^3)$ time and uses $O(n)$ space. We also describe an algorithm that uses
    optimal $O(n)$ space and solves the three-dimensional layers-of-maxima problem
    in $O(n\log n)$ time in the pointer machine model.

  198. An Optimal Lower Bound for Buffer Management in Multi-Queue Switches.

    Authors: Marcin Bienkowski
    Subjects: Data Structures and Algorithms
    Abstract

    We consider a competitive analysis of the online packet buffering problem
    (also known as the unweighted FIFO variant of buffer management). In this
    scenario, we focus on a single network packet switching device, which has
    several input ports and one output port. This device forwards unit-size,
    unit-value packets from its input ports to the output port. The burstiness of
    the incoming traffic motivates the use of buffers of a certain size, attached
    to input ports, which can accumulate incoming packets and store them for later
    transmission.

  199. Quantum search by partial adiabatic evolution.

    Authors: Ying-Yu Zhang, Song-Feng Lu
    Subjects: Data Structures and Algorithms
    Abstract

    A quantum search algorithm based on the partial adiabatic
    evolution\cite{Tulsi2009} is provided. We calculate its time complexity by
    studying the Hamiltonian in a two-dimensional Hilbert space. It is found that
    the algorithm improves the time complexity, which is $O(\sqrt{N/M})$, of the
    local adiabatic search algorithm\cite{Roland2002}, to $O(\sqrt{N}/M)$.

  200. Flows in One-Crossing-Minor-Free Graphs.

    Authors: David Eppstein, Erin Chambers
    Subjects: Data Structures and Algorithms
    Abstract

    We study the maximum flow problem in directed H-minor-free graphs where H can
    be drawn in the plane with one crossing. If a structural decomposition of the
    graph as a clique-sum of planar graphs and graphs of constant complexity is
    given, we show that a maximum flow can be computed in O(n log n) time. In
    particular, maximum flows in directed K_{3,3}-minor-free graphs and directed
    K_5-minor-free graphs can be computed in O(n log n) time without additional
    assumptions.

  201. Online Vertex-Weighted Bipartite Matching and Single-bid Budgeted Allocations.

    Authors: Gagan Goel, Chinmay Karande, Gagan Aggarwal, Aranyak Mehta
    Subjects: Data Structures and Algorithms
    Abstract

    We study the following vertex-weighted online bipartite matching problem:
    $G(U, V, E)$ is a bipartite graph. The vertices in $U$ have weights and are
    known ahead of time, while the vertices in $V$ arrive online in an arbitrary
    order and have to be matched upon arrival. The goal is to maximize the sum of
    weights of the matched vertices in $U$. When all the weights are equal, this
    reduces to the classic \emph{online bipartite matching} problem for which Karp,
    Vazirani and Vazirani gave an optimal $\left(1-\frac{1}{e}\right)$-competitive
    algorithm in their seminal work~\cite{KVV90}.

  202. Efficient Sketches for the Set Query Problem.

    Authors: Eric Price
    Subjects: Data Structures and Algorithms
    Abstract

    We develop an algorithm for estimating the values of a vector x in R^n over a
    support S of size k from a randomized sparse binary linear sketch Ax of size
    O(k). Given Ax and S, we can recover x' with ||x' - x_S||_2 <= eps ||x -
    x_S||_2 with probability at least 1 - k^{-\Omega(1)}. The recovery takes O(k)
    time.

    While interesting in its own right, this primitive also has a number of
    applications. For example, we can:

  203. MapReduce Parallel Cuckoo Hashing and Oblivious RAM Simulations.

    Authors: Michael T. Goodrich, Michael Mitzenmacher
    Subjects: Data Structures and Algorithms
    Abstract

    We present an efficient algorithm for performing cuckoo hashing in the
    MapReduce parallel model of computation and we show how this result in turn
    leads to improved methods for performing data-oblivious RAM simulations.

  204. Narrow sieves for parameterized paths and packings.

    Authors: Andreas Bj&#xf6;rklund, Thore Husfeldt, Petteri Kaski, Mikko Koivisto
    Subjects: Data Structures and Algorithms
    Abstract

    We present randomized algorithms for some well-studied, hard combinatorial
    problems: the k-path problem, the p-packing of q-sets problem, and the
    q-dimensional p-matching problem. Our algorithms solve these problems with high
    probability in time exponential only in the parameter (k, p, q) and using
    polynomial space; the constant bases of the exponentials are significantly
    smaller than in previous works. For example, for the k-path problem the
    improvement is from 2 to 1.66.

  205. An Improved Neighbourhood for the Traveling Tournament Problem.

    Authors: Glenn Langford
    Subjects: Data Structures and Algorithms
    Abstract

    The Traveling Tournament Problem (TTP) is a challenging combinatorial
    optimization problem that has attracted the interest of researchers around the
    world. This paper proposes an improved search neighbourhood for the TTP that
    has been tested in a simulated annealing context. The neighbourhood encompasses
    both feasible and infeasible schedules, and can be generated efficiently.

  206. The Computational Complexity of Estimating Convergence Time.

    Authors: Elchanan Mossel, Nayantara Bhatnagar, Andrej Bogdanov
    Subjects: Data Structures and Algorithms
    Abstract

    An important problem in the implementation of Markov Chain Monte Carlo
    algorithms is to determine the convergence time, or the number of iterations
    before the chain is close to stationarity. For many Markov chains used in
    practice this time is not known. Even in cases where the convergence time is
    known to be polynomial, the theoretical bounds are often too crude to be
    practical. Thus, practitioners like to carry out some form of statistical
    analysis in order to assess convergence.

  207. Vertex Sparsifiers: New Results from Old Techniques.

    Authors: Kunal Talwar, Anupam Gupta, Robert Krauthgamer, Matthias Englert, Harald Raecke, Inbal Talgam
    Subjects: Data Structures and Algorithms
    Abstract

    Given a capacitated graph $G = (V,E)$ and a set of terminals $K \sse V$, how
    should we produce a graph $H$ only on the terminals $K$ so that every
    (multicommodity) flow between the terminals in $G$ could be supported in $H$
    with low congestion, and vice versa? (Such a graph $H$ is called a
    $flow-sparsifier$ for $G$.) What if we want $H$ to be a "simple" graph? What if
    we allow $H$ to be a convex combination of simple graphs?

  208. Algorithmic Solutions for Several Offline Constrained Resource Processing and Data Transfer Multicriteria Optimization Problems.

    Authors: Mugurel Ionut Andreica, Nicolae Tapus
    Subjects: Data Structures and Algorithms
    Abstract

    In this paper we present novel algorithmic solutions for several resource
    processing and data transfer multicriteria optimization problems. The results
    of most of the presented techniques are strategies which solve the considered
    problems (almost) optimally. Thus, the developed algorithms construct
    intelligent strategies which can be implemented by agents in specific
    situations. All the described solutions make use of the properties of the
    considered problems and, thus, they are not applicable to a very general class
    of problems.

  209. An Efficient Algorithm For Chinese Postman Walk on Bi-directed de Bruijn Graphs.

    Authors: Vamsi Kundeti, Sanguthevar Rajasekaran, Hieu Dinh
    Subjects: Data Structures and Algorithms
    Abstract

    Sequence assembly from short reads is an important problem in biology. It is
    known that solving the sequence assembly problem exactly on a bi-directed de
    Bruijn graph or a string graph is intractable. However finding a Shortest
    Double stranded DNA string (SDDNA) containing all the k-long words in the reads
    seems to be a good heuristic to get close to the original genome. This problem
    is equivalent to finding a cyclic Chinese Postman (CP) walk on the underlying
    un-weighted bi-directed de Bruijn graph built from the reads.

  210. A Method for Accelerating Conway's Doomsday Algorithm.

    Authors: Chamberlain Fong
    Subjects: Data Structures and Algorithms
    Abstract

    We propose a simplification of a key component in the Doomsday Algorithm for
    calculating the day-of-the-week of any given date.

  211. Prize-collecting Network Design on Planar Graphs.

    Authors: D&#xe1;niel Marx, MohammadHossein Bateni, MohammadTaghi Hajiaghayi
    Subjects: Data Structures and Algorithms
    Abstract

    In this paper, we reduce Prize-Collecting Steiner TSP (PCTSP),
    Prize-Collecting Stroll (PCS), Prize-Collecting Steiner Tree (PCST),
    Prize-Collecting Steiner Forest (PCSF) and more generally Submodular
    Prize-Collecting Steiner Forest (SPCSF) on planar graphs (and more generally
    bounded-genus graphs) to the same problems on graphs of bounded treewidth.

  212. Prize-Collecting Steiner Tree and Forest in Planar Graphs.

    Authors: Nitish Korula, Chandra Chekuri, Alina Ene
    Subjects: Data Structures and Algorithms
    Abstract

    We obtain polynomial-time approximation-preserving reductions (up to a factor
    of 1 + \epsilon) from the prize-collecting Steiner tree and prize-collecting
    Steiner forest problems in planar graphs to the corresponding problems in
    graphs of bounded treewidth. We also give an exact algorithm for the
    prize-collecting Steiner tree problem that runs in polynomial time for graphs
    of bounded treewidth.

  213. Better size estimation for sparse matrix products.

    Authors: Andrea Campagna, Rasmus Pagh, Rasmus Resen Amossen
    Subjects: Data Structures and Algorithms
    Abstract

    We consider the problem of doing fast and reliable estimation of the number
    of non-zero entries in a sparse boolean matrix product. This problem has
    applications in databases and computer algebra. Let n denote the total number
    of non-zero entries in the input matrices. We show how to compute a 1 +-
    epsilon approximation (with small probability of error) in expected time O(n)
    for any epsilon > 4/\sqrt[4]{n}. The previously best estimation algorithm, due
    to Cohen (JCSS 1997), uses time O(n/epsilon^2).

  214. Should Static Search Trees Ever Be Unbalanced?.

    Authors: Prosenjit Bose, Karim Dou&#xef;eb
    Subjects: Data Structures and Algorithms
    Abstract

    In this paper we study the question of whether or not a static search tree
    should ever be unbalanced. We present several methods to restructure an
    unbalanced k-ary search tree $T$ into a new tree $R$ that preserves many of the
    properties of $T$ while having a height of $\log_k n +1$ which is one unit off
    of the optimal height. More specifically, we show that it is possible to ensure
    that the depth of the elements in $R$ is no more than their depth in $T$ plus
    at most $\log_k \log_k n +2$.

  215. Complexity dichotomy on partial grid recognition.

    Authors: Vin&#xed;cius G. P. de S&#xe1;, Guilherme D. da Fonseca, Raphael Machado, Celina M. H. de Figueiredo
    Subjects: Data Structures and Algorithms
    Abstract

    Deciding whether a graph can be embedded in a grid using only unit-length
    edges is NP-complete, even when restricted to binary trees. However, it is not
    difficult to devise a number of graph classes for which the problem is
    polynomial, even trivial. A natural step, outstanding thus far, was to provide
    a broad classification of graphs that make for polynomial or NP-complete
    instances. We provide such a classification based on the set of allowed vertex
    degrees in the input graphs, yielding a full dichotomy on the complexity of the
    problem.

  216. Quasirandom Load Balancing.

    Authors: Tobias Friedrich, Thomas Sauerwald, Martin Gairing
    Subjects: Data Structures and Algorithms
    Abstract

    We propose a simple distributed algorithm for balancing indivisible tokens on
    graphs. The algorithm is completely deterministic, though it tries to imitate
    (and enhance) a random algorithm by keeping the accumulated rounding errors as
    small as possible.

  217. Smoothed Analysis of Balancing Networks.

    Authors: Tobias Friedrich, Thomas Sauerwald, Dan Vilenchik
    Subjects: Data Structures and Algorithms
    Abstract

    In a balancing network each processor has an initial collection of unit-size
    jobs (tokens) and in each round, pairs of processors connected by balancers
    split their load as evenly as possible. An excess token (if any) is placed
    according to some predefined rule. As it turns out, this rule crucially affects
    the performance of the network. In this work we propose a model that studies
    this effect. We suggest a model bridging the uniformly-random assignment rule,
    and the arbitrary one (in the spirit of smoothed-analysis).

  218. Algebraic characterisation of one-way patterns.

    Authors: Vedran Dunjko, Elham Kashefi
    Subjects: Data Structures and Algorithms
    Abstract

    We give a complete structural characterisation of the map the positive branch
    of a one-way pattern implements. We start with the representation of the
    positive branch in terms of the phase map decomposition, which is then further
    analysed to obtain the primary structure of the matrix M, representing the
    phase map decomposition in the computational basis. Using this approach we
    obtain some preliminary results on the connection between the columns structure
    of a given unitary and the angles of measurements in a pattern that implements
    it.

  219. I/O Efficient Algorithms for Matrix Computations.

    Authors: Sraban Kumar Mohanty
    Subjects: Data Structures and Algorithms
    Abstract

    We analyse some QR decomposition algorithms, and show that the I/O complexity
    of the tile based algorithm is asymptotically the same as that of matrix
    multiplication. This algorithm, we show, performs the best when the tile size
    is chosen so that exactly one tile fits in the main memory. We propose a
    constant factor improvement, as well as a new recursive cache oblivious
    algorithm with the same asymptotic I/O complexity.

  220. On the Insertion Time of Cuckoo Hashing.

    Authors: Konstantinos Panagiotou, Angelika Steger, Nikolaos Fountoulakis
    Subjects: Data Structures and Algorithms
    Abstract

    Cuckoo hashing is an efficient technique for creating large hash tables with
    high space utilization and guaranteed constant access times. There, each item
    can be placed in a location given by any one out of k different hash functions.
    In this paper we investigate further the random walk heuristic for inserting in
    an online fashion new items into the hash table.

  221. Tight and simple Web graph compression.

    Authors: Szymon Grabowski, Wojciech Bieniecki
    Subjects: Data Structures and Algorithms
    Abstract

    Analysing Web graphs has applications in determining page ranks, fighting Web
    spam, detecting communities and mirror sites, and more. This study is however
    hampered by the necessity of storing a major part of huge graphs in the
    external memory, which prevents efficient random access to edge (hyperlink)
    lists.

  222. Reconstruction of Causal Networks by Set Covering.

    Authors: Nick Fyson, Tijl De Bie, Nello Cristianini
    Subjects: Data Structures and Algorithms
    Abstract

    We present a method for the reconstruction of networks, based on the order of
    nodes visited by a stochastic branching process. Our algorithm reconstructs a
    network of minimal size that ensures consistency with the data. Crucially, we
    show that global consistency with the data can be achieved through purely local
    considerations, inferring the neighbourhood of each node in turn. The
    optimisation problem solved for each individual node can be reduced to a Set
    Covering Problem, which is known to be NP-hard but can be approximated well in
    practice.

  223. Inferring Networks of Diffusion and Influence.

    Authors: Andreas Krause, Jure Leskovec, Manuel Gomez-Rodriguez
    Subjects: Data Structures and Algorithms
    Abstract

    Information diffusion and virus propagation are fundamental processes talking
    place in networks. While it is often possible to directly observe when nodes
    become infected, observing individual transmissions (i.e., who infects whom or
    who influences whom) is typically very difficult. Furthermore, in many
    applications, the underlying network over which the diffusions and propagations
    spread is actually unobserved. We tackle these challenges by developing a
    method for tracing paths of diffusion and influence through networks and
    inferring the networks over which contagions propagate.

  224. Bidimensionality and EPTAS.

    Authors: Venkatesh Raman, Saket Saurabh, Fedor V. Fomin, Daniel Lokshtanov
    Subjects: Data Structures and Algorithms
    Abstract

    Bidimensionality theory is a powerful framework for the development of
    metaalgorithmic techniques. It was introduced by Demaine et al. as a tool to
    obtain sub-exponential time parameterized algorithms for problems on H-minor
    free graphs. Demaine and Hajiaghayi extended the theory to obtain PTASs for
    bidimensional problems, and subsequently improved these results to EPTASs.
    Fomin et. al related the theory to the existence of linear kernels for
    parameterized problems.

  225. Almost Optimal Unrestricted Fast Johnson-Lindenstrauss Transform.

    Authors: Nir Ailon, Edo Liberty
    Subjects: Data Structures and Algorithms
    Abstract

    The problems of random projections and sparse reconstruction have much in
    common and individually received much attention. Surprisingly, until now they
    progressed in parallel and remained mostly separate. Here, we employ new tools
    from probability in Banach spaces that were successfully used in the context of
    sparse reconstruction to advance on an open problem in random pojection. In
    particular, we generalize and use an intricate result by Rudelson and Vershynin
    for sparse reconstruction which uses Dudley's theorem for bounding Gaussian
    processes.

  226. Local Search Algorithms for the Generalized Traveling Salesman Problem.

    Authors: Gregory Gutin, Daniel Karapetyan
    Subjects: Data Structures and Algorithms
    Abstract

    The Generalized Traveling Salesman Problem (GTSP) is a well-known
    combinatorial optimization problem with a host of applications. It is an
    extension of the Traveling Salesman Problem (TSP) where the set of cities is
    divided into so-called clusters, and the salesman have to visit each cluster
    exactly once.

  227. Succinct Representations of Dynamic Strings.

    Authors: J. Ian Munro, Meng He
    Subjects: Data Structures and Algorithms
    Abstract

    The rank and select operations over a string of length n from an alphabet of
    size $\sigma$ have been used widely in the design of succinct data structures.
    In many applications, the string itself need be maintained dynamically,
    allowing characters of the string to be inserted and deleted. Under the word
    RAM model with word size $w=\Omega(\lg n)$, we design a succinct representation
    of dynamic strings using $nH_0 + o(n)\lg\sigma + O(w)$ bits to support rank,
    select, insert and delete in $O(\frac{\lg n}{\lg\lg n}(\frac{\lg \sigma}{\lg\lg
    n}+1))$ time. When the alphabet size is small, i.e.

  228. Scheduling Packets with Values and Deadlines in Size-bounded Buffers.

    Authors: Fei Li
    Subjects: Data Structures and Algorithms
    Abstract

    Motivated by providing quality-of-service differentiated services in the
    Internet, we consider buffer management algorithms for network switches. We
    study a multi-buffer model. A network switch consists of multiple size-bounded
    buffers such that at any time, the number of packets residing in each
    individual buffer cannot exceed its capacity. Packets arrive at the network
    switch over time; they have values, deadlines, and designated buffers. In each
    time step, at most one pending packet is allowed to be sent and this packet can
    be from any buffer.

  229. Low Rank Matrix-Valued Chernoff Bounds and Applications.

    Authors: Anastasios Zouzias, Avner Magen
    Subjects: Data Structures and Algorithms
    Abstract

    In this paper we develop algorithms for approximating matrix multiplication
    with respect to the spectral norm. Let A\in{\RR^{n\times m}} and B\in{\RR^{n
    \times p}} be two matrices and \eps>0. We approximate the product A^\top B
    using two down-sampled sketches, \tilde{A}\in{\RR^{t\times m}} and
    \tilde{B}\in{\RR^{t\times p}}, such that \norm{\tilde{A}^\top \tilde{B} -
    A^\top B} \leq \eps \norm{A}\norm{B} with constant probability. We use two
    different sampling procedures for constructing \tilde{A} and \tilde{B}; one of
    them is done by i.i.d.

  230. A Simple Approach to Error Reconciliation in Quantum Key Distribution.

    Authors: Richard P. Brent
    Subjects: Data Structures and Algorithms
    Abstract

    We discuss the error reconciliation phase in quantum key distribution (QKD)
    and analyse a simple scheme in which blocks with bad parity (that is, blocks
    containing an odd number of errors) are discarded. We predict the performance
    of this scheme and show, using a simulation, that the prediction is accurate.

  231. Estimating small moments of data stream in nearly optimal space-time.

    Authors: Sumit Ganguly
    Subjects: Data Structures and Algorithms
    Abstract

    For each $p \in (0,2]$, we present a randomized algorithm that returns an
    $\epsilon$-approximation of the $p$th frequency moment of a data stream $F_p =
    \sum_{i = 1}^n \abs{f_i}^p$. The algorithm requires space $O(\epsilon^{-2} \log
    (mM)(\log n))$ and processes each stream update using time $O((\log n) (\log
    \epsilon^{-1}))$. It is nearly optimal in terms of space (lower bound
    $O(\epsilon^{-2} \log (mM))$ as well as time and is the first algorithm with
    these properties.

  232. Estimating small frequency moments of data stream: a characteristic function approach.

    Authors: Purushottam Kar, Sumit Ganguly
    Subjects: Data Structures and Algorithms
    Abstract

    We consider the problem of estimating the first moment of a data stream
    defined as $F_1 = \sum_{i \in \{1,2, ..., n\}} \abs{f_i}$ to within $1\pm
    \epsilon$-relative error with high probability. Several algorithms are
    well-known for this problem including the median estimator over $p$-stable
    sketches by Indyk \cite{indy:focs00}, the geometric means estimator over
    $p$-stable sketches by Li \cite{pinglib:2006} and the \Hss sketch based
    algorithm in \cite{gc:random07}.

  233. Round-Trip Voronoi Diagrams and Doubling Density in Geographic Networks.

    Authors: Michael T. Goodrich, Matthew T. Dickerson, Thomas D. Dickerson
    Subjects: Data Structures and Algorithms
    Abstract

    The round-trip distance function on a geographic network (such as a road
    network, flight network, or utility distribution grid) defines the "distance"
    from a single vertex to a pair of vertices as the minimum length tour visiting
    all three vertices and ending at the starting vertex. Given a geographic
    network and a subset of its vertices called "sites" (for example a road network
    with a list of grocery stores), a two-site round-trip Voronoi diagram labels
    each vertex in the network with the pair of sites that minimizes the round-trip
    distance from that vertex.

  234. Maximum flow is approximable by deterministic constant-time algorithm in sparse networks.

    Authors: Endre Cs&#xf3;ka
    Subjects: Data Structures and Algorithms
    Abstract

    We show a deterministic constant-time parallel algorithm for finding an
    almost maximum flow in multisource-multitarget networks with bounded degrees
    and bounded edge capacities. As a consequence, we show that the value of the
    maximum flow over the number of nodes is a testable parameter on these
    networks.

  235. Graph Sparsification by Edge-Connectivity and Random Spanning Trees.

    Authors: Nicholas J. A. Harvey, Wai Shing Fung
    Subjects: Data Structures and Algorithms
    Abstract

    We present two new approaches to constructing graph sparsifiers - weighted
    subgraphs for which every cut has the same value as the original graph, up to a
    factor of $(1 \pm \epsilon)$. The first approach independently samples each
    edge $uv$ with probability inversely proportional to the edge-connectivity
    between $u$ and $v$. The fact that this approach produces a sparsifier resolves
    an open question of Bencz\'ur and Karger. Concurrent work of Hariharan and
    Panigrahi (2010) also resolves this question. The second approach samples
    uniformly random spanning trees.

  236. On the (Im)possibility of Preserving Utility and Privacy in Personalized Social Recommendations.

    Authors: Atish Das Sarma, Ashwin Machanavajjhala, Aleksandra Korolova
    Subjects: Data Structures and Algorithms
    Abstract

    With the recent surge of social networks like Facebook, new forms of
    recommendations have become possible -- personalized recommendations of ads,
    content, and even new social and product connections based on one's social
    interactions. In this paper, we study whether "social recommendations", or
    recommendations that utilize a user's social network, can be made without
    disclosing sensitive links between users. More precisely, we quantify the loss
    in utility when existing recommendation algorithms are modified to satisfy a
    strong notion of privacy called differential privacy.

  237. Parallel algorithms in linear algebra.

    Authors: Richard P. Brent
    Subjects: Data Structures and Algorithms
    Abstract

    This report provides an introduction to algorithms for fundamental linear
    algebra problems on various parallel computer architectures, with the emphasis
    on distributed-memory MIMD machines. To illustrate the basic concepts and key
    issues, we consider the problem of parallel solution of a nonsingular linear
    system by Gaussian elimination with partial pivoting. This problem has come to
    be regarded as a benchmark for the performance of parallel machines.

  238. The stubborn problem is stubborn no more (a polynomial algorithm for 3-compatible colouring and the stubborn list partition problem).

    Authors: Marek Cygan, Marcin Pilipczuk, Jakub Onufry Wojtaszczyk, Michal Pilipczuk
    Subjects: Data Structures and Algorithms
    Abstract

    One of the driving problems in the CSP area is the Dichotomy Conjecture,
    formulated in 1993 by Feder and Vardi [STOC'93]: for any fixed relational
    structure G the Constraint Satisfaction Problem CSP(G) is either in P or
    NP-complete. A large amount of research has gone into checking various specific
    cases of this conjecture. One such variant which attracted a lot of attention
    in the recent years is the LIST MATRIX PARTITION (LMP) problem. In 2004 Cameron
    et al. [SODA'04] classified almost all LMP variants for matrices of size at
    most four.

  239. Bandwidth and Distortion Revisited.

    Authors: Marek Cygan, Marcin Pilipczuk
    Subjects: Data Structures and Algorithms
    Abstract

    In this paper we merge recent developments on exact algorithms for finding an
    ordering of vertices of a given graph that minimizes bandwidth (the BANDWIDTH
    problem) and for finding an embedding of a given graph into a line that
    minimizes distortion (the DISTORTION problem). For both problems we develop
    algorithms that work in O(9.363^n) time and polynomial space. For BANDWIDTH,
    this improves O^*(10^n) algorithm by Feige and Kilian from 2000, for DISTORTION
    this is the first polynomial space exact algorithm that works in O(c^n) time we
    are aware of.

  240. Modern Computer Arithmetic (version 0.5.1).

    Authors: Richard P. Brent, Paul Zimmermann
    Subjects: Data Structures and Algorithms
    Abstract

    This is a draft of a book about algorithms for performing arithmetic, and
    their implementation on modern computers. We are concerned with software more
    than hardware - we do not cover computer architecture or the design of computer
    hardware. Instead we focus on algorithms for efficiently performing arithmetic
    operations such as addition, multiplication and division, and their connections
    to topics such as modular arithmetic, greatest common divisors, the Fast
    Fourier Transform (FFT), and the computation of elementary and special
    functions.

  241. Optimal Data Placement on Networks With Constant Number of Clients.

    Authors: Eric Angel, Evripidis Bampis, Gerasimos G. Pollatos, Vassilis Zissimopoulos
    Subjects: Data Structures and Algorithms
    Abstract

    We introduce optimal algorithms for the problems of data placement (DP) and
    page placement (PP) in networks with a constant number of clients each of which
    has limited storage availability and issues requests for data objects. The
    objective for both problems is to efficiently utilize each client's storage
    (deciding where to place replicas of objects) so that the total incurred access
    and installation cost over all clients is minimized. In the PP problem an extra
    constraint on the maximum number of clients served by a single client must be
    satisfied.

  242. A General Framework for Graph Sparsification.

    Authors: Ramesh Hariharan, Debmalya Panigrahi
    Subjects: Data Structures and Algorithms
    Abstract

    Given a weighted graph $G$ and an error parameter $\epsilon > 0$, the {\em
    graph sparsification} problem requires sampling edges in $G$ and giving the
    sampled edges appropriate weights to obtain a sparse graph $G_{\epsilon}$
    (containing O(n\log n) edges in expectation) with the following property: the
    weight of every cut in $G_{\epsilon}$ is within a factor of $(1\pm \epsilon)$
    of the weight of the corresponding cut in $G$.

  243. Efficient volume sampling for row/column subset selection.

    Authors: Amit Deshpande, Luis Rademacher
    Subjects: Data Structures and Algorithms
    Abstract

    We give efficient algorithms for volume sampling, i.e., for picking
    $k$-subsets of the rows of any given matrix with probabilities proportional to
    the squared volumes of the simplices defined by them and the origin (or the
    squared volumes of the parallelepipeds defined by these subsets of rows). This
    solves an open problem from the monograph on spectral algorithms by Kannan and
    Vempala.

  244. Some linear-time algorithms for systolic arrays.

    Authors: Richard P. Brent, Franklin T. Luk, H. T. Kung
    Subjects: Data Structures and Algorithms
    Abstract

    We survey some results on linear-time algorithms for systolic arrays. In
    particular, we show how the greatest common divisor (GCD) of two polynomials of
    degree n over a finite field can be computed in time O(n) on a linear systolic
    array of O(n) cells; similarly for the GCD of two n-bit binary numbers. We show
    how n * n Toeplitz systems of linear equations can be solved in time O(n) on a
    linear array of O(n) cells, each of which has constant memory size (independent
    of n).

  245. Approximating the minimum directed tree cover.

    Authors: Viet Hung Nguyen
    Subjects: Data Structures and Algorithms
    Abstract

    Given a directed graph $G$ with non negative cost on the arcs, a directed
    tree cover of $G$ is a directed tree such that either head or tail (or both of
    them) of every arc in $G$ is touched by $T$. The minimum directed tree cover
    problem (DTCP) is to find a directed tree cover of minimum cost. The problem is
    known to be $NP$-hard. In this paper, we show that the weighted Set Cover
    Problem (SCP) is a special case of DTCP. Hence, one can expect at best to
    approximate DTCP with the same factor as for SCP.

  246. A Polynomial Time Algorithm for Hamilton Cycle and Its Proof.

    Authors: Lizhi Du
    Subjects: Data Structures and Algorithms
    Abstract

    We present a polynomial time algorithm for finding a Hamilton Cycle(Path) in
    an undirected graph and proves its correctness. A program is developed
    according to this algorithm and it works very well. This paper declares the
    algorithm, its proof, and the experiment data. Even only our experiment data is
    a breakthrough.

  247. On Practical Algorithms for Entropy Estimation and the Improved Sample Complexity of Compressed Counting.

    Authors: Ping Li
    Subjects: Data Structures and Algorithms
    Abstract

    Estimating the p-th frequency moment of data stream is a very heavily studied
    problem. The problem is actually trivial when p = 1, assuming the strict
    Turnstile model. The sample complexity of our proposed algorithm is essentially
    O(1) near p=1. This is a very large improvement over the previously believed
    O(1/eps^2) bound. The proposed algorithm makes the long-standing problem of
    entropy estimation an easy task, as verified by the experiments included in the
    appendix.

  248. Uses of randomness in computation.

    Authors: Richard P. Brent
    Subjects: Data Structures and Algorithms
    Abstract

    Random number generators are widely used in practical algorithms. Examples
    include simulation, number theory (primality testing and integer
    factorization), fault tolerance, routing, cryptography, optimization by
    simulated annealing, and perfect hashing. Complexity theory usually considers
    the worst-case behaviour of deterministic algorithms, but it can also consider
    average-case behaviour if it is assumed that the input data is drawn randomly
    from a given distribution.

  249. Faster Algorithms for Semi-Matching Problems.

    Authors: Danupon Nanongkai, Jittat Fakcharoenphol, Bundit Lekhanukit
    Subjects: Data Structures and Algorithms
    Abstract

    We consider the problem of finding \textit{semi-matching} in bipartite graphs
    which is also extensively studied under various names in the scheduling
    literature. We give faster algorithms for both weighted and unweighted case.

    For the weighted case, we give an $O(nm\log n)$-time algorithm, where $n$ is
    the number of vertices and $m$ is the number of edges, by exploiting the
    geometric structure of the problem. This improves the classical $O(n^3)$
    algorithms by Horn [Operations Research 1973] and Bruno, Coffman and Sethi
    [Communications of the ACM 1974].

  250. Faster Approximation Schemes and Parameterized Algorithms on H-Minor-Free and Odd-Minor-Free Graphs.

    Authors: Siamak Tazari
    Subjects: Data Structures and Algorithms
    Abstract

    We improve the running time of the general algorithmic technique known as
    Baker's approach (1994) on H-minor-free graphs from O(n^{f(|H|)}) to O(f(|H|)
    n^{O(1)}). The numerous applications include e.g. a 2-approximation for
    coloring and PTASes for various problems such as dominating set and max-cut,
    where we obtain similar improvements.

  251. Empirical Comparison of Algorithms for Network Community Detection.

    Authors: Jure Leskovec, Kevin J. Lang, Michael W. Mahoney
    Subjects: Data Structures and Algorithms
    Abstract

    Detecting clusters or communities in large real-world graphs such as large
    social or information networks is a problem of considerable interest. In
    practice, one typically chooses an objective function that captures the
    intuition of a network cluster as set of nodes with better internal
    connectivity than external connectivity, and then one applies approximation
    algorithms or heuristics to extract sets of nodes that are related to the
    objective function and that "look like" good communities for the application of
    interest.

  252. Fast normal random number generators on vector processors.

    Authors: Richard P. Brent
    Subjects: Data Structures and Algorithms
    Abstract

    We consider pseudo-random number generators suitable for vector processors.
    In particular, we describe vectorised implementations of the Box-Muller and
    Polar methods, and show that they give good performance on the Fujitsu VP2200.
    We also consider some other popular methods, e.g. the Ratio method of Kinderman
    and Monahan (1977) (as improved by Leva (1992)), and the method of Von Neumann
    and Forsythe, and show why they are unlikely to be competitive with the Polar
    method on vector processors.

  253. A fast vectorised implementation of Wallace's normal random number generator.

    Authors: Richard P. Brent
    Subjects: Data Structures and Algorithms
    Abstract

    Wallace has proposed a new class of pseudo-random generators for normal
    variates. These generators do not require a stream of uniform pseudo-random
    numbers, except for initialisation. The inner loops are essentially
    matrix-vector multiplications and are very suitable for implementation on
    vector processors or vector/parallel processors such as the Fujitsu VPP300.

  254. Some long-period random number generators using shifts and xors.

    Authors: Richard P. Brent
    Subjects: Data Structures and Algorithms
    Abstract

    Marsaglia recently introduced a class of xorshift random number generators
    (RNGs) with periods 2n-1 for n = 32, 64, etc. Here we give a generalisation of
    Marsaglia's xorshift generators in order to obtain fast and high-quality RNGs
    with extremely long periods. RNGs based on primitive trinomials may be
    unsatisfactory because a trinomial has very small weight. In contrast, our
    generators can be chosen so that their minimal polynomials have large weight
    (number of nonzero terms). A computer search using Magma has found good
    generators for n a power of two up to 4096.

  255. Streaming Graph Computations with a Helpful Advisor.

    Authors: Michael Mitzenmacher, Graham Cormode, Justin Thaler
    Subjects: Data Structures and Algorithms
    Abstract

    Motivated by the trend to outsource work to commercial cloud computing
    services, we consider a variation of the streaming paradigm where a streaming
    algorithm can be assisted by a powerful helper that can provide annotations to
    the data stream. We extend previous work on such {\em annotation models} by
    considering a number of graph streaming problems.

  256. One Tree Suffices: A Simultaneous O(1)-Approximation for Single-Sink Buy-at-Bulk.

    Authors: Ashish Goel, Ian Post
    Subjects: Data Structures and Algorithms
    Abstract

    We study the single-sink buy-at-bulk problem with an unknown cost function.
    We want to route flow from a set of demand nodes to a root node, where the cost
    of routing x total flow along an edge is proportional to f(x) for some concave,
    non-decreasing function f satisfying f(0)=0. We present a simple, fast,
    deterministic, combinatorial algorithm that takes a set of demands and
    constructs a single tree T such that for all f the cost f(T) is a
    49.48-approximation of the optimal cost for that f.

  257. On the Continuous CNN Problem.

    Authors: Nick Gravin, John Augustine
    Subjects: Data Structures and Algorithms
    Abstract

    In the (discrete) CNN problem, online requests appear as points in $\R^2$.
    Each request must be served before the next one is revealed. We have a server
    that can serve a request simply by aligning either its $x$ or $y$ coordinate
    with the request. The goal of the online algorithm is to minimize the total
    $L_1$ distance traveled by the server to serve all the requests.

  258. An O(M(n) log n) algorithm for the Jacobi symbol.

    Authors: Richard P. Brent, Paul Zimmermann
    Subjects: Data Structures and Algorithms
    Abstract

    The best known algorithm to compute the Jacobi symbol of two n-bit integers
    runs in time O(M(n) log n), using Sch\"onhage's fast continued fraction
    algorithm combined with an identity due to Gauss. We give a different O(M(n)
    log n) algorithm based on the binary recursive gcd algorithm of Stehl\'e and
    Zimmermann. Our implementation - which to our knowledge is the first to run in
    time O(M(n) log n) - is faster than GMP's quadratic implementation for inputs
    larger than about 10000 decimal digits.

  259. On Feedback Vertex Set, New Measure and New Structures.

    Authors: Yixin Cao, Jianer Chen, Yang Liu
    Subjects: Data Structures and Algorithms
    Abstract

    We study the parameterized complexity of the {\sc feedback vertex set}
    problem ({\sc fvs}) on undirected graphs. We approach the problem by
    considering a variation of it, the {\sc disjoint feedback vertex set} problem
    ({\sc disjoint-fvs}), which finds a disjoint feedback vertex set of size $k$
    when a feedback vertex set of a graph is given.

  260. Feasibility Analysis of Sporadic Real-Time Multiprocessor Task Systems.

    Authors: Vincenzo Bonifaci, Alberto Marchetti-Spaccamela
    Subjects: Data Structures and Algorithms
    Abstract

    We give the first algorithm for testing the feasibility of a system of
    sporadic real-time tasks on a set of identical processors, solving one major
    open problem in the area of multiprocessor real-time scheduling [S.K. Baruah
    and K. Pruhs, Journal of Scheduling, 2009]. We also investigate the related
    notion of schedulability and a notion that we call online feasibility. Finally,
    we show that discrete-time schedules are as powerful as continuous-time
    schedules, which answers another open question in the above mentioned survey.

  261. Clustering with Spectral Norm and the k-means Algorithm.

    Authors: Ravindran Kannan, Amit Kumar
    Subjects: Data Structures and Algorithms
    Abstract

    There has been much progress on efficient algorithms for clustering data
    points generated by a mixture of $k$ probability distributions under the
    assumption that the means of the distributions are well-separated, i.e., the
    distance between the means of any two distributions is at least $\Omega(k)$
    standard deviations. These results generally make heavy use of the generative
    model and particular properties of the distributions. In this paper, we show
    that a simple clustering algorithm works without assuming any generative
    (probabilistic) model.

  262. All Ternary Permutation Constraint Satisfaction Problems Parameterized Above Average Have Polynomial Kernels.

    Authors: Gregory Gutin, Leo van Iersel, Matthias Mnich, Anders Yeo
    Subjects: Data Structures and Algorithms
    Abstract

    A ternary Permutation-CSP is specified by a subset $\Pi$ of the symmetric
    group $\mathcal S_3$. An instance of such a problem consists of a set of
    variables $V$ and a multiset of constraints, which are ordered triples of
    distinct variables of $V.$ The objective is to find a linear ordering $\alpha$
    of $V$ that maximizes the number of triples whose ordering (under $\alpha$)
    follows a permutation in $\Pi$. We prove that all ternary Permutation-CSPs
    parameterized above average have polynomial-size kernels.

  263. Backyard Cuckoo Hashing: Constant Worst-Case Operations with a Succinct Representation.

    Authors: Yuriy Arbitman, Moni Naor, Gil Segev
    Subjects: Data Structures and Algorithms
    Abstract

    The performance of a dynamic dictionary is measured mainly by its update
    time, lookup time, and space consumption. In terms of update time and lookup
    time there are known constructions that guarantee constant-time operations in
    the worst case with high probability, and in terms of space consumption there
    are known constructions that use essentially optimal space. However, although
    the first analysis of a dynamic dictionary dates back more than 45 years ago
    (when Knuth analyzed linear probing in 1963), the trade-off between these
    aspects of performance is still not completely understood.

  264. Unified Compression-Based Acceleration of Edit-Distance Computation.

    Authors: Oren Weimann, Danny Hermelin, Gad M. Landau, Shir Landau
    Subjects: Data Structures and Algorithms
    Abstract

    The edit distance problem is a classical fundamental problem in computer
    science in general, and in combinatorial pattern matching in particular. The
    standard dynamic programming solution for this problem computes the
    edit-distance between a pair of strings of total length O(N) in O(N^2) time. To
    this date, this quadratic upper-bound has never been substantially improved for
    general strings. However, there are known techniques for breaking this bound in
    case the strings are known to compress well under a particular compression
    scheme.

  265. A Deterministic Algorithm for the Vertex Connectivity Survivable Network Design Problem.

    Authors: Pushkar Tripathi
    Subjects: Data Structures and Algorithms
    Abstract

    In the vertex connectivity survivable network design problem we are given an
    undirected graph G = (V,E) and connectivity requirement r(u,v) for each pair of
    vertices u,v. We are also given a cost function on the set of edges. Our goal
    is to find the minimum cost subset of edges such that for every pair (u,v) of
    vertices we have r(u,v) vertex disjoint paths in the graph induced by the
    chosen edges. Recently, Chuzhoy and Khanna presented a randomized algorithm
    that achieves a factor of O(k^3 log n) for this problem where k is the maximum
    connectivity requirement.

  266. Spectral Methods for Matrices and Tensors.

    Authors: Ravindran Kannan
    Subjects: Data Structures and Algorithms
    Abstract

    While Spectral Methods have long been used for Principal Component Analysis,
    this survey focusses on work over the last 15 years with three salient
    features: (i) Spectral methods are useful not only for numerical problems, but
    also discrete optimization problems (Constraint Optimization Problems - CSP's)
    like the max. cut problem and similar mathematical considerations underlie both
    areas. (ii) Spectral methods can be extended to tensors.

  267. Relaxation-based coarsening and multiscale graph organization.

    Authors: Ilya Safro, Dorit Ron, Achi Brandt
    Subjects: Data Structures and Algorithms
    Abstract

    In this paper we generalize and improve the multiscale organization of graphs
    by introducing a new measure that quantifies the "closeness" between two nodes.
    The calculation of the measure is linear in the number of edges in the graph
    and involves just a small number of relaxation sweeps. A similar notion of
    distance is then calculated and used at each coarser level. We demonstrate the
    use of this measure in multiscale methods for several important combinatorial
    optimization problems and discuss the multiscale graph organization.

  268. Range Quantile Queries: Another Virtue of Wavelet Trees.

    Authors: Travis Gagie, Simon J. Puglisi, Andrew Turpin
    Subjects: Data Structures and Algorithms
    Abstract

    We show how to use a balanced wavelet tree as a data structure that stores a
    list of numbers and supports efficient {\em range quantile queries}. A range
    quantile query takes a rank and the endpoints of a sublist and returns the
    number with that rank in that sublist. For example, if the rank is half the
    sublist's length, then the query returns the sublist's median. We also show how
    these queries can be used to support space-efficient {\em coloured range
    reporting} and {\em document listing}.

  269. Strong Fault-Tolerance for Self-Assembly with Fuzzy Temperature.

    Authors: David Doty, Robert T. Schweller, Matthew J. Patitz, Scott M. Summers, Dustin Reishus
    Subjects: Data Structures and Algorithms
    Abstract

    We consider the problem of fault-tolerance in nanoscale algorithmic
    self-assembly. We employ a variant of Winfree's abstract Tile Assembly Model
    (aTAM), the two-handed aTAM, in which square "tiles" -- a model of molecules
    constructed from DNA for the purpose of engineering self-assembled
    nanostructures -- aggregate according to specific binding sites of varying
    strengths, and in which large aggregations of tiles may attach to each other,
    in contrast to the seeded aTAM, in which tiles aggregate one at a time to a
    single specially-designated "seed" assembly.

  270. The Graph Traversal Pattern.

    Authors: Marko A. Rodriguez, Peter Neubauer
    Subjects: Data Structures and Algorithms
    Abstract

    A graph is a structure composed of a set of vertices (i.e.nodes, dots)
    connected to one another by a set of edges (i.e.links, lines). The concept of a
    graph has been around since the late 19$^\text{th}$ century, however, only in
    recent decades has there been a strong resurgence in both theoretical and
    applied graph research in mathematics, physics, and computer science. In
    applied computing, since the late 1960s, the interlinked table structure of the
    relational database has been the predominant information storage and retrieval
    model.

  271. On building minimal automaton for subset matching queries.

    Authors: Kimmo Fredriksson
    Subjects: Data Structures and Algorithms
    Abstract

    We address the problem of building an index for a set $D$ of $n$ strings,
    where each string location is a subset of some finite integer alphabet, so that
    we can answer efficiently if a given simple query string (where each string
    location is a single symbol) $p$ occurs in the set. That is, we need to
    efficiently find a string $d \in D$ such that $p[i] \in d[i]$ for every $i$. We
    show how to build such index in $O(n^{\log_{\sigma/\Delta}(\sigma)}\log(n))$
    average time and space, where $\Delta$ is the average size of the subsets.

  272. Permuted Common Supersequence.

    Authors: Alexandru Popa, Rapha&#xeb;l Clifford, Zvi Gotthilf, Moshe Lewenstein
    Subjects: Data Structures and Algorithms
    Abstract

    The Shortest Common Supersequence (\textit{SCS}) is a well studied problem
    having a wide range of applications. In this paper we first introduce a new
    problem closely related to the SCS, denoted as the \textit{PCS} problem. Given
    a set $L$ of $k$ strings, $s_1$, $s_2$,..., $s_k$, and a text $t$, we look for
    a permutation $\pi(t)$ of the text $t$, such that as many input strings of the
    set $L$ are subsequences of $\pi(t)$.

  273. Linear-Number-of-Variables Kernel for Unit-Conflict-Free-Max-Sat Parameterized Above Expectation.

    Authors: R. Crowston, G. Gutin, M. Jones, A. Yeo
    Subjects: Data Structures and Algorithms
    Abstract

    A formula $F$ in conjunctive normal form (CNF) is unit-conflict free (UCF) if
    for no variable $x$ both clauses, $(x)$ and $(\bar{x})$ appear in $F$. It is
    well-known that for a UCF CNF with $n$ variables and $m$ clauses, it is always
    possible to satisfy at least $p\cdot m$ clauses, where $p=(\sqrt{5}-1)/2$. The
    parameterized problem UCF-Max-Sat-AE asks whether for a given UCF CNF formula
    it is possible to satisfy at least $p\cdot m + k$ clauses, where integer $k$ is
    the parameter. Mahajan and Raman (J.

  274. Faster Approximation Schemes for Fractional Multicommodity Flow Problems via Dynamic Graph Algorithms.

    Authors: Aleksander Madry
    Subjects: Data Structures and Algorithms
    Abstract

    We combine the work of Garg and Konemann, and Fleischer with ideas from
    dynamic graph algorithms to obtain faster (1-eps)-approximation schemes for
    various versions of the multicommodity flow problem. In particular, if eps is
    moderately small and the size of every number used in the input instance is
    polynomially bounded, the running times of our algorithms match - up to
    poly-logarithmic factors and some provably optimal terms - the Omega(mn)
    flow-decomposition barrier for single-commodity flow.

  275. Lin-Kernighan Heuristic Adaptation for the Generalized Traveling Salesman Problem.

    Authors: Gregory Gutin, Daniel Karapetyan
    Subjects: Data Structures and Algorithms
    Abstract

    Lin-Kernighan heuristic is known to be one of the most successful heuristics
    for the Traveling Salesman Problem (TSP). It has also proven its efficiency in
    application to some other problems. However, it was never applied to the the
    Generalized Traveling Salesman Problem (GTSP) though it has the same nature as
    TSP. In this paper we discuss possible adaptations of TSP heuristics for GTSP
    and focus on the case of Lin-Kernighan algorithm. At first, we provide an
    easy-to-understand description of the original Lin-Kernighan heuristic.

  276. Angle Tree: Nearest Neighbor Search in High Dimensions with Low Intrinsic Dimensionality.

    Authors: Ilia Zvedeniouk, Sanjay Chawla
    Subjects: Data Structures and Algorithms
    Abstract

    We propose an extension of tree-based space-partitioning indexing structures
    for data with low intrinsic dimensionality embedded in a high dimensional
    space. We call this extension an Angle Tree. Our extension can be applied to
    both classical kd-trees as well as the more recent rp-trees. The key idea of
    our approach is to store the angle (the "dihedral angle") between the data
    region (which is a low dimensional manifold) and the random hyperplane that
    splits the region (the "splitter").

  277. A Note on Integer Factorization Using Lattices.

    Authors: Antonio Ignacio Vera
    Subjects: Data Structures and Algorithms
    Abstract

    We revisit Schnorr's lattice-based integer factorization algorithm, now with
    an effective point of view. We present effective versions of Theorem 2 of
    Schnorr's "Factoring integers and computing discrete logarithms via diophantine
    approximation" paper, as well as new elementary properties of the Prime Number
    Lattice bases of Schnorr and Adleman.

  278. Approximate Dynamic Programming for Fast Denoising of aCGH Data.

    Authors: Charalampos E. Tsourakakis, Gary L. Miller, Richard Peng, Russell Schwartz
    Subjects: Data Structures and Algorithms
    Abstract

    DNA sequence copy number is the number of copies of DNA at a region of a
    genome. Identifying genomic regions whose DNA copy number deviates from the
    normal is a crucial task in understanding cancer evolution. Array-based
    comparative genomic hybridization (aCGH) is a high-throughput technique for
    identifying DNA gain or loss. Due to the high level of noise in microarray
    data, however, interpretation of aCGH output is a difficult and error-prone
    task.

  279. A New Approach to Population Sizing for Memetic Algorithms: A Case Study for the Multidimensional Assignment Problem.

    Authors: Gregory Gutin, Daniel Karapetyan
    Subjects: Data Structures and Algorithms
    Abstract

    Memetic Algorithms are known to be a powerful technique in solving hard
    optimization problems. To design a memetic algorithm one needs to make a host
    of decisions; selecting a population size is one of the most important among
    them. Most algorithms in the literature fix the population size to a certain
    constant value. This reduces the algorithm's quality since the optimal
    population size varies for different instances, local search procedures and
    running times. In this paper we propose an adjustable population size.

  280. Graph Iterators: Decoupling Graph Structures from Algorithms.

    Authors: Marco Nissen
    Subjects: Data Structures and Algorithms
    Abstract

    I will present a way to implement graph algorithms which is different from
    traditional methods. This work was motivated by the belief that some ideas from
    software engineering should be applied to graph algorithms. Re-usability of
    software is an important and difficult problem in general, and this is
    particularly true for graph algorithms. The scientific literature demonstrates
    plenty of applications of graph algorithms as subroutines for other algorithms.
    Moreover, many practical problems from various domains may be modeled as graph
    problems and hence solved by means of graph algorithms.

  281. Phase Synchronization in Railway Timetables.

    Authors: Matthias M&#xfc;ller-Hannemann, Christoph Fretter, Lachezar Krumov, Karsten Weihe, Marc-Thorsten H&#xfc;tt
    Subjects: Data Structures and Algorithms
    Abstract

    Timetable construction belongs to the most important optimization problems in
    public transport. Finding optimal or near-optimal timetables under the
    subsidiary conditions of minimizing travel times and other criteria is a
    targeted contribution to the functioning of public transport. In addition to
    efficiency (given, e.g., by minimal average travel times), a significant
    feature of a timetable is its robustness against delay propagation.

  282. Simple heuristics for the assembly line worker assignment and balancing problem.

    Authors: Mayron C&#xe9;sar O. Moreira, Alysson M. Costa, Marcus Ritt, Antonio A. Chaves
    Subjects: Data Structures and Algorithms
    Abstract

    We propose simple heuristics for the assembly line worker assignment and
    balancing problem. This problem typically occurs in assembly lines in sheltered
    work centers for the disabled and differs from the classical simple assembly
    line balancing problem in the fact that task execution times vary according to
    the assigned worker. We develop a constructive heuristic based on task
    assignment priority rules defining the order the tasks should be assigned to
    the workstations.

  283. Exponential Lower Bounds For Policy Iteration.

    Authors: John Fearnley
    Subjects: Data Structures and Algorithms
    Abstract

    We study policy iteration for infinite-horizon Markov decision processes. It
    has recently been shown policy iteration style algorithms have exponential
    lower bounds in a two player game setting. We extend these lower bounds to
    Markov decision processes with the total reward and average-reward optimality
    criteria.

  284. The Distribution and Deposition Algorithm for Multiple Sequences Sets.

    Authors: Kang Ning, Hon Wai Leong
    Subjects: Data Structures and Algorithms
    Abstract

    Sequences set is a mathematical model used in many applications. As the
    number of the sequences becomes larger, single sequence set model is not
    appropriate for the rapidly increasing problem sizes. For example, more and
    more text processing applications separate a single big text file into multiple
    files before processing. For these applications, the underline mathematical
    model is multiple sequences sets (MSS). Though there is increasing use of MSS,
    there is little research on how to process MSS efficiently.

  285. On the Border Length Minimization Problem (BLMP) on a Square Array.

    Authors: Vamsi Kundeti, Sanguthevar Rajasekaran, Hieu Dinh
    Subjects: Data Structures and Algorithms
    Abstract

    Protein/Peptide microarrays are rapidly gaining momentum in the diagnosis of
    cancer. High-density and highthroughput peptide arrays are being extensively
    used to detect tumor biomarkers, examine kinase activity, identify antibodies
    having low serum titers and locate antibody signatures. Improving the yield of
    microarray fabrication involves solving a hard combinatorial optimization
    problem called the Border Length Minimization Problem (BLMP). An important
    question that remained open for the past seven years is if the BLMP is
    tractable or not.

  286. Approaching optimality for solving SDD systems.

    Authors: Ioannis Koutis, Gary L. Miller, Richard Peng
    Subjects: Data Structures and Algorithms
    Abstract

    We present an algorithm that on input a graph $G$ with $n$ vertices and
    $m+n-1$ edges and a value $k$, produces an {\em incremental sparsifier}
    $\hat{G}$ with $n-1 + m/k$ edges, such that the condition number of $G$ with
    $\hat{G}$ is bounded above by $\tilde{O}(k\log^2 n)$, with probability $1-p$.
    The algorithm runs in time

    $$\tilde{O}((m \log{n} + n\log^2{n})\log(1/p)).$$

  287. On Generalizations of Network Design Problems with Degree Bounds.

    Authors: Viswanath Nagarajan, Nikhil Bansal, Rohit Khandekar, Jochen Konemann, Britta Peis
    Subjects: Data Structures and Algorithms
    Abstract

    Iterative rounding and relaxation have arguably become the method of choice
    in dealing with unconstrained and constrained network design problems. In this
    paper we extend the scope of the iterative relaxation method in two directions:
    (1) by handling more complex degree constraints in the minimum spanning tree
    problem (namely, laminar crossing spanning tree), and (2) by incorporating
    `degree bounds' in other combinatorial optimization problems such as matroid
    intersection and lattice polyhedra.

  288. Efficient Parallel and Out of Core Algorithms for Constructing Large Bi-directed de Bruijn Graphs.

    Authors: Vamsi Kundeti, Sanguthevar Rajasekaran, Hieu Dinh
    Subjects: Data Structures and Algorithms
    Abstract

    Assembling genomic sequences from a set of overlapping reads is one of the
    most fundamental problems in computational biology. Algorithms addressing the
    assembly problem fall into two broad categories -- based on the data structures
    which they employ. The first class uses an overlap/string graph and the second
    type uses a de Bruijn graph. However with the recent advances in short read
    sequencing technology, de Bruijn graph based algorithms seem to play a vital
    role in practice.

  289. Minimum Spanning Tree on Spatio-Temporal Networks.

    Authors: Arnab Bhattacharya, Viswanath Gunturi, Shashi Shekhar
    Subjects: Data Structures and Algorithms
    Abstract

    Given a spatio-temporal network (ST network) whose edge properties vary with
    time, a time-sub-interval minimum spanning tree (TSMST) is a collection of
    distinct minimum spanning trees of the ST network. The TSMST computation
    problem aims to identify a collection of distinct minimum spanning trees and
    their respective time-sub-intervals under the constraint that the edge weight
    functions are piecewise linear. This is an important problem in ST network
    application domains such as wireless sensor networks (e.g., energy efficient
    routing).

  290. Parameter-Free Deterministic Global Search with Central Force Optimization.

    Authors: Richard A. Formato
    Subjects: Data Structures and Algorithms
    Abstract

    This note describes a parameter-free implementation of Central Force
    Optimization for deterministic multidimensional search and optimization. The
    user supplies only one input: the objective function to be maximized, nothing
    more. The CFO equations of motion are simplified by assigning specific values
    to CFO's basic parameters, and this particular algorithmic implementation also
    includes hardwired internal parameters so that none is user-specified.

  291. Approximation Algorithms for Optimal Decision Trees and Adaptive TSP Problems.

    Authors: Anupam Gupta, Viswanath Nagarajan, R. Ravi, Ravishankar Krishnaswamy
    Subjects: Data Structures and Algorithms
    Abstract

    We consider the problem of constructing optimal decision trees: given a
    collection of tests which can disambiguate between a set of $m$ possible
    diseases, each test having a cost, and the a-priori likelihood of the patient
    having any particular disease, what is a good adaptive strategy to perform
    these tests to minimize the expected cost to identify the disease? We settle
    the approximability of this problem by giving a tight $O(\log m)$-approximation
    algorithm. We also consider a more substantial generalization, the Adaptive TSP
    problem.

  292. Multi-dimensional Boltzmann Sampling of context-free Languages.

    Authors: Olivier Bodini, Yann Ponty
    Subjects: Data Structures and Algorithms
    Abstract

    This paper addresses the uniform random generation of words from a
    context-free language (over an alphabet of size $k$), while constraining every
    letter to a targeted frequency of occurrence. Our approach consists in an
    extended -- multidimensional -- version of the classic Boltzmann samplers
    \cite{Duchon2004}.

  293. An O(loglog n)-Competitive Binary Search Tree with Optimal Worst-Case Access Times.

    Authors: Vida Dujmovic, Prosenjit Bose, Karim Dou&#xef;eb, Rolf Fagerberg
    Subjects: Data Structures and Algorithms
    Abstract

    We present the zipper tree, an $O(\log \log n)$-competitive online binary
    search tree that performs each access in $O(\log n)$ worst-case time. This
    shows that for binary search trees, optimal worst-case access time and
    near-optimal amortized access time can be guaranteed simultaneously.

  294. When LP is the Cure for Your Matching Woes: Approximating Stochastic Matchings.

    Authors: Anupam Gupta, Viswanath Nagarajan, Atri Rudra, Nikhil Bansal
    Subjects: Data Structures and Algorithms
    Abstract

    Consider a random graph model where each possible edge $e$ is present
    independently with some probability $p_e$. We are just given these numbers
    $p_e$, and want to build a large/heavy matching in the randomly generated
    graph. However, the only way we can find out whether an edge is present or not
    is to query it--and if the edge is indeed present in the graph, we are forced
    to add it to our matching. Further, each vertex $i$ is allowed to be queried at
    most $t_i$ times. How should we adaptively query the edges to maximize the
    expected weight of the matching?

  295. Multidimensional Divide-and-Conquer and Weighted Digital Sums.

    Authors: Y. K. Cheung, Philippe Flajolet, Mordecai Golin, C. Y. James Lee
    Subjects: Data Structures and Algorithms
    Abstract

    This paper studies three types of functions arising separately in the
    analysis of algorithms that we analyze exactly using similar Mellin transform
    techniques. The first is the solution to a Multidimensional Divide-and-Conquer
    (MDC) recurrence that arises when solving problems on points in $d$-dimensional
    space. The second involves weighted digital sums. Write $n$ in its binary
    representation $n=(b_i b_{i-1}... b_1 b_0)_2$ and set $S_M(n) = \sum_{t=0}^i
    t^{\bar{M}} b_t 2^t$. We analyze the average $TS_M(n) = \frac{1}{n}\sum_{j<n}
    S_M(j)$.

  296. Mergeable Dictionaries.

    Authors: John Iacono, &#xd6;zg&#xfc;r &#xd6;zkan
    Subjects: Data Structures and Algorithms
    Abstract

    A data structure is presented for the Mergeable Dictionary abstract data
    type, which supports the following operations on a collection of disjoint sets
    of totally ordered data: Predecessor-Search, Split and Merge. While
    Predecessor-Search and Split work in the normal way, the novel operation is
    Merge. While in a typical mergeable dictionary (e.g. 2-4 Trees), the Merge
    operation can only be performed on sets that span disjoint intervals in
    keyspace, the structure here has no such limitation, and permits the merging of
    arbitrarily interleaved sets.

  297. Defining and Computing Alternative Routes in Road Networks.

    Authors: Peter Sanders, Robert Geisberger, Jonathan Dees, Roland Bader
    Subjects: Data Structures and Algorithms
    Abstract

    Every human likes choices. But today's fast route planning algorithms usually
    compute just a single route between source and target. There are beginnings to
    compute alternative routes, but this topic has not been studied thoroughly.
    Often, the aspect of meaningful alternative routes is neglected from a human
    point of view. We fill in this gap by suggesting mathematical definitions for
    such routes. As a second contribution we propose heuristics to compute them, as
    this is NP-hard in general.

  298. A weakly universal cellular automaton in the hyperbolic 3D space with three states.

    Authors: Margenstern Maurice
    Subjects: Data Structures and Algorithms
    Abstract

    In this paper, we significantly improve a previous result by the same author
    showing the existence of a weakly universal cellular automaton with five states
    living in the hyperbolic 3D-space. Here, we get such a cellular automaton with
    three states only.

  299. Deciding the finiteness of the number of simple permutations contained in a wreath-closed class is polynomial.

    Authors: Fr&#xe9;d&#xe9;rique Bassino, Mathilde Bouvel, Adeline Pierrot, Dominique Rossin
    Subjects: Data Structures and Algorithms
    Abstract

    We present an algorithm running in time O(n ln n) which decides if a
    wreath-closed permutation class Av(B) given by its finite basis B contains a
    finite number of simple permutations. The method we use is based on an article
    of Brignall, Ruskuc and Vatter which presents a decision procedure (of high
    complexity) for solving this question, without the assumption that Av(B) is
    wreath-closed.

  300. Multi-Objective Geometric Programming Problem Being Cost Coefficients as Continous Function with Weighted Mean Method.

    Authors: A. K. Ojha, A.K. Das
    Subjects: Data Structures and Algorithms
    Abstract

    Geometric programming problems occur frequently in engineering design and
    management. In multiobjective optimization, the trade-off information between
    different objective functions is probably the most important piece of
    information in a solution process to reach the most preferred solution. In this
    paper we have discussed the basic concepts and principles of multiple objective
    optimization problems and developed a solution procedure to solve this
    optimization problem where the cost coefficients are continuous functions using
    weighted method to obtain the non-inferior solutions.

  301. Improved NSGA-II Based on a Novel Ranking Scheme.

    Authors: Rio G. L. D&#x27;Souza, K. Chandra Sekaran, A. Kandasamy
    Subjects: Data Structures and Algorithms
    Abstract

    Non-dominated Sorting Genetic Algorithm (NSGA) has established itself as a
    benchmark algorithm for Multiobjective Optimization. The determination of
    pareto-optimal solutions is the key to its success. However the basic algorithm
    suffers from a high order of complexity, which renders it less useful for
    practical applications. Among the variants of NSGA, several attempts have been
    made to reduce the complexity. Though successful in reducing the runtime
    complexity, there is scope for further improvements, especially considering
    that the populations involved are frequently of large size.

  302. On constant factor approximation for earth mover distance over doubling metrics.

    Authors: Shi Li
    Subjects: Data Structures and Algorithms
    Abstract

    Given a metric space $(X,d_X)$, the earth mover distance between two
    distributions over $X$ is defined as the minimum cost of a bipartite matching
    between the two distributions. The doubling dimension of a metric $(X, d_X)$ is
    the smallest value $\alpha$ such that every ball in $X$ can be covered by
    $2^\alpha$ ball of half the radius. We study efficient algorithms for
    approximating earth mover distance over metrics with bounded doubling
    dimension.

  303. Improved bounds for stochastic matching.

    Authors: Jian Li, Juli&#xe1;n Mestre
    Subjects: Data Structures and Algorithms
    Abstract

    In this paper we study stochastic matching problems that are motivated by
    applications in online dating and kidney exchange programs. We consider two
    probing models: edge probing and matching probing.

    Our main result is an algorithm that finds a matching-probing strategy
    attaining a small constant approximation ratio. An interesting aspect of our
    approach is that we compare the cost our solution to the best edge-probing
    strategy. Thus, we indirectly show that the best matching-probing strategy is
    only a constant factor away from the best edge-probing strategy.

  304. On Cut Dimension of $\ell_1$ Metrics and Volumes, and Related Sparsification Techniques.

    Authors: Ilan Newman, Yuri Rabinovich
    Subjects: Data Structures and Algorithms
    Abstract

    We show that for any $\epsilon >0$ and any $\ell_1$ metric on $n$ points $d$
    there is a metric $d'$ that approximates $d$ by at most $1+ \epsilon$ and $d'$
    can be written as a non-negative combination of $O(n/\e^2)$ cuts metrics. This
    implies that any $\ell_1$ metric on $n$ points can be embedded into
    $O(n/\e^2)$-dimensional $\ell_1$-space, with distortion bounded by $1+ \e$.

    We then show that the methods used have a much wider range of applicability,
    and develop basic tools to analyze their performance in the general settings.

  305. Range Reporting for Moving Points on a Grid.

    Authors: Marek Karpinski, Yakov Nekrich, J. Ian Munro
    Subjects: Data Structures and Algorithms
    Abstract

    In this paper we describe a new data structure that supports orthogonal range
    reporting queries on a set of points that move along linear trajectories on a
    $U\times U$ grid. The assumption that points lie on a $U\times U$ grid enables
    us to significantly decrease the query time in comparison to the standard
    kinetic model. Our data structure answers queries in $O(\sqrt{\log U/\log \log
    U}+k)$ time, where $k$ denotes the number of points in the answer. The above
    improves over the $\Omega(\log n)$ lower bound that is valid in the
    infinite-precision kinetic model.

  306. Comparative Results: Group Search Optimizer and Central Force Optimization.

    Authors: Richard A. Formato
    Subjects: Data Structures and Algorithms
    Abstract

    This note compares the performance of two multidimensional search and
    optimization algorithms: Group Search Optimizer and Central Force Optimization.
    GSO is a new state-of-the-art algorithm that has gained some notoriety,
    consequently providing an excellent yardstick for measuring the performance of
    other algorithms. CFO is a novel deterministic metaheuristic that has performed
    well against GSO in previous tests. The CFO implementation reported here
    includes architectural improvements in errant probe retrieval and decision
    space adaptation that result in even better performance.

  307. Optimal Prefix Codes with Fewer Distinct Codeword Lengths are Easier to Construct.

    Authors: Ahmed Belal, Amr Elmasry
    Subjects: Data Structures and Algorithms
    Abstract

    A new method for constructing minimum-redundancy prefix codes is described.
    This method does not explicitly build a Huffman tree; instead it uses a
    property of optimal codes to find the codeword length of each weight. The
    running time of the algorithm is shown to be $O(n k)$, which is asymptotically
    faster than Huffman's algorithm when $k=o(\log{n})$, where $n$ is the number of
    weights and $k$ is the number of distinct codeword lengths.

  308. A Sublogarithmic Approximation for Highway and Tollbooth Pricing.

    Authors: Iftah Gamzu, Danny Segev
    Subjects: Data Structures and Algorithms
    Abstract

    An instance of the tollbooth problem consists of an undirected network and a
    collection of single-minded customers, each of which is interested in
    purchasing a fixed path subject to an individual budget constraint. The
    objective is to assign a per-unit price to each edge in a way that maximizes
    the collective revenue obtained from all customers. The revenue generated by
    any customer is equal to the overall price of the edges in her desired path,
    when this cost falls within her budget; otherwise, that customer will not
    purchase any edge.

  309. Optimization with More than One Budget.

    Authors: Fabrizio Grandoni, Rico Zenklusen
    Subjects: Data Structures and Algorithms
    Abstract

    A natural way to deal with multiple, partially conflicting objectives is
    turning all the objectives but one into budget constraints. Some classical
    polynomial-time optimization problems, such as spanning tree and forest,
    shortest path, (perfect) matching, independent set (basis) in a matroid or in
    the intersection of two matroids, become NP-hard even with one budget
    constraint. Still, for most of these problems deterministic and randomized
    polynomial-time approximation schemes are known.

  310. Approximability of Sparse Integer Programs.

    Authors: Deeparnab Chakrabarty, David Pritchard
    Subjects: Data Structures and Algorithms
    Abstract

    The main focus of this paper is a pair of new approximation algorithms for
    certain integer programs. First, for covering integer programs {min cx: Ax >=
    b, 0 <= x <= d} where A has at most k nonzeroes per row, we give a
    k-approximation algorithm. (We assume A, b, c, d are nonnegative.) For any k >=
    2 and eps>0, if P != NP this ratio cannot be improved to k-1-eps, and under the
    unique games conjecture this ratio cannot be improved to k-eps.

  311. Estimating and Sampling Graphs with Multidimensional Random Walks.

    Authors: Bruno Ribeiro, Don Towsley
    Subjects: Data Structures and Algorithms
    Abstract

    Estimating characteristics of large graphs via sampling is a vital part of
    the study of complex networks. Current sampling methods such as (independent)
    random vertex and random walks are useful but have drawbacks. Random vertex
    sampling may require too many resources (time, bandwidth, or money). Random
    walks, which normally require fewer resources per sample, can suffer from large
    estimation errors in the presence of disconnected or quasi-disconnected
    subgraphs. In this work we propose a new $m$-dimensional random walk that uses
    $m$ dependent random walkers.

  312. Mod/Resc Parsimony Inference.

    Authors: Danny Hermelin, Igor Nor, Sylvain Charlat, Jan Engelstadter, Max Reuter, Olivier Duron, Marie-France Sagot
    Subjects: Data Structures and Algorithms
    Abstract

    We address in this paper a new computational biology problem that aims at
    understanding a mechanism that could potentially be used to genetically
    manipulate natural insect populations infected by inherited, intra-cellular
    parasitic bacteria. In this problem, that we denote by \textsc{Mod/Resc
    Parsimony Inference}, we are given a boolean matrix and the goal is to find two
    other boolean matrices with a minimum number of columns such that an
    appropriately defined operation on these matrices gives back the input.

  313. Heuristic Contraction Hierarchies with Approximation Guarantee.

    Authors: Robert Geisberger
    Subjects: Data Structures and Algorithms
    Abstract

    We present a new heuristic point-to-point routing algorithm based on
    contraction hierarchies (CH). Given an epsilon >= 0, we can prove that the
    length of the path computed by our algorithm is at most (1+epsilon) times the
    length of the optimal (shortest) path. CH is based on node contraction:
    removing nodes from a network and adding shortcut edges to preserve shortest
    path distances. Our algorithm tries to avoid shortcuts even when a replacement
    path is epsilon times longer.

  314. Treewidth reduction for constrained separation and bipartization problems.

    Authors: Barry O&#x27;Sullivan, D&#xe1;niel Marx, Igor Razgon
    Subjects: Data Structures and Algorithms
    Abstract

    We present a method for reducing the treewidth of a graph while preserving
    all the minimal $s-t$ separators. This technique turns out to be very useful
    for establishing the fixed-parameter tractability of constrained separation and
    bipartization problems. To demonstrate the power of this technique, we prove
    the fixed-parameter tractability of a number of well-known separation and
    bipartization problems with various additional restrictions (e.g., the vertices
    being removed from the graph form an independent set).

  315. Dynamic sharing of a multiple access channel.

    Authors: Marcin Bienkowski, Marek Klonowski, Miroslaw Korzeniowski, Dariusz R. Kowalski
    Subjects: Data Structures and Algorithms
    Abstract

    In this paper we consider the mutual exclusion problem on a multiple access
    channel. Mutual exclusion is one of the fundamental problems in distributed
    computing. In the classic version of this problem, n processes perform a
    concurrent program which occasionally triggers some of them to use shared
    resources, such as memory, communication channel, device, etc. The goal is to
    design a distributed algorithm to control entries and exits to/from the shared
    resource in such a way that in any time there is at most one process accessing
    it.

  316. Minimum and maximum against k lies.

    Authors: Ji&#x159;&#xed; Matou&#x161;ek, Yoshio Okamoto, Michael Hoffmann, Philipp Zumstein
    Subjects: Data Structures and Algorithms
    Abstract

    A neat 1972 result of Pohl asserts that [3n/2]-2 comparisons are sufficient,
    and also necessary in the worst case, for finding both the minimum and the
    maximum of an n-element totally ordered set. The set is accessed via an oracle
    for pairwise comparisons. More recently, the problem has been studied in the
    context of the Renyi-Ulam liar games, where the oracle may give up to k false
    answers. For large k, an upper bound due to Aigner shows that (k+O(\sqrt{k}))n
    comparisons suffice. We improve on this by providing an algorithm with at most
    (k+1+C)n+O(k^3) comparisons for some constant C.

  317. An Optimal Algorithm for the Indirect Covering Subtree Problem.

    Authors: Joachim Spoerhase
    Subjects: Data Structures and Algorithms
    Abstract

    We consider the indirect covering subtree problem (Kim et al., 1996). The
    input is an edge weighted tree graph along with customers located at the nodes.
    Each customer is associated with a radius and a penalty. The goal is to locate
    a tree-shaped facility such that the sum of setup and penalty cost is
    minimized. The setup cost equals the sum of edge lengths taken by the facility
    and the penalty cost is the sum of penalties of all customers whose distance to
    the facility exceeds their radius.

  318. Online Stochastic Ad Allocation: Efficiency and Fairness.

    Authors: Monika Henzinger, Jon Feldman, Nitish Korula, Vahab S. Mirrokni, Cliff Stein
    Subjects: Data Structures and Algorithms
    Abstract

    We study the efficiency and fairness of online stochastic display ad
    allocation algorithms from a theoretical and practical standpoint. In
    particular, we study the problem of maximizing efficiency in the presence of
    stochastic information. In this setting, each advertiser has a maximum demand
    for impressions of display ads that will arrive online. In our model, inspired
    by the concept of free disposal in economics, we assume that impressions that
    are given to an advertiser above her demand are given to her for free.

  319. Grammar-Based Compression in a Streaming Model.

    Authors: Travis Gagie, Pawel Gawrychowski
    Subjects: Data Structures and Algorithms
    Abstract

    We show that, given a string $s$ of length $n$, with constant memory and
    logarithmic passes over a constant number of streams we can build a
    context-free grammar that generates $s$ and only $s$ and whose size is within
    an $\Oh{\min (g \log g, \sqrt{n \log g})}$-factor of the minimum $g$. This
    stands in contrast to our previous result that, with polylogarithmic memory and
    polylogarithmic passes over a single stream, we cannot build such a grammar
    whose size is within any polynomial of $g$.

  320. The Highest Expected Reward Decoding for HMMs with Application to Recombination Detection.

    Authors: Michal N&#xe1;n&#xe1;si, Tom&#xe1;&#x161; Vina&#x159;, Bro&#x148;a Brejov&#xe1;
    Subjects: Data Structures and Algorithms
    Abstract

    Hidden Markov models are traditionally decoded by the Viterbi algorithm which
    finds the highest probability state path in the model. In recent years, several
    limitations of the Viterbi decoding have been demonstrated, and new algorithms
    have been developed to address them
    \citep{Kall2005,Brejova2007,Gross2007,Brown2010}.

  321. The Complexity of Flood Filling Games.

    Authors: Ashley Montanaro, David Arthur, Raphael Clifford, Markus Jalsenius, Benjamin Sach
    Subjects: Data Structures and Algorithms
    Abstract

    We study the complexity of the popular one player combinatorial game known as
    Flood-It. In this game the player is given an n by n board of tiles where each
    tile is allocated one of c colours. The goal is to make the colours of all
    tiles equal via the shortest possible sequence of flooding operations. In the
    standard version, a flooding operation consists of the player choosing a colour
    k, which then changes the colour of all the tiles in the monochromatic region
    connected to the top left tile to k.

  322. Connected searching of weighted trees.

    Authors: Dariusz Dereniowski
    Subjects: Data Structures and Algorithms
    Abstract

    In this paper we consider the problem of connected edge searching of weighted
    trees. It is shown that there exists a polynomial-time algorithm for finding
    optimal connected search strategy for bounded degree trees with arbitrary
    weights on the edges and vertices of the tree. The problem is NP-complete for
    general node-weighted trees (the weight of each edge is 1).

  323. Fixed-Parameter Algorithms for Computing Kemeny Scores - Theory and Practice.

    Authors: Robert Bredereck
    Subjects: Data Structures and Algorithms
    Abstract

    The central problem in this work is to compute a ranking of a set of elements
    which is "closest to" a given set of input rankings of the elements. We define
    "closest to" in an established way as having the minimum sum of Kendall-Tau
    distances to each input ranking. Unfortunately, the resulting problem Kemeny
    consensus is NP-hard for instances with n input rankings, n being an even
    integer greater than three. Nevertheless this problem plays a central role in
    many rank aggregation problems.

  324. On Fast Algorithm for Computing Even-Length DCT.

    Authors: Yuriy A. Reznik
    Subjects: Data Structures and Algorithms
    Abstract

    We study recursive algorithm for computing DCT of lengths $N=q 2^m$ ($m,q \in
    \mathbb{N}$, $q$ is odd) due to C.W.Kok. We show that this algorithm has the
    same multiplicative complexity as theoretically achievable by the prime factor
    decomposition, when $m \leqslant 2$. We also show that C.W.Kok's factorization
    allows a simple conversion to a scaled form. We analyze complexity of such a
    scaled factorization, and show that for some lengths it achieves lower
    multiplicative complexity than one of known prime factor-based scaled
    transforms.

  325. Posynomial Geometric Programming Problems with Multiple Parameters.

    Authors: A. K.Ojha, K.K.Biswal
    Subjects: Data Structures and Algorithms
    Abstract

    Geometric programming problem is a powerful tool for solving some special
    type non-linear programming problems. It has a wide range of applications in
    optimization and engineering for solving some complex optimization problems.
    Many applications of geometric programming are on engineering design problems
    where parameters are estimated using geometric programming. When the parameters
    in the problems are imprecise, the calculated objective value should be
    imprecise as well.

  326. Minimum Vertex Cover in Rectangle Graphs.

    Authors: Dror Rawitz, Reuven Bar-Yehuda, Danny Hermelin
    Subjects: Data Structures and Algorithms
    Abstract

    We consider the Vertex Cover problem in intersection graphs of axis-parallel
    rectangles on the plane. We present two algorithms: The first is an EPTAS for
    non-crossing rectangle families, rectangle families $\calR$ where $R_1
    \setminus R_2$ is connected for every pair of rectangles $R_1,R_2 \in \calR$.
    This algorithm extends to intersection graphs of pseudo-disks. The second
    algorithm achieves a factor of $(1.5 + \varepsilon)$ in general rectangle
    families, for any fixed $\varepsilon > 0$, and works also for the weighted
    variant of the problem.

  327. Optimal Gossip-Based Aggregate Computation.

    Authors: Gopal Pandurangan, Jen-Yeu Chen
    Subjects: Data Structures and Algorithms
    Abstract

    We present the first provably almost-optimal gossip-based algorithms for
    aggregate computation that are both time optimal and message-optimal. Given a
    $n$-node network, our algorithms guarantee that all the nodes can compute the
    common aggregates (such as Min, Max, Count, Sum, Average, Rank etc.) of their
    values in optimal $O(\log n)$ time and using $O(n \log \log n)$ messages. Our
    result improves on the algorithm of Kempe et al. \cite{kempe} that is
    time-optimal, but uses $O(n \log n)$ messages as well as on the algorithm of
    Kashyap et al.

  328. Detecting High Log-Densities -- an O(n^1/4) Approximation for Densest k-Subgraph.

    Authors: Uriel Feige, Aditya Bhaskara, Aravindan Vijayaraghavan, Moses Charikar, Eden Chlamtac
    Subjects: Data Structures and Algorithms
    Abstract

    In the Densest k-Subgraph problem, given a graph G and a parameter k, one
    needs to find a subgraph of G induced on k vertices that contains the largest
    number of edges. There is a significant gap between the best known upper and
    lower bounds for this problem. It is NP-hard, and does not have a PTAS unless
    NP has subexponential time algorithms. On the other hand, the current best
    known algorithm of Feige, Kortsarz and Peleg, gives an approximation ratio of
    n^(1/3-epsilon) for some specific epsilon > 0 (estimated at around 1/60).

  329. Succinct Dictionary Matching With No Slowdown.

    Authors: Djamal Belazzougui
    Subjects: Data Structures and Algorithms
    Abstract

    The problem of dictionary matching is a classical problem in string matching:
    given a set S of d strings of total length n characters over an (not
    necessarily constant) alphabet of size sigma, build a data structure so that we
    can match in a any text T all occurrences of strings belonging to S. The
    classical solution for this problem is the Aho-Corasick automaton which finds
    all occ occurrences in a text T in time O(|T| + occ) using a data structure
    that occupies O(m log m) bits of space where m <= n + 1 is the number of states
    in the automaton.

  330. Computing the Matrix p-norm.

    Authors: Aditya Bhaskara, Aravindan Vijayaraghavan
    Subjects: Data Structures and Algorithms
    Abstract

    A matrix is said to be positive if all its entries are >0. We consider n x n
    positive matrices where the ratio of the largest to smallest entry is at most
    N, for some parameter N. We show that for any p>1, the p-norm of the matrix,
    which is defined to be |A|_p = Max_x ||Ax||_p, where ||x||_p=1 can be computed
    to a factor of (1+$\delta$) in time polynomial in $\log(1/\delta)$, N and the
    dimension of the matrix n.

  331. Sampled Longest Common Prefix Array.

    Authors: Jouni Sir&#xe9;n
    Subjects: Data Structures and Algorithms
    Abstract

    When augmented with the longest common prefix (LCP) array and some other
    structures, the suffix array can solve many string processing problems in
    optimal time and space. A compressed representation of the LCP array is also
    one of the main building blocks in many compressed suffix tree proposals. In
    this paper, we describe a new compressed LCP representation: the sampled LCP
    array. We show that when used with a compressed suffix array (CSA), the sampled
    LCP array often offers better time/space trade-offs than the existing
    alternatives.

  332. Algorithmic Differentiation of Linear Algebra Functions with Application in Optimum Experimental Design (Extended Version).

    Authors: S.F. Walter, L. Lehmann
    Subjects: Data Structures and Algorithms
    Abstract

    We derive algorithms for higher order derivative computation of the
    rectangular $QR$ and eigenvalue decomposition of symmetric matrices with
    distinct eigenvalues in the forward and reverse mode of algorithmic
    differentiation (AD) using univariate Taylor propagation of matrices (UTPM).
    Linear algebra functions are regarded as elementary functions and not as
    algorithms. The presented algorithms are implemented in the BSD licensed AD
    tool \texttt{ALGOPY}. Numerical tests show that the UTPM algorithms derived in
    this paper produce results close to machine precision accuracy.

  333. New Constructive Aspects of the Lovasz Local Lemma.

    Authors: Bernhard Haeupler, Barna Saha, Aravind Srinivasan
    Subjects: Data Structures and Algorithms
    Abstract

    The Lov\'{a}sz Local Lemma (LLL) states that the probability that none of a
    set of "bad" events happens is nonzero if the probability of each event is
    small compared to the number of bad events it depends on. A series of results
    have provided algorithms to efficiently construct structures whose existence is
    (non-constructively) guaranteed by the full asymmetric LLL, culminating in the
    recent breakthrough of Moser & Tardos. We show that the output distribution of
    the Moser-Tardos procedure has sufficient randomness, leading to two classes of
    algorithmic applications.

  334. Approximate Shortest Paths Avoiding a Failed Vertex: Optimal Size Data Structures for Unweighted Graph.

    Authors: Neelesh Khanna Surender Baswana
    Subjects: Data Structures and Algorithms
    Abstract

    Let $G=(V,E)$ be any undirected graph on $V$ vertices and $E$ edges. A path
    $\textbf{P}$ between any two vertices $u,v\in V$ is said to be $t$-approximate
    shortest path if its length is at most $t$ times the length of the shortest
    path between $u$ and $v$. We consider the problem of building a compact data
    structure for a given graph $G$ which is capable of answering the following
    query for any $u,v,z\in V$ and $t>1$:

    Report $t$-approximate shortest path between $u$ and $v$ when vertex $z$
    fails

  335. Beyond Bidimensionality: Parameterized Subexponential Algorithms on Directed Graphs.

    Authors: Venkatesh Raman, Saket Saurabh, Frederic Dorn, Fedor V. Fomin, Daniel Lokshtanov
    Subjects: Data Structures and Algorithms
    Abstract

    We develop two different methods to achieve subexponential time parameterized
    algorithms for problems on sparse directed graphs. We exemplify our approaches
    with two well studied problems.

  336. How to meet asynchronously (almost) everywhere.

    Authors: Jurek Czyzowicz, Arnaud Labourel, Andrzej Pelc
    Subjects: Data Structures and Algorithms
    Abstract

    Two mobile agents (robots) with distinct labels have to meet in an arbitrary,
    possibly in?nite, unknown connected graph or in an unknown connected terrain in
    the plane. Agents are modeled as points, and the route of each of them only
    depends on its label and on the unknown environment. The actual walk of each
    agent also depends on an asynchronous adversary that may arbitrarily vary the
    speed of the agent, stop it, or even move it back and forth, as long as the
    walk of the agent in each segment of its route is continuous, does not leave it
    and covers all of it.

  337. Online Correlation Clustering.

    Authors: Warren Schudy, Claire Mathieu, Ocan Sankur
    Subjects: Data Structures and Algorithms
    Abstract

    We study the online clustering problem where data items arrive in an online
    fashion. The algorithm maintains a clustering of data items into similarity
    classes. Upon arrival of v, the relation between v and previously arrived items
    is revealed, so that for each u we are told whether v is similar to u. The
    algorithm can create a new cluster for v and merge existing clusters.

    When the objective is to minimize disagreements between the clustering and
    the input, we prove that a natural greedy algorithm is O(n)-competitive, and
    this is optimal.

  338. Computing a Frobenius Coin Problem decision problem in O(n^2).

    Authors: Charles Sauerbier
    Subjects: Data Structures and Algorithms
    Abstract

    Expanding on recent results of another an algorithm is presented that
    provides solution to the Frobenius Coin Problem in worst case O(n^2) in the
    magnitude of the largest denomination.

  339. Lipschitz Unimodal and Isotonic Regression on Paths and Trees.

    Authors: Jeff M. Phillips, Pankaj K. Agarwal, Bardia Sadri
    Subjects: Data Structures and Algorithms
    Abstract

    We describe algorithms for finding the regression of t, a sequence of values,
    to the closest sequence s by mean squared error, so that s is always increasing
    (isotonicity) and so the values of two consecutive points do not increase by
    too much (Lipschitz). The isotonicity constraint can be replaced with a
    unimodular constraint, where there is exactly one local maximum in s. These
    algorithm are generalized from sequences of values to trees of values. For each
    scenario we describe near-linear time algorithms.

  340. Continuous Monitoring of Distributed Data Streams over a Time-based Sliding Window.

    Authors: Ho-Leung Chan, Tak-Wah Lam, Lap-Kei Lee, Hing-Fung Ting
    Subjects: Data Structures and Algorithms
    Abstract

    The past decade has witnessed many interesting algorithms for maintaining
    statistics over a data stream. This paper initiates a theoretical study of
    algorithms for monitoring distributed data streams over a time-based sliding
    window (which contains a variable number of items and possibly out-of-order
    items). The concern is how to minimize the communication between individual
    streams and the root, while allowing the root, at any time, to be able to
    report the global statistics of all streams within a given error bound.

  341. A Simplified Proof For The Application Of Freivalds' Technique to Verify Matrix Multiplication.

    Authors: Vamsi K. Kundeti
    Subjects: Data Structures and Algorithms
    Abstract

    Fingerprinting is a well known technique, which is often used in designing
    Monte Carlo algorithms for verifying identities involving ma- trices, integers
    and polynomials. The book by Motwani and Raghavan [1] shows how this technique
    can be applied to check the correctness of matrix multiplication -- check if AB
    = C where A, B and C are three nxn matrices. The result is a Monte Carlo
    algorithm running in time $Theta(n^2)$ with an exponentially decreasing error
    probability after each indepen- dent iteration. In this paper we give a simple
    alternate proof addressing the same problem.

  342. Computing the Prime Factoring of an Integer in O(n^4).

    Authors: Charles Sauerbier
    Subjects: Data Structures and Algorithms
    Abstract

    A deterministic algorithm computing the prime factoring of an integer value
    with worst case complexity of O(n^4) in the magnitude of the value to be
    factored is presented. Empirical data within the limits of available resources
    is presented that suggest the worst case complexity of the algorithm in context
    of bits of representation of the value to be factored is approximate to O(n^4).

  343. Computing Least Fixed Points of Probabilistic Systems of Polynomials.

    Authors: Andreas Gaiser, Javier Esparza, Stefan Kiefer
    Subjects: Data Structures and Algorithms
    Abstract

    We study systems of equations of the form X1 = f1(X1, ..., Xn), ..., Xn =
    fn(X1, ..., Xn), where each fi is a polynomial with nonnegative coefficients
    that add up to 1. The least nonnegative solution, say mu, of such equation
    systems is central to problems from various areas, like physics, biology,
    computational linguistics and probabilistic program verification. We give a
    simple and strongly polynomial algorithm to decide whether mu=(1, ..., 1)
    holds. Furthermore, we present an algorithm that computes reliable sequences of
    lower and upper bounds on mu, converging linearly to mu.

  344. Fast Fraction-Integer Method for Computing Multiplicative Inverse.

    Authors: Hani M. AL-Matari, Sattar J. Aboud, Nidal F. Shilbayeh
    Subjects: Data Structures and Algorithms
    Abstract

    Multiplicative inverse is a crucial operation in public key cryptography, and
    been widely used in cryptography. Public key cryptography has given rise to
    such a need, in which we need to generate a related public and private pair of
    numbers, each of which is the inverse of the other. The basic method to find
    multiplicative inverses is Extended-Euclidean method. In this paper we will
    propose a new algorithm for computing the inverse, based on continues subtract
    fraction from integer and divide by fraction to obtain integer that will be
    used to compute the inverse d.

  345. Robust Fault Tolerant uncapacitated facility location.

    Authors: David Peleg, Shiri Chechik
    Subjects: Data Structures and Algorithms
    Abstract

    In the uncapacitated facility location problem, given a graph, a set of
    demands and opening costs, it is required to find a set of facilities R, so as
    to minimize the sum of the cost of opening the facilities in R and the cost of
    assigning all node demands to open facilities. This paper concerns the robust
    fault-tolerant version of the uncapacitated facility location problem (RFTFL).
    In this problem, one or more facilities might fail, and each demand should be
    supplied by the closest open facility that did not fail.

  346. Relaxed spanners for directed disk graphs.

    Authors: David Peleg, Liam Roditty
    Subjects: Data Structures and Algorithms
    Abstract

    Let $(V,\delta)$ be a finite metric space, where $V$ is a set of $n$ points
    and $\delta$ is a distance function defined for these points. Assume that
    $(V,\delta)$ has a constant doubling dimension $d$ and assume that each point
    $p\in V$ has a disk of radius $r(p)$ around it. The disk graph that corresponds
    to $V$ and $r(\cdot)$ is a \emph{directed} graph $I(V,E,r)$, whose vertices are
    the points of $V$ and whose edge set includes a directed edge from $p$ to $q$
    if $\delta(p,q)\leq r(p)$.

  347. Combining Partial Order Alignment and Progressive Near-Optimal Alignment.

    Authors: Dai Tri Man Le
    Subjects: Data Structures and Algorithms
    Abstract

    In this paper, I proposed to utilize partial-order alignment technique as a
    heuristic method to cope with the state-space explosion problem in progressive
    near-optimal alignment. The key idea of my approach is a formal treatment of
    progressive partial order alignment based on the graph product construction.

  348. Faster Algorithms for Finding and Counting Subgraphs.

    Authors: Venkatesh Raman, Saket Saurabh, Fedor V. Fomin, Daniel Lokshtanov, B. V. Raghavendra Rao
    Subjects: Data Structures and Algorithms
    Abstract

    In this paper we study a natural generalization of both {\sc $k$-Path} and
    {\sc $k$-Tree} problems, namely, the {\sc Subgraph Isomorphism} problem.

  349. Space Constrained Dynamic Covering.

    Authors: Anish Das Sarma, Shaddin Dughmi, Ioannis Antonellis
    Subjects: Data Structures and Algorithms
    Abstract

    In this paper, we identify a fundamental algorithmic problem that we term
    space-constrained dynamic covering (SCDC), arising in many modern-day web
    applications, including ad-serving and online recommendation systems in eBay
    and Netflix. Roughly speaking, SCDC applies two restrictions to the
    well-studied Max-Coverage problem: Given an integer k, X={1,2,...,n} and
    I={S_1, ..., S_m}, S_i a subset of X, find a subset J of I, such that |J| <= k
    and the union of S in J is as large as possible.

  350. Construction Sequences and Certifying 3-Connectedness.

    Authors: Jens M. Schmidt
    Subjects: Data Structures and Algorithms
    Abstract

    Tutte proved that every 3-connected graph on more than 4 nodes has a
    contractible edge. Barnette and Gruenbaum proved the existence of a removable
    edge in the same setting. We show that the sequence of contractions and the
    sequence of removals from G to the K_4 can be computed in O(|V|^2) time by
    extending Barnette and Gruenbaum's theorem. As an application, we derive a
    certificate for the 3-connectedness of graphs that can be easily computed and
    verified.

  351. Renewal theory in analysis of tries and strings.

    Authors: Svante Janson
    Subjects: Data Structures and Algorithms
    Abstract

    We give a survey of a number of simple applications of renewal theory to
    problems on random strings and tries: insertion depth, size, insertion mode and
    imbalance of tries; variations for b-tries and Patricia tries; Khodak and
    Tunstall codes.

  352. Computing a Discrete Logarithm in O(n^3).

    Authors: Charles Sauerbier
    Subjects: Data Structures and Algorithms
    Abstract

    This paper presents a means with time complexity of at worst O(n^3) to
    compute the discrete logarithm on cyclic finite groups of integers modulo p.

  353. Algorithms and Hardness for Subspace Approximation.

    Authors: Amit Deshpande, Kasturi Varadarajan, Madhur Tulsiani, Nisheeth K. Vishnoi
    Subjects: Data Structures and Algorithms
    Abstract

    The subspace approximation problem Subspace($k$,$p$) asks for a
    $k$-dimensional linear subspace that fits a given set of points optimally,
    where the error for fitting is a generalization of the least squares fit and
    uses the $\ell_{p}$ norm instead. Most of the previous work on subspace
    approximation has focused on small or constant $k$ and $p$, using coresets and
    sampling techniques from computational geometry.

  354. Euclidean Prize-collecting Steiner Forest.

    Authors: MohammadHossein Bateni, MohammadTaghi Hajiaghayi
    Subjects: Data Structures and Algorithms
    Abstract

    In this paper, we consider Steiner forest and its generalizations,
    prize-collecting Steiner forest and k-Steiner forest, when the vertices of the
    input graph are points in the Euclidean plane and the lengths are Euclidean
    distances. First, we present a simpler analysis of the polynomial-time
    approximation scheme (PTAS) of Borradaile et al. [12] for the Euclidean Steiner
    forest problem. This is done by proving a new structural property and modifying
    the dynamic programming by adding a new piece of information to each dynamic
    programming state.

  355. Thresholded Covering Algorithms for Robust and Max-Min Optimization.

    Authors: Anupam Gupta, Viswanath Nagarajan, R. Ravi
    Subjects: Data Structures and Algorithms
    Abstract

    The general problem of robust optimization is this: one of several possible
    scenarios will appear tomorrow, but things are more expensive tomorrow than
    they are today. What should you anticipatorily buy today, so that the
    worst-case cost (summed over both days) is minimized? Feige et al. and
    Khandekar et al. considered the k-robust model where the possible outcomes
    tomorrow are given by all demand-subsets of size k, and gave algorithms for the
    set cover problem, and the Steiner tree and facility location problems in this
    model, respectively.

  356. Practical Algorithmic Techniques for Several String Processing Problems.

    Authors: Mugurel Ionut Andreica, Nicolae Tapus
    Subjects: Data Structures and Algorithms
    Abstract

    The domains of data mining and knowledge discovery make use of large amounts
    of textual data, which need to be handled efficiently. Specific problems, like
    finding the maximum weight ordered common subset of a set of ordered sets or
    searching for specific patterns within texts, occur frequently in this context.
    In this paper we present several novel and practical algorithmic techniques for
    processing textual data (strings) in order to efficiently solve multiple
    problems.

  357. New Algorithmic Approaches for Computing Optimal Network Paths with Several Types of QoS Constraints.

    Authors: Mugurel Ionut Andreica, Nicolae Tapus
    Subjects: Data Structures and Algorithms
    Abstract

    The problem of efficiently delivering data within networks is very important
    nowadays, especially in the context of the large volumes of data which are
    being produced each year and of the increased data access needs of the users.
    Efficient data delivery strategies must satisfy several types of Quality of
    Service (QoS) constraints which are imposed by the data consumers. One
    possibility of achieving this goal (particularly in the case of in-order data
    transfers) is to choose a satisfactory network delivery path.

  358. Submodular Functions: Extensions, Distributions, and Algorithms. A Survey.

    Authors: Shaddin Dughmi
    Subjects: Data Structures and Algorithms
    Abstract

    Submodularity is a fundamental phenomenon in combinatorial optimization.
    Submodular functions occur in a variety of combinatorial settings such as
    coverage problems, cut problems, welfare maximization, and many more.
    Therefore, a lot of work has been concerned with maximizing or minimizing a
    submodular function, often subject to combinatorial constraints. Many of these
    algorithmic results exhibit a common structure.

  359. Tight Thresholds for Cuckoo Hashing via XORSAT.

    Authors: Andrea Montanari, Rasmus Pagh, Martin Dietzfelbinger, Andreas Goerdt, Michael Mitzenmacher, Michael Rink
    Subjects: Data Structures and Algorithms
    Abstract

    We settle the question of tight thresholds for offline cuckoo hashing. The
    problem can be stated as follows: we have n keys to be hashed into m buckets
    each capable of holding a single key. Each key has k >= 3 (distinct) associated
    buckets chosen uniformly at random and independently of the choices of other
    keys. A hash table can be constructed successfully if each key can be placed
    into one of its buckets.

  360. Optimal lower bounds for locality sensitive hashing (except when q is tiny).

    Authors: Yuan Zhou, Ryan O&#x27;Donnell, Yi Wu
    Subjects: Data Structures and Algorithms
    Abstract

    We study lower bounds for Locality Sensitive Hashing (LSH) in the strongest
    setting: point sets in {0,1}^d under the Hamming distance. Recall that here H
    is said to be an (r, cr, p, q)-sensitive hash family if all pairs x, y in
    {0,1}^d with dist(x,y) at most r have probability at least p of collision under
    a randomly chosen h in H, whereas all pairs x, y in {0,1}^d with dist(x,y) at
    least cr have probability at most q of collision. Typically, one considers d
    tending to infinity, with c fixed and q bounded away from 0.

  361. Approximate Sparse Recovery: Optimizing Time and Measurements.

    Authors: Ely Porat, Yi Li, Anna C. Gilbert, Martin J. Strauss
    Subjects: Data Structures and Algorithms
    Abstract

    An approximate sparse recovery system consists of parameters $k,N$, an
    $m$-by-$N$ measurement matrix, $\Phi$, and a decoding algorithm, $\mathcal{D}$.
    Given a vector, $x$, the system approximates $x$ by $\widehat x
    =\mathcal{D}(\Phi x)$, which must satisfy $\| \widehat x - x\|_2\le C \|x -
    x_k\|_2$, where $x_k$ denotes the optimal $k$-term approximation to $x$. For
    each vector $x$, the system must succeed with probability at least 3/4. Among
    the goals in designing such systems are minimizing the number $m$ of
    measurements and the runtime of the decoding algorithm, $\mathcal{D}$.

  362. Note on Max Lin-2 above Average.

    Authors: Gregory Gutin, Robert Crowston, Mark Jones
    Subjects: Data Structures and Algorithms
    Abstract

    In the Max Lin-2 problem we are given a system $S$ of $m$ linear equations in
    $n$ variables over $\mathbb{F}_2$ in which Equation $j$ is assigned a positive
    integral weight $w_j$ for each $j$. We wish to find an assignment of values to
    the variables which maximizes the total weight of satisfied equations. This
    problem generalizes Max Cut. The expected weight of satisfied equations is
    $W/2$, where $W=w_1+... +w_m$; $W/2$ is a tight lower bound on the optimal
    solution of Max Lin-2.

  363. Faster and simpler approximation of stable matchings.

    Authors: Katarzyna Paluch
    Subjects: Data Structures and Algorithms
    Abstract

    We give a 3/2-approximation algorithm for stable matchings that runs in
    $O(m)$ time. The previously best known algorithm by McDermid has the same
    approximation ratio but runs in $O(n^{3/2}m)$ time, where $n$ denotes the
    number of vertices and $m$ the number of edges. Also the algorithm and the
    analysis are much simpler.

  364. Alphabet Partitioning for Compressed Rank/Select with Applications.

    Authors: Gonzalo Navarro, Jeremy Barbay, Travis Gagie, Yakov Nekrich
    Subjects: Data Structures and Algorithms
    Abstract

    We show show how, if we have a data structure that efficiently supports
    \access, \rank and \select queries on strings in compressed form, and another
    that supports those queries efficiently on strings over large alphabets, we can
    combine their strengths via alphabet partitioning. Specifically, we present a
    data structure that stores a string (s [1..n]) over alphabet $[1..\sigma]$ in
    (n H_0 (s) + o (n) (H_0 (s) + 1)) bits, where $H_0(s)$ is the zero-order
    entropy of $s$, and supports \access and \rank queries in time $\Oh{\log \log
    \sigma}$ and \select queries in $\Oh{1}$ time.

  365. Approximation Schemes for Steiner Forest on Planar Graphs and Graphs of Bounded Treewidth.

    Authors: D&#xe1;niel Marx, MohammadHossein Bateni, MohammadTaghi Hajiaghayi
    Subjects: Data Structures and Algorithms
    Abstract

    We give the first polynomial-time approximation scheme (PTAS) for the Steiner
    forest problem on planar graphs and, more generally, on graphs of bounded
    genus. As a first step, we show how to build a Steiner forest spanner for such
    graphs. The crux of the process is a clustering procedure called
    prize-collecting clustering that breaks down the input instance into separate
    subinstances which are easier to handle; moreover, the terminals in different
    subinstances are far from each other.

  366. An $O(n^2)$ Algorithm for Computing Longest Common Cyclic Subsequence.

    Authors: Masud Hasan, M. Sohel Rahman, Shihabur Rahman Chowdhury, Sumaiya Iqbal
    Subjects: Data Structures and Algorithms
    Abstract

    The {\em longest common subsequence (LCS)} problem is a classic and
    well-studied problem in computer science. LCS is a central problem in
    stringology and finds broad applications in text compression, error-detecting
    codes and biological sequence comparison. However, in numerous contexts, words
    represent cyclic sequences of symbols and LCS must be generalized to consider
    all circular shifts of the strings.

  367. Faster FAST(Feedback Arc Set in Tournaments).

    Authors: Uriel Feige
    Subjects: Data Structures and Algorithms
    Abstract

    We present an algorithm that finds a feedback arc set of size $k$ in a
    tournament in time $n^{O(1)}2^{O(\sqrt{k})}$. This is asymptotically faster
    than the running time of previously known algorithms for this problem.

  368. Efficient Higher Order Derivatives of Objective Functions Composed of Matrix Operations.

    Authors: Sebastian F. Walter
    Subjects: Data Structures and Algorithms
    Abstract

    This paper is concerned with the efficient evaluation of higher-order
    derivatives of functions $f$ that are composed of matrix operations. I.e., we
    want to compute the $D$-th derivative tensor $\nabla^D f(X) \in \mathbb
    R^{N^D}$, where $f:\mathbb R^{N} \to \mathbb R$ is given as an algorithm that
    consists of many matrix operations. We propose a method that is a combination
    of two well-known techniques from Algorithmic Differentiation (AD): univariate
    Taylor propagation on scalars (UTPS) and first-order forward and reverse on
    matrices.

  369. Improved Algorithm for Degree Bounded Survivable Network Design Problem.

    Authors: Anand Louis, Nisheeth Vishnoi
    Subjects: Data Structures and Algorithms
    Abstract

    We consider the Degree-Bounded Survivable Network Design Problem: the
    objective is to find a minimum cost subgraph satisfying the given connectivity
    requirements as well as the degree bounds on the vertices. If we denote the
    upper bound on the degree of a vertex v by b(v), then we present an algorithm
    that finds a solution whose cost is at most twice the cost of the optimal
    solution while the degree of a degree constrained vertex v is at most 2b(v) +
    2. This improves upon the results of Lau and Singh and that of Lau, Naor,
    Salavatipour and Singh.

  370. Randomized Interior Point methods for Sampling and Optimization.

    Authors: Hariharan Narayanan
    Subjects: Data Structures and Algorithms
    Abstract

    All known Markov Chains for sampling a general convex set have mixing times
    that depend upon the aspect ratio of the convex set, a measure of which is the
    ratio between the radius of the smallest sphere circumscribed around $K$ and
    the radius of the largest sphere inscribed in $K$. Extending earlier work on
    polytopes, we present a Markov chain for sampling from a convex body using a
    self-concordant barrier, whose mixing time does not depend on its aspect ratio
    or diameter.

  371. A $(\log n)^{\Omega(1)}$ integrality gap for the Sparsest Cut SDP.

    Authors: Jeff Cheeger, Bruce Kleiner, Assaf Naor
    Subjects: Data Structures and Algorithms
    Abstract

    We show that the Goemans-Linial semidefinite relaxation of the Sparsest Cut
    problem with general demands has integrality gap $(\log n)^{\Omega(1)}$. This
    is achieved by exhibiting $n$-point metric spaces of negative type whose $L_1$
    distortion is $(\log n)^{\Omega(1)}$. Our result is based on quantitative
    bounds on the rate of degeneration of Lipschitz maps from the Heisenberg group
    to $L_1$ when restricted to cosets of the center.

  372. Recognizing well-parenthesized expressions in the streaming model.

    Authors: F. Magniez, C. Mathieu, A. Nayak
    Subjects: Data Structures and Algorithms
    Abstract

    Motivated by a concrete problem and with the goal of understanding the sense
    in which the complexity of streaming algorithms is related to the complexity of
    formal languages, we investigate the problem Dyck(s) of checking matching
    parentheses, with $s$ different types of parenthesis.

  373. A Minimal Periods Algorithm with Applications.

    Authors: Zhi Xu
    Subjects: Data Structures and Algorithms
    Abstract

    Kosaraju in ``Computation of squares in a string'' briefly described a
    linear-time algorithm for computing the minimal squares starting at each
    position in a word. Using the same construction of suffix trees, we generalize
    his result and describe in detail how to compute in O(k|w|)-time the minimal
    k-th power, with period of length larger than s, starting at each position in a
    word w for arbitrary exponent $k\geq2$ and integer $s\geq0$. We provide the
    complete proof of correctness of the algorithm, which is somehow not completely
    clear in Kosaraju's original paper.

  374. Synthesizing Minimal Tile Sets for Patterned DNA Self-Assembly.

    Authors: Mika G&#xf6;&#xf6;s, Pekka Orponen
    Subjects: Data Structures and Algorithms
    Abstract

    The Pattern self-Assembly Tile set Synthesis (PATS) problem is to determine a
    set of coloured tiles that self-assemble to implement a given rectangular
    colour pattern. We give an exhaustive branch-and-bound algorithm to find tile
    sets of minimum cardinality for the PATS problem. Our algorithm makes use of a
    search tree in the lattice of partitions of the ambient rectangular grid, and
    an efficient bounding function to prune this search tree. Empirical data on the
    performance of the algorithm shows that it compares favourably to previously
    presented heuristic solutions to the problem.

  375. PageRank Optimization by Edge Selection.

    Authors: Bal&#xe1;zs Csan&#xe1;d Cs&#xe1;ji, Rapha&#xeb;l M. Jungers, Vincent D. Blondel
    Subjects: Data Structures and Algorithms
    Abstract

    The importance of a node in a directed graph can be measured by its PageRank.
    The PageRank of a node is used in a number of application contexts - including
    web-page ranking - and can be interpreted as the average portion of time spent
    at the node by an infinite random walk in the graph. We consider the problem of
    maximizing the PageRank of a node by selecting some of the edges from a set of
    edges that are under our control. By applying results from Markov decision
    theory, we show that an optimal solution to this problem can be found in
    polynomial time.

  376. Approximation Schemes for the Betweenness Problem in Tournaments and Related Ranking Problems.

    Authors: Marek Karpinski, Warren Schudy
    Subjects: Data Structures and Algorithms
    Abstract

    We design the first polynomial time approximation schemes (PTASs) for the
    Minimum Betweenness problem in tournaments and some related higher arity
    ranking problems. This settles the approximation status of the Betweenness
    problem in tournaments along with other ranking problems which were open for
    some time now. We also show fixed parameter tractability of Betweenness in
    tournaments and improved fixed parameter algorithms for Feedback Arc Set in
    tournaments and Kemeny Rank Aggregation.

  377. GEDI: Scalable Algorithms for Genotype Error Detection and Imputation.

    Authors: Justin Kennedy, Ion I. Mandoiu, Bogdan Pasaniuc
    Subjects: Data Structures and Algorithms
    Abstract

    Genome-wide association studies generate very large datasets that require
    scalable analysis algorithms. In this report we describe the GEDI software
    package, which implements efficient algorithms for performing several common
    tasks in the analysis of population genotype data, including genotype error
    detection and correction, imputation of both randomly missing and untyped
    genotypes, and genotype phasing. Experimental results show that GEDI achieves
    high accuracy with a runtime scaling linearly with the number of markers and
    samples.

  378. A Faster Exact Algorithm for the Directed Maximum Leaf Spanning Tree Problem.

    Authors: Henning Fernau, Daniel Raible
    Subjects: Data Structures and Algorithms
    Abstract

    Given a directed graph $G=(V,A)$, the Directed Maximum Leaf Spanning Tree
    problem asks to compute a directed spanning tree (i.e., an out-branching) with
    as many leaves as possible. By designing a Branch-and-Reduced algorithm
    combined with the Measure & Conquer technique for running time analysis, we
    show that the problem can be solved in time $\Oh^*(1.9043^n)$ using polynomial
    space. Hitherto, there have been only few examples. Provided exponential space
    this run time upper bound can be lowered to $\Oh^*(1.8139^n)$.

  379. Belief Propagation and Loop Calculus for Permanent of a Non-Negative Matrix.

    Authors: Yusuke Watanabe, Michael Chertkov
    Subjects: Data Structures and Algorithms
    Abstract

    We consider computation of permanent of a positive N times N non-negative
    matrix, $P=((p_i^j)^{1/T})$, or equivalently the problem of weighted counting
    of perfect matchings (evaluation of the Partition Function) over a fully
    connected bipartite graph, $K_{N,N}$. The problem is known to be #P hard.
    Stated as a graphical model, this problem allows exact Loop Calculus
    representation [Chertkov, Chernyak '06] in terms of a minimum of the Bethe Free
    Energy functional over $N^2$ marginal beliefs.

  380. Approximating the Statistics of various Properties in Randomly Weighted Graphs.

    Authors: Amos Korman, Yuval Emek, Yuval Shavitt
    Subjects: Data Structures and Algorithms
    Abstract

    Consider the setting of \emph{randomly weighted graphs}, namely, graphs whose
    edge weights are chosen independently according to probability distributions
    with finite support over the non-negative reals. Under this setting, weighted
    graph properties typically become random variables and we are interested in
    computing their statistical features. Unfortunately, this turns out to be
    computationally hard for some weighted graph properties albeit the problem of
    computing the properties per se in the traditional setting of algorithmic graph
    theory is tractable.

  381. Sharp Dichotomies for Regret Minimization in Metric Spaces.

    Authors: Aleksandrs Slivkins, Robert Kleinberg
    Subjects: Data Structures and Algorithms
    Abstract

    The Lipschitz multi-armed bandit (MAB) problem generalizes the classical
    multi-armed bandit problem by assuming one is given side information consisting
    of a priori upper bounds on the difference in expected payoff between certain
    pairs of strategies. Classical results of (Lai and Robbins 1985) and (Auer et
    al. 2002) imply a logarithmic regret bound for the Lipschitz MAB problem on
    finite metric spaces. Recent results on continuum-armed bandit problems and
    their generalizations imply lower bounds of $\sqrt{t}$, or stronger, for many
    infinite metric spaces such as the unit interval.

  382. Tractable hypergraph properties for constraint satisfaction and conjunctive queries.

    Authors: D&#xe1;niel Marx
    Subjects: Data Structures and Algorithms
    Abstract

    An important question in the study of constraint satisfaction problems (CSP)
    is understanding how the graph or hypergraph describing the incidence structure
    of the constraints influences the complexity of the problem. For binary CSP
    instances (i.e., where each constraint involves only two variables), the
    situation is well understood: the complexity of the problem essentially depends
    on the treewidth of the graph of the constraints.

  383. Research report : Collaborative Peer 2 Peer Edition: Avoiding Conflicts is Better than Solving Conflicts.

    Authors: St&#xe9;phane Martin, Denis Lugiez
    Subjects: Data Structures and Algorithms
    Abstract

    Collaborative edition is achieved by distinct sites that work independently
    on (a copy of) a shared document. Conflicts may arise during this process and
    must be solved by the collaborative editor. In pure Peer to Peer collaborative
    editing, no centralization nor locks nor time-stamps are used which make
    conflict resolution difficult. We propose an algorithm which relies on the
    notion or semantics dependence and avoids the need of any integration
    transformation to solve conflicts.

  384. A O(E) Time Shortest Path Algorithm For Non Negative Weighted Undirected Graphs.

    Authors: Sohail Safdar, Muhammad Aasim Qureshi, Rehan Akbar, Dr. Fadzil B. Hassan
    Subjects: Data Structures and Algorithms
    Abstract

    In most of the shortest path problems like vehicle routing problems and
    network routing problems, we only need an efficient path between two points
    source and destination, and it is not necessary to calculate the shortest path
    from source to all other nodes. This paper concentrates on this very idea and
    presents an algorithm for calculating shortest path for (i) nonnegative
    weighted undirected graphs (ii) unweighted undirected graphs.

  385. Exact algorithms for OWA-optimization in multiobjective spanning tree problems.

    Authors: Lucie Galand, Olivier Spanjaard
    Subjects: Data Structures and Algorithms
    Abstract

    This paper deals with the multiobjective version of the optimal spanning tree
    problem. More precisely, we are interested in determining the optimal spanning
    tree according to an Ordered Weighted Average (OWA) of its objective values. We
    first show that the problem is weakly NP-hard. In the case where the weights of
    the OWA are strictly decreasing, we then propose a mixed integer programming
    formulation, and provide dedicated optimality conditions yielding an important
    reduction of the size of the program.

  386. Kernels for Feedback Arc Set In Tournaments.

    Authors: Saket Saurabh, Fedor V. Fomin, St&#xe9;phane Bessy, Serge Gaspers, Christophe Paul, Anthony Perez, St&#xe9;phan Thomass&#xe9;
    Subjects: Data Structures and Algorithms
    Abstract

    A tournament T=(V,A) is a directed graph in which there is exactly one arc
    between every pair of distinct vertices. Given a digraph on n vertices and an
    integer parameter k, the Feedback Arc Set problem asks whether the given
    digraph has a set of k arcs whose removal results in an acyclic digraph. The
    Feedback Arc Set problem restricted to tournaments is known as the k-Feedback
    Arc Set in Tournaments (k-FAST) problem. In this paper we obtain a linear
    vertex kernel for k-FAST.

  387. Maximum Matchings in Random Bipartite Graphs and the Space Utilization of Cuckoo Hashtables.

    Authors: Alan Frieze, P&#xe1;ll Melsted
    Subjects: Data Structures and Algorithms
    Abstract

    We study the the following question in Random Graphs. We are given two
    disjoint sets $L,R$ with $|L|=n=\alpha m$ and $|R|=m$. We construct a random
    graph $G$ by allowing each $x\in L$ to choose $d$ random neighbours in $R$. The
    question discussed is as to the size $\mu(G)$ of the largest matching in $G$.
    When considered in the context of Cuckoo Hashing, one key question is as to
    when is $\mu(G)=n$ whp? We answer this question exactly when $d$ is at least
    four.

  388. Vector Bin Packing with Multiple-Choice.

    Authors: Boaz Patt-Shamir, Dror Rawitz
    Subjects: Data Structures and Algorithms
    Abstract

    We consider a variant of bin packing called multiple-choice vector bin
    packing. In this problem we are given a set of items, where each item can be
    selected in one of several $D$-dimensional incarnations. We are also given $T$
    bin types, each with its own cost and $D$-dimensional size. Our goal is to pack
    the items in a set of bins of minimum overall cost. The problem is motivated by
    scheduling in networks with guaranteed quality of service (QoS), but due to its
    general formulation it has many other applications as well.

  389. Sharp Load Thresholds for Cuckoo Hashing.

    Authors: Konstantinos Panagiotou, Nikolaos Fountoulakis
    Subjects: Data Structures and Algorithms
    Abstract

    The paradigm of many choices has influenced significantly the design of
    efficient data structures and, most notably, hash tables. Cuckoo hashing is a
    technique that extends this concept. There,we are given a table with $n$
    locations, and we assume that each location can hold one item. Each item to be
    inserted chooses randomly k>1 locations and has to be placed in any one of
    them. How much load can cuckoo hashing handle before collisions prevent the
    successful assignment of the available items to the chosen locations?

  390. Semi-local string comparison: algorithmic techniques and applications.

    Authors: Alexander Tiskin
    Subjects: Data Structures and Algorithms
    Abstract

    The longest common subsequence (LCS) problem is a classical problem in
    computer science. The semi-local LCS problem is a generalisation of the LCS
    problem, arising naturally in the context of string comparison.

  391. The 1.375 Approximation Algorithm for Sorting by Transpositions Can Run in $O(n\log n)$ Time.

    Authors: Jesun Sahariar Firoz, Masud Hasan, Ashik Zinnat Khan, M. Sohel Rahman
    Subjects: Data Structures and Algorithms
    Abstract

    Sorting a Permutation by Transpositions (SPbT) is an important problem in
    Bioinformtics. In this paper, we improve the running time of the best known
    approximation algorithm for SPbT. We use the permutation tree data structure of
    Feng and Zhu and improve the running time of the 1.375 Approximation Algorithm
    for SPbT of Elias and Hartman to $O(n\log n)$. The previous running time of EH
    algorithm was $O(n^2)$.

  392. Searching a bitstream in linear time for the longest substring of any given density.

    Authors: Benjamin A. Burton
    Subjects: Data Structures and Algorithms
    Abstract

    Given an arbitrary bitstream, we consider the problem of finding the longest
    substring whose ratio of ones to zeroes equals a given value. The central
    result of this paper is an algorithm that solves this problem in linear time.
    The method involves (i) reformulating the problem as a constrained walk through
    a sparse matrix, and then (ii) developing a data structure for this sparse
    matrix that allows us to perform each step of the walk in amortised constant
    time.

  393. Testing Distribution Identity Efficiently.

    Authors: Krzysztof Onak
    Subjects: Data Structures and Algorithms
    Abstract

    We consider the problem of testing distribution identity. Given a sequence of
    independent samples from an unknown distribution on a domain of size n, the
    goal is to check if the unknown distribution approximately equals a known
    distribution on the same domain. While Batu, Fortnow, Fischer, Kumar,
    Rubinfeld, and White (FOCS 2001) proved that the sample complexity of the
    problem is O~(sqrt(n) * poly(1/epsilon)), the running time of their tester is
    much higher: O(n) + O~(sqrt(n) * poly(1/epsilon)). We modify their tester to
    achieve a running time of O~(sqrt(n) * poly(1/epsilon)).

  394. b-Bit Minwise Hashing.

    Authors: Ping Li, Arnd Christian Konig
    Subjects: Data Structures and Algorithms
    Abstract

    This paper establishes the theoretical framework of b-bit minwise hashing.
    The original minwise hashing method has become a standard technique for
    estimating set similarity (e.g., resemblance) with applications in information
    retrieval, data management, social networks and computational advertising.

  395. Parameterized Complexity of the k-anonymity Problem.

    Authors: Paola Bonizzoni, Gianluca Della Vedova, Riccardo Dondi, Yuri Pirola
    Subjects: Data Structures and Algorithms
    Abstract

    The problem of publishing personal data without giving up privacy is becoming
    increasingly important. An interesting formalization that has been recently
    proposed is the $k$-anonymity. This approach requires that the rows of a table
    are partitioned in clusters of size at least $k$ and that all the rows in a
    cluster become the same tuple, after the suppression of some entries.

  396. Wee LCP.

    Authors: Johannes Fischer
    Subjects: Data Structures and Algorithms
    Abstract

    We prove that longest common prefix (LCP) information can be stored in much
    less space than previously known. More precisely, we show that in the presence
    of the text and the suffix array, o(n) additional bits are sufficient to answer
    LCP-queries asymptotically in the same time that is needed to retrieve an entry
    from the suffix array. This yields the smallest compressed suffix tree with
    sub-logarithmic navigation time.

  397. Scalable Distributed-Memory External Sorting.

    Authors: Peter Sanders, Mirko Rahn, Johannes Singler
    Subjects: Data Structures and Algorithms
    Abstract

    We engineer algorithms for sorting huge data sets on massively parallel
    machines. The algorithms are based on the multiway merging paradigm. We first
    outline an algorithm whose I/O requirement is close to a lower bound. Thus, in
    contrast to naive implementations of multiway merging and all other approaches
    known to us, the algorithm works with just two passes over the data even for
    the largest conceivable inputs. A second algorithm reduces communication
    overhead and uses more conventional specifications of the result at the cost of
    slightly increased I/O requirements.

  398. An algorithm for computing cutpoints in finite metric spaces.

    Authors: V. Moulton, A. Dress, K. T. Huber, J. Koolen, A. Spillner
    Subjects: Data Structures and Algorithms
    Abstract

    The theory of the tight span, a cell complex that can be associated to every
    metric $D$, offers a unifying view on existing approaches for analyzing
    distance data, in particular for decomposing a metric $D$ into a sum of simpler
    metrics as well as for representing it by certain specific edge-weighted
    graphs, often referred to as realizations of $D$. Many of these approaches
    involve the explicit or implicit computation of the so-called cutpoints of (the
    tight span of) $D$, such as the algorithm for computing the "building blocks"
    of optimal realizations of $D$ recently presented by A.

  399. On the Sample Complexity of Compressed Counting.

    Authors: Ping Li
    Subjects: Data Structures and Algorithms
    Abstract

    Compressed Counting (CC), based on maximally skewed stable random
    projections, was recently proposed for estimating the p-th frequency moments of
    data streams. The case p->1 is extremely useful for estimating Shannon entropy
    of data streams. In this study, we provide a very simple algorithm based on the
    sample minimum estimator and prove a much improved sample complexity bound,
    compared to prior results.

  400. Approximation Algorithms for Constrained Knapsack Problems.

    Authors: Glencora Borradaile, Brent Heeringa, Gordon Wilfong
    Subjects: Data Structures and Algorithms
    Abstract

    We study constrained versions of the knapsack problem in which dependencies
    between items are given by a graph. In one version, an item can be selected
    only if one of its neighbours is also selected. In the other version, an item
    can be selected only when all its neighbours are also selected. These problems
    generalize and unify several problems including the prize collecting and
    budgeted maximum coverage problems.

  401. Combining Approximation Algorithms for the Prize-Collecting TSP.

    Authors: Michel X. Goemans
    Subjects: Data Structures and Algorithms
    Abstract

    We present a 1.91457-approximation algorithm for the prize-collecting
    travelling salesman problem. This is obtained by combining a randomized variant
    of a rounding algorithm of Bienstock et al. and a primal-dual algorithm of
    Goemans and Williamson.

  402. Improved Analysis of a Max Cut Algorithm Based on Spectral Partitioning.

    Authors: Jos&#xe9; Soto
    Subjects: Data Structures and Algorithms
    Abstract

    Luca Trevisan presented an approximation algorithm for Max Cut based on
    spectral partitioning techniques. He proved that the algorithm has an
    approximation ratio of at least 0.531. We improve this bound up to 0.6142. We
    also define and extend this result for the more general Maximum Colored Cut
    problem.

  403. Exact Covers via Determinants.

    Authors: Andreas Bj&#xf6;rklund
    Subjects: Data Structures and Algorithms
    Abstract

    Given a k-uniform hypergraph on n vertices, partitioned in k equal parts such
    that every hyperedge includes one vertex from each part, the k-dimensional
    matching problem asks whether there is a disjoint collection of the hyperedges
    which covers all vertices. We show it can be solved by a randomized polynomial
    space algorithm in time O*(2^(n(k-2)/k)). The O*() notation hides factors
    polynomial in n and k.

  404. A 4/3-competitive randomised algorithm for online scheduling of packets with agreeable deadlines.

    Authors: Lukasz Jez
    Subjects: Data Structures and Algorithms
    Abstract

    In 2005 Li et al. gave a phi-competitive deterministic online algorithm for
    scheduling of packets with agreeable deadlines with a very interesting
    analysis. This is known to be optimal due to a lower bound by Hajek. We claim
    that the algorithm by Li et al. can be slightly simplified, while retaining its
    competitive ratio. Then we introduce randomness to the modified algorithm and
    argue that the competitive ratio against oblivious adversary is at most 4/3.
    Note that this still leaves a gap between the best known lower bound of 5/4 by
    Chin et al.

  405. Fast evaluation of interlace polynomials on graphs of bounded treewidth.

    Authors: Markus Bl&#xe4;ser, Christian Hoffmann
    Subjects: Data Structures and Algorithms
    Abstract

    We consider the multivariate interlace polynomial introduced by Courcelle
    (2008), which generalizes several interlace polynomials defined by Arratia,
    Bollobas, and Sorkin (2004) and by Aigner and van der Holst (2004). We present
    an algorithm to evaluate the multivariate interlace polynomial of a graph with
    n vertices given a tree decomposition of the graph of width k. The best
    previously known result (Courcelle 2008) employs a general logical framework
    and leads to an algorithm with running time f(k)*n, where f(k) is doubly
    exponential in k.

  406. Algorithms for finding dispensable variables.

    Authors: Mikolas Janota, Joao Marques-Silva, Radu Grigore
    Subjects: Data Structures and Algorithms
    Abstract

    This short note reviews briefly three algorithms for finding the set of
    dispensable variables of a boolean formula. The presentation is light on proofs
    and heavy on intuitions.

  407. Finding Associations and Computing Similarity via Biased Pair Sampling.

    Authors: Andrea Campagna, Rasmus Pagh
    Subjects: Data Structures and Algorithms
    Abstract

    Sampling-based methods have previously been proposed for the problem of
    finding interesting associations in data, even for low-support items. While
    these methods do not guarantee precise results, they can be vastly more
    efficient than approaches that rely on exact counting. However, for many
    similarity measures no such methods have been known. In this paper we show how
    a wide variety of measures can be supported by a simple biased sampling method.
    The method also extends to find high-confidence association rules.

  408. Improved Hardness of Approximation for Stackelberg Shortest-Path Pricing.

    Authors: Sanjeev Khanna, Patrick Briest
    Subjects: Data Structures and Algorithms
    Abstract

    We consider the Stackelberg shortest-path pricing problem, which is defined
    as follows. Given a graph G with fixed-cost and pricable edges and two distinct
    vertices s and t, we may assign prices to the pricable edges. Based on the
    predefined fixed costs and our prices, a customer purchases a cheapest s-t-path
    in G and we receive payment equal to the sum of prices of pricable edges
    belonging to the path. Our goal is to find prices maximizing the payment
    received from the customer.

  409. GPU sample sort.

    Authors: Nikolaj Leischner, Vitaly Osipov, Peter Sanders
    Subjects: Data Structures and Algorithms
    Abstract

    In this paper, we present the design of a sample sort algorithm for manycore
    GPUs. Despite being one of the most efficient comparison-based sorting
    algorithms for distributed memory architectures its performance on GPUs was
    previously unknown. For uniformly distributed keys our sample sort is at least
    25% and on average 68% faster than the best comparison-based sorting algorithm,
    GPU Thrust merge sort, and on average more than 2 times faster than GPU
    quicksort.

  410. Finding Induced Subgraphs via Minimal Triangulations.

    Authors: Fedor V. Fomin, Yngve Villanger
    Subjects: Data Structures and Algorithms
    Abstract

    Potential maximal cliques and minimal separators are combinatorial objects
    which were introduced and studied in the realm of minimal triangulations
    problems including Minimum Fill-in and Treewidth.

  411. Fast Set Intersection and Two Patterns Matching.

    Authors: Hagai Cohen, Ely Porat
    Subjects: Data Structures and Algorithms
    Abstract

    In this paper we present a new problem, the fast set intersection problem,
    which is to preprocess a collection of sets in order to efficiently report the
    intersection of any two sets in the collection. In addition we suggest new
    solutions for the two-dimensional substring indexing problem and the document
    listing problem for two patterns by reduction to the fast set intersection
    problem.

  412. MACH: Fast Randomized Tensor Decompositions.

    Authors: Charalampos E. Tsourakakis
    Subjects: Data Structures and Algorithms
    Abstract

    Tensors naturally model many real world processes which generate multi-aspect
    data. Such processes appear in many different research disciplines, e.g,
    chemometrics, computer vision, psychometrics and neuroimaging analysis. Tensor
    decompositions such as the Tucker decomposition are used to analyze
    multi-aspect data and extract latent factors, which capture the multilinear
    data structure. Such decompositions are powerful mining tools, for extracting
    patterns from large data volumes.

  413. Combinatiorial Algorithms for Wireless Information Flow.

    Authors: Javad Ebrahimi, Christina Fragouli
    Subjects: Data Structures and Algorithms
    Abstract

    A long-standing open question in information theory is to characterize the
    unicast capacity of a wireless relay network. The difficulty arises due to the
    complex signal interactions induced in the network, since the wireless channel
    inherently broadcasts the signals and there is interference among
    transmissions.

  414. Planar Subgraph Isomorphism Revisited.

    Authors: Frederic Dorn
    Subjects: Data Structures and Algorithms
    Abstract

    The problem of Subgraph Isomorphism is defined as follows: Given a pattern H
    and a host graph G on n vertices, does G contain a subgraph that is isomorphic
    to H? Eppstein [SODA 95, J'GAA 99] gives the first linear time algorithm for
    subgraph isomorphism for a fixed-size pattern, say of order k, and arbitrary
    planar host graph, improving upon the O(n^\sqrt{k})-time algorithm when using
    the ``Color-coding'' technique of Alon et al [J'ACM 95]. Eppstein's algorithm
    runs in time k^O(k) n, that is, the dependency on k is superexponential.

  415. Exploration of Periodically Varying Graphs.

    Authors: Paola Flocchini, Bernard Mans, Nicola Santoro
    Subjects: Data Structures and Algorithms
    Abstract

    We study the computability and complexity of the exploration problem in a
    class of highly dynamic graphs: periodically varying (PV) graphs, where the
    edges exist only at some (unknown) times defined by the periodic movements of
    carriers. These graphs naturally model highly dynamic infrastructure-less
    networks such as public transports with fixed timetables, low earth orbiting
    (LEO) satellite systems, security guards' tours, etc. We establish necessary
    conditions for the problem to be solved.

  416. Breaking the 2^n-Barrier for Irredundance: A Parameterized Route to Solving Exact Puzzles.

    Authors: Ljiljana Brankovic, Henning Fernau, Joachim Kneis, Dieter Kratsch Alexander Langer Mathieu Liedloff Daniel Raible Peter Rossmanith
    Subjects: Data Structures and Algorithms
    Abstract

    The lower and the upper irredundance numbers of a graph $G$, denoted $ir(G)$
    and $IR(G)$ respectively, are conceptually linked to domination and
    independence numbers and have numerous relations to other graph parameters. It
    is a long-standing open question whether determining these numbers for a graph
    $G$ on $n$ vertices admits exact algorithms running in time less than the
    trivial $\Omega(2^n)$ enumeration barrier. We solve these open problems by
    devising parameterized algorithms for the dual of the natural parameterizations
    of the problems with running times faster than $O^*(4^{k})$.

  417. Beyond O*(2^n) in domination-type problems.

    Authors: Marek Cygan, Marcin Pilipczuk, Jakub Onufry Wojtaszczyk
    Subjects: Data Structures and Algorithms
    Abstract

    In this paper we provide algorithms faster than O*(2^n) for several
    NP-complete domination-type problems. More precisely, we provide: an algorithm
    for CAPACITATED DOMINATING SET that solves it in O(1.89^n), a branch-and-reduce
    algorithm solving LARGEST IRREDUNDANT SET in O(1.9657^n) time and a simple
    iterative-DFS algorithm for SMALLEST INCLUSION-MAXIMAL IRREDUNDANT SET that
    solves it in O(1.999956^n) time.

  418. Large-girth roots of graphs.

    Authors: Anna Adamaszek, Michal Adamaszek
    Subjects: Data Structures and Algorithms
    Abstract

    We study the problem of recognizing graph powers and computing roots of
    graphs. We provide a polynomial time recognition algorithm for r-th powers of
    graphs of girth at least 2r+3, thus improving a bound conjectured by Farzad et
    al. (STACS 2009). Our algorithm also finds all r-th roots of a given graph that
    have girth at least 2r+3 and no degree one vertices, which is a step towards a
    recent conjecture of Levenshtein that such root should be unique. On the
    negative side, we prove that recognition becomes an NP-complete problem when
    the bound on girth is about twice smaller.

  419. Efficient and Error-Correcting Data Structures for Membership and Polynomial Evaluation.

    Authors: Victor Chen, Elena Grigorescu, Ronald de Wolf
    Subjects: Data Structures and Algorithms
    Abstract

    We construct efficient data structures that are resilient against a constant
    fraction of adversarial noise. Our model requires that the decoder answers most
    queries correctly with high probability and for the remaining queries, the
    decoder with high probability either answers correctly or declares "don't
    know." Furthermore, if there is no noise on the data structure, it answers all
    queries correctly with high probability. Our model is the common generalization
    of a model proposed recently by de Wolf and the notion of "relaxed locally
    decodable codes" developed in the PCP literature.

  420. Packet Scheduling in a Size-Bounded Buffer.

    Authors: Fei Li
    Subjects: Data Structures and Algorithms
    Abstract

    We consider algorithms to schedule packets with values and deadlines in a
    size-bounded buffer. At any time, the buffer can store at most B packets.
    Packets arrive over time. Each packet has a non-negative value and an integer
    deadline. In each time step, at most one packet can be sent. The objective is
    to maximize the total value gained by delivering packets no later than their
    respective deadlines. This model generalizes the well-studied bounded-delay
    model (Hajek. CISS 2001. Kesselman et al. STOC 2001).

  421. Characterizing Truthful Multi-Armed Bandit Mechanisms.

    Authors: Moshe Babaioff, Yogeshwer Sharma, Aleksandrs Slivkins
    Subjects: Data Structures and Algorithms
    Abstract

    We consider a multi-round auction setting motivated by pay-per-click auctions
    for Internet advertising. In each round the auctioneer selects an advertiser
    and shows her ad, which is then either clicked or not. An advertiser derives
    value from clicks; the value of a click is her private information. Initially,
    neither the auctioneer nor the advertisers have any information about the
    likelihood of clicks on the advertisements. The auctioneer's goal is to design
    a (dominant strategies) truthful mechanism that (approximately) maximizes the
    social welfare.

  422. Perfect Matchings in O(n \log n) Time in Regular Bipartite Graphs.

    Authors: Ashish Goel, Michael Kapralov, Sanjeev Khanna
    Subjects: Data Structures and Algorithms
    Abstract

    In this paper we consider the well-studied problem of finding a perfect
    matching in a d-regular bipartite graph on 2n nodes with m=nd edges. The
    best-known algorithm for general bipartite graphs (due to Hopcroft and Karp)
    takes time O(m\sqrt{n}). In regular bipartite graphs, however, a matching is
    known to be computable in O(m) time (due to Cole, Ost and Schirra). In a recent
    line of work by Goel, Kapralov and Khanna the O(m) time algorithm was improved
    first to \tilde O(min{m, n^{2.5}/d}) and then to \tilde O(min{m, n^2/d}).

  423. AMS Without 4-Wise Independence on Product Domains.

    Authors: Vladimir Braverman, Rafail Ostrovsky
    Subjects: Data Structures and Algorithms
    Abstract

    Measuring independence and $k$-wise independence is a fundamental problem
    that has multiple applications and was the subject of intensive research during
    the last decade; In the streaming environment this problem was pioneered by
    Indyk and McGregor \cite{mi}. Indyk and McGregor state, as an explicit open
    question in their paper, the question of whether one can estimate $k$-wise
    independence on $k$-tuples for any $k> 2$.

  424. A note on the data-driven capacity of P2P networks.

    Authors: Jacob Chakareski, Pascal Frossard, Herv&#xe9; Kerivin, Jimmy Leblet, Gwendal Simon
    Subjects: Data Structures and Algorithms
    Abstract

    We consider two capacity problems in P2P networks. In the first one, the
    nodes have an infinite amount of data to send and the goal is to optimally
    allocate their uplink bandwidths such that the demands of every peer in terms
    of receiving data rate are met. We solve this problem through a mapping from a
    node-weighted graph featuring two labels per node to a max flow problem on an
    edge-weighted bipartite graph. In the second problem under consideration, the
    resource allocation is driven by the availability of the data resource that the
    peers are interested in sharing.

  425. FPT Algorithms for Connected Feedback Vertex Set.

    Authors: Neeldhara Misra, Geevarghese Philip, Venkatesh Raman, Saket Saurabh, Somnath Sikdar
    Subjects: Data Structures and Algorithms
    Abstract

    We study the recently introduced Connected Feedback Vertex Set (CFVS) problem
    from the view-point of parameterized algorithms. CFVS is the connected variant
    of the classical Feedback Vertex Set problem and is defined as follows: given a
    graph G=(V,E) and an integer k, decide whether there exists a subset F of V, of
    size at most k, such that G[V F] is a forest and G[F] is connected. We show
    that Connected Feedback Vertex Set can be solved in time $O(2^{O(k)}n^{O(1)})$
    on general graphs and in time $O(2^{O(\sqrt{k}\log k)}n^{O(1)})$ on graphs
    excluding a fixed graph H as a minor.

  426. Constant factor approximation to the bounded genus instances of ATSP.

    Authors: Amin Saberi, Shayan Oveis Gharan
    Subjects: Data Structures and Algorithms
    Abstract

    We give a constant factor approximation algorithm for the asymmetric
    traveling salesman problem when the underlying undirected graph of the
    Held-Karp linear programming relaxation of the problem has orientable bounded
    genus. Our result also implies the weak version Goddyn's conjecture on the
    existence of thin trees on graphs with orientable bounded genus.

  427. Memory Consistency Conditions for Self-Assembly Programming.

    Authors: Aaron Sterling
    Subjects: Data Structures and Algorithms
    Abstract

    Perhaps the two most significant theoretical questions about the programming
    of self-assembling agents are: (1) necessary and sufficient conditions to
    produce a unique terminal assembly, and (2) error correction. We address both
    questions, by reducing two well-studied models of tile assembly to models of
    distributed shared memory (DSM), in order to obtain results from the memory
    consistency systems induced by tile assembly systems when simulated in the DSM
    setting.

  428. Representations of Stream Processors Using Nested Fixed Points.

    Authors: Neil Ghani, Peter Hancock, Dirk Pattinson
    Subjects: Data Structures and Algorithms
    Abstract

    We define representations of continuous functions on infinite streams of
    discrete values, both in the case of discrete-valued functions, and in the case
    of stream-valued functions. We define also an operation on the representations
    of two continuous functions between streams that yields a representation of
    their composite.

  429. Crossing-Free Acyclic Hamiltonian Path Completion for Planar st-Digraphs.

    Authors: Tamara Mchedlidze, Antonios Symvonis
    Subjects: Data Structures and Algorithms
    Abstract

    In this paper we study the problem of existence of a crossing-free acyclic
    hamiltonian path completion (for short, HP-completion) set for embedded upward
    planar digraphs. In the context of book embeddings, this question becomes:
    given an embedded upward planar digraph $G$, determine whether there exists an
    upward 2-page book embedding of $G$ preserving the given planar embedding.

  430. An Optimal Labeling Scheme for Ancestry Queries.

    Authors: Pierre Fraigniaud, Amos Korman
    Subjects: Data Structures and Algorithms
    Abstract

    An ancestry labeling scheme assigns labels (bit strings) to the nodes of
    rooted trees such that ancestry queries between any two nodes in a tree can be
    answered merely by looking at their corresponding labels. The quality of an
    ancestry labeling scheme is measured by its label size, that is the maximal
    number of bits in a label of a tree node.

  431. Generating All Partitions: A Comparison Of Two Encodings.

    Authors: Jerome Kelleher, Barry O&#x27;Sullivan
    Subjects: Data Structures and Algorithms
    Abstract

    Integer partitions may be encoded as either ascending or descending
    compositions for the purposes of systematic generation. Many algorithms exist
    to generate all descending compositions, yet none have previously been
    published to generate all ascending compositions. We develop three new
    algorithms to generate all ascending compositions and compare these with
    descending composition generators from the literature. We analyse the new
    algorithms and provide new and more precise analyses for the descending
    composition generators.

  432. Simple implementation of deletion from open-address hash table.

    Authors: Maxim Kolosovskiy
    Subjects: Data Structures and Algorithms
    Abstract

    Deletion from open-address hash table is not so easy as deletion from chained
    hash table, because in open-address table we can't simply mark a slot
    containing deleted key as empty. Search for keys may become incorrect. The
    classical method to implement deletion is to mark slots in hash table by three
    values: "free", "busy", "deleted". That method is easy to implement, but there
    are some disadvantages. In this article we consider alternative method of
    deletion keys, where we avoid using the mark "deleted". The article contains
    the implementation of the method in Java.

  433. Approximate Nearest Neighbor Search through Comparisons.

    Authors: Dominique Tschopp, Suhas Diggavi
    Subjects: Data Structures and Algorithms
    Abstract

    This paper addresses the problem of finding the nearest neighbor (or one of
    the R-nearest neighbors) of a query object q in a database of n objects. In
    contrast with most existing approaches, we can only access the ``hidden'' space
    in which the objects live through a similarity oracle. The oracle, given two
    reference objects and a query object, returns the reference object closest to
    the query object. The oracle attempts to model the behavior of human users,
    capable of making statements about similarity, but not of assigning meaningful
    numerical values to distances between objects.

  434. Straggler Identification in Round-Trip Data Streams via Newton's Identities and Invertible Bloom Filters.

    Authors: Michael T. Goodrich, David Eppstein
    Subjects: Data Structures and Algorithms
    Abstract

    We introduce the straggler identification problem, in which an algorithm must
    determine the identities of the remaining members of a set after it has had a
    large number of insertion and deletion operations performed on it, and now has
    relatively few remaining members. The goal is to do this in o(n) space, where n
    is the total number of identities. The straggler identification problem has
    applications, for example, in determining the set of unacknowledged packets in
    a high-bandwidth multicast data stream.

  435. Deterministic approximation for the cover time of trees.

    Authors: Uriel Feige, Ofer Zeitouni
    Subjects: Data Structures and Algorithms
    Abstract

    We present a deterministic algorithm that given a tree T with n vertices, a
    starting vertex v and a slackness parameter epsilon > 0, estimates within an
    additive error of epsilon the cover and return time, namely, the expected time
    it takes a simple random walk that starts at v to visit all vertices of T and
    return to v. The running time of our algorithm is polynomial in n/epsilon, and
    hence remains polynomial in n also for epsilon = 1/n^{O(1)}. We also show how
    the algorithm can be extended to estimate the expected cover (without return)
    time on trees.

  436. Computing alignment plots efficiently.

    Authors: Peter Krusche, Alexander Tiskin
    Subjects: Data Structures and Algorithms
    Abstract

    Dot plots are a standard method for local comparison of biological sequences.
    In a dot plot, a substring to substring distance is computed for all pairs of
    fixed-size windows in the input strings. Commonly, the Hamming distance is used
    since it can be computed in linear time. However, the Hamming distance is a
    rather crude measure of string similarity, and using an alignment-based edit
    distance can greatly improve the sensitivity of the dot plot method.

  437. Paired approximation problems and incompatible inapproximabilities.

    Authors: David Eppstein
    Subjects: Data Structures and Algorithms
    Abstract

    This paper considers pairs of optimization problems that are defined from a
    single input and for which it is desired to find a good approximation to either
    one of the problems. In many instances, it is possible to efficiently find an
    approximation of this type that is better than known inapproximability lower
    bounds for either of the two individual optimization problems forming the pair.
    In particular, we find either a $(1+\epsilon)$-approximation to $(1,2)$-TSP or
    a $1/\epsilon$-approximation to maximum independent set, from a given graph, in
    linear time.

  438. Fast C-K-R Partitions of Sparse Graphs.

    Authors: Manor Mendel, Chaya Schwob
    Subjects: Data Structures and Algorithms
    Abstract

    We present fast algorithms for constructing probabilistic embeddings and
    approximate distance oracles in sparse graphs. The main ingredient is a fast
    algorithm for sampling the probabilistic partitions of Calinescu, Karloff, and
    Rabani in sparse graphs.

  439. Randomized Shellsort: A Simple Oblivious Sorting Algorithm.

    Authors: Michael T. Goodrich
    Subjects: Data Structures and Algorithms
    Abstract

    In this paper, we describe randomized Shellsort--a simple, randomized,
    data-oblivious version of the Shellsort algorithm that always runs in O(n log
    n) time and, as we show, succeeds in sorting any given input permutation with
    very high probability. Thus, randomized Shellsort is simultaneously simple,
    time-optimal, and data-oblivious. Taken together, these properties imply
    applications in the design of new efficient privacy-preserving computations
    based on the secure multi-party computation (SMC) paradigm.

  440. The Euler Path to Static Level-Ancestors.

    Authors: Amir M. Ben-Amram
    Subjects: Data Structures and Algorithms
    Abstract

    Suppose that a rooted tree T is given for preprocessing. The Level-Ancestor
    Problem is to answer quickly queries of the following form. Given a vertex v
    and an integer i > 0, find the i-th vertex on the path from the root to v.
    Algorithms that achieve a linear time bound for preprocessing and a constant
    time bound for a query have been published by Dietz (1991), Alstrup and Holm
    (2000), and Bender and Farach (2002). The first two algorithms address dynamic
    versions of the problem; the last addresses the static version only and is the
    simplest so far.

  441. A Randomized Rounding Algorithm for the Asymmetric Traveling Salesman Problem.

    Authors: Michel X. Goemans, Nicholas J. A. Harvey, Kamal Jain, Mohit Singh
    Subjects: Data Structures and Algorithms
    Abstract

    We present an algorithm for the asymmetric traveling salesman problem on
    instances which satisfy the triangle inequality. Like several existing
    algorithms, it achieves approximation ratio O(log n). Unlike previous
    algorithms, it uses randomized rounding.

  442. Constraint Minimum Vertex Cover in K Partite Graph, Approximation Algorithm and Complexity Analysis.

    Authors: Kamanashis Biswas, S. A. M. Harun
    Subjects: Data Structures and Algorithms
    Abstract

    Generally, a graph G, an independent set is a subset S of vertices in G such
    that no two vertices in S are adjacent (connected by an edge) and a vertex
    cover is a subset S of vertices such that each edge of G has at least one of
    its endpoints in S. Again, the minimum vertex cover problem is to find a vertex
    cover with the smallest number of vertices. This study shows that the
    constrained minimum vertex cover problem in k-partite graph (MIN CVCK) is
    NP-Complete which is an important property of k partite graph.

  443. Online Algorithms for Self-Organizing Sequential Search - A Survey.

    Authors: Rakesh Mohanty, N. S. Narayanaswamy
    Subjects: Data Structures and Algorithms
    Abstract

    The main objective of this survey is to present the important theoretical and
    experimental results contributed till date in the area of online algorithms for
    the self organizing sequential search problem, also popularly known as the List
    Update Problem(LUP) in a chronological way. The survey includes competitiveness
    results of deterministic and randomized online algorithms and complexity
    results of optimal off line algorithms for the list update problem.

  444. An Oblivious O(1)-Approximation for Single Source Buy-at-Bulk.

    Authors: Ashish Goel, Ian Post
    Subjects: Data Structures and Algorithms
    Abstract

    We consider the single-source (or single-sink) buy-at-bulk problem with an
    unknown concave cost function. We want to route a set of demands along a graph
    to or from a designated root node, and the cost of routing x units of flow
    along an edge is proportional to some concave, non-decreasing function f such
    that f(0) = 0. We present a polynomial time algorithm that finds a distribution
    over trees such that the expected cost of a tree for any f is within an
    O(1)-factor of the optimum cost for that f.

  445. Notes on large angle crossing graphs.

    Authors: Vida Dujmovic, Joachim Gudmundsson, Pat Morin, Thomas Wolle
    Subjects: Data Structures and Algorithms
    Abstract

    A graph G is an a-angle crossing (aAC) graph if every pair of crossing edges
    in G intersect at an angle of at least a. The concept of right angle crossing
    (RAC) graphs (a=Pi/2) was recently introduced by Didimo et. al. It was shown
    that any RAC graph with n vertices has at most 4n-10 edges and that there are
    infinitely many values of n for which there exists a RAC graph with n vertices
    and 4n-10 edges. In this paper, we give upper and lower bounds for the number
    of edges in aAC graphs for all 0 < a < Pi/2.

  446. In-packet Bloom filters: Design and networking applications.

    Authors: Christian Esteve Rothenberg, Carlos Macapuna, Alexander Wiesmaier
    Subjects: Data Structures and Algorithms
    Abstract

    Bloom filters are well-known space-efficient data structures that answer set
    membership queries with some probability of false positives. In an attempt to
    solve many of the limitations of current inter-networking architectures, some
    recent proposals rely on including small Bloom filters in packet headers for
    routing, security, accountability or other purposes that push application
    states into the packets themselves. In this paper, we consider the design of
    in-packet Bloom filters (iBF).

RSS-материал