Distributed

  1. Iterative Approximate Byzantine Consensus in Arbitrary Directed Graphs.

    Authors: Guanfeng Liang, Nitin Vaidya, Lewis Tseng
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    In this paper, we explore the problem of iterative approximate Byzantine
    consensus in arbitrary directed graphs. In particular, we prove a necessary and
    sufficient condition for the existence of iterative byzantine consensus
    algorithms. Additionally, we use our sufficient condition to examine whether
    such algorithms exist for some specific graphs.

  2. A Study on Using Uncertain Time Series Matching Algorithms in Map-Reduce Applications.

    Authors: Nikzad Babaii Rizvandi, Javid Taheri, Albert Y. Zomaya
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    In this paper, we study CPU utilization time patterns of several Map-Reduce
    applications. After extracting running patterns of several applications, the
    patterns with their statistical information are saved in a reference database
    to be later used to tweak system parameters to efficiently execute unknown
    applications in future. To achieve this goal, CPU utilization patterns of new
    applications along with its statistical information are compared with the
    already known ones in the reference database to find/predict their most
    probable execution patterns.

  3. Relativistic causality and clockless circuits.

    Authors: Philippe Matherat, Marc-Thierry Jaekel
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Time plays a crucial role in the performance of computing systems. The
    accurate modelling of logical devices, and of their physical implementations,
    requires an appropriate representation of time and of all properties that
    depend on this notion. The need for a proper model, particularly acute in the
    design of clockless delay-insensitive (DI) circuits, leads one to reconsider
    the classical descriptions of time and of the resulting order and causal
    relations satisfied by logical operations.

  4. PC-Cluster based Storage System Architecture for Cloud Storage.

    Authors: Tin Tin Yee, Thinn Thu Naing
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Design and architecture of cloud storage system plays a vital role in cloud
    computing infrastructure in order to improve the storage capacity as well as
    cost effectiveness. Usually cloud storage system provides users to efficient
    storage space with elasticity feature. One of the challenges of cloud storage
    system is difficult to balance the providing huge elastic capacity of storage
    and investment of expensive cost for it.

  5. Programmable Cellular Automata Based Efficient Parallel AES Encryption Algorithm.

    Authors: Debasis Das, Rajiv Misra
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Cellular Automata(CA) is a discrete computing model which provides simple,
    flexible and efficient platform for simulating complicated systems and
    performing complex computation based on the neighborhoods information. CA
    consists of two components 1) a set of cells and 2) a set of rules .
    Programmable Cellular Automata(PCA) employs some control signals on a Cellular
    Automata(CA) structure.

  6. Distributed Signal Processing via Chebyshev Polynomial Approximation.

    Authors: Pascal Frossard, Pierre Vandergheynst, David I Shuman
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Unions of graph multiplier operators are an important class of linear
    operators for processing signals defined on graphs. We present a novel method
    to efficiently distribute the application of these operators to
    high-dimensional signals. The proposed method features approximations of the
    graph multipliers by shifted Chebyshev polynomials, whose recurrence relations
    make them readily amenable to distributed computation.

  7. Pretty Private Group Management.

    Authors: Olivier Heen, Erwan Le Merrer, Christoph Neumann, Stéphane Onno
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Group management is a fundamental building block of today's Internet
    applications. Mailing lists, chat systems, collaborative document edition but
    also online social networks such as Facebook and Twitter use group management
    systems. In many cases, group security is required in the sense that access to
    data is restricted to group members only. Some applications also require
    privacy by keeping group members anonymous and unlinkable. Group management
    systems routinely rely on a central authority that manages and controls the
    infrastructure and data of the system.

  8. A Low-Energy Fast Cyber Foraging Mechanism for Mobile Devices.

    Authors: Somayeh Kafaie, Omid Kashefi, Mohsen Sharifi
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    The ever increasing demands for using resource-constrained mobile devices for
    running more resource intensive applications nowadays has initiated the
    development of cyber foraging solutions that offload parts or whole
    computational intensive tasks to more powerful surrogate stationary computers
    and run them on behalf of mobile devices as required.

  9. Data Integrity and Dynamic Storage Way in Cloud Computing.

    Authors: C. Dinesh
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    It is not an easy task to securely maintain all essential data where it has
    the need in many applications for clients in cloud. To maintain our data in
    cloud, it may not be fully trustworthy because client doesn't have copy of all
    stored data. But any authors don't tell us data integrity through its user and
    CSP level by comparison before and after the data update in cloud. So we have
    to establish new proposed system for this using our data reading protocol
    algorithm to check the integrity of data before and after the data insertion in
    cloud.

  10. Secured Data Consistency and Storage Way in Untrusted Cloud using Server Management Algorithm.

    Authors: C. Dinesh
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    It is very challenging part to keep safely all required data that are needed
    in many applications for user in cloud. Storing our data in cloud may not be
    fully trustworthy. Since client doesn't have copy of all stored data, he has to
    depend on Cloud Service Provider.

  11. Game Theoretic Iterative Partitioning for Dynamic Load Balancing in Distributed Network Simulation.

    Authors: Christopher Griffin, George Kesidis, Aditya Kurve, David J. Miller
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    High fidelity simulation of large-sized complex networks can be realized on a
    distributed computing platform that leverages the combined resources of
    multiple processors or machines. In a discrete event driven simulation, the
    assignment of logical processes (LPs) to machines is a critical step that
    affects the computational and communication burden on the machines, which in
    turn affects the simulation execution time of the experiment. We study a
    network partitioning game wherein each node (LP) acts as a selfish player.

  12. Computing Optimal Cycle Mean in Parallel on CUDA.

    Authors: Jiří Barnat, Luboš Brim, Milan Češka, Petr Bauch
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Computation of optimal cycle mean in a directed weighted graph has many
    applications in program analysis, performance verification in particular. In
    this paper we propose a data-parallel algorithmic solution to the problem and
    show how the computation of optimal cycle mean can be efficiently accelerated
    by means of CUDA technology. We show how the problem of computation of optimal
    cycle mean is decomposed into a sequence of data-parallel graph computation
    primitives and show how these primitives can be implemented and optimized for
    CUDA computation.

  13. Parallel implematation of flow and matching algorithms.

    Authors: Agnieszka Łupińska
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    In our work we present two parallel algorithms and their lock-free
    implementations using a popular GPU environment Nvidia CUDA. The first
    algorithm is the push-relabel method for the flow problem in grid graphs. The
    second is the cost scaling algorithm for the assignment problem in complete
    bipartite graphs.

  14. Parallel Binomial American Option Pricing with (and without) Transaction Costs.

    Authors: Alet Roux, Tomasz Zastawniak, Nan Zhang
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We present a parallel algorithm that computes the ask and bid prices of an
    American option when proportional transaction costs apply to the trading of the
    underlying asset. The algorithm computes the prices on recombining binomial
    trees, and is designed for modern multi-core processors. Although parallel
    option pricing has been well studied, none of the existing approaches takes
    transaction costs into consideration. The algorithm that we propose partitions
    a binomial tree into blocks.

  15. Failure Detectors in Homonymous Distributed Systems (with an Application to Consensus).

    Authors: Antonio Fernández Anta, Sergio Arévalo, Damien Imbs, Ernesto Jiménez, Michel Raynal
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    This paper addresses the consensus problem in homonymous distributed systems
    where processes are prone to crash failures and have no initial knowledge of
    the system membership ("homonymous" means that several processes may have the
    same identifier). New classes of failure detectors suited to these systems are
    first defined. Among them, the classes H\Omega\ and H\Sigma\ are introduced
    that are the homonymous counterparts of the classes \Omega\ and \Sigma,
    respectively.

  16. Optimal Joint Multiple Resource Allocation Method for Cloud Computing Environments.

    Authors: Shin-ichi Kuribayashi
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Cloud computing is a model for enabling convenient, on-demand network access
    to a shared pool of configurable computing resources. To provide cloud
    computing services economically, it is important to optimize resource
    allocation under the assumption that the required resource can be taken from a
    shared resource pool. In addition, to be able to provide processing ability and
    storage capacity, it is necessary to allocate bandwidth to access them at the
    same time.

  17. Scalable Grid Resource Discovery through Distributed Search.

    Authors: Alexander Ferworn, Fouad Butt, Syed Saadat Bokhari, Abdolreza Abhari
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    This paper proposes a simple and scalable web-based model for grid resource
    discovery for the Internet. The resource discovery model contains the metadata
    and resource finder web services. The information of resource finder web
    services is kept in the repositories that are distributed in the application
    layer of Internet. The resource finder web services will be discovered by
    sending queries to the repositories in a similar way as the DNS protocol. The
    underlying technology for implementation of the two architectures of this model
    is introduced.

  18. A Non-MDS Erasure Code Scheme For Storage Applications.

    Authors: Soroush Akhlaghi, Abbas Kiani
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    This paper investigates the use of redundancy and self repairing against node
    failures in distributed storage systems, using various strategies. In
    replication method, access to one replication node is sufficient to reconstruct
    a lost node, while in MDS erasure coded systems which are optimal in terms of
    redundancy-reliability tradeoff, a single node failure is repaired after
    recovering the entire stored data. Moreover, regenerating codes yield a
    tradeoff curve between storage capacity and repair bandwidth. The current paper
    aims at investigating a new storage code.

  19. An Application Driven Analysis of the ParalleX Execution Model.

    Authors: Matthew Anderson, Maciej Brodowicz, Hartmut Kaiser, Thomas Sterling
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Exascale systems, expected to emerge by the end of the next decade, will
    require the exploitation of billion-way parallelism at multiple hierarchical
    levels in order to achieve the desired sustained performance. The task of
    assessing future machine performance is approached by identifying the factors
    which currently challenge the scalability of parallel applications.

  20. Improving the scalability of parallel N-body applications with an event driven constraint based execution model.

    Authors: Chirag Dekate, Matthew Anderson, Maciej Brodowicz, Hartmut Kaiser, Bryce Adelstein-Lelbach, Thomas Sterling
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    The scalability and efficiency of graph applications are significantly
    constrained by conventional systems and their supporting programming models.
    Technology trends like multicore, manycore, and heterogeneous system
    architectures are introducing further challenges and possibilities for emerging
    application domains such as graph applications. This paper explores the space
    of effective parallel execution of ephemeral graphs that are dynamically
    generated using the Barnes-Hut algorithm to exemplify dynamic workloads.

  21. Cost of Virtual Machine Live Migration in Clouds: A Performance Evaluation.

    Authors: Rajkumar Buyya, William Voorsluys, James Broberg, Srikumar Venugopal
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Virtualization has become commonplace in modern data centers, often referred
    as "computing clouds". The capability of virtual machine live migration brings
    benefits such as improved performance, manageability and fault tolerance, while
    allowing workload movement with a short service downtime. However, service
    levels of applications are likely to be negatively affected during a live
    migration. For this reason, a better understanding of its effects on system
    performance is desirable.

  22. Parallel Sparse Matrix-Matrix Multiplication and Indexing: Implementation and Experiments.

    Authors: Aydin Buluc, John Gilbert
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Generalized sparse matrix-matrix multiplication (or SpGEMM) is a key
    primitive for many high performance graph algorithms as well as for some linear
    solvers, such as algebraic multigrid. Here we show that SpGEMM also yields
    efficient algorithms for general sparse-matrix indexing in distributed memory,
    provided that the underlying SpGEMM implementation is sufficiently flexible and
    scalable.

  23. An Overview of Codes Tailor-made for Networked Distributed Data Storage.

    Authors: Frederique Oggier, Anwitaman Datta
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    The continuously increasing amount of digital data generated by today's
    society asks for better storage solutions.

  24. Optimization and Evaluation of a Multimedia Streaming Service on Hybrid Telco cloud.

    Authors: Trong Duong Quoc, Heiko Perkuhn, Daniel Catrein, Uwe Naumann, Toni Anwar
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    With recent developments in cloud computing, a paradigm shift from rather
    static deployment of resources to more dynamic, on-demand practices means more
    flexibility and better utilization of resources. This demands new ways to
    efficiently configure networks.

  25. Fast Clustering using MapReduce.

    Authors: Alina Ene, Benjamin Moseley, Sungjin Im
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Clustering problems have numerous applications and are becoming more
    challenging as the size of the data increases. In this paper, we consider
    designing clustering algorithms that can be used in MapReduce, the most popular
    programming environment for processing large datasets. We focus on the
    practical and popular clustering problems, $k$-center and $k$-median. We
    develop fast clustering algorithms with constant factor approximation
    guarantees.

  26. Quality Up in Polynomial Homotopy Continuation by Multithreaded Path Tracking.

    Authors: Jan Verschelde, Genady Yoffe
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Speedup measures how much faster we can solve the same problem using many
    cores. If we can afford to keep the execution time fixed, then quality up
    measures how much better the solution will be computed using many cores. In
    this paper we describe our multithreaded implementation to track one solution
    path defined by a polynomial homotopy. Limiting quality to accuracy and
    confusing accuracy with precision, we strive to offset the cost of
    multiprecision arithmetic running multithreaded code on many cores.

  27. Information Dissemination in Unknown Radio networks with Large Labels.

    Authors: Shailesh Vaya
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We consider the problems of deterministic broadcasting and gossiping in
    completely unknown ad-hoc radio networks. We assume that nothing is known to
    the nodes about the topology or even the size of the network, $n$, except that
    $n > 1$. Protocols for vanilla model, when $n$ is known, may be run for
    increasingly larger estimates $2^i$ on the size of the network, but one cannot
    determine when such a protocol should terminate. Thus, to carry this design
    paradigm, successful completion or in-completion of the process should be
    detected, and this knowledge circulated in the network.

  28. Edge-preserving self-healing: keeping network backbones densely connected.

    Authors: Atish Das Sarma, Amitabh Trehan
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Healing algorithms play a crucial part in distributed P2P networks where
    failures occur continuously and frequently. Several self-healing algorithms
    have been suggested recently [IPDPS'08, PODC'08, PODC'09, PODC'11] in a line of
    work that has yielded gradual improvements in the properties ensured on the
    graph. This work motivates a strong general phenomenon of edge-preserving
    healing that aims at obtaining self-healing algorithms with the constraint that
    all original edges in the graph (not deleted by the adversary), be retained in
    every intermediate graph.

  29. Network Localization on Unit Disk Graphs.

    Authors: Jing Li, Phisan Kaewprapha, Nattakan Puttarak
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We study the problem of cooperative localization of a large network of nodes
    in integer-coordinated unit disk graphs, a simplified but useful version of
    general random graph. Exploiting the property that the radius $r$ sets clear
    cut on the connectivity of two nodes, we propose an essential philosophy that
    "no connectivity is also useful information just like the information being
    connected" in unit disk graphs.

  30. High-Performance Pseudo-Random Number Generation on Graphics Processing Units.

    Authors: Richard P. Brent, Nimalan Nandapalan, Lawrence M. Murray, Alistair Rendell
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    This work considers the deployment of pseudo-random number generators (PRNGs)
    on graphics processing units (GPUs), developing an approach based on the
    xorgens generator to rapidly produce pseudo-random numbers of high statistical
    quality. The chosen algorithm has configurable state size and period, making it
    ideal for tuning to the GPU architecture. We present a comparison of both speed
    and statistical quality with other common parallel, GPU-based PRNGs,
    demonstrating favourable performance of the xorgens-based approach.

  31. Context-Capture Multi-Valued Decision Fusion With Fault Tolerant Capability For Wireless Sensor Networks.

    Authors: Jun Wu, Shigeru Shimamoto
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Wireless sensor networks (WSNs) are usually utilized to perform decision
    fusion of event detection. Current decision fusion schemes are based on binary
    valued decision and do not consider bursty contextcapture. However, bursty
    context and multi-valued data are important characteristics of WSNs. One on
    hand, the local decisions from sensors usually have bursty and contextual
    characteristics. Fusion center must capture the bursty context information from
    the sensors.

  32. Distance-based Node Sampling using Drifting Random Walks.

    Authors: Antonio Fernández Anta, Andrés Sevilla, Alberto Mozo
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Sampling a large network with a given distribution has been identified as a
    useful operation to build network overlays. For example, sampling nodes with
    uniform probability is the cornerstone of epidemic information spreading, and
    constructing small world network topologies can be done by sampling with a
    probability that depends on the distance to a given node. In this paper we
    describe distributed algorithms for sampling networks, so that a node is
    selected with a probability that is a function of the distance of the node to a
    special node, called the \emph{source}.

  33. Consensus vs Broadcast in Communication Networks with Arbitrary Mobile Omission Faults.

    Authors: Emmanuel Godard, Joseph Peters
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We compare the solvability of the Consensus and Broadcast problems in
    synchronous communication networks in which the delivery of messages is not
    reliable. The failure model is the mobile omission faults model. During each
    round, some messages can be lost and the set of possible simultaneous losses is
    the same for each round. We investigate these problems for the first time for
    arbitrary sets of possible failures. Previously, these sets were defined by
    bounding the numbers of failures.

  34. Byzantine Fault Tolerance of Regenerating Codes.

    Authors: Frédérique Oggier, Anwitaman Datta
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Recent years have witnessed a slew of coding techniques custom designed for
    networked storage systems. Network coding inspired regenerating codes are the
    most prolifically studied among these new age storage centric codes. A lot of
    effort has been invested in understanding the fundamental achievable trade-offs
    of storage and bandwidth usage to maintain redundancy in presence of different
    models of failures, showcasing the efficacy of regenerating codes with respect
    to traditional erasure coding techniques.

  35. Workload Classification & Software Energy Measurement for Efficient Scheduling on Private Cloud Platforms.

    Authors: Ian Sommerville, James W. Smith
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    At present there are a number of barriers to creating an energy efficient
    workload scheduler for a Private Cloud based data center. Firstly, the
    relationship between different workloads and power consumption must be
    investigated. Secondly, current hardware-based solutions to providing energy
    usage statistics are unsuitable in warehouse scale data centers where low cost
    and scalability are desirable properties.

  36. Parallel and Distributed Simulation from Many Cores to the Public Cloud (Extended Version).

    Authors: Gabriele D'Angelo
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    In this tutorial paper, we will firstly review some basic simulation concepts
    and then introduce the parallel and distributed simulation techniques in view
    of some new challenges of today and tomorrow. More in particular, in the last
    years there has been a wide diffusion of many cores architectures and we can
    expect this trend to continue. On the other hand, the success of cloud
    computing is strongly promoting the everything as a service paradigm. Is
    parallel and distributed simulation ready for these new challenges?

  37. A Cloud-based Approach for Context Information Provisioning.

    Authors: Elarbi Badidi, Larbi Esmahi
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    As a result of the phenomenal proliferation of modern mobile Internet-enabled
    devices and the widespread utilization of wireless and cellular data networks,
    mobile users are increasingly requiring services tailored to their current
    context. High-level context information is typically obtained from context
    services that aggregate raw context information sensed by various sensors and
    mobile devices.

  38. Chebyshev Polynomial Approximation for Distributed Signal Processing.

    Authors: Pascal Frossard, Pierre Vandergheynst, David I Shuman
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Unions of graph Fourier multipliers are an important class of linear
    operators for processing signals defined on graphs. We present a novel method
    to efficiently distribute the application of these operators to the
    high-dimensional signals collected by sensor networks. The proposed method
    features approximations of the graph Fourier multipliers by shifted Chebyshev
    polynomials, whose recurrence relations make them readily amenable to
    distributed computation.

  39. Dissecting a Small InfiniBand Application Using the Verbs API.

    Authors: Gregory Kerr
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    InfiniBand is a switched fabric interconnect. The InfiniBand specification
    does not define an API. However the OFED package, libibverbs, has become the
    default API on Linux and Solaris systems. Sparse documentation exists for the
    verbs API. The simplest InfiniBand program provided by OFED, ibv_rc_pingpong,
    is about 800 lines long. The semantics of using the verbs API for this program
    is not obvious to the first time reader. This paper will dissect the
    ibv_rc_pingpong program in an attempt to make clear to users how to interact
    with verbs.

  40. Performance improvement of an optical network providing services based on multicast.

    Authors: Vincent Reinhard, Johanne Cohen, Joanna Tomasik, Dominique Barth, Marc-Antoine Weisser
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Operators of networks covering large areas are confronted with demands from
    some of their customers who are virtual service providers. These providers may
    call for the connectivity service which fulfils the specificity of their
    services, for instance a multicast transition with allocated bandwidth. On the
    other hand, network operators want to make profit by trading the connectivity
    service of requested quality to their customers and to limit their
    infrastructure investments (or do not invest anything at all).

  41. LIKWID: Lightweight Performance Tools.

    Authors: Jan Treibig, Gerhard Wellein, Georg Hager
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Exploiting the performance of today's microprocessors requires intimate
    knowledge of the microarchitecture as well as an awareness of the ever-growing
    complexity in thread and cache topology. LIKWID is a set of command line
    utilities that addresses four key problems: Probing the thread and cache
    topology of a shared-memory node, enforcing thread-core affinity on a program,
    measuring performance counter metrics, and microbenchmarking for reliable upper
    performance bounds.

  42. Parallel Breadth-First Search on Distributed Memory Systems.

    Authors: Aydin Buluc, Kamesh Madduri
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Data-intensive, graph-based computations are pervasive in several scientific
    applications, and are known to to be quite challenging to implement on
    distributed memory systems. In this work, we explore the design space of
    parallel algorithms for Breadth-First Search (BFS), a key subroutine in several
    graph algorithms.

  43. Platforms for Building and Deploying Applications for Cloud Computing.

    Authors: Rajkumar Buyya, Karthik Sukumar
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Cloud computing is rapidly emerging as a new paradigm for delivering IT
    services as utlity-oriented services on subscription-basis. The rapid
    development of applications and their deployment in Cloud computing
    environments in efficient manner is a complex task.

  44. Communication Optimalement Stabilisante sur Canaux non Fiables et non FIFO.

    Authors: Swan Dubois, Sébastien Tixeuil, Shlomi Dolev, Maria Potop-Butucaru
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    A self-stabilizing protocol has the capacity to recover a legitimate behavior
    whatever is its initial state. The majority of works in self-stabilization
    assume a shared memory model or a communication using reliable and FIFO
    channels. In this article, we interest in self-stabilizing systems using
    bounded but non reliable and non FIFO channels. We propose a stabilizing
    communication protocol with optimal fault resilience. In more details, this
    protocol simulates a reliable and FIFO channel and ensures a minimal number of
    looses, duplications, creations, and re-ordering of messages.

  45. Extending and Implementing the Self-adaptive Virtual Processor for Distributed Memory Architectures.

    Authors: Michiel W. van Tol, Juha Koivisto
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Many-core architectures of the future are likely to have distributed memory
    organizations and need fine grained concurrency management to be used
    effectively. The Self-adaptive Virtual Processor (SVP) is an abstract
    concurrent programming model which can provide this, but the model and its
    current implementations assume a single address space shared memory.

  46. Parallel calculation of the median and order statistics on GPUs with application to robust regression.

    Authors: Gleb Beliakov
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We present and compare various approaches to a classical selection problem on
    Graphics Processing Units (GPUs). The selection problem consists in selecting
    the $k$-th smallest element from an array of size $n$, called $k$-th order
    statistic. We focus on calculating the median of a sample, the $n/2$-th order
    statistic. We introduce a new method based on minimization of a convex
    function, and show its numerical superiority when calculating the order
    statistics of very large arrays on GPUs.

  47. Capacity of Byzantine Consensus with Capacity-Limited Point-to-Point Links.

    Authors: Guanfeng Liang, Nitin Vaidya
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We consider the problem of maximizing the throughput of Byzantine consensus,
    when communication links have finite capacity. Byzantine consensus is a
    classical problem in distributed computing. In existing literature, the
    communication links are implicitly assumed to have infinite capacity. The
    problem changes significantly when the capacity of links is finite. We define
    the throughput and capacity of consensus, and identify upper bound of
    achievable consensus throughput.

  48. The Space Complexity of Long-lived and One-Shot Timestamp Implementations.

    Authors: Maryam Helmi, Lisa Higham, Eduardo Pacheco, Philipp Woelfel
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    This paper is concerned with the problem of implementing an unbounded
    timestamp object from multi-writer atomic registers, in an asynchronous
    distributed system of n processors with distinct identifiers where timestamps
    are taken from an arbitrary universe. Ellen, Fatourou and Ruppert (2008) showed
    that sqrt{n}/2-O(1) registers are required for any obstruction-free
    implementation of long-lived timestamp systems from atomic registers (meaning
    processors can repeatedly get timestamps). We improve this existing lower bound
    in two ways.

  49. Agent Based Processing of Global Evaluation Function.

    Authors: M. Shahriar Hossain, M. Muztaba Fuad, Md. Mahbubul Alam Joarder
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Load balancing across a networked environment is a monotonous job. Moreover,
    if the job to be distributed is a constraint satisfying one, the distribution
    of load demands core intelligence. This paper proposes parallel processing
    through Global Evaluation Function by means of randomly initialized agents for
    solving Constraint Satisfaction Problems.

  50. Triangular Dynamic Architecture for Distributed Computing in a LAN Environment.

    Authors: M. Shahriar Hossain, Kazi Muhammad Najmul Hasan Khan, M. Muztaba Fuad, Debzani Deb
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    A computationally intensive large job, granulized to concurrent pieces and
    operating in a dynamic environment should reduce the total processing time.
    However, distributing jobs across a networked environment is a tedious and
    difficult task. Job distribution in a Local Area Network based on Triangular
    Dynamic Architecture (TDA) is a mechanism that establishes a dynamic
    environment for job distribution, load balancing and distributed processing
    with minimum interaction from the user.

  51. Analysis of Randomized Work Stealing with False Sharing.

    Authors: Richard Cole, Vijaya Ramachandran
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    This paper analyzes the cache miss cost of algorithms when scheduled using
    randomized work stealing (RWS) in a parallel environment, taking into account
    the effects of false sharing.

    First, prior analyses (due to Acar et al.) are extended to incorporate false
    sharing. However, to control the possible delays due to false sharing, some
    restrictions on the algorithms seem necessary. Accordingly, the class of
    Hierarchical Tree algorithms is introduced and their performance analyzed.

  52. Self-Stabilization, Byzantine Containment, and Maximizable Metrics: Necessary Conditions.

    Authors: Swan Dubois, Sébastien Tixeuil, Toshimitsu Masuzawa
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Self-stabilization is a versatile approach to fault-tolerance since it
    permits a distributed system to recover from any transient fault that
    arbitrarily corrupts the contents of all memories in the system. Byzantine
    tolerance is an attractive feature of distributed systems that permits to cope
    with arbitrary malicious behaviors. We consider the well known problem of
    constructing a maximum metric tree in this context. Combining these two
    properties leads to some impossibility results.

  53. On the Cost of Concurrency in Transactional Memory.

    Authors: Petr Kuznetsov, Srivatsan Ravi
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    The crux of software transactional memory (STM) is to combine an easy-to-use
    programming interface with an efficient utilization of the concurrent computing
    abilities provided by modern machines. But does this combination come with an
    inherent cost? We evaluate the cost of concurrency by measuring the amount of
    expensive synchronization that must be employed in an STM implementation that
    ensures positive concurrency, i.e., allows for concurrent transaction
    processing in some executions. We consider two popular progress conditions:
    progressiveness and permissiveness.

  54. Towards Autonomic Service Provisioning Systems.

    Authors: Michele Mazzucco
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    This paper discusses our experience in building SPIRE, an autonomic system
    for service provision. The architecture consists of a set of hosted Web
    Services subject to QoS constraints, and a certain number of servers used to
    run session-based traffic. Customers pay for having their jobs run, but require
    in turn certain quality guarantees: there are different SLAs specifying charges
    for running jobs and penalties for failing to meet promised performance
    metrics. The system is driven by an utility function, aiming at optimizing the
    average earned revenue per unit time.

  55. Maximizing Cloud Providers Revenues via Energy Aware Allocation Policies.

    Authors: Michele Mazzucco, Dmytro Dyachuk, Ralph Deters
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Cloud providers, like Amazon, offer their data centers' computational and
    storage capacities for lease to paying customers. High electricity consumption,
    associated with running a data center, not only reflects on its carbon
    footprint, but also increases the costs of running the data center itself. This
    paper addresses the problem of maximizing the revenues of Cloud providers by
    trimming down their electricity costs. As a solution allocation policies which
    are based on the dynamic powering servers on and off are introduced and
    evaluated.

  56. A Tight Lower Bound on Distributed Random Walk Computation.

    Authors: Danupon Nanongkai, Atish Das Sarma, Gopal Pandurangan
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We consider the problem of performing a random walk in a distributed network.
    Given bandwidth constraints, the goal of the problem is to minimize the number
    of rounds required to obtain a random walk sample. Das Sarma et al. [PODC'10]
    show that a random walk of length $\ell$ on a network of diameter $D$ can be
    performed in $\tilde O(\sqrt{\ell D}+D)$ time. A major question left open is
    whether there exists a faster algorithm, especially whether the multiplication
    of $\sqrt{\ell}$ and $\sqrt{D}$ is necessary.

  57. Using Planetlab to Implement Multicast at the Application Level.

    Authors: Genge Bela, Haller Piroska
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Application-layer multicast implements the multicast functionality at the
    application layer. The main goal of application-layer multicast is to construct
    and maintain efficient distribution structures between endhosts. In this paper
    we focus on the implementation of an application-layer multicast network using
    PlanetLab. We observe that the total time required to measure network latency
    over TCP is influenced dramatically by the TCP connection time.

  58. The Computing of Digital Ecosystems.

    Authors: Gerard Briscoe, Philippe De Wilde
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    A primary motivation for our research in digital ecosystems is the desire to
    exploit the self-organising properties of biological ecosystems. Ecosystems are
    thought to be robust, scalable architectures that can automatically solve
    complex, dynamic problems.

  59. Parallelization Strategies for Ant Colony Optimisation on GPUs.

    Authors: Martyn Amos, Jose M. Cecilia, Jose M. Garcia, Manuel Ujaldon, Andy Nisbet
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Ant Colony Optimisation (ACO) is an effective population-based meta-heuristic
    for the solution of a wide variety of problems. As a population-based
    algorithm, its computation is intrinsically massively parallel, and it is
    there- fore theoretically well-suited for implementation on Graphics Processing
    Units (GPUs). The ACO algorithm comprises two main stages: Tour construction
    and Pheromone update. The former has been previously implemented on the GPU,
    using a task-based parallelism approach. However, up until now, the latter has
    always been implemented on the CPU.

  60. A comprehensive operational semantics of the SCOOP programming model.

    Authors: Bertrand Meyer, Sebastian Nanz, Benjamin Morandi
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Operational semantics has established itself as a flexible but rigorous means
    to describe the meaning of programming languages. Oftentimes, it is felt
    necessary to keep a semantics small, for example to facilitate its use for
    model checking by avoiding state space explosion. However, omitting many
    details in a semantics typically makes results valid for a limited core
    language only, leaving a wide gap towards any real implementation.

  61. Computer Simulation Center in Internet.

    Authors: G.A. Tarnavsky, E.V. Vorozhtsov
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    The general description of infrastructure and content of SciShop.ru computer
    simulation center is given. This resource is a new form of knowledge generation
    and remote education using modern Cloud Computing technologies.

  62. High Performance Gravitational N-body Simulations on a Planet-wide Distributed Supercomputer.

    Authors: Derek Groen, Simon Portegies Zwart, Tomoaki Ishiyama, Junichiro Makino
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We report on the performance of our cold-dark matter cosmological N-body
    simulation which was carried out concurrently using supercomputers across the
    globe. We ran simulations on 60 to 750 cores distributed over a variety of
    supercomputers in Amsterdam (the Netherlands, Europe), in Tokyo (Japan, Asia),
    Edinburgh (UK, Europe) and Espoo (Finland, Europe). Regardless the network
    latency of 0.32 seconds and the communication over 30.000 km of optical network
    cable we are able to achieve about 87% of the performance compared to an equal
    number of cores on a single supercomputer.

  63. Performance Evaluation of Parallel Message Passing and Thread Programming Model on Multicore Architectures.

    Authors: A.B. Mutiara, D.T. Hasta
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    The current trend of multicore architectures on shared memory systems
    underscores the need of parallelism. While there are some programming model to
    express parallelism, thread programming model has become a standard to support
    these system such as OpenMP, and POSIX threads. MPI (Message Passing Interface)
    which remains the dominant model used in high-performance computing today faces
    this challenge.

  64. New Row-grouped CSR format for storing the sparse matrices on GPU with implementation in CUDA.

    Authors: Tomáš Oberhuber, Atsushi Suzuki, Jan Vacata
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    In this article we present a new format for storing sparse matrices. The
    format is designed to perform well mainly on the GPU devices. We present its
    implementation in CUDA. The performance has been tested on 1,600 different
    types of matrices and we compare our format with the Hybrid format. We give
    detailed comparison of both formats and show their strong and weak parts.

  65. A Robust and Efficient Trust Management Scheme for Peer-to-Peer Networks.

    Authors: Jaydip Sen
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Studies on the large scale peer-to-peer (P2P) network like Gnutella have
    shown the presence of large number of free riders. Moreover, the open and
    decentralized nature of P2P network is exploited by malicious users who
    distribute unauthentic or harmful contents. Despite the existence of a number
    of trust management schemes in the literature for combating against free riding
    and distribution of malicious files, these mechanisms are not scalable due to
    their high computational, communication and storage overhead.

  66. Simplicial models for concurrency.

    Authors: Peter Bubenik
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We model both concurrent programs and the possible executions from one state
    to another in a concurrent program using simplices. The latter are calculated
    using necklaces of simplices in the former. For these models, the appropriate
    setting is the not the traditional approach to simplicial sets, but a more
    recent one due to Joyal.

  67. Dependability in Aggregation by Averaging.

    Authors: Paulo Jesus, Carlos Baquero, Paulo Sérgio Almeida
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Aggregation is an important building block of modern distributed
    applications, allowing the determination of meaningful properties (e.g. network
    size, total storage capacity, average load, majorities, etc.) that are used to
    direct the execution of the system. However, the majority of the existing
    aggregation algorithms exhibit relevant dependability issues, when prospecting
    their use in real application environments. In this paper, we reveal some
    dependability issues of aggregation algorithms based on iterative averaging
    techniques, giving some directions to solve them.

  68. Multiparty Symmetric Sum Types.

    Authors: Nobuko Yoshida, Lasse Nielsen, Kohei Honda
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    This paper introduces a new theory of multiparty session types based on
    symmetric sum types, by which we can type non-deterministic orchestration
    choice behaviours. While the original branching type in session types can
    represent a choice made by a single participant and accepted by others
    determining how the session proceeds, the symmetric sum type represents a
    choice made by agreement among all the participants of a session.

  69. Stochastic Analysis of a Churn-Tolerant Structured Peer-to-Peer Scheme.

    Authors: Gopal Pandurangan, Tim Jacobs
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We present and analyze a simple and general scheme to build a churn
    (fault)-tolerant structured Peer-to-Peer (P2P) network. Our scheme shows how to
    ``convert" a static network into a dynamic distributed hash table(DHT)-based
    P2P network such that all the good properties of the static network are
    guaranteed with high probability (w.h.p). Applying our scheme to a
    cube-connected cycles network, for example, yields a $O(\log N)$ degree
    connected network, in which every search succeeds in $O(\log N)$ hops w.h.p.,
    using $O(\log N)$ messages, where $N$ is the expected stable network size.

  70. Slightly smaller splitter networks.

    Authors: James Aspnes
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    The classic renaming protocol of Moir and Anderson (1995) uses a network of
    Theta(n^2) splitters to assign unique names to n processes with unbounded
    initial names. We show how to reduce this bound to Theta(n^{3/2}) splitters.

  71. A Trust Model Based on Service Classification in Mobile Services.

    Authors: Feng Xia, Yang Liu, Zhikui Chen, Xiaoning Lv, Fanyu Bu
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Internet of Things (IoT) and B3G/4G communication are promoting the pervasive
    mobile services with its advanced features. However, security problems are also
    baffled the development. This paper proposes a trust model to protect the
    user's security. The billing or trust operator works as an agent to provide a
    trust authentication for all the service providers. The services are classified
    by sensitive value calculation. With the value, the user's trustiness for
    corresponding service can be obtained.

  72. A High-confidence Cyber-Physical Alarm System: Design and Implementation.

    Authors: Feng Xia, Ming Xu, Longhua Ma, Tengkai Yuan, Meng Shao, Jun Yao
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Most traditional alarm systems cannot address security threats in a
    satisfactory manner. To alleviate this problem, we developed a high-confidence
    cyber-physical alarm system (CPAS), a new kind of alarm systems. This system
    establishes the connection of the Internet (i.e. TCP/IP) through GPRS/CDMA/3G.
    It achieves mutual communication control among terminal equipments, human
    machine interfaces and users by using the existing mobile communication
    network. The CPAS will enable the transformation in alarm mode from traditional
    one-way alarm to two-way alarm.

  73. Leakage-Aware Reallocation for Periodic Real-Time Tasks on Multicore Processors.

    Authors: Feng Xia, Guowei Wu, Hongtao Huang, Jijie Wang, Siyu Lei
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    It is an increasingly important issue to reduce the energy consumption of
    computing systems. In this paper, we consider partition based energy-aware
    scheduling of periodic real-time tasks on multicore processors. The scheduling
    exploits dynamic voltage scaling (DVS) and core sleep scheduling to reduce both
    dynamic and leakage energy consumption. If the overhead of core state switching
    is non-negligible, however, the performance of this scheduling strategy in
    terms of energy efficiency might degrade.

  74. Distributed Verification and Hardness of Distributed Approximation.

    Authors: Amos Korman, Danupon Nanongkai, Atish Das Sarma, Gopal Pandurangan, David Peleg, Stephan Holzer, Liah Kor, Roger Wattenhofer
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We study the {\em verification} problem in distributed networks, stated as
    follows. Let $H$ be a subgraph of a network $G$ where each vertex of $G$ knows
    which edges incident on it are in $H$. We would like to verify whether $H$ has
    some properties, e.g., if it is a tree or if it is connected. We would like to
    perform this verification in a decentralized fashion via a distributed
    algorithm. The time complexity of verification is measured as the number of
    rounds of distributed communication.

  75. Multiscale Gossip for Efficient Decentralized Averaging in Wireless Packet Networks.

    Authors: Michael G. Rabbat, Konstantinos I. Tsianos
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    This paper describes and analyzes a hierarchical gossip algorithm for solving
    the distributed average consensus problem in wireless sensor networks. The
    network is recursively partitioned into subnetworks. Initially, nodes at the
    finest scale gossip to compute local averages. Then, using geographic routing
    to enable gossip between nodes that are not directly connected, these local
    averages are progressively fused up the hierarchy until the global average is
    computed.

  76. Parallelization of Weighted Sequence Comparison by using EBWT.

    Authors: Binay Kumar Pandey, Shashank Srikant, Dr. Rajdeep Niyogi, Dr. Ankush Mittal
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    The Extended Burrows Wheeler transform (EBWT) helps to find the distance
    between two sequences. Implementation of an existing algorithm takes
    considerable amount of time for small size sequences. In this paper, we give a
    parallel implementation of this algorithm using NVIDIA Compute Unified Device
    Architecture (CUDA). We have obtained, on an average, a 2X improvement in the
    performance.

  77. Edge- and Node-Disjoint Paths in P Systems.

    Authors: Michael J. Dinneen, Yun-Bum Kim, Radu Nicolescu
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    In this paper, we continue our development of algorithms used for topological
    network discovery. We present native P system versions of two fundamental
    problems in graph theory: finding the maximum number of edge- and node-disjoint
    paths between a source node and target node. We start from the standard
    depth-first-search maximum flow algorithms, but our approach is totally
    distributed, when initially no structural information is available and each P
    system cell has to even learn its immediate neighbors.

  78. Parallel Sparse Matrix Solver on the GPU Applied to Simulation of Electrical Machines.

    Authors: Antonio Wendell De Oliveira Rodrigues, Frédéric Guyomarch, Yvonnick Le Menach, Jean-Luc Dekeyser
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Nowadays, several industrial applications are being ported to parallel
    architectures. In fact, these platforms allow acquire more performance for
    system modelling and simulation. In the electric machines area, there are many
    problems which need speed-up on their solution. This paper examines the
    parallelism of sparse matrix solver on the graphics processors. More
    specifically, we implement the conjugate gradient technique with input matrix
    stored in CSR, and Symmetric CSR and CSC formats.

  79. A strong law for the rate of growth of long latency periods in cloud computing service.

    Authors: Souvik Ghosh, Soumyadip Ghosh
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Cloud-computing shares a common pool of resources across customers at a scale
    that is orders of magnitude larger than traditional multi-user systems.
    Constituent physical compute servers are allocated multiple "virtual machines"
    (VM) to serve simultaneously. Each VM user should ideally be unaffected by
    others' demand. Naturally, this environment produces new challenges for the
    service providers in meeting customer expectations while extracting an
    efficient utilization from server resources. We study a new cloud service
    metric that measures prolonged latency or delay suffered by customers.

  80. Parallel Sorted Neighborhood Blocking with MapReduce.

    Authors: Andreas Thor, Erhard Rahm, Lars Kolb
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Cloud infrastructures enable the efficient parallel execution of
    data-intensive tasks such as entity resolution on large datasets. We
    investigate challenges and possible solutions of using the MapReduce
    programming model for parallel entity resolution. In particular, we propose and
    evaluate two MapReduce-based implementations for Sorted Neighborhood blocking
    that either use multiple MapReduce jobs or apply a tailored data replication.

  81. A Reusable Component for Communication and Data Synchronization in Mobile Distributed Interactive Applications.

    Authors: Antoine Beugnard, Abdul Malik Khan, Sophie Chabridon
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    In Distributed Interactive Applications (DIA) such as multiplayer games,
    where many participants are involved in a same game session and communicate
    through a network, they may have an inconsistent view of the virtual world
    because of the communication delays across the network. This issue becomes even
    more challenging when communicating through a cellular network while executing
    the DIA client on a mobile terminal. Consistency maintenance algorithms may be
    used to obtain a uniform view of the virtual world.

  82. Behavioural Models for Group Communications.

    Authors: Rabéa Ameur-Boulifa, Ludovic Henrio, Eric Madelaine
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Group communication is becoming a more and more popular infrastructure for
    efficient distributed applications. It consists in representing locally a group
    of remote objects as a single object accessed in a single step; communications
    are then broadcasted to all members. This paper provides models for automatic
    verification of group-based applications, typically for detecting deadlocks or
    checking message ordering. We show how to encode group communication, together
    with different forms of synchronisation for group results.

  83. Distributed Deterministic Edge Coloring using Bounded Neighborhood Independence.

    Authors: Leonid Barenboim, Michael Elkin
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We study the {edge-coloring} problem in the message-passing model of
    distributed computing. This is one of the most fundamental and well-studied
    problems in this area. Currently, the best-known deterministic algorithms for
    (2Delta -1)-edge-coloring requires O(Delta) + log-star n time \cite{PR01},
    where Delta is the maximum degree of the input graph.

  84. On Designing Multicore-aware Simulators for Biological Systems.

    Authors: Marco Aldinucci, Massimo Torquati, Angelo Troina, Mario Coppo, Ferruccio Damiani, Maurizio Drocco
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    The stochastic simulation of biological systems is an increasingly popular
    technique in bioinformatics. It often is an enlightening technique, which may
    however result in being computational expensive. We discuss the main
    opportunities to speed it up on multi-core platforms, which pose new challenges
    for parallelisation techniques.

  85. Modified Bully Algorithm using Election Commission.

    Authors: Muhammad Mahbubur Rahman, Afroza Nahar
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Electing leader is a vital issue not only in distributed computing but also
    in communication network [1, 2, 3, 4, 5], centralized mutual exclusion
    algorithm [6, 7], centralized control IPC, etc. A leader is required to make
    synchronization between different processes. And different election algorithms
    are used to elect a coordinator among the available processes in the system
    such a way that there will be only one coordinator at any time. Bully election
    algorithm is one of the classical and well-known approaches in coordinator
    election process.

  86. Astronomy in the Cloud: Using MapReduce for Image Coaddition.

    Authors: YongChul Kwon, Keith Wiley, Andrew Connolly, Jeff Gardner, Simon Krughof, Magdalena Balazinska, Bill Howe, YingYi Bu
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    In the coming decade, astronomical surveys of the sky will generate tens of
    terabytes of images and detect hundreds of millions of sources every night. The
    study of these sources will involve computation challenges such as anomaly
    detection and classification, and moving object tracking. Since such studies
    benefit from the highest quality data, methods such as image coaddition
    (stacking) will be a critical preprocessing step prior to scientific
    investigation.

  87. A Gossip-based optimistic replication for efficient delay-sensitive streaming using an interactive middleware support system.

    Authors: Constandinos X. Mavromoustakis, Helen D. Karatza
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    While sharing resources the efficiency is substantially degraded as a result
    of the scarceness of availability of the requested resources in a multiclient
    support manner. These resources are often aggravated by many factors like the
    temporal constraints for availability or node flooding by the requested
    replicated file chunks. Thus replicated file chunks should be efficiently
    disseminated in order to enable resource availability on-demand by the mobile
    users.

  88. D$^2$-Tree: A New Overlay with Deterministic Bounds.

    Authors: G.S. Brodal, S. Sioutas, K. Tsichlas, C. Zaroliagis
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We present a new overlay, called the {\em Deterministic Decentralized tree}
    ($D^2$-tree).

  89. CloneCloud: Boosting Mobile Device Applications Through Cloud Clone Execution.

    Authors: Petros Maniatis, Byung-Gon Chun, Sunghwan Ihm, Mayur Naik
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Mobile applications are becoming increasingly ubiquitous and provide ever
    richer functionality on mobile devices. At the same time, such devices often
    enjoy strong connectivity with more powerful machines ranging from laptops and
    desktops to commercial clouds. This paper presents the design and
    implementation of CloneCloud, a system that automatically transforms mobile
    applications to benefit from the cloud.

  90. Concurrent Processing Memory.

    Authors: Chengpu Wang
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    A novel memory with limited processing power and internal connectivity at
    each element is proposed. This memory carries out parallel processing within
    itself. Many common algorithms using this memory are discussed. For an array of
    N items, it reduces the total instruction cycle count of universal operations
    such as insertion and match finding to ~ 1, local operations such as filtering
    and pattern recognition to ~ local operation size, and global operations such
    as sum and sorting to ~ sqrt(N) instruction cycles.

  91. Component Specification in the Cactus Framework: The Cactus Configuration Language.

    Authors: Gabrielle Allen, Tom Goodale, Frank Löffler, David Rideout, Erik Schnetter, Eric L. Seidel
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Component frameworks are complex systems that rely on many layers of
    abstraction to function properly. One essential requirement is a consistent
    means of describing each individual component and how it relates to both other
    components and the whole framework. As component frameworks are designed to be
    flexible by nature, the description method should be simultaneously powerful,
    lead to efficient code, and be easy to use, so that new users can quickly adapt
    their own code to work with the framework.

  92. A Competitive Analysis for Balanced Transactional Memory Workloads.

    Authors: Gokarna Sharma, Costas Busch
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We consider transactional memory contention management in the context of
    balanced workloads, where if a transaction is writing, the number of write
    operations it performs is a constant fraction of its total reads and writes. We
    explore the theoretical performance boundaries of contention management in
    balanced workloads from the worst-case perspective by presenting and analyzing
    two new contention management algorithms. The first algorithm Clairvoyant is
    O(\surd s)-competitive, where s is the number of shared resources. This
    algorithm depends on explicitly knowing the conflict graph.

  93. Managing Clouds in Cloud Platforms.

    Authors: Hassan Gobjuka, Kamal A. Ahmat
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Managing cloud services is a fundamental challenge in todays virtualized
    environments. These challenges equally face both providers and consumers of
    cloud services. The issue becomes even more challenging in virtualized
    environments that support mobile clouds. Cloud computing platforms such as
    Amazon EC2 provide customers with flexible, on demand resources at low cost.
    However, they fail to provide seamless infrastructure management and monitoring
    capabilities that many customers may need.

  94. Deterministic Consensus Algorithm with Linear Per-Bit Complexity.

    Authors: Guanfeng Liang, Nitin Vaidya
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    In this report, building on the deterministic multi-valued one-to-many
    Byzantine agreement (broadcast) algorithm in our recent technical report [2],
    we introduce a deterministic multi-valued all-to-all Byzantine agreement
    algorithm (consensus), with linear complexity per bit agreed upon. The
    discussion in this note is not self-contained, and relies heavily on the
    material in [2] - please refer to [2] for the necessary background.

  95. The Two Quadrillionth Bit of Pi is 0! Distributed Computation of Pi with Apache Hadoop.

    Authors: Tsz-Wo Sze
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We present a new record on computing specific bits of Pi, the mathematical
    constant, and discuss performing such computations on Apache Hadoop clusters.
    The specific bits represented in hexadecimal are 0E6C1294 AED40403 F56D2D76
    4026265B CA98511D 0FCFFAA1 0F4D28B1 BB5392B8. These 256 bits end at the
    2,000,000,000,000,252nd bit position, which doubles the previous known record.
    The position of the first bit is 1,999,999,999,999,997 and the value of the two
    quadrillionth bit is 0. The computation is carried out by a MapReduce program
    called DistBbp.

  96. Optimal Gradient Clock Synchronization in Dynamic Networks.

    Authors: Fabian Kuhn, Christoph Lenzen, Thomas Locher, Rotem Oshman
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We study the problem of clock synchronization in highly dynamic networks,
    where communication links can appear or disappear at any time. The nodes in the
    network are equipped with hardware clocks, but the rate of the hardware clocks
    can vary arbitrarily within specific bounds, and the estimates that nodes can
    obtain about the clock values of other nodes are inherently inaccurate.

  97. A Light-Weight Communication Library for Distributed Computing.

    Authors: Derek Groen, Steven Rieder, Paola Grosso, Cees de Laat, Simon Portegies Zwart
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We present MPWide, a platform independent communication library for
    performing message passing between computers. Our library allows coupling of
    several local MPI applications through a long distance network and is
    specifically optimized for such communications. The implementation is
    deliberately kept light-weight, platform independent and the library can be
    installed and used without administrative privileges.

  98. Immune System Inspired Strategies for Distributed Systems.

    Authors: Soumya Banerjee, Melanie Moses
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Many components of the IS are constructed as modular units which do not need
    to communicate with each other such that the number of components increases but
    the size remains constant. However, a sub-modular IS architecture in which
    lymph node number and size both increase sublinearly with body size is shown to
    efficiently balance the requirements of communication and migration, consistent
    with experimental data. We hypothesize that the IS architecture optimizes the
    tradeoff between local search for pathogens and global response using
    antibodies.

  99. The Cloud Adoption Toolkit: Supporting Cloud Adoption Decisions in the Enterprise.

    Authors: Ali Khajeh-Hosseini, Ian Sommerville, David Greenwood, James W. Smith
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Cloud computing promises a radical shift in the provisioning of computing
    resource within the enterprise. This paper describes the challenges that
    decision makers face when assessing the feasibility of the adoption of cloud
    computing in their organisations, and describes our Cloud Adoption Toolkit,
    which has been developed to support this process. The toolkit provides a
    framework to support decision makers in identifying their concerns, and
    matching these concerns to appropriate tools/techniques that can be used to
    address them.

  100. Self-repairing Homomorphic Codes for Distributed Storage Systems.

    Authors: Frederique Oggier, Anwitaman Datta
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Erasure codes provide a storage efficient alternative to replication based
    redundancy in (networked) storage systems. They however entail high
    communication overhead for maintenance, when some of the encoded fragments are
    lost and need to be replenished. Such overheads arise from the fundamental need
    to recreate (or keep separately) first a copy of the whole object before any
    individual encoded fragment can be generated and replenished. There has been
    recently intense interest to explore alternatives, most prominent ones being
    regenerating codes (RGC) and hierarchical codes (HC).

  101. Parallel and distributed Gr\"obner bases computation in JAS.

    Authors: Heinz Kredel
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    This paper considers parallel Gr\"obner bases algorithms on distributed
    memory parallel computers with multi-core compute nodes. We summarize three
    different Gr\"obner bases implementations: shared memory parallel, pure
    distributed memory parallel and distributed memory combined with shared memory
    parallelism. The last algorithm, called distributed hybrid, uses only one
    control communication channel between the master node and the worker nodes and
    keeps polynomials in shared memory on a node.

  102. Self-Recovering Sensor-Actor Networks.

    Authors: Maryam Kamali, Linas Laibinis, Luigia Petre, Kaisa Sere
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Wireless sensor-actor networks are a recent development of wireless networks
    where both ordinary sensor nodes and more sophisticated and powerful nodes,
    called actors, are present. In this paper we formalize a recently introduced
    algorithm that recovers failed actor communication links via the existing
    sensor infrastructure. We prove via refinement that the recovery is terminating
    in a finite number of steps and is distributed, thus self-performed by the
    actors.

  103. Implementing Distributed Controllers for Systems with Priorities.

    Authors: Imene Ben-Hafaiedh, Susanne Graf, Hammadi Khairallah
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Implementing a component-based system in a distributed way so that it ensures
    some global constraints is a challenging problem. We consider here abstract
    specifications consisting of a composition of components and a controller given
    in the form of a set of interactions and a priority order amongst them. In the
    context of distributed systems, such a controller must be executed in a
    distributed fashion while still respecting the global constraints imposed by
    interactions and priorities.

  104. Simplified Distributed Programming with Micro Objects.

    Authors: Jan-Mark S. Wams, Maarten van Steen
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Developing large-scale distributed applications can be a daunting task.
    object-based environments have attempted to alleviate problems by providing
    distributed objects that look like local objects. We advocate that this
    approach has actually only made matters worse, as the developer needs to be
    aware of many intricate internal details in order to adequately handle partial
    failures. The result is an increase of application complexity. We present an
    alternative in which distribution transparency is lessened in favor of clearer
    semantics.

  105. PowerTracer: Tracing requests in multi-tier services to save cluster power consumption.

    Authors: Lei Wang, Lin Yuan, Jianfeng Zhan, Bo Sang, Haining Wang
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    As energy proportional computing has extended the success of DVFS (Dynamic
    voltage and frequency scaling) to the entire system, DVFS control algorithms
    will play a key role in reducing server clusters' power consumption. The focus
    of this paper is to provide accurate cluster-level DVFS control for power
    saving in a server cluster. To achieve this goal, we propose a request tracing
    approach that online classifies the major causal path patterns and monitors
    their performance data as a guide for accurate DVFS control.

  106. Short Note on Complexity of Multi-Value Byzantine Agreement.

    Authors: Guanfeng Liang, Nitin Vaidya
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Randomized algorithm that achieves multi-valued Byzantine agreement with high
    probability, and achieves optimal complexity.

  107. A Fault-Resistant Asynchronous Clock Function.

    Authors: Danny Dolev, Ezra N. Hoch, Michael Ben-Or
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Consider an asynchronous network in a shared-memory environment consisting of
    n nodes. Assume that up to f of the nodes might be Byzantine (n > 12f), where
    the adversary is full-information and dynamic (sometimes called adaptive). In
    addition, the non-Byzantine nodes may undergo transient failures. Nodes advance
    in atomic steps, which consist of reading all registers, performing some
    calculation and writing to all registers.

  108. Double Circulant Minimum Storage Regenerating Codes.

    Authors: Bernat Gaston Braso, Jaume Pujol Capdevila
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Storage optimization in distributed environments is a major concern when
    talking about reliability in this kind of schemes. Although replication is the
    most used option, erasure coding is a more optimized one.

    However, erasure coding uses a lot of bandwidth to replace one node. In a
    dynamic scheme, where nodes enter and leave the system frequently, bandwidth
    use could be an important drawback.

  109. A Random Search Framework for Convergence Analysis of Distributed Beamforming with Feedback.

    Authors: C. Lin, V. V. Veeravalli, S. Meyn
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    The focus of this work is on the analysis of transmit beamforming schemes
    with a low-rate feedback link in wireless sensor/relay networks, where nodes in
    the network need to implement beamforming in a distributed manner.
    Specifically, the problem of distributed phase alignment is considered, where
    neither the transmitters nor the receiver has perfect channel state
    information, but there is a low-rate feedback link from the receiver to the
    transmitters. In this setting, a framework is proposed for systematically
    analyzing the performance of distributed beamforming schemes.

  110. MalStone: Towards A Benchmark for Analytics on Large Data Clouds.

    Authors: Collin Bennett, Robert L. Grossman, David Locke, Jonathan Seidman, Steve Vejcik
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Developing data mining algorithms that are suitable for cloud computing
    platforms is currently an active area of research, as is developing cloud
    computing platforms appropriate for data mining. Currently, the most common
    benchmark for cloud computing is the Terasort (and related) benchmarks.
    Although the Terasort Benchmark is quite useful, it was not designed for data
    mining per se.

  111. A Jamming-Resistant MAC Protocol for Multi-Hop Wireless Networks.

    Authors: Andrea Richa, Christian Scheideler, Stefan Schmid, Jin Zhang
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    This paper presents a simple local medium access control protocol, called
    \textsc{Jade}, for multi-hop wireless networks with a single channel that is
    provably robust against adaptive adversarial jamming. The wireless network is
    modeled as a unit disk graph on a set of nodes distributed arbitrarily in the
    plane. In addition to these nodes, there are adversarial jammers that know the
    protocol and its entire history and that are allowed to jam the wireless
    channel at any node for an arbitrary $(1-\epsilon)$-fraction of the time steps,
    where $0<\epsilon<1$ is an arbitrary constant.

  112. On the Power of Impersonation Attacks.

    Authors: Michael Okun
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    In this paper we consider a synchronous message passing system in which in
    every round an external adversary is able to send each processor up to k
    messages with falsified sender identities and arbitrary content. It is formally
    shown that this impersonation model is slightly stronger than the asynchronous
    message passing model with crash failures. In particular, we prove that
    (k+1)-set agreement can be solved in this model, while k-set agreement is
    impossible, for any k>=1.

  113. Simple Gradecast Based Algorithms.

    Authors: Danny Dolev, Ezra N. Hoch, Michael Ben-Or
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Gradecast is a simple three-round algorithm presented by Feldman and Micali.
    The current work presents a very simple algorithm that utilized Gradecast to
    achieve Byzantine agreement. Two small variations of the presented algorithm
    lead to improved algorithms for solving the Approximate agreement problem and
    the Multi-consensus problem.

    An optimal approximate agreement algorithm was presented by Fekete, which
    supports up to 1/4 n Byzantine nodes and has message complexity of O(n^k),
    where n is the number of nodes and k is the number of rounds.

  114. End-Host Distribution in Application-Layer Multicast: Main Issues and Solutions.

    Authors: Bela Genge, Piroska Haller
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Application-layer multicast implements the multicast functionality at the
    application layer. The main goal of application-layer multicast is to construct
    and maintain efficient distribution structures between end-hosts. In this paper
    we focus on the implementation of an application-layer multicast distribution
    algorithm. We observe that the total time required to measure network latency
    over TCP is influenced dramatically by the TCP connection time.

  115. Transmission Line Inspires A New Distributed Algorithm to Solve the Nonlinear Dynamical System of Physical Circuit.

    Authors: Fei Wei, Huazhong Yang
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    As known, physical circuits, e.g. integrated circuits or power system, work
    in a distributed manner, but these circuits could not be easily simulated in a
    distributed way. This is mainly because that the dynamical system of physical
    circuits is nonlinear and the linearized system of physical circuits is
    non-symmetrical. This paper proposes a simple and natural strategy to simulate
    the physical circuit in parallel, by mimicking the internal wires or
    interconnects inside the circuits by distributed numerical algorithm.

  116. Second Set of Spaces.

    Authors: Graham Kirby, Alan Dearle, Ron Morrison, Andrew McCarthy, Paddy Nixon, Evangelos Zirintsis, Ben Allen, Rob MacInnis, Andrew Jamieson, Chris Nicholson, Steven Harris
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    This document describes the Gloss infrastructure supporting implementation of
    location-aware services. The document is in two parts. The first part describes
    software architecture for the smart space. As described in D8, a local
    architecture provides a framework for constructing Gloss applications, termed
    assemblies, that run on individual physical nodes, whereas a global
    architecture defines an overlay network for linking individual assemblies. The
    second part outlines the hardware installation for local sensing.

  117. An Information Flow Architecture for Global Smart Spaces.

    Authors: Graham Kirby, Alan Dearle, Andrew McCarthy, Juan-Carlos Diaz y Carballo
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    In this paper we describe an architecture which: Permits the deployment and
    execution of components in appropriate geographical locations. Provides
    security mechanisms that prevent misuse of the architecture. Supports a
    programming model that is familiar to application programmers. Permits
    installed components to share data. Permits the deployed components to
    communicate via communication channels. Provides evolution mechanisms
    permitting the dynamic rearrangement of inter-connection topologies the
    components that they connect.

  118. RoboCast: Asynchronous Communication in Robot Networks.

    Authors: S&#xe9;bastien Tixeuil, Shlomi Dolev, Maria Potop-Butucaru, Zohir Bouzid
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    This paper introduces the \emph{RoboCast} communication abstraction. The
    RoboCast allows a swarm of non oblivious, anonymous robots that are only
    endowed with visibility sensors but do not share a common coordinate system, to
    asynchronously exchange information. We propose a generic framework that covers
    a large class of asynchronous broadcast-based algorithms and show how our
    framework can be used to implement fundamental building blocks in robot
    networks such as gathering or stigmergy.

  119. Greedy Routing on Augmented Ring Graphs.

    Authors: R. Seth Terashima, James D. Fix
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Random ring-based overlay networks have been used to study the small world
    phenomenon and model fault-tolerant peer-to-peer systems (Kleinberg, 2006). It
    has been shown that when each of $n$ nodes has $\ell = O(\log n)$ links,
    assigning contacts according to an inverse power-law distance distribution
    allows greedy routing to perform in $O(\log^2 n / \ell)$ steps (Aspnes et al.
    2002). In this paper, we generalize this result by showing the same upper bound
    holds when nodes are assigned a random number of links according to an
    arbitrary distribution with mean $\ell$.

  120. Architectural Support for Global Smart Spaces.

    Authors: Graham Kirby, Alan Dearle, Ron Morrison, Andrew McCarthy, Richard Connor, Kevin Mullen, Yanyan Yang, Paula Welen, Andy Wilson
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    A GLObal Smart Space (GLOSS) provides support for interaction amongst people,
    artefacts and places while taking account of both context and movement on a
    global scale. Crucial to the definition of a GLOSS is the provision of a set of
    location-aware services that detect, convey, store and exploit location
    information. We use one of these services, hearsay, to illustrate the
    implementation dimensions of a GLOSS. The focus of the paper is on both local
    and global software architecture to support the implementation of such
    services.

  121. Active Architecture for Pervasive Contextual Services.

    Authors: Graham Kirby, Alan Dearle, Ron Morrison, Mark Dunlop, Richard Connor, Paddy Nixon
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Pervasive services may be defined as services that are available "to any
    client (anytime, anywhere)". Here we focus on the software and network
    infrastructure required to support pervasive contextual services operating over
    a wide area. One of the key requirements is a matching service capable of
    as-similating and filtering information from various sources and determining
    matches relevant to those services.

  122. A Middleware Framework for Constraint-Based Deployment and Autonomic Management of Distributed Applications.

    Authors: Graham Kirby, Alan Dearle, Andrew McCarthy
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We propose a middleware framework for deployment and subsequent autonomic
    management of component-based distributed applications. An initial deployment
    goal is specified using a declarative constraint language, expressing
    constraints over aspects such as component-host mappings and component
    interconnection topology. A constraint solver is used to find a configuration
    that satisfies the goal, and the configuration is deployed automatically.

  123. A Framework for Constraint-Based Deployment and Autonomic Management of Distributed Applications (Extended Abstract).

    Authors: Graham Kirby, Alan Dearle, Andrew McCarthy
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We propose a framework for the deployment and subsequent autonomic management
    of component-based distributed applications. An initial deployment goal is
    specified using a declarative constraint language, expressing constraints over
    aspects such as component-host mappings and component interconnection topology.
    A constraint solver is used to find a configuration that satisfies the goal,
    and the configuration is deployed automatically. The deployed application is
    instrumented to allow subsequent autonomic management.

  124. A Survey on Peer-to-Peer and DHT.

    Authors: Siamak Sarmady
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Distributed systems with different levels of dependence to central services
    have been designed and used during recent years. Pure peer-to-peer systems
    among distributed systems have no dependence on a central resource. DHT is one
    of the main techniques behind these systems resulting into failure tolerant
    systems which are also able to isolate continuous changes to the network to a
    small section of it and therefore making it possible to scale up such networks
    to millions of nodes. This survey takes a look at P2P in general and DHT
    algorithms and implementations in more detail.

  125. Formal Derivation of Concurrent Garbage Collectors.

    Authors: Dusko Pavlovic, Peter Pepper, Douglas R. Smith
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Concurrent garbage collectors are notoriously difficult to implement
    correctly. Previous approaches to the issue of producing correct collectors
    have mainly been based on posit-and-prove verification or on the application of
    domain-specific templates and transformations. We show how to derive the upper
    reaches of a family of concurrent garbage collectors by refinement from a
    formal specification, emphasizing the application of domain-independent design
    theories and transformations.

  126. A Peer-to-Peer Middleware Framework for Resilient Persistent Programming.

    Authors: Graham Kirby, Alan Dearle, Stuart Norcross, Andrew McCarthy
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    The persistent programming systems of the 1980s offered a programming model
    that integrated computation and long-term storage. In these systems, reliable
    applications could be engineered without requiring the programmer to write
    translation code to manage the transfer of data to and from non-volatile
    storage. More importantly, it simplified the programmer's conceptual model of
    an application, and avoided the many coherency problems that result from
    multiple cached copies of the same information.

  127. Applying Constraint Solving to the Management of Distributed Applications.

    Authors: Graham Kirby, Alan Dearle, Andrew McCarthy
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We present our approach for deploying and managing distributed
    component-based applications. A Desired State Description (DSD), written in a
    high-level declarative language, specifies requirements for a distributed
    application. Our infrastructure accepts a DSD as input, and from it
    automatically configures and deploys the distributed application. Subsequent
    violations of the original requirements are detected and, where possible,
    automatically rectified by reconfiguration and redeployment of the necessary
    application components.

  128. Snap-Stabilizing Linear Message Forwarding.

    Authors: Franck Petit, Swan Dubois, Anissa Lamani, Alain Cournier, Vincent Villain
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    In this paper, we present the first snap-stabilizing message forwarding
    protocol that uses a number of buffers per node being inde- pendent of any
    global parameter, that is 4 buffers per link. The protocol works on a linear
    chain of nodes, that is possibly an overlay on a large- scale and dynamic
    system, e.g., Peer-to-Peer systems, Grids. . . Provided that the topology
    remains a linear chain and that nodes join and leave "neatly", the protocol
    tolerates topology changes. We expect that this protocol will be the base to
    get similar results on more general topologies.

  129. Generating a Family of Byzantine Tolerant Protocol Implementations Using a Meta-Model Architecture.

    Authors: Graham Kirby, Alan Dearle, Stuart Norcross
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We describe an approach to modelling a Byzantine tolerant distributed
    algorithm as a family of related finite state machines, generated from a single
    meta-model. Various artefacts are generated from each state machine, including
    diagrams and source-level protocol implementations. The approach allows a state
    machine formulation to be applied to problems for which it would not otherwise
    be suitable, increasing confidence in correctness.

  130. Fast Self-Stabilizing Minimum Spanning Tree Construction.

    Authors: L&#xe9;lia Blin, Shlomi Dolev, Maria Potop-Butucaru, Stephane Rovedakis
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We present a novel self-stabilizing algorithm for minimum spanning tree (MST)
    construction. The space complexity of our solution is $O(\log^2n)$ bits and it
    converges in $O(n^2)$ rounds. Thus, this algorithm improves the convergence
    time of all previously known self-stabilizing asynchronous MST algorithms by a
    multiplicative factor $\Theta(n)$, to the price of increasing the best known
    space complexity by a factor $O(\log n)$.

  131. Leveraging shared caches for parallel temporal blocking of stencil codes on multicore processors and clusters.

    Authors: Jan Treibig, Gerhard Wellein, Georg Hager, Markus Wittmann
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Bandwidth-starved multicore chips have become ubiquitous. It is well known
    that the performance of stencil codes can be improved by temporal blocking,
    lessening the pressure on the memory interface. We introduce a new pipelined
    approach that makes explicit use of shared caches in multicore environments and
    minimizes synchronization and boundary overhead. Benchmark results are
    presented for three current x86-based microprocessors, showing clearly that our
    optimization works best on designs with high-speed shared caches and low memory
    bandwidth per core.

  132. Proceedings First International Workshop on Decentralized Coordination of Distributed Processes.

    Authors: Tom Van Cutsem, Mark Miller
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    This volume contains the papers presented at the 1st International Workshop
    on "Decentralized Coordination of Distributed Processes", DCDP 2010, held in
    Amsterdam, The Netherlands on June 10th, 2010 in conjunction with the 5th
    International Federated Conferences on Distributed Computing Techniques,
    DisCoTec 2010. The central theme of the workshop is the decentralized
    coordination of distributed processes. Decentralized: there is no single
    authority in the network that everything is vulnerable to.

  133. Efficient Resource Matching in Heterogeneous Grid Using Resource Vector.

    Authors: Srirangam V Addepallil, Per Andersen, George L Barnes
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    In this paper, a method for efficient scheduling to obtain optimum job
    throughput in a distributed campus grid environment is presented; Traditional
    job schedulers determine job scheduling using user and job resource attributes.
    User attributes are related to current usage, historical usage, user priority
    and project access. Job resource attributes mainly comprise of soft
    requirements (compilers, libraries) and hard requirements like memory, storage
    and interconnect. A job scheduler dispatches jobs to a resource if a job's hard
    and soft requirements are met by a resource.

  134. Survey on the Event Orderings Semantics Used for Distributed System.

    Authors: Yaser Miaji Osman Gazali, Suhaidi Hassan
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Event ordering in distributed system (DS) is disputable and proactive subject
    in DS particularly with the emergence of multimedia synchronization. According
    to the literature, different type of event ordering is used for different DS
    mode such as asynchronous or synchronous. Recently, there are several novel
    implementation of these types introduced to fulfill the demand for establishing
    a certain order according to a specific criterion in DS with lighter
    complexity.

  135. Implementation of a Cloud Data Server (CDS) for Providing Secure Service in E-Business.

    Authors: D. Kesavaraja, R. Balasubramanian, D. Sasireka
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Cloud Data Servers is the novel approach for providing secure service to
    e-business .Millions of users are surfing the Cloud for various purposes,
    therefore they need highly safe and persistent services. Usually hackers target
    particular Operating Systems or a Particular Controller. Inspiteof several
    ongoing researches Conventional Web Servers and its Intrusion Detection System
    might not be able to detect such attacks. So we implement a Cloud Data Server
    with Session Controller Architecture using Redundancy and Disconnected Data
    Access Mechanism.

  136. A Low Overhead Minimum Process Global Snapshop Collection Algorithm for Mobile Distributed System.

    Authors: Parveen Kumar, Surender Kumar, R.K.Chauhan
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Coordinated checkpointing is an effective fault tolerant technique in
    distributed system as it avoids the domino effect and require minimum storage
    requirement. Most of the earlier coordinated checkpoint algorithms block their
    computation during checkpointing and forces minimum-process or non-blocking but
    forces all nodes to takes checkpoint even though many of them may not be
    necessary or non-blocking minimum-process but takes useless checkpoints or
    reduced useless checkpoint but has higher synchronization message overhead or
    has high checkpoint request propagation time.

  137. Improvement Cache Efficiency of Explicit Finite Element Procedure and its Application to Parallel Casting Solidification Simulation.

    Authors: Ruhollah Tavakoli
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    A simple method for improving cache efficiency of serial and parallel
    explicit finite procedure with application to casting solidification simulation
    over three-dimensional complex geometries is presented. The method is based on
    division of the global data to smaller blocks and treating each block
    independently from others at each time step. A novel parallel finite element
    algorithm for non-overlapped element-base decomposed domain is presented for
    implementation of serial and parallel version of the presented method. Effect
    of mesh reordering on the efficiency is also investigated.

  138. An Integrated Framework for Performance Analysis and Tuning in Grid Environment.

    Authors: Sarbani Roy, Nandini Mukherjee, Ajanta De Sarkar, Sudipto Biswas
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    In a heterogeneous, dynamic environment, like Grid, post-mortem analysis is
    of no use and data needs to be collected and analysed in real time. Novel
    techniques are also required for dynamically tuning the application's
    performance and resource brokering in order to maintain the desired QoS. The
    objective of this paper is to propose an integrated framework for performance
    analysis and tuning of the application, and rescheduling the application, if
    necessary, to some other resources in order to adapt to the changing resource
    usage scenario in a dynamic environment.

  139. A Multi-agent Framework for Performance Tuning in Distributed Environment.

    Authors: Sarbani Roy, Saikat Halder, Nandini Mukherjee
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    This paper presents the overall design of a multi-agent framework for tuning
    the performance of an application executing in a distributed environment. The
    multi-agent framework provides services like resource brokering, analyzing
    performance monitoring data, local tuning and also rescheduling in case of any
    performance problem on a specific resource provider. The paper also briefly
    describes the implementation of some part of the framework. In particular, job
    migration on the basis of performance monitoring data is particularly
    highlighted in this paper.

  140. Cloud Computing: Exploring the scope.

    Authors: Abhinav Pandey, Akash Pandey, Ankit Tandon, Brajesh Kr Maurya, Upendra Kushwaha
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Cloud computing refers a paradigm shift to overall IT solutions while raising
    the accessibility, scalability and effectiveness through its enabling
    technologies. However, migrated cloud platforms and services cost benefits as
    well as performances are neither clear nor summarized. Globalization and the
    recessionary economic times have not only raised the bar of a better IT
    delivery models but also have given access to technology enabled services via
    internet.

  141. An Efficient and Secure Routing Protocol for Mobile Ad-Hoc Networks.

    Authors: N. Ch. Sriman Narayana Iyengar, Syed Mohammad Ansar Sachin kumar, Piyush Nagar, Siddharth Sharma, Akshay Atrey
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Efficiency and simplicity of random algorithms have made them a lucrative
    alternative for solving complex problems in the domain of communication
    networks. This paper presents a random algorithm for handling the routing
    problem in Mobile Ad hoc Networks [MANETS].The performance of most existing
    routing protocols for MANETS degrades in terms of packet delay and congestion
    caused as the number of mobile nodes increases beyond a certain level or their
    speed passes a certain level.

  142. A Real Time Optimistic Strategy to achieve Concurrency Control in Mobile Environments Using On-demand Multicasting.

    Authors: Salman Abdul Moiz, Dr. Lakshmi Rajamani
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    In mobile database environments, multiple users may access similar data items
    irrespective of their physical location leading to concurrent access anomalies.
    As disconnections and mobility are the common characteristics in mobile
    environment, performing concurrent access to a particular data item leads to
    inconsistency. Most of the approaches use locking mechanisms to achieve
    concurrency control.

  143. L-Resilient Adversaries and Hitting Sets.

    Authors: Eli Gafni, Petr Kuznetsov
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    The condition of $t$-resilience stipulates that an $n$-process program is
    only obliged to make progress when at least $n-t$ processes are correct. Put
    another way, the \emph{live sets}, the collection of process sets such that
    progress is required if all the processes in one of these sets are correct, are
    all sets with at least $n-t$ processes. In this paper we study what happens
    when the live sets are any arbitrary collection of sets $\L$.

  144. The Impact of Topology on Byzantine Containment in Stabilization.

    Authors: Swan Dubois, S&#xe9;bastien Tixeuil, Toshimitsu Masuzawa
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Self-stabilization is an versatile approach to fault-tolerance since it
    permits a distributed system to recover from any transient fault that
    arbitrarily corrupts the contents of all memories in the system. Byzantine
    tolerance is an attractive feature of distributed system that permits to cope
    with arbitrary malicious behaviors. We consider the well known problem of
    constructing a maximum metric tree in this context.

  145. Improving Overhead Computation and pre-processing Time for Grid Scheduling System.

    Authors: Asgarali Bouyer, Mohammad Javad hoseyni, Abdul Hanan Abdullah
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Computational Grid is enormous environments with heterogeneous resources and
    stable infrastructures among other Internet-based computing systems. However,
    the managing of resources in such systems has its special problems. Scheduler
    systems need to get last information about participant nodes from information
    centers for the purpose of firmly job scheduling. In this paper, we focus on
    online updating resource information centers with processed and provided data
    based on the assumed hierarchical model.

  146. Construction auto-stabilisante d'arbre couvrant en d\'epit d'actions malicieuses.

    Authors: Swan Dubois, S&#xe9;bastien Tixeuil, Toshimitsu Masuzawa
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    A self-stabilizing protocol provides by definition a tolerance to transient
    failures. Recently, a new class of self-stabilizing protocols appears. These
    protocols provides also a tolerance to a given number of permanent failures. In
    this article, we are interested in self-stabilizing protocols that deal with
    Byzantines failures. We prove that, for some problems which not allow strict
    stabilization (see [Nesterenko,Arora,2002]), there exist solutions that
    tolerates Byzantine faults if we define a new criteria of tolerance.

  147. The Accuracy of Tree-based Counting in Dynamic Networks.

    Authors: Erik Aurell, Supriya Krishnamurthy, John Ardelius, Mads Dam, Rolf Stadler, Fetahi Wuhib
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Tree-based protocols are ubiquitous in distributed systems. They are
    flexible, they perform generally well, and, in static conditions, their
    analysis is mostly simple. Under churn, however, node joins and failures can
    have complex global effects on the tree overlays, making analysis surprisingly
    subtle. To our knowledge, few prior analytic results for performance estimation
    of tree based protocols under churn are currently known. We study a simple
    Bellman-Ford-like protocol which performs network size estimation over a
    tree-shaped overlay.

  148. On the Impact of the Migration Topology on the Island Model.

    Authors: Francesco Biscani, Dario Izzo, Marek Ruci&#x144;ski
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Parallel Global Optimization Algorithms (PGOA) provide an efficient way of
    dealing with hard optimization problems. One method of parallelization of GOAs
    that is frequently applied and commonly found in the contemporary literature is
    the so-called Island Model (IM). In this paper we analyze the impact of the
    migration topology on the performance of a PGOA which uses the Island Model. In
    particular we consider parallel Differential Evolution and Simulated Annealing
    with Adaptive Neighborhood and draw first conclusions that emerge from the
    conducted experiments.

  149. LIKWID: A lightweight performance-oriented tool suite for x86 multicore environments.

    Authors: Jan Treibig, Gerhard Wellein, Georg Hager
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Exploiting the performance of today's processors requires, apart from an
    intimate knowledge of the microarchitecture, taking into account the influence
    of an ever-growing complexity in thread and cache topology. LIKWID is a
    collection of small command line applications that support inexperienced as
    well as seasoned programmers in developing and running software in an efficient
    way. The development of LIKWID is targeted on providing access to
    performance-oriented tooling in a transparent and easy manner.

  150. A Global Optimisation Toolbox for Massively Parallel Engineering Optimisation.

    Authors: Francesco Biscani, Dario Izzo, Chit Hong Yam
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    A software platform for global optimisation, called PaGMO, has been developed
    within the Advanced Concepts Team (ACT) at the European Space Agency, and was
    recently released as an open-source project.

  151. Exact Sparse Matrix-Vector Multiplication on GPU's and Multicore Architectures.

    Authors: Jean-Guillaume Dumas, Brice Boyer, Pascal Giorgi
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We propose different implementations of the sparse matrix--dense vector
    multiplication (\spmv{}) for finite fields and rings $\Zb/m\Zb$. We take
    advantage of graphic card processors (GPU) and multi-core architectures. Our
    aim is to improve the speed of \spmv{} in the \linbox library, and henceforth
    the speed of its black box algorithms. Besides, we use this and a new
    parallelization of the sigma-basis algorithm in a parallel block Wiedemann rank
    implementation over finite fields.

  152. Passively Mobile Communicating Logarithmic Space Machines.

    Authors: Paul G. Spirakis, Ioannis Chatzigiannakis, Othon Michail, Stavros Nikolaou, Andreas Pavlogiannis
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We propose a new theoretical model for passively mobile Wireless Sensor
    Networks. We call it the PALOMA model, standing for PAssively mobile
    LOgarithmic space MAchines. The main modification w.r.t. the Population
    Protocol model is that agents now, instead of being automata, are Turing
    Machines whose memory is logarithmic in the population size n. Note that the
    new model is still easily implementable with current technology. We focus on
    complete communication graphs.

  153. Boosting Multi-Core Reachability Performance with Shared Hash Tables.

    Authors: Jaco van de Pol, Alfons Laarman, Michael Weber
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    This paper focuses on data structures for multi-core reachability, which is a
    key component in model checking algorithms and other verification methods. A
    cornerstone of an efficient solution is the storage of visited states. In
    related work, static partitioning of the state space was combined with
    thread-local storage and resulted in reasonable speedups, but left open whether
    improvements are possible.

  154. Formal Relationships Between Geometrical and Classical Models for Concurrency.

    Authors: Samuel Mimram, Eric Goubault
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    A wide variety of models for concurrent programs has been proposed during the
    past decades, each one focusing on various aspects of computations: trace
    equivalence, causality between events, conflicts and schedules due to resource
    accesses, etc. More recently, models with a geometrical flavor have been
    introduced, based on the notion of cubical set. These models are very rich and
    expressive since they can represent commutation between any bunch of events,
    thus generalizing the principle of true concurrency.

  155. Addressing the P2P Bootstrap Problem for Small Networks.

    Authors: David Isaac Wolinsky, P. Oscar Boykin, Renato Figueiredo, Pierre St. Juste
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    P2P overlays provide a framework for building distributed applications
    consisting of few to many resources with features including self-configuration,
    scalability, and resilience to node failures. Such systems have been
    successfully adopted in large-scale services for content delivery networks,
    file sharing, and data storage. In small-scale systems, they can be useful to
    address privacy concerns and for network applications that lack dedicated
    servers. The bootstrap problem, finding an existing peer in the overlay,
    remains a challenge to enabling these services for small-scale P2P systems.

  156. The Low Latency Fault Tolerance System.

    Authors: Wenbing Zhao, P. M. Melliar-Smith, L. E. Moser
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    The Low Latency Fault Tolerance (LLFT) system provides fault tolerance for
    distributed applications, using the leader-follower replication technique. The
    LLFT system provides application-transparent replication, with strong replica
    consistency, for applications that involve multiple interacting processes or
    threads.

  157. Internet ware cloud computing :Challenges.

    Authors: S Qamar, Niranjan Lal, Mrityunjay Singh
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    After decades of engineering development and infrastructural investment,
    Internet connections have become commodity product in many countries, and
    Internet scale "cloud computing" has started to compete with traditional
    software business through its technological advantages and economy of scale.
    Cloud computing is a promising enabling technology of Internet ware Cloud
    Computing is termed as the next big thing in the modern corporate world.

  158. In Cloud, Do MTC or HTC Service Providers Benefit from the Economies of Scale?.

    Authors: Lei Wang, Lin Yuan, Jianfeng Zhan, Weisong Shi, Yi Liang
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    In this paper, we intend to answer one key question to the success of cloud
    computing: in cloud, do many task computing (MTC) or high throughput computing
    (HTC) service providers, which offer the corresponding computing service to end
    users, benefit from the economies of scale? Our research contributions are
    three-fold: first, we propose an innovative usage model, called dynamic service
    provision (DSP) model, for MTC or HTC service providers.

  159. In Cloud, Can Scientific Communities Benefit from the Economies of Scale?.

    Authors: Lei Wang, Jianfeng Zhan, Weisong Shi, Yi Liang
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    The basic idea behind Cloud computing is that resource providers offer
    elastic resources to end users. In this paper, we intend to answer one key
    question to the success of Cloud computing: in Cloud, can small or medium-scale
    scientific computing communities benefit from the economies of scale?

  160. Engineering a Scalable High Quality Graph Partitioner.

    Authors: Peter Sanders, Manuel Holtgrewe, Christian Schulz
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We describe an approach to parallel graph partitioning that scales to
    hundreds of processors and produces a high solution quality. For example, for
    many instances from Walshaw's benchmark collection we improve the best known
    partitioning. We use the well known framework of multi-level graph
    partitioning.

  161. Effects of component-subscription network topology on large-scale data centre performance scaling.

    Authors: Ilango Sriram, Dave Cliff
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Modern large-scale date centres, such as those used for cloud computing
    service provision, are becoming ever-larger as the operators of those data
    centres seek to maximise the benefits from economies of scale. With these
    increases in size comes a growth in system complexity, which is usually
    problematic. There is an increased desire for automated "self-star"
    configuration, management, and failure-recovery of the data-centre
    infrastructure, but many traditional techniques scale much worse than linearly
    as the number of nodes to be managed increases.

  162. Importance of Explicit Vectorization for CPU and GPU Software Performance.

    Authors: Kamran Karimi, Neil G. Dickson, Firas Hamze
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Much of the current focus in high-performance computing is on
    multi-threading, multi-computing, and graphics processing unit (GPU) computing.
    However, vectorization and non-parallel optimization techniques, which can
    often be employed additionally, are less frequently discussed. In this paper,
    we present an analysis of several optimizations done on both central processing
    unit (CPU) and GPU implementations of a particular computationally intensive
    Metropolis Monte Carlo algorithm.

  163. High-Performance Physics Simulations Using Multi-Core CPUs and GPGPUs in a Volunteer Computing Context.

    Authors: Kamran Karimi, Neil G. Dickson, Firas Hamze
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    This paper presents two conceptually simple methods for parallelizing a
    Parallel Tempering Monte Carlo simulation in a distributed volunteer computing
    context, where computers belonging to the general public are used. The first
    method uses conventional multi-threading. The second method uses CUDA, a
    graphics card computing system. Parallel Tempering is described, and challenges
    such as parallel random number generation and mapping of Monte Carlo chains to
    different threads are explained.

  164. Precoded Turbo Equalizer for Power Line Communication Systems.

    Authors: Kai Xie, Jing
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Power line communication continues to draw increasing interest by promising a
    wide range of applications including cost-free last-mile communication
    solution. However, signal transmitted through the power lines deteriorates
    badly due to the presence of severe inter-symbol interference (ISI) and harsh
    random pulse noise. This work proposes a new precoded turbo equalization scheme
    specifically designed for the PLC channels.

  165. A Characterization of Combined Traces Using Labeled Stratified Order Structures.

    Authors: Dai Tri Man Le
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    This paper defines a class of labeled stratified order structures that
    characterizes exactly the notion of combined traces (i.e., comtraces) proposed
    by Janicki and Koutny in 1995. Our main technical contributions are the
    representation theorems showing that comtrace quotient monoid, combined
    dependency graph (Kleijn and Koutny 2008) and our labeled stratified order
    structure characterization are three different and yet equivalent ways to
    represent comtraces.

  166. Scalable Group Management in Large-Scale Virtualized Clusters.

    Authors: Lei Wang, Lin Yuan, Jianfeng Zhan, Dan Meng, Wei Zhou
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    To save cost, recently more and more users choose to provision virtual
    machine resources in cluster systems, especially in data centres. Maintaining a
    consistent member view is the foundation of reliable cluster managements, and
    it also raises several challenge issues for large scale cluster systems
    deployed with virtual machines (which we call virtualized clusters). In this
    paper, we introduce our experiences in design and implementation of scalable
    member view management on large-scale virtual clusters.

  167. An efficient algorithm for the parallel solution of high-dimensional differential equations.

    Authors: Tuhin Sahai, Stefan Klus, Cong Liu, Michael Dellnitz
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    The study of high-dimensional differential equations is challenging and
    difficult due to the analytical and computational intractability. Here, we
    significantly improve the speed of waveform relaxation (WR), a method to
    simulate high-dimensional differential-algebraic equations. This new method
    termed adaptive waveform relaxation (AWR) is tested on a communication network
    example. Further we analyze different heuristics for computing graph partitions
    tailored to adaptive waveform relaxation.

  168. Gossip Algorithms for Distributed Signal Processing.

    Authors: Alexandros G. Dimakis, Soummya Kar, Michael G. Rabbat, Jose M.F. Moura, Anna Scaglione
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Gossip algorithms are attractive for in-network processing in sensor networks
    because they do not require any specialized routing, there is no bottleneck or
    single point of failure, and they are robust to unreliable wireless network
    conditions. Recently, there has been a surge of activity in the computer
    science, control, signal processing, and information theory communities,
    developing faster and more robust gossip algorithms and deriving theoretical
    performance guarantees. This article presents an overview of recent work in the
    area.

  169. Improving Waiting Time of Tasks Scheduled Under Preemptive Round Robin Using Changeable Time Quantum.

    Authors: Samih Mohemmed Mostafa
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Minimizing waiting time for tasks waiting in the queue for execution is one
    of the important scheduling cri-teria which took a wide area in scheduling
    preemptive tasks. In this paper we present Changeable Time Quan-tum (CTQ)
    approach combined with the round-robin algorithm, we try to adjust the time
    quantum according to the burst times of the tasks in the ready queue.

  170. InterCloud: Utility-Oriented Federation of Cloud Computing Environments for Scaling of Application Services.

    Authors: Rajkumar Buyya, Rajiv Ranjan, Rodrigo N. Calheiros
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Cloud computing providers have setup several data centers at different
    geographical locations over the Internet in order to optimally serve needs of
    their customers around the world. However, existing systems do not support
    mechanisms and policies for dynamically coordinating load distribution among
    different Cloud-based data centers in order to determine optimal location for
    hosting application services to achieve reasonable QoS levels.

  171. Cloud Computing.

    Authors: N. V. Kalyankar, Shivaji P. Mirashe
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Computing as you know it is about to change, your applications and documents
    are going to move from the desktop into the cloud. I'm talking about cloud
    computing, where applications and files are hosted on a "cloud" consisting of
    thousands of computers and servers, all linked together and accessible via the
    Internet. With cloud computing, everything you do is now web based instead of
    being desktop based. You can access all your programs and documents from any
    computer that's connected to the Internet. How will cloud computing change the
    way you work?

  172. A Security Based Data Mining Approach in Data Grid.

    Authors: S. Vidhya, S. Karthikeyan
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Grid computing is the next logical step to distributed computing. Main
    objective of grid computing is an innovative approach to share resources such
    as CPU usage; memory sharing and software sharing. Data Grids provide
    transparent access to semantically related data resources in a heterogeneous
    system. The system incorporates both data mining and grid computing techniques
    where Grid application reduces the time for sending results to several clients
    at the same time and Data mining application on computational grids gives fast
    and sophisticated results to users.

  173. The Cloud Adoption Toolkit: Addressing the Challenges of Cloud Adoption in Enterprise.

    Authors: Ali Khajeh-Hosseini, Ian Sommerville, David Greenwood, James Smith
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Cloud computing promises a radical shift in the provisioning of computing
    resource within enterprise.

  174. Parallel Generation of Massive Scale-Free Graphs.

    Authors: Andy Yoo, Keith Henderson
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    One of the biggest huddles faced by researchers studying algorithms for
    massive graphs is the lack of large input graphs that are essential for the
    development and test of the graph algorithms. This paper proposes two efficient
    and highly scalable parallel graph generation algorithms that can produce
    massive realistic graphs to address this issue. The algorithms, designed to
    achieve high degree of parallelism by minimizing inter-processor
    communications, are two of the fastest graph generators which are capable of
    generating scale-free graphs with billions of vertices and edges.

  175. Towards trusted volunteer grid environments.

    Authors: Maher Khemakhem, Abdelfettah Belghith, Sousse University, Tunisia, Manouba University, Tunisia)
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Intensive experiences show and confirm that grid environments can be
    considered as the most promising way to solve several kinds of problems
    relating either to cooperative work especially where involved collaborators are
    dispersed geographically or to some very greedy applications which require
    enough power of computing or/and storage. Such environments can be classified
    into two categories; first, dedicated grids where the federated computers are
    solely devoted to a specific work through its end.

  176. Asynchronous Bounded Expected Delay Networks.

    Authors: Jun Pang, Wan Fokkink, Rena Bakhshi, J&#xf6;rg Endrullis
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    The commonly used asynchronous bounded delay (ABD) network models assume a
    fixed bound on message delay. We propose a probabilistic network model, called
    asynchronous bounded expected delay (ABE) model. Instead of a strict bound, the
    ABE model requires only a bound on the expected message delay. While the
    conditions of ABD networks restrict the set of possible executions, in ABE
    networks all asynchronous executions are possible, but executions with
    extremely long delays are less probable. In contrast to ABD networks, ABE
    networks cannot be synchronised efficiently.

  177. Parallel structurally-symmetric sparse matrix-vector products on multi-core processors.

    Authors: Vicente H. F. Batista, George O. Ainsworth Jr., Fernando L. B. Ribeiro
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We consider the problem of developing an efficient multi-threaded
    implementation of the matrix-vector multiplication algorithm for sparse
    matrices with structural symmetry. Matrices are stored using the compressed
    sparse row-column format (CSRC), designed for profiting from the symmetric
    non-zero pattern observed in global finite element matrices. Unlike classical
    compressed storage formats, performing the sparse matrix-vector product using
    the CSRC requires thread-safe access to the destination vector. To avoid race
    conditions, we have implemented two partitioning strategies.

  178. Algorithms For Extracting Timeliness Graphs.

    Authors: Carole Delporte-Gallet, St&#xe9;phane Devismes, Hugues Fauconnier, Mikel Larrea
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We consider asynchronous message-passing systems in which some links are
    timely and processes may crash. Each run defines a timeliness graph among
    correct processes: (p; q) is an edge of the timeliness graph if the link from p
    to q is timely (that is, there is bound on communication delays from p to q).
    The main goal of this paper is to approximate this timeliness graph by graphs
    having some properties (such as being trees, rings,...).

  179. Decreasing log data of multi-tier services for effective request tracing.

    Authors: Jianfeng Zhan, Bo Sang, Guanhua Tian
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Previous work shows request tracing systems help understand and debug the
    performance problems of multi-tier services. However, for large-scale data
    centers, more than hundreds of thousands of service instances provide online
    service at the same time. Previous work such as white-box or black box tracing
    systems will produce large amount of log data, which would be correlated into
    large quantities of causal paths for performance debugging. In this paper, we
    propose an innovative algorithm to eliminate valueless logs of multitiers
    services.

  180. PhoenixCloud: Provisioning Runtime Environments for Heterogeneous Cloud Workloads.

    Authors: Lei Wang, Jianfeng Zhan, Weisong Shi, Shimin Gong, Xiutao Zang
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    As more and more service providers choose Cloud platforms, a resource
    provider needs to provision runtime environments (REs) for heterogeneous
    workloads in different scenarios. Previous work fails to resolve this issue in
    several ways: (1) it fails to pay attention to diverse RE requirements, and
    does not enable creating coordinated REs on demand; (2) few work investigates
    coordinated resource provisioning for heterogeneous workloads.

  181. Precise Request Tracing and Performance Debugging for Multi-tier Services of Black Boxes.

    Authors: Lei Wang, Jianfeng Zhan, Dan Meng, Yong Li, Zhihong Zhang, Bo Sang
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    As more and more multi-tier services are developed from commercial components
    or heterogeneous middleware without the source code available, both developers
    and administrators need a precise request tracing tool to help understand and
    debug performance problems of large concurrent services of black boxes.
    Previous work fails to resolve this issue in several ways: they either accept
    the imprecision of probabilistic correlation methods, or rely on knowledge of
    protocols to isolate requests in pursuit of tracing accuracy.

  182. Accelerating sequential programs using FastFlow and self-offloading.

    Authors: Marco Aldinucci, Massimo Torquati, Massimiliano Meneghin, Marco Danelutto, Peter Kilpatrick
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    FastFlow is a programming environment specifically targeting cache-coherent
    shared-memory multi-cores. FastFlow is implemented as a stack of C++ template
    libraries built on top of lock-free (fence-free) synchronization mechanisms. In
    this paper we present a further evolution of FastFlow enabling programmers to
    offload part of their workload on a dynamically created software accelerator
    running on unused CPUs. The offloaded function can be easily derived from
    pre-existing sequential code.

  183. An Approach to Ad hoc Cloud Computing.

    Authors: Graham Kirby, Alan Dearle, Angus Macdonald, Alvaro Fernandes
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We consider how underused computing resources within an enterprise may be
    harnessed to improve utilization and create an elastic computing
    infrastructure. Most current cloud provision involves a data center model, in
    which clusters of machines are dedicated to running cloud infrastructure
    software. We propose an additional model, the ad hoc cloud, in which
    infrastructure software is distributed over resources harvested from machines
    already in existence within an enterprise.

  184. Breaking the O(n^2) Bit Barrier: Scalable Byzantine agreement with an Adaptive Adversary.

    Authors: Valerie King, Jared Saia
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We describe an algorithm for Byzantine agreement that is scalable in the
    sense that each processor sends only $\tilde{O}(\sqrt{n})$ bits, where $n$ is
    the total number of processors. Our algorithm succeeds with high probability
    against an \emph{adaptive adversary}, which can take over processors at any
    time during the protocol, up to the point of taking over arbitrarily close to a
    1/3 fraction. We assume synchronous communication but a \emph{rushing}
    adversary.

  185. Deterministic Sample Sort For GPUs.

    Authors: Frank Dehne, Hamidreza Zaboli
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We present and evaluate GPU Bucket Sort, a parallel deterministic sample sort
    algorithm for many-core GPUs. Our method is considerably faster than Thrust
    Merge (Satish et.al., Proc. IPDPS 2009), the best comparison-based sorting
    algorithm for GPUs, and it is as fast as the new randomized sample sort for
    GPUs by Leischner et.al. (to appear in Proc. IPDPS 2010). Our deterministic
    sample sort has the advantage that bucket sizes are guaranteed and therefore
    its running time does not have the input data dependent fluctuations that can
    occur for randomized sample sort.

  186. Exploring the Limits of GPUs With Parallel Graph Algorithms.

    Authors: Frank Dehne, Kumanan Yogaratnam
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    In this paper, we explore the limits of graphics processors (GPUs) for
    general purpose parallel computing by studying problems that require highly
    irregular data access patterns: parallel graph algorithms for list ranking and
    connected components. Such graph problems represent a worst case scenario for
    coalescing parallel memory accesses on GPUs which is critical for good GPU
    performance. Our experimental study indicates that PRAM algorithms are a good
    starting point for developing efficient parallel GPU methods but require
    non-trivial modifications to ensure good GPU performance.

  187. Phoenix Cloud : Consolidating Heterogeneous Workloads of Large Organizations on Cloud Computing Platforms.

    Authors: Lei Wang, Jianfeng Zhan, Bibo Tu, Dan Meng, Yong Li, Peng Wang, Wei Zhou
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    For a large organization, different departments often maintain dedicated
    cluster systems for different workloads, for example parallel batch jobs or Web
    services. In this paper, we design and implement an innovative cloud computing
    system software, Phoenix Cloud, to consolidate heterogeneous workloads of the
    same organization on cloud computing platforms. For Phoenix Cloud, we propose
    cooperative resource provision and management polices for the affiliated
    departments of a large organization to share cluster systems.

  188. Automatic Performance Debugging of SPMD Parallel Programs.

    Authors: Xu Liu, Lin Yuan, Jianfeng Zhan, Bibo Tu, Dan Meng
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Automatic performance debugging of parallel applications usually involves two
    steps: automatic detection of performance bottlenecks and uncovering their root
    causes for performance optimization. Previous work fails to resolve this
    challenging issue in several ways: first, several previous efforts automate
    analysis processes, but present the results in a confined way that only
    identifies performance problems with apriori knowledge; second, several tools
    take exploratory or confirmatory data analysis to automatically discover
    relevant performance data relationships.

  189. A Cluster-based Approach for Outlier Detection in Dynamic Data Streams (KORM: k-median OutlieR Miner).

    Authors: Parneeta Dhaliwal, M.P.S. Bhatia, Priti Bansal
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Outlier detection in data streams has gained wide importance presently due to
    the increasing cases of fraud in various applications of data streams. The
    techniques for outlier detection have been divided into either statistics
    based, distance based, density based or deviation based. Till now, most of the
    work in the field of fraud detection was distance based but it is incompetent
    from computational point of view. In this paper we introduced a new clustering
    based approach, which divides the stream in chunks and clusters each chunk
    using kmedian into variable number of clusters.

  190. Window-Based Greedy Contention Management for Transactional Memory.

    Authors: Gokarna Sharma, Brett Estrade, Costas Busch
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We consider greedy contention managers for transactional memory for M x N
    execution windows of transactions with M threads and N transactions per thread.
    Assuming that each transaction conflicts with at most C other transactions
    inside the window, a trivial greedy contention manager can schedule them within
    CN time. In this paper, we show that there are much better schedules.

  191. Fast Flooding over Manhattan.

    Authors: Andrea Clementi, Angelo Monti, Riccardo Silvestri
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We consider a Mobile Ad-hoc NETwork (MANET) formed by n agents that move at
    speed V according to the Manhattan Random-Way Point model over a square region
    of side length L. The resulting stationary (agent) spatial probability
    distribution is far to be uniform: the average density over the "central zone"
    is asymptotically higher than that over the "suburb". Agents exchange data iff
    they are at distance at most R within each other.

  192. Generalized Adaptive Network Coded Cooperation (GANCC): A Unified Framework for Network Coding and Channel Coding.

    Authors: Xingkai Bao, Jing Li
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    This paper considers distributed coding for multi-source single-sink data
    collection wireless networks. A unified framework for network coding and
    channel coding, termed "generalized adaptive network coded cooperation"
    (GANCC), is proposed. Key ingredients of GANCC include: matching code graphs
    with the dynamic network graphs on-the-fly, and integrating channel coding with
    network coding through circulant low-density parity-check codes. Several code
    constructing methods and several families of sparse-graph codes are proposed,
    and information theoretical analysis is performed.

  193. COTAR: An Accurate, Cost-Effective Cooperative Wireless Localization Strategy for Mobile Nodes.

    Authors: Xingkai Bao, Jing Li
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    This paper considers N mobile nodes that move together in the vicinity of
    each other, whose initial poses as well as subsequent movements must be
    accurately tracked in real time with the assist of M(>=3) reference nodes. By
    engaging the neighboring mobile nodes in a simple but effective cooperation,
    and by exploiting both the time-of-arrival (TOA) information (between mobile
    nodes and reference nodes) and the received-signal-strength (RSS) information
    (between mobile nodes), an effective new localization strategy, termed
    cooperative TOA and RSS (COTAR), is developed.

  194. Cloud Migration: A Case Study of Migrating an Enterprise IT System to IaaS.

    Authors: Ali Khajeh-Hosseini, Ian Sommerville, David Greenwood
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    This case study illustrates the potential benefits and risks associated with
    the migration of an IT system in the oil & gas industry from an in-house data
    center to Amazon EC2 from a broad variety of stakeholder perspectives across
    the enterprise, thus transcending the typical, yet narrow, financial and
    technical analysis offered by providers. Our results show that the system
    infrastructure in the case study would have cost 37% less over 5 years on EC2,
    and using cloud computing could have potentially eliminated 21% of the support
    calls for this system.

  195. Optimization and Analysis of Distributed Averaging with Short Node Memory.

    Authors: Boris N. Oreshkin, Mark J. Coates, Michael G. Rabbat
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    In this paper, we demonstrate, both theoretically and by numerical examples,
    that adding a local prediction component to the update rule can significantly
    improve the convergence rate of distributed averaging algorithms. We focus on
    the case where the local predictor is a linear combination of the node's two
    previous values (i.e., two memory taps), and our update rule computes a
    combination of the predictor and the usual weighted linear combination of
    values received from neighbouring nodes.

  196. Virtual Machine Support for Many-Core Architectures: Decoupling Abstract from Concrete Concurrency Models.

    Authors: Stefan Marr, Michael Haupt, Stijn Timbermont, Bram Adams, Theo D&#x27;Hondt, Pascal Costanza, Wolfgang De Meuter
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    The upcoming many-core architectures require software developers to exploit
    concurrency to utilize available computational power. Today's high-level
    language virtual machines (VMs), which are a cornerstone of software
    development, do not provide sufficient abstraction for concurrency concepts. We
    analyze concrete and abstract concurrency models and identify the challenges
    they impose for VMs. To provide sufficient concurrency support in VMs, we
    propose to integrate concurrency operations into VM instruction sets.

  197. Simplifying Parallelization of Scientific Codes by a Function-Centric Approach in Python.

    Authors: Jon K. Nilsen, Xing Cai, Bjorn Hoyland, Hans Petter Langtangen
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    The purpose of this paper is to show how existing scientific software can be
    parallelized using a separate thin layer of Python code where all parallel
    communication is implemented. We provide specific examples on such layers of
    code, and these examples may act as templates for parallelizing a wide set of
    serial scientific codes. The use of Python for parallelization is motivated by
    the fact that the language is well suited for reusing existing serial codes
    programmed in other languages.

  198. Performance and Stability of the Chelonia Storage Cloud.

    Authors: Jon K. Nilsen, Salman Toor, Zsombor Nagy, Bjarte Mohn, Alex L. Read
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    In this paper we present the Chelonia storage cloud middleware. It was
    designed to fill the requirements gap between those of large, sophisticated
    scientific collaborations which have adopted the grid paradigm for their
    distributed storage needs, and of corporate business communities which are
    gravitating towards the cloud paradigm. The similarities to and differences
    between Chelonia and several well-known grid- and cloud-based storage solutions
    are commented.

  199. Local algorithms in (weakly) coloured graphs.

    Authors: Matti &#xc5;strand, Valentin Polishchuk, Joel Rybicki, Jukka Suomela, Jara Uitto
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    A local algorithm is a distributed algorithm that completes after a constant
    number of synchronous communication rounds. We present local approximation
    algorithms for the minimum dominating set problem and the maximum matching
    problem in 2-coloured and weakly 2-coloured graphs. In a weakly 2-coloured
    graph, both problems admit a local algorithm with the approximation factor
    $(\Delta+1)/2$, where $\Delta$ is the maximum degree of the graph. We also give
    a matching lower bound proving that there is no local algorithm with a better
    approximation factor for either of these problems.

  200. A Multi-Stage CUDA Kernel for Floyd-Warshall.

    Authors: Ben Lund, Justin W Smith
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We present a new implementation of the Floyd-Warshall All-Pairs Shortest
    Paths algorithm on CUDA. Our algorithm runs approximately 5 times faster than
    the previously best reported algorithm. In order to achieve this speedup, we
    applied a new technique to reduce usage of on-chip shared memory and allow the
    CUDA scheduler to more effectively hide instruction latency.

  201. Optimization of Multiple Vehicle Routing Problems using Approximation Algorithms.

    Authors: R. Nallusamy, K. Duraiswamy, R. Dhanalaksmi, P. Parthiban
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    This paper deals with generating of an optimized route for multiple Vehicle
    routing Problems (mVRP). We used a methodology of clustering the given cities
    depending upon the number of vehicles and each cluster is allotted to a
    vehicle. k- Means clustering algorithm has been used for easy clustering of the
    cities. In this way the mVRP has been converted into VRP which is simple in
    computation compared to mVRP. After clustering, an optimized route is generated
    for each vehicle in its allotted cluster.

  202. Severity Prediction of Drought in A Large Geographical Area Using Distributed Wireless Sensor Networks.

    Authors: M. Vaidehi, T.R. Gopalakrsihnan Nair, Satish.G. Dappin, G. Nithya Nair
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    In this paper, the severity prediction of drought through the implementation
    of modern sensor networks is discussed. We describe how to design a drought
    prediction system using wireless sensor networks. This paper will describe a
    terrestrial interconnected wireless sensor network paradigm for the prediction
    of severity of drought over a vast area of 10,000 sq km. The communication
    architecture for sensor network is outlined and the protocols developed for
    each layer is explored. The data integration model and sensor data analysis at
    the central computer is explained.

  203. Performance and Fault Tolerance in the StoreTorrent Parallel Filesystem.

    Authors: Federico D. Sacerdoti
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    With a goal of supporting the timely and cost-effective analysis of Terabyte
    datasets on commodity components, we present and evaluate StoreTorrent, a
    simple distributed filesystem with integrated fault tolerance for efficient
    handling of small data records. Our contributions include an application-OS
    pipelining technique and metadata structure to increase small write and read
    performance by a factor of 1-10, and the use of peer-to-peer communication of
    replica-location indexes to avoid transferring data during parallel analysis
    even in a degraded state.

  204. Termination Detection of Local Computations.

    Authors: Emmanuel Godard, Yves M&#xe9;tivier, Gerard Tel
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Contrary to the sequential world, the processes involved in a distributed
    system do not necessarily know when a computation is globally finished. This
    paper investigates the problem of the detection of the termination of local
    computations. We define four types of termination detection: no detection,
    detection of the local termination, detection by a distributed observer,
    detection of the global termination.

  205. Practical Parallel External Memory Algorithms via Simulation of Parallel Algorithms.

    Authors: David E. Robillard
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    This thesis introduces PEMS2, an improvement to PEMS (Parallel External
    Memory System). PEMS executes Bulk-Synchronous Parallel (BSP) algorithms in an
    External Memory (EM) context, enabling computation with very large data sets
    which exceed the size of main memory. Many parallel algorithms have been
    designed and implemented for Bulk-Synchronous Parallel models of computation.
    Such algorithms generally assume that the entire data set is stored in main
    memory at once.

  206. Research Agenda in Cloud Technologies.

    Authors: Ilango Sriram, Ali Khajeh-Hosseini
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Cloud computing is the latest effort in delivering computing resources as a
    service. It represents a shift away from computing as a product that is
    purchased, to computing as a service that is delivered to consumers over the
    internet from large-scale data centres - or "clouds". Whilst cloud computing is
    gaining growing popularity in the IT industry, academia appeared to be lagging
    behind the rapid developments in this field.

  207. Research Challenges for Enterprise Cloud Computing.

    Authors: Ilango Sriram, Ali Khajeh-Hosseini, Ian Sommerville
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Cloud computing represents a shift away from computing as a product that is
    purchased, to computing as a service that is delivered to consumers over the
    internet from large-scale data centers - or "clouds". This paper discusses some
    of the research challenges for cloud computing from an enterprise or
    organizational perspective, and puts them in context by reviewing the existing
    body of literature in cloud computing.

  208. Towards Transactional Load over XtreemFS.

    Authors: Roman Talyansky, Adolf Hohl, Bernd Scheuermann, Bjorn Kolbeck, Erich Focht
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We propose using trace-based assessment of the performance of distributed
    file systems (DFS) under transactional IO load. The assessment includes
    simulations and experiments using the IO traces. Our experiments suggest that
    DFS, and specifically XtreemFS have a good potential to support transactional
    IO load in distributed environments: they demonstrate good performance, high
    availability and scalability, while at the same time opening the way to TCO
    reduction.

  209. Classifying Application Phases in Asymmetric Chip Multiprocessors.

    Authors: A. Z. Jooya, M. Analoui
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    In present study, in order to improve the performance and reduce the amount
    of power which is dissipated in heterogeneous multicore processors, the ability
    of detecting the program execution phases is investigated. The programs
    execution intervals have been classified in different phases based on their
    throughput and the utilization of the cores. The results of implementing the
    phase detection technique are investigated on a single core processor and also
    on a multicore processor.

  210. Multiprocessor Scheduling For Tasks With Priority Using GA.

    Authors: Dr.G.Padmavathi, Mrs.S.R.Vijayalakshmi
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Multiprocessors have emerged as a powerful computing means for running
    realtime applications, especially where a uniprocessor system would not be
    sufficient enough to execute all the tasks. The high performance and
    reliability of multiprocessors have made them a powerful computing resource.
    Such computing environment requires an efficient algorithm to determine when
    and on which processor a given task should execute.

  211. MapReduce for Integer Factorization.

    Authors: Javier Tordable
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Integer factorization is a very hard computational problem. Currently no
    efficient algorithm for integer factorization is publicly known. However, this
    is an important problem on which it relies the security of many real world
    cryptographic systems.

    I present an implementation of a fast factorization algorithm on MapReduce.
    MapReduce is a programming model for high performance applications developed
    originally at Google. The quadratic sieve algorithm is split into the different
    MapReduce phases and compared against a standard implementation.

  212. A distributed file system for a wide-area high performance computing infrastructure.

    Authors: Edward Walker
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We describe our work in implementing a wide-area distributed file system for
    the NSF TeraGrid. The system, called XUFS, allows private distributed name
    spaces to be created for transparent access to personal files across over 9000
    computer nodes. XUFS builds on many principles from prior distributed file
    systems research, but extends key design goals to support the workflow of
    computational science researchers.

  213. QR Factorization of Tall and Skinny Matrices in a Grid Computing Environment.

    Authors: Emmanuel Agullo, Camille Coti, Jack Dongarra, Thomas Herault, Julien Langou
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Previous studies have reported that common dense linear algebra operations do
    not achieve speed up by using multiple geographical sites of a computational
    grid. Because such operations are the building blocks of most scientific
    applications, conventional supercomputers are still strongly predominant in
    high-performance computing and the use of grids for speeding up large-scale
    scientific problems is limited to applications exhibiting parallelism at a
    higher level.

  214. DiVinE-CUDA - A Tool for GPU Accelerated LTL Model Checking.

    Authors: Ji&#x159;&#xed; Barnat, Lubo&#x161; Brim, Milan &#x10c;e&#x161;ka
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    In this paper we present a tool that performs CUDA accelerated LTL Model
    Checking. The tool exploits parallel algorithm MAP adjusted to the NVIDIA CUDA
    architecture in order to efficiently detect the presence of accepting cycles in
    a directed graph. Accepting cycle detection is the core algorithmic procedure
    in automata-based LTL Model Checking. We demonstrate that the tool outperforms
    non-accelerated version of the algorithm and we discuss where the limits of the
    tool are and what we intend to do in the future to avoid them.

  215. Formal Aspects of Grid Brokering.

    Authors: Attila Kert&#xe9;sz, Zsolt N&#xe9;meth
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Coordination in distributed environments, like Grids, involves selecting the
    most appropriate services, resources or compositions to carry out the planned
    activities. Such functionalities appear at various levels of the infrastructure
    and in various means forming a blurry domain, where it is hard to see how the
    participating components are related and what their relevant properties are. In
    this paper we focus on a subset of these problems: resource brokering in Grid
    middleware.

  216. Parallelizing Deadlock Resolution in Symbolic Synthesis of Distributed Programs.

    Authors: Fuad Abujarad, Borzoo Bonakdarpour, Sandeep S. Kulkarni
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Previous work has shown that there are two major complexity barriers in the
    synthesis of fault-tolerant distributed programs: (1) generation of fault-span,
    the set of states reachable in the presence of faults, and (2) resolving
    deadlock states, from where the program has no outgoing transitions. Of these,
    the former closely resembles with model checking and, hence, techniques for
    efficient verification are directly applicable to it. Hence, we focus on
    expediting the latter with the use of multi-core technology.

  217. Survey of clustering algorithms for MANET.

    Authors: Dr. Mahesh Motwani, Ratish Agarwal
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Many clustering schemes have been proposed for ad hoc networks. A systematic
    classification of these clustering schemes enables one to better understand and
    make improvements. In mobile ad hoc networks, the movement of the network nodes
    may quickly change the topology resulting in the increase of the overhead
    message in topology maintenance. Protocols try to keep the number of nodes in a
    cluster around a pre-defined threshold to facilitate the optimal operation of
    the medium access control protocol.

  218. Efficient Gaussian Elimination on a 2D SIMD Array of Processors without Column Broadcasts.

    Authors: Mugurel Ionut Andreica
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    This paper presents an efficient method for implementing the Gaussian
    elimination technique for an nxm (m>=n) matrix, using a 2D SIMD array of nxm
    processors. The described algorithm consists of 2xn-1=O(n) iterations, which
    provides an optimal speed-up over the serial version. A particularity of the
    algorithm is that it only requires broadcasts on the rows of the processor
    matrix and not on its columns. The paper also presents several extensions and
    applications of the Gaussian elimination algorithm.

  219. Peer-to-Peer Cloud Provisioning: Service Discovery and Load-Balancing.

    Authors: Anna Liu, Rajiv Ranjan, Liang Zhao, Xiaomin Wu
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    This chapter presents: (i) a layered peer-to-peer Cloud provisioning
    architecture; (ii) a summary of the current state-of-the-art in Cloud
    provisioning with particular emphasis on service discovery and load-balancing;
    (iii) a classification of the existing peer-to-peer network management model
    with focus on extending the DHTs for indexing and managing complex provisioning
    information; and (iv) the design and implementation of novel, extensible
    software fabric (Cloud peer) that combines public/private clouds, overlay
    networking and structured peer-to-peer indexing techniques for supporting
    s

  220. Self-Stabilizing Byzantine Asynchronous Unison.

    Authors: Swan Dubois, Maria Gradinariu Potop-Butucaru, Mikhail Nesterenko, S&#xe9;bastien Tixeuil
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We explore asynchronous unison in the presence of systemic transient and
    permanent Byzantine faults in shared memory. We observe that the problem is not
    solvable under less than strongly fair scheduler or for system topologies with
    maximum node degree greater than two. We present a self-stabilizing
    Byzantine-tolerant solution to asynchronous unison for chain and ring
    topologies. Our algorithm has minimum possible containment radius and optimal
    stabilization time.

  221. Building and Installing a Hadoop/MapReduce Cluster from Commodity Components.

    Authors: Jochen L. Leidner, Gary Berosik
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    This tutorial presents a recipe for the construction of a compute cluster for
    processing large volumes of data, using cheap, easily available personal
    computer hardware (Intel/AMD based PCs) and freely available open source
    software (Ubuntu Linux, Apache Hadoop).

  222. Checkpointing vs. Migration for Post-Petascale Machines.

    Authors: Franck Cappello, Henri Casanova, Yves Robert
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We craft a few scenarios for the execution of sequential and parallel jobs on
    future generation machines. Checkpointing or migration, which technique to
    choose?

  223. Lattice QCD Thermodynamics on the Grid.

    Authors: Jakub T. Mo&#x15b;cicki, Maciej Wo&#x15b;, Massimo Lamanna, Philippe de Forcrand, Owe Philipsen
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We describe how we have used simultaneously ${\cal O}(10^3)$ nodes of the
    EGEE Grid, accumulating ca. 300 CPU-years in 2-3 months, to determine an
    important property of Quantum Chromodynamics. We explain how Grid resources
    were exploited efficiently and with ease, using user-level overlay based on
    Ganga and DIANE tools above standard Grid software stack. Application-specific
    scheduling and resource selection based on simple but powerful heuristics
    allowed to improve efficiency of the processing to obtain desired scientific
    results by a specified deadline.

  224. A Semantic Grid Oriented to E-Tourism.

    Authors: Xiao Ming Zhang
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    With increasing complexity of tourism business models and tasks, there is a
    clear need of the next generation e-Tourism infrastructure to support flexible
    automation, integration, computation, storage, and collaboration. Currently
    several enabling technologies such as semantic Web, Web service, agent and grid
    computing have been applied in the different e-Tourism applications, however
    there is no a unified framework to be able to integrate all of them.

  225. Best-effort Group Service in Dynamic Networks.

    Authors: Bertrand Ducourthial, Sofiane Khalfallah, Franck Petit
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We propose a group membership service for dynamic ad hoc networks. It
    maintains as long as possible the existing groups and ensures that each group
    diameter is always smaller than a constant, ?xed according to the application
    using the groups. The proposed protocol is self-stabilizing and works in
    dynamic distributed systems. Moreover, it ensures a kind of continuity in the
    service o?er to the application while the system is converging, except if too
    strong topology changes happen.

  226. PyCUDA: GPU Run-Time Code Generation for High-Performance Computing.

    Authors: Andreas Kl&#xf6;ckner, Nicolas Pinto, Yunsup Lee, Bryan Catanzaro, Paul Ivanov, Ahmed Fasih
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    High-performance scientific computing has recently seen a surge of interest
    in heterogeneous systems, with an emphasis on modern Graphics Processing Units
    (GPUs). These devices offer tremendous potential for performance and efficiency
    in important large-scale applications of computational science. However,
    exploiting this potential can be challenging, as one must adapt to the
    specialized and rapidly evolving computing environment currently exhibited by
    GPUs. One way of addressing this challenge is to embrace better techniques and
    develop tools tailored to their needs.

  227. Near-Optimal Sublinear Time Bounds for Distributed Random Walks.

    Authors: Prasad Tetali, Danupon Nanongkai, Atish Das Sarma, Gopal Pandurangan
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We focus on the problem of performing random walks efficiently in a
    distributed network. Given bandwidth constraints, the goal is to minimize the
    number of rounds required to obtain a random walk sample on an undirected
    network. Despite the widespread use of random walks in distributed computing,
    most algorithms that compute a random walk sample of length $\ell$ naively,
    i.e., in $O(\ell)$ rounds.

  228. Fault-Tolerance through Message-logging and Check-pointing: Disaster Recovery for CORBA-based Distributed Bank Servers.

    Authors: Emil Vassev, Que Thu Dung Nguyen, Heng Kuang
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    This report presents results of our endeavor towards developing a
    failure-recovery variant of a CORBA-based bank server that provides fault
    tolerance features through message logging and checkpoint logging.

  229. Global communications in multiprocessor simulations of flames.

    Authors: V. Karlin
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    In this paper we investigate performance of global communications in a
    particular parallel code. The code simulates dynamics of expansion of premixed
    spherical flames using an asymptotic model of Sivashinsky type and a spectral
    numerical algorithm. As a result, the code heavily relies on global all-to-all
    interprocessor communications implementing transposition of the distributed
    data array in which numerical solution to the problem is stored.

  230. Domain Decomposition Based High Performance Parallel Computing.

    Authors: Mandhapati P. Raju, Siddhartha Khaitan
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    The study deals with the parallelization of finite element based
    Navier-Stokes codes using domain decomposition and state-ofart sparse direct
    solvers. There has been significant improvement in the performance of sparse
    direct solvers. Parallel sparse direct solvers are not found to exhibit good
    scalability. Hence, the parallelization of sparse direct solvers is done using
    domain decomposition techniques. A highly efficient sparse direct solver
    PARDISO is used in this study. The scalability of both Newton and modified
    Newton algorithms are tested.

  231. Distributed Abstract Optimization via Constraints Consensus: Theory and Applications.

    Authors: Francesco Bullo, Giuseppe Notarstefano
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Distributed abstract programs are a novel class of distributed optimization
    problems where (i) the number of variables is much smaller than the number of
    constraints and (ii) each constraint is associated to a network node. Abstract
    optimization programs are a generalization of linear programs that captures
    numerous geometric optimization problems.

  232. SPECI, a simulation tool exploring cloud-scale data centres.

    Authors: Ilango Sriram
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    There is a rapid increase in the size of data centres (DCs) used to provide
    cloud computing services. It is commonly agreed that not all properties in the
    middleware that manages DCs will scale linearly with the number of components.
    Further, "normal failure" complicates the assessment of the per-formance of a
    DC. However, unlike in other engineering domains, there are no well established
    tools that allow the prediction of the performance and behav-iour of future
    generations of DCs.

  233. Optimised access to user analysis data using the gLite DPM.

    Authors: Sam Skipsey, Greig Cowan, Mike Kenyon, Graeme Stewart, Stuart Purdie
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    The ScotGrid distributed Tier-2 now provides more that 4MSI2K and 500TB for
    LHC computing, which is spread across three sites at Durham, Edinburgh and
    Glasgow. Tier-2 sites have a dual role to play in the computing models of the
    LHC VOs. Firstly, their CPU resources are used for the generation of Monte
    Carlo event data. Secondly, the end user analysis data is distributed across
    the grid to the site's storage system and held on disk ready for processing by
    physicists' analysis jobs.

  234. ScotGrid: Providing an Effective Distributed Tier-2 in the LHC Era.

    Authors: Sam Skipsey, David Ambrose-Griffith, Greig Cowan, Mike Kenyon, Orlando Richards, Phil Roffe, Graeme Stewart
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    ScotGrid is a distributed Tier-2 centre in the UK with sites in Durham,
    Edinburgh and Glasgow. ScotGrid has undergone a huge expansion in hardware in
    anticipation of the LHC and now provides more than 4MSI2K and 500TB to the LHC
    VOs. Scaling up to this level of provision has brought many challenges to the
    Tier-2 and we show in this paper how we have adopted new methods of organising
    the centres, from fabric management and monitoring to remote management of
    sites to management and operational procedures, to meet these challenges.

  235. Modeling and Verification for Timing Satisfaction of Fault-Tolerant Systems with Finiteness.

    Authors: Chih-Hong Cheng, Christian Buckl, Javier Esparza, Alois Knoll
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    The increasing use of model-based tools enables further use of formal
    verification techniques in the context of distributed real-time systems. To
    avoid state explosion, it is necessary to construct verification models that
    focus on the aspects under consideration.

  236. Critical Analysis of Middleware Architectures for Large Scale Distributed Systems.

    Authors: Alexandru Costan, Corina Stratan, Eliana-Dina Tirsa, Mugurel Ionut Andreica, Valentin Cristea, Ciprian Mihai Dobre, Florin Pop
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Distributed computing is increasingly being viewed as the next phase of Large
    Scale Distributed Systems (LSDSs). However, the vision of large scale resource
    sharing is not yet a reality in many areas - Grid computing is an evolving area
    of computing, where standards and technology are still being developed to
    enable this new paradigm. Hence, in this paper we analyze the current
    development of middleware tools for LSDS, from multiple perspectives:
    architecture, applications and market research.

  237. DILAND: An Algorithm for Distributed Sensor Localization with Noisy Distance Measurements.

    Authors: Usman A. Khan, Soummya Kar, Jose M. F. Moura
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    In this correspondence, we present an algorithm for distributed sensor
    localization with noisy distance measurements (DILAND) that extends and makes
    the DLRE more robust. DLRE is a distributed sensor localization algorithm in
    $\mathbb{R}^m$ $(m\geq1)$ introduced in \cite{usman_loctsp:08}. DILAND operates
    when (i) the communication among the sensors is noisy; (ii) the communication
    links in the network may fail with a non-zero probability; and (iii) the
    measurements performed to compute distances among the sensors are corrupted
    with noise.

  238. DILAND: An Algorithm for Distributed Sensor Localization with Noisy Distance Measurements.

    Authors: Usman A. Khan, Soummya Kar, Jose M. F. Moura
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    In this correspondence, we present an algorithm for distributed sensor
    localization with noisy distance measurements (DILAND) that extends and makes
    the DLRE more robust. DLRE is a distributed sensor localization algorithm in
    $\mathbb{R}^m$ $(m\geq1)$ introduced in \cite{usman_loctsp:08}. DILAND operates
    when (i) the communication among the sensors is noisy; (ii) the communication
    links in the network may fail with a non-zero probability; and (iii) the
    measurements performed to compute distances among the sensors are corrupted
    with noise.

  239. Towards a Unified Approach to (In)Decision: Routing, Games, Circuits, Consensus, and Beyond.

    Authors: Aaron D. Jaggard, Michael Schapira, Rebecca N. Wright
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    In this paper, we explore a unified treatment of the difficulty of reaching a
    decision in constrained distributed computing environments in which there is a
    lack of global coordination or knowledge. We show a general impossibility
    result for a broad class of decision protocols. Importantly, our impossibility
    result holds, in particular, for "asynchronous, distributed, historyless
    computation", in which each computational node's selection of actions only
    depends on the current actions of other nodes, even under the assumption that
    no node can be faulty.

  240. High availability using virtualization.

    Authors: Federico Calzolari
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    High availability has always been one of the main problems for a data center.
    Till now high availability was achieved by host per host redundancy, a highly
    expensive method in terms of hardware and human costs. A new approach to the
    problem can be offered by virtualization.

  241. Robust Failure Detection Architecture for Large Scale Distributed Systems.

    Authors: Alexandru Costan, Mugurel Ionut Andreica, Valentin Cristea, Ciprian Mihai Dobre, Florin Pop
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Failure detection is a fundamental building block for ensuring fault
    tolerance in large scale distributed systems. There are lots of approaches and
    implementations in failure detectors. Providing flexible failure detection in
    off-the-shelf distributed systems is difficult. In this paper we present an
    innovative solution to this problem. Our approach is based on adaptive,
    decentralized failure detectors, capable of working asynchronous and
    independent on the application flow.

  242. Towards a Grid Platform for Scientific Workflows Management.

    Authors: Alexandru Costan, Corina Stratan, Eliana-Dina Tirsa, Mugurel Ionut Andreica, Valentin Cristea
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Workflow management systems allow the users to develop complex applications
    at a higher level, by orchestrating functional components without handling the
    implementation details. Although a wide range of workflow engines are developed
    in enterprise environments, the open source engines available for scientific
    applications lack some functionalities or are too difficult to use for
    non-specialists.

  243. A historical perspective on developing foundations for privacy-friendly client cloud computing: The Paradigm Shift from "Inconsistency Denial" to "Practical Semantic Integration(TM)".

    Authors: Carl Hewitt
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Arguably, the original paradigm for computation was Logic Programming broadly
    conceived as "deducing computational steps from existing information."

    The idea has a long development that went through many twists in which
    important questions turned out to have surprising answers, including the
    following:

    * How much of concurrent computation is reducible to deduction?

    * Are the laws of thought consistent?

    * Is "rapid recovery" a more viable policy than "inconsistency denial"?

  244. Supporting Lock-Free Composition of Concurrent Data Objects.

    Authors: Daniel Cederman, Philippas Tsigas
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Lock-free data objects offer several advantages over their blocking
    counterparts, such as being immune to deadlocks and convoying and, more
    importantly, being highly concurrent. But they share a common disadvantage in
    that the operations they provide are difficult to compose into larger atomic
    operations while still guaranteeing lock-freedom. We present a lock-free
    methodology for composing highly concurrent linearizable objects together by
    unifying their linearization points.

  245. A New Fuzzy Approach for Dynamic Load Balancing Algorithm.

    Authors: Abbas Karimi, Faraneh Zarafshan, Adznan.b. Jantan, A.R Ramli, M.Iqbal b.Saripan
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Load balancing is the process of improving the Performance of a parallel and
    distributed system through is distribution of load among the processors [1-2].
    Most of the previous work in load balancing and distributed decision making in
    general, do not effectively take into account the uncertainty and inconsistency
    in state information but in fuzzy logic, we have advantage of using crisps
    inputs.

  246. Web-enabling Cache Daemon for Complex Data.

    Authors: Ivan Voras, Mario Zagar
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    One of the most common basic techniques for improving the performance of web
    applications is caching frequently accessed data in fast data stores,
    colloquially known as cache daemons. In this paper we present a cache daemon
    suitable for storing complex data while maintaining fine-grained control over
    data storage, retrieval and expiry. Data manipulation in this cache daemon is
    performed via standard SQL statements so we call it SQLcached. It is a
    practical, usable solution already implemented in several large web sites.

  247. Transform-based Distributed Data Gathering.

    Authors: Godwin Shen, Antonio Ortega
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    A general class of unidirectional transforms is presented that can be
    computed in a distributed manner along an arbitrary routing tree. Additionally,
    we provide a set of conditions under which these transforms are invertible.
    These transforms can be computed as data is routed towards the collection (or
    sink) node in the tree and exploit data correlation between nodes in the tree.
    Moreover, when used in wireless sensor networks, these transforms can also
    leverage data received at nodes via broadcast wireless communications.

  248. Business in the Grid.

    Authors: Erich Schikuta, Thomas Weishaeupl, Flavia Donno, Heinz Stockinger, Elisabeth Vinek, Helmut Wanek, Christoph Witzany, Irfan Ul Haq
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    From 2004 to 2007 the Business In the Grid (BIG) project took place and was
    driven by the following goals: Firstly, make business aware of Grid technology
    and, secondly, try to explore new business models. We disseminated Grid
    computing by mainly concentrating on the central European market and
    interviewed several companies in order to gain insights into the Grid
    acceptance in industrial environments.

  249. Greedy Gossip with Eavesdropping.

    Authors: Deniz Ustebay, Boris Oreshkin, Mark Coates, Michael Rabbat
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    This paper presents greedy gossip with eavesdropping (GGE), a novel
    randomized gossip algorithm for distributed computation of the average
    consensus problem. In gossip algorithms, nodes in the network randomly
    communicate with their neighbors and exchange information iteratively. The
    algorithms are simple and decentralized, making them attractive for wireless
    network applications. In general, gossip algorithms are robust to unreliable
    wireless conditions and time varying network topologies. In this paper we
    introduce GGE and demonstrate that greedy updates lead to rapid convergence.

  250. Building on Quicksand.

    Authors: Pat Helland, David Campbell
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Reliable systems have always been built out of unreliable components. Early
    on, the reliable components were small such as mirrored disks or ECC (Error
    Correcting Codes) in core memory. These systems were designed such that
    failures of these small components were transparent to the application. Later,
    the size of the unreliable components grew larger and semantic challenges crept
    into the application when failures occurred.

  251. Resource Matchmaking Algorithm using Dynamic Rough Set in Grid Environment.

    Authors: Iraj Ataollahi, Mortza Analoui
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Grid environment is a service oriented infrastructure in which many
    heterogeneous resources participate to provide the high performance
    computation. One of the bug issues in the grid environment is the vagueness and
    uncertainty between advertised resources and requested resources. Furthermore,
    in an environment such as grid dynamicity is considered as a crucial issue
    which must be dealt with. Classical rough set have been used to deal with the
    uncertainty and vagueness. But it can just be used on the static systems and
    can not support dynamicity in a system.

  252. FastFlow: Efficient Parallel Streaming Applications on Multi-core.

    Authors: Marco Aldinucci, Massimo Torquati, Massimiliano Meneghin
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Shared memory multiprocessors come back to popularity thanks to rapid
    spreading of commodity multi-core architectures. As ever, shared memory
    programs are fairly easy to write and quite hard to optimise; providing
    multi-core programmers with optimising tools and programming frameworks is a
    nowadays challenge. Few efforts have been done to support effective streaming
    applications on these architectures. In this paper we introduce FastFlow, a
    low-level programming framework based on lock-free queues explicitly designed
    to support high-level languages for streaming applications.

  253. Energy-Efficient Scheduling of HPC Applications in Cloud Computing Environments.

    Authors: Saurabh Kumar Garg, Chee Shin Yeo, Arun Anandasivam, Rajkumar Buyya
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    The use of High Performance Computing (HPC) in commercial and consumer IT
    applications is becoming popular. They need the ability to gain rapid and
    scalable access to high-end computing capabilities. Cloud computing promises to
    deliver such a computing infrastructure using data centers so that HPC users
    can access applications and data from a Cloud anywhere in the world on demand
    and pay based on what they use. However, the growing demand drastically
    increases the energy consumption of data centers, which has become a critical
    issue.

Syndicate content