Parallel

  1. Component Specification in the Cactus Framework: The Cactus Configuration Language.

    Authors: Gabrielle Allen, Tom Goodale, Frank Löffler, David Rideout, Erik Schnetter, Eric L. Seidel
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Component frameworks are complex systems that rely on many layers of
    abstraction to function properly. One essential requirement is a consistent
    means of describing each individual component and how it relates to both other
    components and the whole framework. As component frameworks are designed to be
    flexible by nature, the description method should be simultaneously powerful,
    lead to efficient code, and be easy to use, so that new users can quickly adapt
    their own code to work with the framework.

  2. A Competitive Analysis for Balanced Transactional Memory Workloads.

    Authors: Gokarna Sharma, Costas Busch
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We consider transactional memory contention management in the context of
    balanced workloads, where if a transaction is writing, the number of write
    operations it performs is a constant fraction of its total reads and writes. We
    explore the theoretical performance boundaries of contention management in
    balanced workloads from the worst-case perspective by presenting and analyzing
    two new contention management algorithms. The first algorithm Clairvoyant is
    O(\surd s)-competitive, where s is the number of shared resources. This
    algorithm depends on explicitly knowing the conflict graph.

  3. Managing Clouds in Cloud Platforms.

    Authors: Hassan Gobjuka, Kamal A. Ahmat
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Managing cloud services is a fundamental challenge in todays virtualized
    environments. These challenges equally face both providers and consumers of
    cloud services. The issue becomes even more challenging in virtualized
    environments that support mobile clouds. Cloud computing platforms such as
    Amazon EC2 provide customers with flexible, on demand resources at low cost.
    However, they fail to provide seamless infrastructure management and monitoring
    capabilities that many customers may need.

  4. Deterministic Consensus Algorithm with Linear Per-Bit Complexity.

    Authors: Guanfeng Liang, Nitin Vaidya
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    In this report, building on the deterministic multi-valued one-to-many
    Byzantine agreement (broadcast) algorithm in our recent technical report [2],
    we introduce a deterministic multi-valued all-to-all Byzantine agreement
    algorithm (consensus), with linear complexity per bit agreed upon. The
    discussion in this note is not self-contained, and relies heavily on the
    material in [2] - please refer to [2] for the necessary background.

  5. The Two Quadrillionth Bit of Pi is 0! Distributed Computation of Pi with Apache Hadoop.

    Authors: Tsz-Wo Sze
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We present a new record on computing specific bits of Pi, the mathematical
    constant, and discuss performing such computations on Apache Hadoop clusters.
    The specific bits represented in hexadecimal are 0E6C1294 AED40403 F56D2D76
    4026265B CA98511D 0FCFFAA1 0F4D28B1 BB5392B8. These 256 bits end at the
    2,000,000,000,000,252nd bit position, which doubles the previous known record.
    The position of the first bit is 1,999,999,999,999,997 and the value of the two
    quadrillionth bit is 0. The computation is carried out by a MapReduce program
    called DistBbp.

  6. Optimal Gradient Clock Synchronization in Dynamic Networks.

    Authors: Fabian Kuhn, Christoph Lenzen, Thomas Locher, Rotem Oshman
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We study the problem of clock synchronization in highly dynamic networks,
    where communication links can appear or disappear at any time. The nodes in the
    network are equipped with hardware clocks, but the rate of the hardware clocks
    can vary arbitrarily within specific bounds, and the estimates that nodes can
    obtain about the clock values of other nodes are inherently inaccurate.

  7. A Light-Weight Communication Library for Distributed Computing.

    Authors: Derek Groen, Steven Rieder, Paola Grosso, Cees de Laat, Simon Portegies Zwart
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We present MPWide, a platform independent communication library for
    performing message passing between computers. Our library allows coupling of
    several local MPI applications through a long distance network and is
    specifically optimized for such communications. The implementation is
    deliberately kept light-weight, platform independent and the library can be
    installed and used without administrative privileges.

  8. Immune System Inspired Strategies for Distributed Systems.

    Authors: Soumya Banerjee, Melanie Moses
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Many components of the IS are constructed as modular units which do not need
    to communicate with each other such that the number of components increases but
    the size remains constant. However, a sub-modular IS architecture in which
    lymph node number and size both increase sublinearly with body size is shown to
    efficiently balance the requirements of communication and migration, consistent
    with experimental data. We hypothesize that the IS architecture optimizes the
    tradeoff between local search for pathogens and global response using
    antibodies.

  9. The Cloud Adoption Toolkit: Supporting Cloud Adoption Decisions in the Enterprise.

    Authors: Ali Khajeh-Hosseini, Ian Sommerville, David Greenwood, James W. Smith
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Cloud computing promises a radical shift in the provisioning of computing
    resource within the enterprise. This paper describes the challenges that
    decision makers face when assessing the feasibility of the adoption of cloud
    computing in their organisations, and describes our Cloud Adoption Toolkit,
    which has been developed to support this process. The toolkit provides a
    framework to support decision makers in identifying their concerns, and
    matching these concerns to appropriate tools/techniques that can be used to
    address them.

  10. Self-repairing Homomorphic Codes for Distributed Storage Systems.

    Authors: Frederique Oggier, Anwitaman Datta
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Erasure codes provide a storage efficient alternative to replication based
    redundancy in (networked) storage systems. They however entail high
    communication overhead for maintenance, when some of the encoded fragments are
    lost and need to be replenished. Such overheads arise from the fundamental need
    to recreate (or keep separately) first a copy of the whole object before any
    individual encoded fragment can be generated and replenished. There has been
    recently intense interest to explore alternatives, most prominent ones being
    regenerating codes (RGC) and hierarchical codes (HC).

  11. Parallel and distributed Gr\"obner bases computation in JAS.

    Authors: Heinz Kredel
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    This paper considers parallel Gr\"obner bases algorithms on distributed
    memory parallel computers with multi-core compute nodes. We summarize three
    different Gr\"obner bases implementations: shared memory parallel, pure
    distributed memory parallel and distributed memory combined with shared memory
    parallelism. The last algorithm, called distributed hybrid, uses only one
    control communication channel between the master node and the worker nodes and
    keeps polynomials in shared memory on a node.

  12. Self-Recovering Sensor-Actor Networks.

    Authors: Maryam Kamali, Linas Laibinis, Luigia Petre, Kaisa Sere
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Wireless sensor-actor networks are a recent development of wireless networks
    where both ordinary sensor nodes and more sophisticated and powerful nodes,
    called actors, are present. In this paper we formalize a recently introduced
    algorithm that recovers failed actor communication links via the existing
    sensor infrastructure. We prove via refinement that the recovery is terminating
    in a finite number of steps and is distributed, thus self-performed by the
    actors.

  13. Implementing Distributed Controllers for Systems with Priorities.

    Authors: Imene Ben-Hafaiedh, Susanne Graf, Hammadi Khairallah
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Implementing a component-based system in a distributed way so that it ensures
    some global constraints is a challenging problem. We consider here abstract
    specifications consisting of a composition of components and a controller given
    in the form of a set of interactions and a priority order amongst them. In the
    context of distributed systems, such a controller must be executed in a
    distributed fashion while still respecting the global constraints imposed by
    interactions and priorities.

  14. Simplified Distributed Programming with Micro Objects.

    Authors: Jan-Mark S. Wams, Maarten van Steen
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Developing large-scale distributed applications can be a daunting task.
    object-based environments have attempted to alleviate problems by providing
    distributed objects that look like local objects. We advocate that this
    approach has actually only made matters worse, as the developer needs to be
    aware of many intricate internal details in order to adequately handle partial
    failures. The result is an increase of application complexity. We present an
    alternative in which distribution transparency is lessened in favor of clearer
    semantics.

  15. PowerTracer: Tracing requests in multi-tier services to save cluster power consumption.

    Authors: Lei Wang, Lin Yuan, Jianfeng Zhan, Bo Sang, Haining Wang
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    As energy proportional computing has extended the success of DVFS (Dynamic
    voltage and frequency scaling) to the entire system, DVFS control algorithms
    will play a key role in reducing server clusters' power consumption. The focus
    of this paper is to provide accurate cluster-level DVFS control for power
    saving in a server cluster. To achieve this goal, we propose a request tracing
    approach that online classifies the major causal path patterns and monitors
    their performance data as a guide for accurate DVFS control.

  16. Short Note on Complexity of Multi-Value Byzantine Agreement.

    Authors: Guanfeng Liang, Nitin Vaidya
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Randomized algorithm that achieves multi-valued Byzantine agreement with high
    probability, and achieves optimal complexity.

  17. A Fault-Resistant Asynchronous Clock Function.

    Authors: Danny Dolev, Ezra N. Hoch, Michael Ben-Or
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Consider an asynchronous network in a shared-memory environment consisting of
    n nodes. Assume that up to f of the nodes might be Byzantine (n > 12f), where
    the adversary is full-information and dynamic (sometimes called adaptive). In
    addition, the non-Byzantine nodes may undergo transient failures. Nodes advance
    in atomic steps, which consist of reading all registers, performing some
    calculation and writing to all registers.

  18. Double Circulant Minimum Storage Regenerating Codes.

    Authors: Bernat Gaston Braso, Jaume Pujol Capdevila
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Storage optimization in distributed environments is a major concern when
    talking about reliability in this kind of schemes. Although replication is the
    most used option, erasure coding is a more optimized one.

    However, erasure coding uses a lot of bandwidth to replace one node. In a
    dynamic scheme, where nodes enter and leave the system frequently, bandwidth
    use could be an important drawback.

  19. A Random Search Framework for Convergence Analysis of Distributed Beamforming with Feedback.

    Authors: C. Lin, V. V. Veeravalli, S. Meyn
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    The focus of this work is on the analysis of transmit beamforming schemes
    with a low-rate feedback link in wireless sensor/relay networks, where nodes in
    the network need to implement beamforming in a distributed manner.
    Specifically, the problem of distributed phase alignment is considered, where
    neither the transmitters nor the receiver has perfect channel state
    information, but there is a low-rate feedback link from the receiver to the
    transmitters. In this setting, a framework is proposed for systematically
    analyzing the performance of distributed beamforming schemes.

  20. MalStone: Towards A Benchmark for Analytics on Large Data Clouds.

    Authors: Collin Bennett, Robert L. Grossman, David Locke, Jonathan Seidman, Steve Vejcik
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Developing data mining algorithms that are suitable for cloud computing
    platforms is currently an active area of research, as is developing cloud
    computing platforms appropriate for data mining. Currently, the most common
    benchmark for cloud computing is the Terasort (and related) benchmarks.
    Although the Terasort Benchmark is quite useful, it was not designed for data
    mining per se.

  21. A Jamming-Resistant MAC Protocol for Multi-Hop Wireless Networks.

    Authors: Andrea Richa, Christian Scheideler, Stefan Schmid, Jin Zhang
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    This paper presents a simple local medium access control protocol, called
    \textsc{Jade}, for multi-hop wireless networks with a single channel that is
    provably robust against adaptive adversarial jamming. The wireless network is
    modeled as a unit disk graph on a set of nodes distributed arbitrarily in the
    plane. In addition to these nodes, there are adversarial jammers that know the
    protocol and its entire history and that are allowed to jam the wireless
    channel at any node for an arbitrary $(1-\epsilon)$-fraction of the time steps,
    where $0<\epsilon<1$ is an arbitrary constant.

  22. On the Power of Impersonation Attacks.

    Authors: Michael Okun
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    In this paper we consider a synchronous message passing system in which in
    every round an external adversary is able to send each processor up to k
    messages with falsified sender identities and arbitrary content. It is formally
    shown that this impersonation model is slightly stronger than the asynchronous
    message passing model with crash failures. In particular, we prove that
    (k+1)-set agreement can be solved in this model, while k-set agreement is
    impossible, for any k>=1.

  23. Simple Gradecast Based Algorithms.

    Authors: Danny Dolev, Ezra N. Hoch, Michael Ben-Or
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Gradecast is a simple three-round algorithm presented by Feldman and Micali.
    The current work presents a very simple algorithm that utilized Gradecast to
    achieve Byzantine agreement. Two small variations of the presented algorithm
    lead to improved algorithms for solving the Approximate agreement problem and
    the Multi-consensus problem.

    An optimal approximate agreement algorithm was presented by Fekete, which
    supports up to 1/4 n Byzantine nodes and has message complexity of O(n^k),
    where n is the number of nodes and k is the number of rounds.

  24. End-Host Distribution in Application-Layer Multicast: Main Issues and Solutions.

    Authors: Bela Genge, Piroska Haller
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Application-layer multicast implements the multicast functionality at the
    application layer. The main goal of application-layer multicast is to construct
    and maintain efficient distribution structures between end-hosts. In this paper
    we focus on the implementation of an application-layer multicast distribution
    algorithm. We observe that the total time required to measure network latency
    over TCP is influenced dramatically by the TCP connection time.

  25. Transmission Line Inspires A New Distributed Algorithm to Solve the Nonlinear Dynamical System of Physical Circuit.

    Authors: Fei Wei, Huazhong Yang
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    As known, physical circuits, e.g. integrated circuits or power system, work
    in a distributed manner, but these circuits could not be easily simulated in a
    distributed way. This is mainly because that the dynamical system of physical
    circuits is nonlinear and the linearized system of physical circuits is
    non-symmetrical. This paper proposes a simple and natural strategy to simulate
    the physical circuit in parallel, by mimicking the internal wires or
    interconnects inside the circuits by distributed numerical algorithm.

  26. Second Set of Spaces.

    Authors: Graham Kirby, Alan Dearle, Ron Morrison, Andrew McCarthy, Paddy Nixon, Evangelos Zirintsis, Ben Allen, Rob MacInnis, Andrew Jamieson, Chris Nicholson, Steven Harris
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    This document describes the Gloss infrastructure supporting implementation of
    location-aware services. The document is in two parts. The first part describes
    software architecture for the smart space. As described in D8, a local
    architecture provides a framework for constructing Gloss applications, termed
    assemblies, that run on individual physical nodes, whereas a global
    architecture defines an overlay network for linking individual assemblies. The
    second part outlines the hardware installation for local sensing.

  27. An Information Flow Architecture for Global Smart Spaces.

    Authors: Graham Kirby, Alan Dearle, Andrew McCarthy, Juan-Carlos Diaz y Carballo
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    In this paper we describe an architecture which: Permits the deployment and
    execution of components in appropriate geographical locations. Provides
    security mechanisms that prevent misuse of the architecture. Supports a
    programming model that is familiar to application programmers. Permits
    installed components to share data. Permits the deployed components to
    communicate via communication channels. Provides evolution mechanisms
    permitting the dynamic rearrangement of inter-connection topologies the
    components that they connect.

  28. RoboCast: Asynchronous Communication in Robot Networks.

    Authors: S&#xe9;bastien Tixeuil, Shlomi Dolev, Maria Potop-Butucaru, Zohir Bouzid
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    This paper introduces the \emph{RoboCast} communication abstraction. The
    RoboCast allows a swarm of non oblivious, anonymous robots that are only
    endowed with visibility sensors but do not share a common coordinate system, to
    asynchronously exchange information. We propose a generic framework that covers
    a large class of asynchronous broadcast-based algorithms and show how our
    framework can be used to implement fundamental building blocks in robot
    networks such as gathering or stigmergy.

  29. Greedy Routing on Augmented Ring Graphs.

    Authors: R. Seth Terashima, James D. Fix
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Random ring-based overlay networks have been used to study the small world
    phenomenon and model fault-tolerant peer-to-peer systems (Kleinberg, 2006). It
    has been shown that when each of $n$ nodes has $\ell = O(\log n)$ links,
    assigning contacts according to an inverse power-law distance distribution
    allows greedy routing to perform in $O(\log^2 n / \ell)$ steps (Aspnes et al.
    2002). In this paper, we generalize this result by showing the same upper bound
    holds when nodes are assigned a random number of links according to an
    arbitrary distribution with mean $\ell$.

  30. Architectural Support for Global Smart Spaces.

    Authors: Graham Kirby, Alan Dearle, Ron Morrison, Andrew McCarthy, Richard Connor, Kevin Mullen, Yanyan Yang, Paula Welen, Andy Wilson
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    A GLObal Smart Space (GLOSS) provides support for interaction amongst people,
    artefacts and places while taking account of both context and movement on a
    global scale. Crucial to the definition of a GLOSS is the provision of a set of
    location-aware services that detect, convey, store and exploit location
    information. We use one of these services, hearsay, to illustrate the
    implementation dimensions of a GLOSS. The focus of the paper is on both local
    and global software architecture to support the implementation of such
    services.

  31. Active Architecture for Pervasive Contextual Services.

    Authors: Graham Kirby, Alan Dearle, Ron Morrison, Mark Dunlop, Richard Connor, Paddy Nixon
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Pervasive services may be defined as services that are available "to any
    client (anytime, anywhere)". Here we focus on the software and network
    infrastructure required to support pervasive contextual services operating over
    a wide area. One of the key requirements is a matching service capable of
    as-similating and filtering information from various sources and determining
    matches relevant to those services.

  32. A Middleware Framework for Constraint-Based Deployment and Autonomic Management of Distributed Applications.

    Authors: Graham Kirby, Alan Dearle, Andrew McCarthy
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We propose a middleware framework for deployment and subsequent autonomic
    management of component-based distributed applications. An initial deployment
    goal is specified using a declarative constraint language, expressing
    constraints over aspects such as component-host mappings and component
    interconnection topology. A constraint solver is used to find a configuration
    that satisfies the goal, and the configuration is deployed automatically.

  33. A Framework for Constraint-Based Deployment and Autonomic Management of Distributed Applications (Extended Abstract).

    Authors: Graham Kirby, Alan Dearle, Andrew McCarthy
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We propose a framework for the deployment and subsequent autonomic management
    of component-based distributed applications. An initial deployment goal is
    specified using a declarative constraint language, expressing constraints over
    aspects such as component-host mappings and component interconnection topology.
    A constraint solver is used to find a configuration that satisfies the goal,
    and the configuration is deployed automatically. The deployed application is
    instrumented to allow subsequent autonomic management.

  34. A Survey on Peer-to-Peer and DHT.

    Authors: Siamak Sarmady
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Distributed systems with different levels of dependence to central services
    have been designed and used during recent years. Pure peer-to-peer systems
    among distributed systems have no dependence on a central resource. DHT is one
    of the main techniques behind these systems resulting into failure tolerant
    systems which are also able to isolate continuous changes to the network to a
    small section of it and therefore making it possible to scale up such networks
    to millions of nodes. This survey takes a look at P2P in general and DHT
    algorithms and implementations in more detail.

  35. Formal Derivation of Concurrent Garbage Collectors.

    Authors: Dusko Pavlovic, Peter Pepper, Douglas R. Smith
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Concurrent garbage collectors are notoriously difficult to implement
    correctly. Previous approaches to the issue of producing correct collectors
    have mainly been based on posit-and-prove verification or on the application of
    domain-specific templates and transformations. We show how to derive the upper
    reaches of a family of concurrent garbage collectors by refinement from a
    formal specification, emphasizing the application of domain-independent design
    theories and transformations.

  36. A Peer-to-Peer Middleware Framework for Resilient Persistent Programming.

    Authors: Graham Kirby, Alan Dearle, Stuart Norcross, Andrew McCarthy
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    The persistent programming systems of the 1980s offered a programming model
    that integrated computation and long-term storage. In these systems, reliable
    applications could be engineered without requiring the programmer to write
    translation code to manage the transfer of data to and from non-volatile
    storage. More importantly, it simplified the programmer's conceptual model of
    an application, and avoided the many coherency problems that result from
    multiple cached copies of the same information.

  37. Applying Constraint Solving to the Management of Distributed Applications.

    Authors: Graham Kirby, Alan Dearle, Andrew McCarthy
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We present our approach for deploying and managing distributed
    component-based applications. A Desired State Description (DSD), written in a
    high-level declarative language, specifies requirements for a distributed
    application. Our infrastructure accepts a DSD as input, and from it
    automatically configures and deploys the distributed application. Subsequent
    violations of the original requirements are detected and, where possible,
    automatically rectified by reconfiguration and redeployment of the necessary
    application components.

  38. Snap-Stabilizing Linear Message Forwarding.

    Authors: Franck Petit, Swan Dubois, Anissa Lamani, Alain Cournier, Vincent Villain
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    In this paper, we present the first snap-stabilizing message forwarding
    protocol that uses a number of buffers per node being inde- pendent of any
    global parameter, that is 4 buffers per link. The protocol works on a linear
    chain of nodes, that is possibly an overlay on a large- scale and dynamic
    system, e.g., Peer-to-Peer systems, Grids. . . Provided that the topology
    remains a linear chain and that nodes join and leave "neatly", the protocol
    tolerates topology changes. We expect that this protocol will be the base to
    get similar results on more general topologies.

  39. Generating a Family of Byzantine Tolerant Protocol Implementations Using a Meta-Model Architecture.

    Authors: Graham Kirby, Alan Dearle, Stuart Norcross
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We describe an approach to modelling a Byzantine tolerant distributed
    algorithm as a family of related finite state machines, generated from a single
    meta-model. Various artefacts are generated from each state machine, including
    diagrams and source-level protocol implementations. The approach allows a state
    machine formulation to be applied to problems for which it would not otherwise
    be suitable, increasing confidence in correctness.

  40. Fast Self-Stabilizing Minimum Spanning Tree Construction.

    Authors: L&#xe9;lia Blin, Shlomi Dolev, Maria Potop-Butucaru, Stephane Rovedakis
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We present a novel self-stabilizing algorithm for minimum spanning tree (MST)
    construction. The space complexity of our solution is $O(\log^2n)$ bits and it
    converges in $O(n^2)$ rounds. Thus, this algorithm improves the convergence
    time of all previously known self-stabilizing asynchronous MST algorithms by a
    multiplicative factor $\Theta(n)$, to the price of increasing the best known
    space complexity by a factor $O(\log n)$.

  41. Leveraging shared caches for parallel temporal blocking of stencil codes on multicore processors and clusters.

    Authors: Jan Treibig, Gerhard Wellein, Georg Hager, Markus Wittmann
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Bandwidth-starved multicore chips have become ubiquitous. It is well known
    that the performance of stencil codes can be improved by temporal blocking,
    lessening the pressure on the memory interface. We introduce a new pipelined
    approach that makes explicit use of shared caches in multicore environments and
    minimizes synchronization and boundary overhead. Benchmark results are
    presented for three current x86-based microprocessors, showing clearly that our
    optimization works best on designs with high-speed shared caches and low memory
    bandwidth per core.

  42. Proceedings First International Workshop on Decentralized Coordination of Distributed Processes.

    Authors: Tom Van Cutsem, Mark Miller
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    This volume contains the papers presented at the 1st International Workshop
    on "Decentralized Coordination of Distributed Processes", DCDP 2010, held in
    Amsterdam, The Netherlands on June 10th, 2010 in conjunction with the 5th
    International Federated Conferences on Distributed Computing Techniques,
    DisCoTec 2010. The central theme of the workshop is the decentralized
    coordination of distributed processes. Decentralized: there is no single
    authority in the network that everything is vulnerable to.

  43. Efficient Resource Matching in Heterogeneous Grid Using Resource Vector.

    Authors: Srirangam V Addepallil, Per Andersen, George L Barnes
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    In this paper, a method for efficient scheduling to obtain optimum job
    throughput in a distributed campus grid environment is presented; Traditional
    job schedulers determine job scheduling using user and job resource attributes.
    User attributes are related to current usage, historical usage, user priority
    and project access. Job resource attributes mainly comprise of soft
    requirements (compilers, libraries) and hard requirements like memory, storage
    and interconnect. A job scheduler dispatches jobs to a resource if a job's hard
    and soft requirements are met by a resource.

  44. Survey on the Event Orderings Semantics Used for Distributed System.

    Authors: Yaser Miaji Osman Gazali, Suhaidi Hassan
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Event ordering in distributed system (DS) is disputable and proactive subject
    in DS particularly with the emergence of multimedia synchronization. According
    to the literature, different type of event ordering is used for different DS
    mode such as asynchronous or synchronous. Recently, there are several novel
    implementation of these types introduced to fulfill the demand for establishing
    a certain order according to a specific criterion in DS with lighter
    complexity.

  45. Implementation of a Cloud Data Server (CDS) for Providing Secure Service in E-Business.

    Authors: D. Kesavaraja, R. Balasubramanian, D. Sasireka
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Cloud Data Servers is the novel approach for providing secure service to
    e-business .Millions of users are surfing the Cloud for various purposes,
    therefore they need highly safe and persistent services. Usually hackers target
    particular Operating Systems or a Particular Controller. Inspiteof several
    ongoing researches Conventional Web Servers and its Intrusion Detection System
    might not be able to detect such attacks. So we implement a Cloud Data Server
    with Session Controller Architecture using Redundancy and Disconnected Data
    Access Mechanism.

  46. A Low Overhead Minimum Process Global Snapshop Collection Algorithm for Mobile Distributed System.

    Authors: Parveen Kumar, Surender Kumar, R.K.Chauhan
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Coordinated checkpointing is an effective fault tolerant technique in
    distributed system as it avoids the domino effect and require minimum storage
    requirement. Most of the earlier coordinated checkpoint algorithms block their
    computation during checkpointing and forces minimum-process or non-blocking but
    forces all nodes to takes checkpoint even though many of them may not be
    necessary or non-blocking minimum-process but takes useless checkpoints or
    reduced useless checkpoint but has higher synchronization message overhead or
    has high checkpoint request propagation time.

  47. Improvement Cache Efficiency of Explicit Finite Element Procedure and its Application to Parallel Casting Solidification Simulation.

    Authors: Ruhollah Tavakoli
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    A simple method for improving cache efficiency of serial and parallel
    explicit finite procedure with application to casting solidification simulation
    over three-dimensional complex geometries is presented. The method is based on
    division of the global data to smaller blocks and treating each block
    independently from others at each time step. A novel parallel finite element
    algorithm for non-overlapped element-base decomposed domain is presented for
    implementation of serial and parallel version of the presented method. Effect
    of mesh reordering on the efficiency is also investigated.

  48. An Integrated Framework for Performance Analysis and Tuning in Grid Environment.

    Authors: Sarbani Roy, Nandini Mukherjee, Ajanta De Sarkar, Sudipto Biswas
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    In a heterogeneous, dynamic environment, like Grid, post-mortem analysis is
    of no use and data needs to be collected and analysed in real time. Novel
    techniques are also required for dynamically tuning the application's
    performance and resource brokering in order to maintain the desired QoS. The
    objective of this paper is to propose an integrated framework for performance
    analysis and tuning of the application, and rescheduling the application, if
    necessary, to some other resources in order to adapt to the changing resource
    usage scenario in a dynamic environment.

  49. A Multi-agent Framework for Performance Tuning in Distributed Environment.

    Authors: Sarbani Roy, Saikat Halder, Nandini Mukherjee
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    This paper presents the overall design of a multi-agent framework for tuning
    the performance of an application executing in a distributed environment. The
    multi-agent framework provides services like resource brokering, analyzing
    performance monitoring data, local tuning and also rescheduling in case of any
    performance problem on a specific resource provider. The paper also briefly
    describes the implementation of some part of the framework. In particular, job
    migration on the basis of performance monitoring data is particularly
    highlighted in this paper.

  50. Cloud Computing: Exploring the scope.

    Authors: Abhinav Pandey, Akash Pandey, Ankit Tandon, Brajesh Kr Maurya, Upendra Kushwaha
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Cloud computing refers a paradigm shift to overall IT solutions while raising
    the accessibility, scalability and effectiveness through its enabling
    technologies. However, migrated cloud platforms and services cost benefits as
    well as performances are neither clear nor summarized. Globalization and the
    recessionary economic times have not only raised the bar of a better IT
    delivery models but also have given access to technology enabled services via
    internet.

  51. An Efficient and Secure Routing Protocol for Mobile Ad-Hoc Networks.

    Authors: N. Ch. Sriman Narayana Iyengar, Syed Mohammad Ansar Sachin kumar, Piyush Nagar, Siddharth Sharma, Akshay Atrey
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Efficiency and simplicity of random algorithms have made them a lucrative
    alternative for solving complex problems in the domain of communication
    networks. This paper presents a random algorithm for handling the routing
    problem in Mobile Ad hoc Networks [MANETS].The performance of most existing
    routing protocols for MANETS degrades in terms of packet delay and congestion
    caused as the number of mobile nodes increases beyond a certain level or their
    speed passes a certain level.

  52. A Real Time Optimistic Strategy to achieve Concurrency Control in Mobile Environments Using On-demand Multicasting.

    Authors: Salman Abdul Moiz, Dr. Lakshmi Rajamani
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    In mobile database environments, multiple users may access similar data items
    irrespective of their physical location leading to concurrent access anomalies.
    As disconnections and mobility are the common characteristics in mobile
    environment, performing concurrent access to a particular data item leads to
    inconsistency. Most of the approaches use locking mechanisms to achieve
    concurrency control.

  53. L-Resilient Adversaries and Hitting Sets.

    Authors: Eli Gafni, Petr Kuznetsov
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    The condition of $t$-resilience stipulates that an $n$-process program is
    only obliged to make progress when at least $n-t$ processes are correct. Put
    another way, the \emph{live sets}, the collection of process sets such that
    progress is required if all the processes in one of these sets are correct, are
    all sets with at least $n-t$ processes. In this paper we study what happens
    when the live sets are any arbitrary collection of sets $\L$.

  54. The Impact of Topology on Byzantine Containment in Stabilization.

    Authors: Swan Dubois, S&#xe9;bastien Tixeuil, Toshimitsu Masuzawa
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Self-stabilization is an versatile approach to fault-tolerance since it
    permits a distributed system to recover from any transient fault that
    arbitrarily corrupts the contents of all memories in the system. Byzantine
    tolerance is an attractive feature of distributed system that permits to cope
    with arbitrary malicious behaviors. We consider the well known problem of
    constructing a maximum metric tree in this context.

  55. Improving Overhead Computation and pre-processing Time for Grid Scheduling System.

    Authors: Asgarali Bouyer, Mohammad Javad hoseyni, Abdul Hanan Abdullah
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Computational Grid is enormous environments with heterogeneous resources and
    stable infrastructures among other Internet-based computing systems. However,
    the managing of resources in such systems has its special problems. Scheduler
    systems need to get last information about participant nodes from information
    centers for the purpose of firmly job scheduling. In this paper, we focus on
    online updating resource information centers with processed and provided data
    based on the assumed hierarchical model.

  56. Construction auto-stabilisante d'arbre couvrant en d\'epit d'actions malicieuses.

    Authors: Swan Dubois, S&#xe9;bastien Tixeuil, Toshimitsu Masuzawa
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    A self-stabilizing protocol provides by definition a tolerance to transient
    failures. Recently, a new class of self-stabilizing protocols appears. These
    protocols provides also a tolerance to a given number of permanent failures. In
    this article, we are interested in self-stabilizing protocols that deal with
    Byzantines failures. We prove that, for some problems which not allow strict
    stabilization (see [Nesterenko,Arora,2002]), there exist solutions that
    tolerates Byzantine faults if we define a new criteria of tolerance.

  57. The Accuracy of Tree-based Counting in Dynamic Networks.

    Authors: Erik Aurell, Supriya Krishnamurthy, John Ardelius, Mads Dam, Rolf Stadler, Fetahi Wuhib
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Tree-based protocols are ubiquitous in distributed systems. They are
    flexible, they perform generally well, and, in static conditions, their
    analysis is mostly simple. Under churn, however, node joins and failures can
    have complex global effects on the tree overlays, making analysis surprisingly
    subtle. To our knowledge, few prior analytic results for performance estimation
    of tree based protocols under churn are currently known. We study a simple
    Bellman-Ford-like protocol which performs network size estimation over a
    tree-shaped overlay.

  58. On the Impact of the Migration Topology on the Island Model.

    Authors: Francesco Biscani, Dario Izzo, Marek Ruci&#x144;ski
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Parallel Global Optimization Algorithms (PGOA) provide an efficient way of
    dealing with hard optimization problems. One method of parallelization of GOAs
    that is frequently applied and commonly found in the contemporary literature is
    the so-called Island Model (IM). In this paper we analyze the impact of the
    migration topology on the performance of a PGOA which uses the Island Model. In
    particular we consider parallel Differential Evolution and Simulated Annealing
    with Adaptive Neighborhood and draw first conclusions that emerge from the
    conducted experiments.

  59. LIKWID: A lightweight performance-oriented tool suite for x86 multicore environments.

    Authors: Jan Treibig, Gerhard Wellein, Georg Hager
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Exploiting the performance of today's processors requires, apart from an
    intimate knowledge of the microarchitecture, taking into account the influence
    of an ever-growing complexity in thread and cache topology. LIKWID is a
    collection of small command line applications that support inexperienced as
    well as seasoned programmers in developing and running software in an efficient
    way. The development of LIKWID is targeted on providing access to
    performance-oriented tooling in a transparent and easy manner.

  60. A Global Optimisation Toolbox for Massively Parallel Engineering Optimisation.

    Authors: Francesco Biscani, Dario Izzo, Chit Hong Yam
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    A software platform for global optimisation, called PaGMO, has been developed
    within the Advanced Concepts Team (ACT) at the European Space Agency, and was
    recently released as an open-source project.

  61. Exact Sparse Matrix-Vector Multiplication on GPU's and Multicore Architectures.

    Authors: Jean-Guillaume Dumas, Brice Boyer, Pascal Giorgi
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We propose different implementations of the sparse matrix--dense vector
    multiplication (\spmv{}) for finite fields and rings $\Zb/m\Zb$. We take
    advantage of graphic card processors (GPU) and multi-core architectures. Our
    aim is to improve the speed of \spmv{} in the \linbox library, and henceforth
    the speed of its black box algorithms. Besides, we use this and a new
    parallelization of the sigma-basis algorithm in a parallel block Wiedemann rank
    implementation over finite fields.

  62. Passively Mobile Communicating Logarithmic Space Machines.

    Authors: Paul G. Spirakis, Ioannis Chatzigiannakis, Othon Michail, Stavros Nikolaou, Andreas Pavlogiannis
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We propose a new theoretical model for passively mobile Wireless Sensor
    Networks. We call it the PALOMA model, standing for PAssively mobile
    LOgarithmic space MAchines. The main modification w.r.t. the Population
    Protocol model is that agents now, instead of being automata, are Turing
    Machines whose memory is logarithmic in the population size n. Note that the
    new model is still easily implementable with current technology. We focus on
    complete communication graphs.

  63. Boosting Multi-Core Reachability Performance with Shared Hash Tables.

    Authors: Jaco van de Pol, Alfons Laarman, Michael Weber
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    This paper focuses on data structures for multi-core reachability, which is a
    key component in model checking algorithms and other verification methods. A
    cornerstone of an efficient solution is the storage of visited states. In
    related work, static partitioning of the state space was combined with
    thread-local storage and resulted in reasonable speedups, but left open whether
    improvements are possible.

  64. Formal Relationships Between Geometrical and Classical Models for Concurrency.

    Authors: Samuel Mimram, Eric Goubault
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    A wide variety of models for concurrent programs has been proposed during the
    past decades, each one focusing on various aspects of computations: trace
    equivalence, causality between events, conflicts and schedules due to resource
    accesses, etc. More recently, models with a geometrical flavor have been
    introduced, based on the notion of cubical set. These models are very rich and
    expressive since they can represent commutation between any bunch of events,
    thus generalizing the principle of true concurrency.

  65. Addressing the P2P Bootstrap Problem for Small Networks.

    Authors: David Isaac Wolinsky, P. Oscar Boykin, Renato Figueiredo, Pierre St. Juste
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    P2P overlays provide a framework for building distributed applications
    consisting of few to many resources with features including self-configuration,
    scalability, and resilience to node failures. Such systems have been
    successfully adopted in large-scale services for content delivery networks,
    file sharing, and data storage. In small-scale systems, they can be useful to
    address privacy concerns and for network applications that lack dedicated
    servers. The bootstrap problem, finding an existing peer in the overlay,
    remains a challenge to enabling these services for small-scale P2P systems.

  66. The Low Latency Fault Tolerance System.

    Authors: Wenbing Zhao, P. M. Melliar-Smith, L. E. Moser
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    The Low Latency Fault Tolerance (LLFT) system provides fault tolerance for
    distributed applications, using the leader-follower replication technique. The
    LLFT system provides application-transparent replication, with strong replica
    consistency, for applications that involve multiple interacting processes or
    threads.

  67. Internet ware cloud computing :Challenges.

    Authors: S Qamar, Niranjan Lal, Mrityunjay Singh
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    After decades of engineering development and infrastructural investment,
    Internet connections have become commodity product in many countries, and
    Internet scale "cloud computing" has started to compete with traditional
    software business through its technological advantages and economy of scale.
    Cloud computing is a promising enabling technology of Internet ware Cloud
    Computing is termed as the next big thing in the modern corporate world.

  68. In Cloud, Do MTC or HTC Service Providers Benefit from the Economies of Scale?.

    Authors: Lei Wang, Lin Yuan, Jianfeng Zhan, Weisong Shi, Yi Liang
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    In this paper, we intend to answer one key question to the success of cloud
    computing: in cloud, do many task computing (MTC) or high throughput computing
    (HTC) service providers, which offer the corresponding computing service to end
    users, benefit from the economies of scale? Our research contributions are
    three-fold: first, we propose an innovative usage model, called dynamic service
    provision (DSP) model, for MTC or HTC service providers.

  69. In Cloud, Can Scientific Communities Benefit from the Economies of Scale?.

    Authors: Lei Wang, Jianfeng Zhan, Weisong Shi, Yi Liang
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    The basic idea behind Cloud computing is that resource providers offer
    elastic resources to end users. In this paper, we intend to answer one key
    question to the success of Cloud computing: in Cloud, can small or medium-scale
    scientific computing communities benefit from the economies of scale?

  70. Engineering a Scalable High Quality Graph Partitioner.

    Authors: Peter Sanders, Manuel Holtgrewe, Christian Schulz
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We describe an approach to parallel graph partitioning that scales to
    hundreds of processors and produces a high solution quality. For example, for
    many instances from Walshaw's benchmark collection we improve the best known
    partitioning. We use the well known framework of multi-level graph
    partitioning.

  71. Effects of component-subscription network topology on large-scale data centre performance scaling.

    Authors: Ilango Sriram, Dave Cliff
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Modern large-scale date centres, such as those used for cloud computing
    service provision, are becoming ever-larger as the operators of those data
    centres seek to maximise the benefits from economies of scale. With these
    increases in size comes a growth in system complexity, which is usually
    problematic. There is an increased desire for automated "self-star"
    configuration, management, and failure-recovery of the data-centre
    infrastructure, but many traditional techniques scale much worse than linearly
    as the number of nodes to be managed increases.

  72. Importance of Explicit Vectorization for CPU and GPU Software Performance.

    Authors: Kamran Karimi, Neil G. Dickson, Firas Hamze
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Much of the current focus in high-performance computing is on
    multi-threading, multi-computing, and graphics processing unit (GPU) computing.
    However, vectorization and non-parallel optimization techniques, which can
    often be employed additionally, are less frequently discussed. In this paper,
    we present an analysis of several optimizations done on both central processing
    unit (CPU) and GPU implementations of a particular computationally intensive
    Metropolis Monte Carlo algorithm.

  73. High-Performance Physics Simulations Using Multi-Core CPUs and GPGPUs in a Volunteer Computing Context.

    Authors: Kamran Karimi, Neil G. Dickson, Firas Hamze
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    This paper presents two conceptually simple methods for parallelizing a
    Parallel Tempering Monte Carlo simulation in a distributed volunteer computing
    context, where computers belonging to the general public are used. The first
    method uses conventional multi-threading. The second method uses CUDA, a
    graphics card computing system. Parallel Tempering is described, and challenges
    such as parallel random number generation and mapping of Monte Carlo chains to
    different threads are explained.

  74. Precoded Turbo Equalizer for Power Line Communication Systems.

    Authors: Kai Xie, Jing
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Power line communication continues to draw increasing interest by promising a
    wide range of applications including cost-free last-mile communication
    solution. However, signal transmitted through the power lines deteriorates
    badly due to the presence of severe inter-symbol interference (ISI) and harsh
    random pulse noise. This work proposes a new precoded turbo equalization scheme
    specifically designed for the PLC channels.

  75. A Characterization of Combined Traces Using Labeled Stratified Order Structures.

    Authors: Dai Tri Man Le
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    This paper defines a class of labeled stratified order structures that
    characterizes exactly the notion of combined traces (i.e., comtraces) proposed
    by Janicki and Koutny in 1995. Our main technical contributions are the
    representation theorems showing that comtrace quotient monoid, combined
    dependency graph (Kleijn and Koutny 2008) and our labeled stratified order
    structure characterization are three different and yet equivalent ways to
    represent comtraces.

  76. Scalable Group Management in Large-Scale Virtualized Clusters.

    Authors: Lei Wang, Lin Yuan, Jianfeng Zhan, Dan Meng, Wei Zhou
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    To save cost, recently more and more users choose to provision virtual
    machine resources in cluster systems, especially in data centres. Maintaining a
    consistent member view is the foundation of reliable cluster managements, and
    it also raises several challenge issues for large scale cluster systems
    deployed with virtual machines (which we call virtualized clusters). In this
    paper, we introduce our experiences in design and implementation of scalable
    member view management on large-scale virtual clusters.

  77. An efficient algorithm for the parallel solution of high-dimensional differential equations.

    Authors: Tuhin Sahai, Stefan Klus, Cong Liu, Michael Dellnitz
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    The study of high-dimensional differential equations is challenging and
    difficult due to the analytical and computational intractability. Here, we
    significantly improve the speed of waveform relaxation (WR), a method to
    simulate high-dimensional differential-algebraic equations. This new method
    termed adaptive waveform relaxation (AWR) is tested on a communication network
    example. Further we analyze different heuristics for computing graph partitions
    tailored to adaptive waveform relaxation.

  78. Gossip Algorithms for Distributed Signal Processing.

    Authors: Alexandros G. Dimakis, Soummya Kar, Michael G. Rabbat, Jose M.F. Moura, Anna Scaglione
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Gossip algorithms are attractive for in-network processing in sensor networks
    because they do not require any specialized routing, there is no bottleneck or
    single point of failure, and they are robust to unreliable wireless network
    conditions. Recently, there has been a surge of activity in the computer
    science, control, signal processing, and information theory communities,
    developing faster and more robust gossip algorithms and deriving theoretical
    performance guarantees. This article presents an overview of recent work in the
    area.

  79. Improving Waiting Time of Tasks Scheduled Under Preemptive Round Robin Using Changeable Time Quantum.

    Authors: Samih Mohemmed Mostafa
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Minimizing waiting time for tasks waiting in the queue for execution is one
    of the important scheduling cri-teria which took a wide area in scheduling
    preemptive tasks. In this paper we present Changeable Time Quan-tum (CTQ)
    approach combined with the round-robin algorithm, we try to adjust the time
    quantum according to the burst times of the tasks in the ready queue.

  80. InterCloud: Utility-Oriented Federation of Cloud Computing Environments for Scaling of Application Services.

    Authors: Rajkumar Buyya, Rajiv Ranjan, Rodrigo N. Calheiros
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Cloud computing providers have setup several data centers at different
    geographical locations over the Internet in order to optimally serve needs of
    their customers around the world. However, existing systems do not support
    mechanisms and policies for dynamically coordinating load distribution among
    different Cloud-based data centers in order to determine optimal location for
    hosting application services to achieve reasonable QoS levels.

  81. Cloud Computing.

    Authors: N. V. Kalyankar, Shivaji P. Mirashe
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Computing as you know it is about to change, your applications and documents
    are going to move from the desktop into the cloud. I'm talking about cloud
    computing, where applications and files are hosted on a "cloud" consisting of
    thousands of computers and servers, all linked together and accessible via the
    Internet. With cloud computing, everything you do is now web based instead of
    being desktop based. You can access all your programs and documents from any
    computer that's connected to the Internet. How will cloud computing change the
    way you work?

  82. A Security Based Data Mining Approach in Data Grid.

    Authors: S. Vidhya, S. Karthikeyan
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Grid computing is the next logical step to distributed computing. Main
    objective of grid computing is an innovative approach to share resources such
    as CPU usage; memory sharing and software sharing. Data Grids provide
    transparent access to semantically related data resources in a heterogeneous
    system. The system incorporates both data mining and grid computing techniques
    where Grid application reduces the time for sending results to several clients
    at the same time and Data mining application on computational grids gives fast
    and sophisticated results to users.

  83. The Cloud Adoption Toolkit: Addressing the Challenges of Cloud Adoption in Enterprise.

    Authors: Ali Khajeh-Hosseini, Ian Sommerville, David Greenwood, James Smith
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Cloud computing promises a radical shift in the provisioning of computing
    resource within enterprise.

  84. Parallel Generation of Massive Scale-Free Graphs.

    Authors: Andy Yoo, Keith Henderson
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    One of the biggest huddles faced by researchers studying algorithms for
    massive graphs is the lack of large input graphs that are essential for the
    development and test of the graph algorithms. This paper proposes two efficient
    and highly scalable parallel graph generation algorithms that can produce
    massive realistic graphs to address this issue. The algorithms, designed to
    achieve high degree of parallelism by minimizing inter-processor
    communications, are two of the fastest graph generators which are capable of
    generating scale-free graphs with billions of vertices and edges.

  85. Towards trusted volunteer grid environments.

    Authors: Maher Khemakhem, Abdelfettah Belghith, Sousse University, Tunisia, Manouba University, Tunisia)
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Intensive experiences show and confirm that grid environments can be
    considered as the most promising way to solve several kinds of problems
    relating either to cooperative work especially where involved collaborators are
    dispersed geographically or to some very greedy applications which require
    enough power of computing or/and storage. Such environments can be classified
    into two categories; first, dedicated grids where the federated computers are
    solely devoted to a specific work through its end.

  86. Asynchronous Bounded Expected Delay Networks.

    Authors: Jun Pang, Wan Fokkink, Rena Bakhshi, J&#xf6;rg Endrullis
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    The commonly used asynchronous bounded delay (ABD) network models assume a
    fixed bound on message delay. We propose a probabilistic network model, called
    asynchronous bounded expected delay (ABE) model. Instead of a strict bound, the
    ABE model requires only a bound on the expected message delay. While the
    conditions of ABD networks restrict the set of possible executions, in ABE
    networks all asynchronous executions are possible, but executions with
    extremely long delays are less probable. In contrast to ABD networks, ABE
    networks cannot be synchronised efficiently.

  87. Parallel structurally-symmetric sparse matrix-vector products on multi-core processors.

    Authors: Vicente H. F. Batista, George O. Ainsworth Jr., Fernando L. B. Ribeiro
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We consider the problem of developing an efficient multi-threaded
    implementation of the matrix-vector multiplication algorithm for sparse
    matrices with structural symmetry. Matrices are stored using the compressed
    sparse row-column format (CSRC), designed for profiting from the symmetric
    non-zero pattern observed in global finite element matrices. Unlike classical
    compressed storage formats, performing the sparse matrix-vector product using
    the CSRC requires thread-safe access to the destination vector. To avoid race
    conditions, we have implemented two partitioning strategies.

  88. Algorithms For Extracting Timeliness Graphs.

    Authors: Carole Delporte-Gallet, St&#xe9;phane Devismes, Hugues Fauconnier, Mikel Larrea
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We consider asynchronous message-passing systems in which some links are
    timely and processes may crash. Each run defines a timeliness graph among
    correct processes: (p; q) is an edge of the timeliness graph if the link from p
    to q is timely (that is, there is bound on communication delays from p to q).
    The main goal of this paper is to approximate this timeliness graph by graphs
    having some properties (such as being trees, rings,...).

  89. PhoenixCloud: Provisioning Runtime Environments for Heterogeneous Cloud Workloads.

    Authors: Lei Wang, Jianfeng Zhan, Weisong Shi, Shimin Gong, Xiutao Zang
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    As more and more service providers choose Cloud platforms, a resource
    provider needs to provision runtime environments (REs) for heterogeneous
    workloads in different scenarios. Previous work fails to resolve this issue in
    several ways: (1) it fails to pay attention to diverse RE requirements, and
    does not enable creating coordinated REs on demand; (2) few work investigates
    coordinated resource provisioning for heterogeneous workloads.

  90. Decreasing log data of multi-tier services for effective request tracing.

    Authors: Jianfeng Zhan, Bo Sang, Guanhua Tian
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Previous work shows request tracing systems help understand and debug the
    performance problems of multi-tier services. However, for large-scale data
    centers, more than hundreds of thousands of service instances provide online
    service at the same time. Previous work such as white-box or black box tracing
    systems will produce large amount of log data, which would be correlated into
    large quantities of causal paths for performance debugging. In this paper, we
    propose an innovative algorithm to eliminate valueless logs of multitiers
    services.

  91. Precise Request Tracing and Performance Debugging for Multi-tier Services of Black Boxes.

    Authors: Lei Wang, Jianfeng Zhan, Dan Meng, Yong Li, Zhihong Zhang, Bo Sang
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    As more and more multi-tier services are developed from commercial components
    or heterogeneous middleware without the source code available, both developers
    and administrators need a precise request tracing tool to help understand and
    debug performance problems of large concurrent services of black boxes.
    Previous work fails to resolve this issue in several ways: they either accept
    the imprecision of probabilistic correlation methods, or rely on knowledge of
    protocols to isolate requests in pursuit of tracing accuracy.

  92. Accelerating sequential programs using FastFlow and self-offloading.

    Authors: Marco Aldinucci, Massimo Torquati, Massimiliano Meneghin, Marco Danelutto, Peter Kilpatrick
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    FastFlow is a programming environment specifically targeting cache-coherent
    shared-memory multi-cores. FastFlow is implemented as a stack of C++ template
    libraries built on top of lock-free (fence-free) synchronization mechanisms. In
    this paper we present a further evolution of FastFlow enabling programmers to
    offload part of their workload on a dynamically created software accelerator
    running on unused CPUs. The offloaded function can be easily derived from
    pre-existing sequential code.

  93. An Approach to Ad hoc Cloud Computing.

    Authors: Graham Kirby, Alan Dearle, Angus Macdonald, Alvaro Fernandes
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We consider how underused computing resources within an enterprise may be
    harnessed to improve utilization and create an elastic computing
    infrastructure. Most current cloud provision involves a data center model, in
    which clusters of machines are dedicated to running cloud infrastructure
    software. We propose an additional model, the ad hoc cloud, in which
    infrastructure software is distributed over resources harvested from machines
    already in existence within an enterprise.

  94. Breaking the O(n^2) Bit Barrier: Scalable Byzantine agreement with an Adaptive Adversary.

    Authors: Valerie King, Jared Saia
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We describe an algorithm for Byzantine agreement that is scalable in the
    sense that each processor sends only $\tilde{O}(\sqrt{n})$ bits, where $n$ is
    the total number of processors. Our algorithm succeeds with high probability
    against an \emph{adaptive adversary}, which can take over processors at any
    time during the protocol, up to the point of taking over arbitrarily close to a
    1/3 fraction. We assume synchronous communication but a \emph{rushing}
    adversary.

  95. Exploring the Limits of GPUs With Parallel Graph Algorithms.

    Authors: Frank Dehne, Kumanan Yogaratnam
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    In this paper, we explore the limits of graphics processors (GPUs) for
    general purpose parallel computing by studying problems that require highly
    irregular data access patterns: parallel graph algorithms for list ranking and
    connected components. Such graph problems represent a worst case scenario for
    coalescing parallel memory accesses on GPUs which is critical for good GPU
    performance. Our experimental study indicates that PRAM algorithms are a good
    starting point for developing efficient parallel GPU methods but require
    non-trivial modifications to ensure good GPU performance.

  96. Deterministic Sample Sort For GPUs.

    Authors: Frank Dehne, Hamidreza Zaboli
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We present and evaluate GPU Bucket Sort, a parallel deterministic sample sort
    algorithm for many-core GPUs. Our method is considerably faster than Thrust
    Merge (Satish et.al., Proc. IPDPS 2009), the best comparison-based sorting
    algorithm for GPUs, and it is as fast as the new randomized sample sort for
    GPUs by Leischner et.al. (to appear in Proc. IPDPS 2010). Our deterministic
    sample sort has the advantage that bucket sizes are guaranteed and therefore
    its running time does not have the input data dependent fluctuations that can
    occur for randomized sample sort.

  97. Phoenix Cloud : Consolidating Heterogeneous Workloads of Large Organizations on Cloud Computing Platforms.

    Authors: Lei Wang, Jianfeng Zhan, Bibo Tu, Dan Meng, Yong Li, Peng Wang, Wei Zhou
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    For a large organization, different departments often maintain dedicated
    cluster systems for different workloads, for example parallel batch jobs or Web
    services. In this paper, we design and implement an innovative cloud computing
    system software, Phoenix Cloud, to consolidate heterogeneous workloads of the
    same organization on cloud computing platforms. For Phoenix Cloud, we propose
    cooperative resource provision and management polices for the affiliated
    departments of a large organization to share cluster systems.

  98. Automatic Performance Debugging of SPMD Parallel Programs.

    Authors: Xu Liu, Lin Yuan, Jianfeng Zhan, Bibo Tu, Dan Meng
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Automatic performance debugging of parallel applications usually involves two
    steps: automatic detection of performance bottlenecks and uncovering their root
    causes for performance optimization. Previous work fails to resolve this
    challenging issue in several ways: first, several previous efforts automate
    analysis processes, but present the results in a confined way that only
    identifies performance problems with apriori knowledge; second, several tools
    take exploratory or confirmatory data analysis to automatically discover
    relevant performance data relationships.

  99. A Cluster-based Approach for Outlier Detection in Dynamic Data Streams (KORM: k-median OutlieR Miner).

    Authors: Parneeta Dhaliwal, M.P.S. Bhatia, Priti Bansal
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Outlier detection in data streams has gained wide importance presently due to
    the increasing cases of fraud in various applications of data streams. The
    techniques for outlier detection have been divided into either statistics
    based, distance based, density based or deviation based. Till now, most of the
    work in the field of fraud detection was distance based but it is incompetent
    from computational point of view. In this paper we introduced a new clustering
    based approach, which divides the stream in chunks and clusters each chunk
    using kmedian into variable number of clusters.

  100. Window-Based Greedy Contention Management for Transactional Memory.

    Authors: Gokarna Sharma, Brett Estrade, Costas Busch
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We consider greedy contention managers for transactional memory for M x N
    execution windows of transactions with M threads and N transactions per thread.
    Assuming that each transaction conflicts with at most C other transactions
    inside the window, a trivial greedy contention manager can schedule them within
    CN time. In this paper, we show that there are much better schedules.

  101. Fast Flooding over Manhattan.

    Authors: Andrea Clementi, Angelo Monti, Riccardo Silvestri
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We consider a Mobile Ad-hoc NETwork (MANET) formed by n agents that move at
    speed V according to the Manhattan Random-Way Point model over a square region
    of side length L. The resulting stationary (agent) spatial probability
    distribution is far to be uniform: the average density over the "central zone"
    is asymptotically higher than that over the "suburb". Agents exchange data iff
    they are at distance at most R within each other.

  102. Generalized Adaptive Network Coded Cooperation (GANCC): A Unified Framework for Network Coding and Channel Coding.

    Authors: Xingkai Bao, Jing Li
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    This paper considers distributed coding for multi-source single-sink data
    collection wireless networks. A unified framework for network coding and
    channel coding, termed "generalized adaptive network coded cooperation"
    (GANCC), is proposed. Key ingredients of GANCC include: matching code graphs
    with the dynamic network graphs on-the-fly, and integrating channel coding with
    network coding through circulant low-density parity-check codes. Several code
    constructing methods and several families of sparse-graph codes are proposed,
    and information theoretical analysis is performed.

  103. COTAR: An Accurate, Cost-Effective Cooperative Wireless Localization Strategy for Mobile Nodes.

    Authors: Xingkai Bao, Jing Li
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    This paper considers N mobile nodes that move together in the vicinity of
    each other, whose initial poses as well as subsequent movements must be
    accurately tracked in real time with the assist of M(>=3) reference nodes. By
    engaging the neighboring mobile nodes in a simple but effective cooperation,
    and by exploiting both the time-of-arrival (TOA) information (between mobile
    nodes and reference nodes) and the received-signal-strength (RSS) information
    (between mobile nodes), an effective new localization strategy, termed
    cooperative TOA and RSS (COTAR), is developed.

  104. Cloud Migration: A Case Study of Migrating an Enterprise IT System to IaaS.

    Authors: Ali Khajeh-Hosseini, Ian Sommerville, David Greenwood
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    This case study illustrates the potential benefits and risks associated with
    the migration of an IT system in the oil & gas industry from an in-house data
    center to Amazon EC2 from a broad variety of stakeholder perspectives across
    the enterprise, thus transcending the typical, yet narrow, financial and
    technical analysis offered by providers. Our results show that the system
    infrastructure in the case study would have cost 37% less over 5 years on EC2,
    and using cloud computing could have potentially eliminated 21% of the support
    calls for this system.

  105. Optimization and Analysis of Distributed Averaging with Short Node Memory.

    Authors: Boris N. Oreshkin, Mark J. Coates, Michael G. Rabbat
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    In this paper, we demonstrate, both theoretically and by numerical examples,
    that adding a local prediction component to the update rule can significantly
    improve the convergence rate of distributed averaging algorithms. We focus on
    the case where the local predictor is a linear combination of the node's two
    previous values (i.e., two memory taps), and our update rule computes a
    combination of the predictor and the usual weighted linear combination of
    values received from neighbouring nodes.

  106. Virtual Machine Support for Many-Core Architectures: Decoupling Abstract from Concrete Concurrency Models.

    Authors: Stefan Marr, Michael Haupt, Stijn Timbermont, Bram Adams, Theo D&#x27;Hondt, Pascal Costanza, Wolfgang De Meuter
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    The upcoming many-core architectures require software developers to exploit
    concurrency to utilize available computational power. Today's high-level
    language virtual machines (VMs), which are a cornerstone of software
    development, do not provide sufficient abstraction for concurrency concepts. We
    analyze concrete and abstract concurrency models and identify the challenges
    they impose for VMs. To provide sufficient concurrency support in VMs, we
    propose to integrate concurrency operations into VM instruction sets.

  107. Simplifying Parallelization of Scientific Codes by a Function-Centric Approach in Python.

    Authors: Jon K. Nilsen, Xing Cai, Bjorn Hoyland, Hans Petter Langtangen
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    The purpose of this paper is to show how existing scientific software can be
    parallelized using a separate thin layer of Python code where all parallel
    communication is implemented. We provide specific examples on such layers of
    code, and these examples may act as templates for parallelizing a wide set of
    serial scientific codes. The use of Python for parallelization is motivated by
    the fact that the language is well suited for reusing existing serial codes
    programmed in other languages.

  108. Performance and Stability of the Chelonia Storage Cloud.

    Authors: Jon K. Nilsen, Salman Toor, Zsombor Nagy, Bjarte Mohn, Alex L. Read
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    In this paper we present the Chelonia storage cloud middleware. It was
    designed to fill the requirements gap between those of large, sophisticated
    scientific collaborations which have adopted the grid paradigm for their
    distributed storage needs, and of corporate business communities which are
    gravitating towards the cloud paradigm. The similarities to and differences
    between Chelonia and several well-known grid- and cloud-based storage solutions
    are commented.

  109. Local algorithms in (weakly) coloured graphs.

    Authors: Matti &#xc5;strand, Valentin Polishchuk, Joel Rybicki, Jukka Suomela, Jara Uitto
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    A local algorithm is a distributed algorithm that completes after a constant
    number of synchronous communication rounds. We present local approximation
    algorithms for the minimum dominating set problem and the maximum matching
    problem in 2-coloured and weakly 2-coloured graphs. In a weakly 2-coloured
    graph, both problems admit a local algorithm with the approximation factor
    $(\Delta+1)/2$, where $\Delta$ is the maximum degree of the graph. We also give
    a matching lower bound proving that there is no local algorithm with a better
    approximation factor for either of these problems.

  110. A Multi-Stage CUDA Kernel for Floyd-Warshall.

    Authors: Ben Lund, Justin W Smith
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We present a new implementation of the Floyd-Warshall All-Pairs Shortest
    Paths algorithm on CUDA. Our algorithm runs approximately 5 times faster than
    the previously best reported algorithm. In order to achieve this speedup, we
    applied a new technique to reduce usage of on-chip shared memory and allow the
    CUDA scheduler to more effectively hide instruction latency.

  111. Optimization of Multiple Vehicle Routing Problems using Approximation Algorithms.

    Authors: R. Nallusamy, K. Duraiswamy, R. Dhanalaksmi, P. Parthiban
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    This paper deals with generating of an optimized route for multiple Vehicle
    routing Problems (mVRP). We used a methodology of clustering the given cities
    depending upon the number of vehicles and each cluster is allotted to a
    vehicle. k- Means clustering algorithm has been used for easy clustering of the
    cities. In this way the mVRP has been converted into VRP which is simple in
    computation compared to mVRP. After clustering, an optimized route is generated
    for each vehicle in its allotted cluster.

  112. Severity Prediction of Drought in A Large Geographical Area Using Distributed Wireless Sensor Networks.

    Authors: M. Vaidehi, T.R. Gopalakrsihnan Nair, Satish.G. Dappin, G. Nithya Nair
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    In this paper, the severity prediction of drought through the implementation
    of modern sensor networks is discussed. We describe how to design a drought
    prediction system using wireless sensor networks. This paper will describe a
    terrestrial interconnected wireless sensor network paradigm for the prediction
    of severity of drought over a vast area of 10,000 sq km. The communication
    architecture for sensor network is outlined and the protocols developed for
    each layer is explored. The data integration model and sensor data analysis at
    the central computer is explained.

  113. Performance and Fault Tolerance in the StoreTorrent Parallel Filesystem.

    Authors: Federico D. Sacerdoti
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    With a goal of supporting the timely and cost-effective analysis of Terabyte
    datasets on commodity components, we present and evaluate StoreTorrent, a
    simple distributed filesystem with integrated fault tolerance for efficient
    handling of small data records. Our contributions include an application-OS
    pipelining technique and metadata structure to increase small write and read
    performance by a factor of 1-10, and the use of peer-to-peer communication of
    replica-location indexes to avoid transferring data during parallel analysis
    even in a degraded state.

  114. Termination Detection of Local Computations.

    Authors: Emmanuel Godard, Yves M&#xe9;tivier, Gerard Tel
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Contrary to the sequential world, the processes involved in a distributed
    system do not necessarily know when a computation is globally finished. This
    paper investigates the problem of the detection of the termination of local
    computations. We define four types of termination detection: no detection,
    detection of the local termination, detection by a distributed observer,
    detection of the global termination.

  115. Practical Parallel External Memory Algorithms via Simulation of Parallel Algorithms.

    Authors: David E. Robillard
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    This thesis introduces PEMS2, an improvement to PEMS (Parallel External
    Memory System). PEMS executes Bulk-Synchronous Parallel (BSP) algorithms in an
    External Memory (EM) context, enabling computation with very large data sets
    which exceed the size of main memory. Many parallel algorithms have been
    designed and implemented for Bulk-Synchronous Parallel models of computation.
    Such algorithms generally assume that the entire data set is stored in main
    memory at once.

  116. Research Challenges for Enterprise Cloud Computing.

    Authors: Ilango Sriram, Ali Khajeh-Hosseini, Ian Sommerville
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Cloud computing represents a shift away from computing as a product that is
    purchased, to computing as a service that is delivered to consumers over the
    internet from large-scale data centers - or "clouds". This paper discusses some
    of the research challenges for cloud computing from an enterprise or
    organizational perspective, and puts them in context by reviewing the existing
    body of literature in cloud computing.

  117. Research Agenda in Cloud Technologies.

    Authors: Ilango Sriram, Ali Khajeh-Hosseini
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Cloud computing is the latest effort in delivering computing resources as a
    service. It represents a shift away from computing as a product that is
    purchased, to computing as a service that is delivered to consumers over the
    internet from large-scale data centres - or "clouds". Whilst cloud computing is
    gaining growing popularity in the IT industry, academia appeared to be lagging
    behind the rapid developments in this field.

  118. Towards Transactional Load over XtreemFS.

    Authors: Roman Talyansky, Adolf Hohl, Bernd Scheuermann, Bjorn Kolbeck, Erich Focht
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We propose using trace-based assessment of the performance of distributed
    file systems (DFS) under transactional IO load. The assessment includes
    simulations and experiments using the IO traces. Our experiments suggest that
    DFS, and specifically XtreemFS have a good potential to support transactional
    IO load in distributed environments: they demonstrate good performance, high
    availability and scalability, while at the same time opening the way to TCO
    reduction.

  119. Classifying Application Phases in Asymmetric Chip Multiprocessors.

    Authors: A. Z. Jooya, M. Analoui
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    In present study, in order to improve the performance and reduce the amount
    of power which is dissipated in heterogeneous multicore processors, the ability
    of detecting the program execution phases is investigated. The programs
    execution intervals have been classified in different phases based on their
    throughput and the utilization of the cores. The results of implementing the
    phase detection technique are investigated on a single core processor and also
    on a multicore processor.

  120. Multiprocessor Scheduling For Tasks With Priority Using GA.

    Authors: Dr.G.Padmavathi, Mrs.S.R.Vijayalakshmi
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Multiprocessors have emerged as a powerful computing means for running
    realtime applications, especially where a uniprocessor system would not be
    sufficient enough to execute all the tasks. The high performance and
    reliability of multiprocessors have made them a powerful computing resource.
    Such computing environment requires an efficient algorithm to determine when
    and on which processor a given task should execute.

  121. MapReduce for Integer Factorization.

    Authors: Javier Tordable
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Integer factorization is a very hard computational problem. Currently no
    efficient algorithm for integer factorization is publicly known. However, this
    is an important problem on which it relies the security of many real world
    cryptographic systems.

    I present an implementation of a fast factorization algorithm on MapReduce.
    MapReduce is a programming model for high performance applications developed
    originally at Google. The quadratic sieve algorithm is split into the different
    MapReduce phases and compared against a standard implementation.

  122. A distributed file system for a wide-area high performance computing infrastructure.

    Authors: Edward Walker
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We describe our work in implementing a wide-area distributed file system for
    the NSF TeraGrid. The system, called XUFS, allows private distributed name
    spaces to be created for transparent access to personal files across over 9000
    computer nodes. XUFS builds on many principles from prior distributed file
    systems research, but extends key design goals to support the workflow of
    computational science researchers.

  123. QR Factorization of Tall and Skinny Matrices in a Grid Computing Environment.

    Authors: Emmanuel Agullo, Camille Coti, Jack Dongarra, Thomas Herault, Julien Langou
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Previous studies have reported that common dense linear algebra operations do
    not achieve speed up by using multiple geographical sites of a computational
    grid. Because such operations are the building blocks of most scientific
    applications, conventional supercomputers are still strongly predominant in
    high-performance computing and the use of grids for speeding up large-scale
    scientific problems is limited to applications exhibiting parallelism at a
    higher level.

  124. DiVinE-CUDA - A Tool for GPU Accelerated LTL Model Checking.

    Authors: Ji&#x159;&#xed; Barnat, Lubo&#x161; Brim, Milan &#x10c;e&#x161;ka
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    In this paper we present a tool that performs CUDA accelerated LTL Model
    Checking. The tool exploits parallel algorithm MAP adjusted to the NVIDIA CUDA
    architecture in order to efficiently detect the presence of accepting cycles in
    a directed graph. Accepting cycle detection is the core algorithmic procedure
    in automata-based LTL Model Checking. We demonstrate that the tool outperforms
    non-accelerated version of the algorithm and we discuss where the limits of the
    tool are and what we intend to do in the future to avoid them.

  125. Formal Aspects of Grid Brokering.

    Authors: Attila Kert&#xe9;sz, Zsolt N&#xe9;meth
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Coordination in distributed environments, like Grids, involves selecting the
    most appropriate services, resources or compositions to carry out the planned
    activities. Such functionalities appear at various levels of the infrastructure
    and in various means forming a blurry domain, where it is hard to see how the
    participating components are related and what their relevant properties are. In
    this paper we focus on a subset of these problems: resource brokering in Grid
    middleware.

  126. Parallelizing Deadlock Resolution in Symbolic Synthesis of Distributed Programs.

    Authors: Fuad Abujarad, Borzoo Bonakdarpour, Sandeep S. Kulkarni
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Previous work has shown that there are two major complexity barriers in the
    synthesis of fault-tolerant distributed programs: (1) generation of fault-span,
    the set of states reachable in the presence of faults, and (2) resolving
    deadlock states, from where the program has no outgoing transitions. Of these,
    the former closely resembles with model checking and, hence, techniques for
    efficient verification are directly applicable to it. Hence, we focus on
    expediting the latter with the use of multi-core technology.

  127. Survey of clustering algorithms for MANET.

    Authors: Dr. Mahesh Motwani, Ratish Agarwal
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Many clustering schemes have been proposed for ad hoc networks. A systematic
    classification of these clustering schemes enables one to better understand and
    make improvements. In mobile ad hoc networks, the movement of the network nodes
    may quickly change the topology resulting in the increase of the overhead
    message in topology maintenance. Protocols try to keep the number of nodes in a
    cluster around a pre-defined threshold to facilitate the optimal operation of
    the medium access control protocol.

  128. Efficient Gaussian Elimination on a 2D SIMD Array of Processors without Column Broadcasts.

    Authors: Mugurel Ionut Andreica
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    This paper presents an efficient method for implementing the Gaussian
    elimination technique for an nxm (m>=n) matrix, using a 2D SIMD array of nxm
    processors. The described algorithm consists of 2xn-1=O(n) iterations, which
    provides an optimal speed-up over the serial version. A particularity of the
    algorithm is that it only requires broadcasts on the rows of the processor
    matrix and not on its columns. The paper also presents several extensions and
    applications of the Gaussian elimination algorithm.

  129. Peer-to-Peer Cloud Provisioning: Service Discovery and Load-Balancing.

    Authors: Anna Liu, Rajiv Ranjan, Liang Zhao, Xiaomin Wu
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    This chapter presents: (i) a layered peer-to-peer Cloud provisioning
    architecture; (ii) a summary of the current state-of-the-art in Cloud
    provisioning with particular emphasis on service discovery and load-balancing;
    (iii) a classification of the existing peer-to-peer network management model
    with focus on extending the DHTs for indexing and managing complex provisioning
    information; and (iv) the design and implementation of novel, extensible
    software fabric (Cloud peer) that combines public/private clouds, overlay
    networking and structured peer-to-peer indexing techniques for supporting
    s

  130. Self-Stabilizing Byzantine Asynchronous Unison.

    Authors: Swan Dubois, Maria Gradinariu Potop-Butucaru, Mikhail Nesterenko, S&#xe9;bastien Tixeuil
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We explore asynchronous unison in the presence of systemic transient and
    permanent Byzantine faults in shared memory. We observe that the problem is not
    solvable under less than strongly fair scheduler or for system topologies with
    maximum node degree greater than two. We present a self-stabilizing
    Byzantine-tolerant solution to asynchronous unison for chain and ring
    topologies. Our algorithm has minimum possible containment radius and optimal
    stabilization time.

  131. Building and Installing a Hadoop/MapReduce Cluster from Commodity Components.

    Authors: Jochen L. Leidner, Gary Berosik
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    This tutorial presents a recipe for the construction of a compute cluster for
    processing large volumes of data, using cheap, easily available personal
    computer hardware (Intel/AMD based PCs) and freely available open source
    software (Ubuntu Linux, Apache Hadoop).

  132. Checkpointing vs. Migration for Post-Petascale Machines.

    Authors: Franck Cappello, Henri Casanova, Yves Robert
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We craft a few scenarios for the execution of sequential and parallel jobs on
    future generation machines. Checkpointing or migration, which technique to
    choose?

  133. Lattice QCD Thermodynamics on the Grid.

    Authors: Jakub T. Mo&#x15b;cicki, Maciej Wo&#x15b;, Massimo Lamanna, Philippe de Forcrand, Owe Philipsen
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We describe how we have used simultaneously ${\cal O}(10^3)$ nodes of the
    EGEE Grid, accumulating ca. 300 CPU-years in 2-3 months, to determine an
    important property of Quantum Chromodynamics. We explain how Grid resources
    were exploited efficiently and with ease, using user-level overlay based on
    Ganga and DIANE tools above standard Grid software stack. Application-specific
    scheduling and resource selection based on simple but powerful heuristics
    allowed to improve efficiency of the processing to obtain desired scientific
    results by a specified deadline.

  134. Best-effort Group Service in Dynamic Networks.

    Authors: Bertrand Ducourthial, Sofiane Khalfallah, Franck Petit
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We propose a group membership service for dynamic ad hoc networks. It
    maintains as long as possible the existing groups and ensures that each group
    diameter is always smaller than a constant, ?xed according to the application
    using the groups. The proposed protocol is self-stabilizing and works in
    dynamic distributed systems. Moreover, it ensures a kind of continuity in the
    service o?er to the application while the system is converging, except if too
    strong topology changes happen.

  135. A Semantic Grid Oriented to E-Tourism.

    Authors: Xiao Ming Zhang
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    With increasing complexity of tourism business models and tasks, there is a
    clear need of the next generation e-Tourism infrastructure to support flexible
    automation, integration, computation, storage, and collaboration. Currently
    several enabling technologies such as semantic Web, Web service, agent and grid
    computing have been applied in the different e-Tourism applications, however
    there is no a unified framework to be able to integrate all of them.

  136. PyCUDA: GPU Run-Time Code Generation for High-Performance Computing.

    Authors: Andreas Kl&#xf6;ckner, Nicolas Pinto, Yunsup Lee, Bryan Catanzaro, Paul Ivanov, Ahmed Fasih
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    High-performance scientific computing has recently seen a surge of interest
    in heterogeneous systems, with an emphasis on modern Graphics Processing Units
    (GPUs). These devices offer tremendous potential for performance and efficiency
    in important large-scale applications of computational science. However,
    exploiting this potential can be challenging, as one must adapt to the
    specialized and rapidly evolving computing environment currently exhibited by
    GPUs. One way of addressing this challenge is to embrace better techniques and
    develop tools tailored to their needs.

  137. Near-Optimal Sublinear Time Bounds for Distributed Random Walks.

    Authors: Prasad Tetali, Danupon Nanongkai, Atish Das Sarma, Gopal Pandurangan
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    We focus on the problem of performing random walks efficiently in a
    distributed network. Given bandwidth constraints, the goal is to minimize the
    number of rounds required to obtain a random walk sample on an undirected
    network. Despite the widespread use of random walks in distributed computing,
    most algorithms that compute a random walk sample of length $\ell$ naively,
    i.e., in $O(\ell)$ rounds.

  138. Fault-Tolerance through Message-logging and Check-pointing: Disaster Recovery for CORBA-based Distributed Bank Servers.

    Authors: Emil Vassev, Que Thu Dung Nguyen, Heng Kuang
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    This report presents results of our endeavor towards developing a
    failure-recovery variant of a CORBA-based bank server that provides fault
    tolerance features through message logging and checkpoint logging.

  139. Global communications in multiprocessor simulations of flames.

    Authors: V. Karlin
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    In this paper we investigate performance of global communications in a
    particular parallel code. The code simulates dynamics of expansion of premixed
    spherical flames using an asymptotic model of Sivashinsky type and a spectral
    numerical algorithm. As a result, the code heavily relies on global all-to-all
    interprocessor communications implementing transposition of the distributed
    data array in which numerical solution to the problem is stored.

  140. Domain Decomposition Based High Performance Parallel Computing.

    Authors: Mandhapati P. Raju, Siddhartha Khaitan
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    The study deals with the parallelization of finite element based
    Navier-Stokes codes using domain decomposition and state-ofart sparse direct
    solvers. There has been significant improvement in the performance of sparse
    direct solvers. Parallel sparse direct solvers are not found to exhibit good
    scalability. Hence, the parallelization of sparse direct solvers is done using
    domain decomposition techniques. A highly efficient sparse direct solver
    PARDISO is used in this study. The scalability of both Newton and modified
    Newton algorithms are tested.

  141. Distributed Abstract Optimization via Constraints Consensus: Theory and Applications.

    Authors: Francesco Bullo, Giuseppe Notarstefano
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Distributed abstract programs are a novel class of distributed optimization
    problems where (i) the number of variables is much smaller than the number of
    constraints and (ii) each constraint is associated to a network node. Abstract
    optimization programs are a generalization of linear programs that captures
    numerous geometric optimization problems.

  142. SPECI, a simulation tool exploring cloud-scale data centres.

    Authors: Ilango Sriram
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    There is a rapid increase in the size of data centres (DCs) used to provide
    cloud computing services. It is commonly agreed that not all properties in the
    middleware that manages DCs will scale linearly with the number of components.
    Further, "normal failure" complicates the assessment of the per-formance of a
    DC. However, unlike in other engineering domains, there are no well established
    tools that allow the prediction of the performance and behav-iour of future
    generations of DCs.

  143. Optimised access to user analysis data using the gLite DPM.

    Authors: Sam Skipsey, Greig Cowan, Mike Kenyon, Graeme Stewart, Stuart Purdie
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    The ScotGrid distributed Tier-2 now provides more that 4MSI2K and 500TB for
    LHC computing, which is spread across three sites at Durham, Edinburgh and
    Glasgow. Tier-2 sites have a dual role to play in the computing models of the
    LHC VOs. Firstly, their CPU resources are used for the generation of Monte
    Carlo event data. Secondly, the end user analysis data is distributed across
    the grid to the site's storage system and held on disk ready for processing by
    physicists' analysis jobs.

  144. ScotGrid: Providing an Effective Distributed Tier-2 in the LHC Era.

    Authors: Sam Skipsey, David Ambrose-Griffith, Greig Cowan, Mike Kenyon, Orlando Richards, Phil Roffe, Graeme Stewart
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    ScotGrid is a distributed Tier-2 centre in the UK with sites in Durham,
    Edinburgh and Glasgow. ScotGrid has undergone a huge expansion in hardware in
    anticipation of the LHC and now provides more than 4MSI2K and 500TB to the LHC
    VOs. Scaling up to this level of provision has brought many challenges to the
    Tier-2 and we show in this paper how we have adopted new methods of organising
    the centres, from fabric management and monitoring to remote management of
    sites to management and operational procedures, to meet these challenges.

  145. Modeling and Verification for Timing Satisfaction of Fault-Tolerant Systems with Finiteness.

    Authors: Chih-Hong Cheng, Christian Buckl, Javier Esparza, Alois Knoll
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    The increasing use of model-based tools enables further use of formal
    verification techniques in the context of distributed real-time systems. To
    avoid state explosion, it is necessary to construct verification models that
    focus on the aspects under consideration.

  146. Critical Analysis of Middleware Architectures for Large Scale Distributed Systems.

    Authors: Alexandru Costan, Corina Stratan, Eliana-Dina Tirsa, Mugurel Ionut Andreica, Valentin Cristea, Ciprian Mihai Dobre, Florin Pop
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Distributed computing is increasingly being viewed as the next phase of Large
    Scale Distributed Systems (LSDSs). However, the vision of large scale resource
    sharing is not yet a reality in many areas - Grid computing is an evolving area
    of computing, where standards and technology are still being developed to
    enable this new paradigm. Hence, in this paper we analyze the current
    development of middleware tools for LSDS, from multiple perspectives:
    architecture, applications and market research.

  147. DILAND: An Algorithm for Distributed Sensor Localization with Noisy Distance Measurements.

    Authors: Usman A. Khan, Soummya Kar, Jose M. F. Moura
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    In this correspondence, we present an algorithm for distributed sensor
    localization with noisy distance measurements (DILAND) that extends and makes
    the DLRE more robust. DLRE is a distributed sensor localization algorithm in
    $\mathbb{R}^m$ $(m\geq1)$ introduced in \cite{usman_loctsp:08}. DILAND operates
    when (i) the communication among the sensors is noisy; (ii) the communication
    links in the network may fail with a non-zero probability; and (iii) the
    measurements performed to compute distances among the sensors are corrupted
    with noise.

  148. DILAND: An Algorithm for Distributed Sensor Localization with Noisy Distance Measurements.

    Authors: Usman A. Khan, Soummya Kar, Jose M. F. Moura
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    In this correspondence, we present an algorithm for distributed sensor
    localization with noisy distance measurements (DILAND) that extends and makes
    the DLRE more robust. DLRE is a distributed sensor localization algorithm in
    $\mathbb{R}^m$ $(m\geq1)$ introduced in \cite{usman_loctsp:08}. DILAND operates
    when (i) the communication among the sensors is noisy; (ii) the communication
    links in the network may fail with a non-zero probability; and (iii) the
    measurements performed to compute distances among the sensors are corrupted
    with noise.

  149. Towards a Unified Approach to (In)Decision: Routing, Games, Circuits, Consensus, and Beyond.

    Authors: Aaron D. Jaggard, Michael Schapira, Rebecca N. Wright
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    In this paper, we explore a unified treatment of the difficulty of reaching a
    decision in constrained distributed computing environments in which there is a
    lack of global coordination or knowledge. We show a general impossibility
    result for a broad class of decision protocols. Importantly, our impossibility
    result holds, in particular, for "asynchronous, distributed, historyless
    computation", in which each computational node's selection of actions only
    depends on the current actions of other nodes, even under the assumption that
    no node can be faulty.

  150. High availability using virtualization.

    Authors: Federico Calzolari
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    High availability has always been one of the main problems for a data center.
    Till now high availability was achieved by host per host redundancy, a highly
    expensive method in terms of hardware and human costs. A new approach to the
    problem can be offered by virtualization.

  151. Robust Failure Detection Architecture for Large Scale Distributed Systems.

    Authors: Alexandru Costan, Mugurel Ionut Andreica, Valentin Cristea, Ciprian Mihai Dobre, Florin Pop
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Failure detection is a fundamental building block for ensuring fault
    tolerance in large scale distributed systems. There are lots of approaches and
    implementations in failure detectors. Providing flexible failure detection in
    off-the-shelf distributed systems is difficult. In this paper we present an
    innovative solution to this problem. Our approach is based on adaptive,
    decentralized failure detectors, capable of working asynchronous and
    independent on the application flow.

  152. Towards a Grid Platform for Scientific Workflows Management.

    Authors: Alexandru Costan, Corina Stratan, Eliana-Dina Tirsa, Mugurel Ionut Andreica, Valentin Cristea
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Workflow management systems allow the users to develop complex applications
    at a higher level, by orchestrating functional components without handling the
    implementation details. Although a wide range of workflow engines are developed
    in enterprise environments, the open source engines available for scientific
    applications lack some functionalities or are too difficult to use for
    non-specialists.

  153. A historical perspective on developing foundations for privacy-friendly client cloud computing: The Paradigm Shift from "Inconsistency Denial" to "Practical Semantic Integration(TM)".

    Authors: Carl Hewitt
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Arguably, the original paradigm for computation was Logic Programming broadly
    conceived as "deducing computational steps from existing information."

    The idea has a long development that went through many twists in which
    important questions turned out to have surprising answers, including the
    following:

    * How much of concurrent computation is reducible to deduction?

    * Are the laws of thought consistent?

    * Is "rapid recovery" a more viable policy than "inconsistency denial"?

  154. Supporting Lock-Free Composition of Concurrent Data Objects.

    Authors: Daniel Cederman, Philippas Tsigas
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Lock-free data objects offer several advantages over their blocking
    counterparts, such as being immune to deadlocks and convoying and, more
    importantly, being highly concurrent. But they share a common disadvantage in
    that the operations they provide are difficult to compose into larger atomic
    operations while still guaranteeing lock-freedom. We present a lock-free
    methodology for composing highly concurrent linearizable objects together by
    unifying their linearization points.

  155. A New Fuzzy Approach for Dynamic Load Balancing Algorithm.

    Authors: Abbas Karimi, Faraneh Zarafshan, Adznan.b. Jantan, A.R Ramli, M.Iqbal b.Saripan
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Load balancing is the process of improving the Performance of a parallel and
    distributed system through is distribution of load among the processors [1-2].
    Most of the previous work in load balancing and distributed decision making in
    general, do not effectively take into account the uncertainty and inconsistency
    in state information but in fuzzy logic, we have advantage of using crisps
    inputs.

  156. Web-enabling Cache Daemon for Complex Data.

    Authors: Ivan Voras, Mario Zagar
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    One of the most common basic techniques for improving the performance of web
    applications is caching frequently accessed data in fast data stores,
    colloquially known as cache daemons. In this paper we present a cache daemon
    suitable for storing complex data while maintaining fine-grained control over
    data storage, retrieval and expiry. Data manipulation in this cache daemon is
    performed via standard SQL statements so we call it SQLcached. It is a
    practical, usable solution already implemented in several large web sites.

  157. Transform-based Distributed Data Gathering.

    Authors: Godwin Shen, Antonio Ortega
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    A general class of unidirectional transforms is presented that can be
    computed in a distributed manner along an arbitrary routing tree. Additionally,
    we provide a set of conditions under which these transforms are invertible.
    These transforms can be computed as data is routed towards the collection (or
    sink) node in the tree and exploit data correlation between nodes in the tree.
    Moreover, when used in wireless sensor networks, these transforms can also
    leverage data received at nodes via broadcast wireless communications.

  158. Business in the Grid.

    Authors: Erich Schikuta, Thomas Weishaeupl, Flavia Donno, Heinz Stockinger, Elisabeth Vinek, Helmut Wanek, Christoph Witzany, Irfan Ul Haq
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    From 2004 to 2007 the Business In the Grid (BIG) project took place and was
    driven by the following goals: Firstly, make business aware of Grid technology
    and, secondly, try to explore new business models. We disseminated Grid
    computing by mainly concentrating on the central European market and
    interviewed several companies in order to gain insights into the Grid
    acceptance in industrial environments.

  159. Greedy Gossip with Eavesdropping.

    Authors: Deniz Ustebay, Boris Oreshkin, Mark Coates, Michael Rabbat
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    This paper presents greedy gossip with eavesdropping (GGE), a novel
    randomized gossip algorithm for distributed computation of the average
    consensus problem. In gossip algorithms, nodes in the network randomly
    communicate with their neighbors and exchange information iteratively. The
    algorithms are simple and decentralized, making them attractive for wireless
    network applications. In general, gossip algorithms are robust to unreliable
    wireless conditions and time varying network topologies. In this paper we
    introduce GGE and demonstrate that greedy updates lead to rapid convergence.

  160. Building on Quicksand.

    Authors: Pat Helland, David Campbell
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Reliable systems have always been built out of unreliable components. Early
    on, the reliable components were small such as mirrored disks or ECC (Error
    Correcting Codes) in core memory. These systems were designed such that
    failures of these small components were transparent to the application. Later,
    the size of the unreliable components grew larger and semantic challenges crept
    into the application when failures occurred.

  161. Resource Matchmaking Algorithm using Dynamic Rough Set in Grid Environment.

    Authors: Iraj Ataollahi, Mortza Analoui
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Grid environment is a service oriented infrastructure in which many
    heterogeneous resources participate to provide the high performance
    computation. One of the bug issues in the grid environment is the vagueness and
    uncertainty between advertised resources and requested resources. Furthermore,
    in an environment such as grid dynamicity is considered as a crucial issue
    which must be dealt with. Classical rough set have been used to deal with the
    uncertainty and vagueness. But it can just be used on the static systems and
    can not support dynamicity in a system.

  162. FastFlow: Efficient Parallel Streaming Applications on Multi-core.

    Authors: Marco Aldinucci, Massimo Torquati, Massimiliano Meneghin
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    Shared memory multiprocessors come back to popularity thanks to rapid
    spreading of commodity multi-core architectures. As ever, shared memory
    programs are fairly easy to write and quite hard to optimise; providing
    multi-core programmers with optimising tools and programming frameworks is a
    nowadays challenge. Few efforts have been done to support effective streaming
    applications on these architectures. In this paper we introduce FastFlow, a
    low-level programming framework based on lock-free queues explicitly designed
    to support high-level languages for streaming applications.

  163. Energy-Efficient Scheduling of HPC Applications in Cloud Computing Environments.

    Authors: Saurabh Kumar Garg, Chee Shin Yeo, Arun Anandasivam, Rajkumar Buyya
    Subjects: and Cluster Computing, Distributed, Parallel
    Abstract

    The use of High Performance Computing (HPC) in commercial and consumer IT
    applications is becoming popular. They need the ability to gain rapid and
    scalable access to high-end computing capabilities. Cloud computing promises to
    deliver such a computing infrastructure using data centers so that HPC users
    can access applications and data from a Cloud anywhere in the world on demand
    and pay based on what they use. However, the growing demand drastically
    increases the energy consumption of data centers, which has become a critical
    issue.

RSS-материал