Search CORE

1,721,117 research outputs found

Load Balancing in Arbitrary Network Topologies with Stochastic Adversarial Input

Author: Eli Upfal
ANAGNOSTOPOULOS ARISTIDIS
Adam Kirsch
Aris Anagnostopoulos
Publication venue
Publication date: 01/01/2005
Field of study

We study the long-term (steady state) performance of a simple, randomized, local load balancing technique under a broad range of input conditions. We assume a system of n processors connected by an arbitrary network topology. Jobs are placed in the processors by a deterministic or randomized adversary. The adversary knows the current and past load distribution in the network and can use this information to place the new tasks in the processors. A node can execute one job per step, and can also participate in one load balancing operation in which it can move tasks to a direct neighbor in the network. In the protocol we analyze here, a node equalizes its load with a random neighbor in the graph

CiteSeerX

Archivio della ricerca- Università di Roma La Sapienza

Steady State Analysis of Balanced-Allocation Routing

Author: Eli Upfal
ANAGNOSTOPOULOS ARISTIDIS
Ioannis Kontoyiannis
Aris Anagnostopoulos
Publication venue
Publication date: 01/01/2005
Field of study

We compare the long-term, steady-state performance of a variant of the standard Dynamic Alternative Routing (DAR) technique commonly used in telephone and ATM networks, to the performance of a path-selection algorithm based on the "balanced-allocation" principle [Y. Azer, A. Z. Broder, A. R. Karlin, and E. Upfal, SIAM J Comput 29(1) (2000), 180--200; M. Mitzenmacher, Ph.D. Thesis, University of California, Berkeley, August 1996]; we refer to this new algorithm as the Balanced Dynamic Alternative Routing (BDAR) algorithm. While DAR checks alternative routes sequentially until available bandwidth is found, the BDAR algorithm compares and chooses the best among a small number of alternatives. We show that, at the expense of a minor increase in routing overhead, the BDAR algorithm gives a substantial improvement in network performance, in terms both of network congestion and of bandwidth requirement

CiteSeerX

Archivio della ricerca- Università di Roma La Sapienza

Sort me if you can: How to sort dynamic data

Author: Eli Upfal
Ravi Kumar
ANAGNOSTOPOULOS ARISTIDIS
Mohammad Mahdian
Aris Anagnostopoulos
Publication venue
Publication date: 01/01/2009
Field of study

We formulate and study a new computational model for dynamic data. In this model the data changes gradually and the goal of an algorithm is to compute the solution to some problem on the data at each time step, under the constraint that it only has a limited access to the data each time. As the data is constantly changing and the algorithm might be unaware of these changes, it cannot be expected to always output the exact right solution; we are interested in algorithms that guarantee to output an approximate solution. In particular, we focus on the fundamental problems of sorting and selection, where the true ordering of the elements changes slowly. We provide algorithms with performance close to the optimal in expectation and with high probability. © 2009 Springer Berlin Heidelberg

CiteSeerX

Crossref

Archivio della ricerca- Università di Roma La Sapienza

Sorting and selection on dynamic data

Author: Kumar Ravi
Eli Upfal
Ravi Kumar
Anagnostopoulos Aris
Upfal Eli
ANAGNOSTOPOULOS ARISTIDIS
Mahdian Mohammad
Mohammad Mahdian
Aris Anagnostopoulos
Publication venue
Publication date: 01/01/2011
Field of study

We formulate and study a new computational model for dynamic data. In this model, the data changes gradually and the goal of an algorithm is to compute the solution to some problem on the data at each time step, under the constraint that it only has limited access to the data each time. As the data is constantly changing and the algorithm might be unaware of these changes, it cannot be expected to always output the exact right solution; we are interested in algorithms that guarantee to output an approximate solution. In particular, we focus on the fundamental problems of sorting and selection, where the true ordering of the elements changes slowly. We provide algorithms with performance close to the optimal in expectation and with high probability. (C) 2010 Elsevier B.V. All rights reserved

Elsevier - Publisher Connector

Crossref

Archivio della ricerca- Università di Roma La Sapienza

Differentially Mutated Subnetworks Discovery

Author: Vandin Fabio
Eli Upfal
Upfal Eli
Fabio Vandin
Hajkarim Morteza Chalabi
Morteza Chalabi Hajkarim
Publication venue
Publication date: 01/01/2018
Field of study

We study the problem of identifying differentially mutated subnetworks of a large gene-gene interaction network, that is, subnetworks that display a significant difference in mutation frequency in two sets of cancer samples. We formally define the associated computational problem and show that the problem is NP-hard. We propose a novel and efficient algorithm, called DAMOKLE to identify differentially mutated subnetworks given genome-wide mutation data for two sets of cancer samples. We prove that DAMOKLE identifies subnetworks with a statistically significant difference in mutation frequency when the data comes from a reasonable generative model, provided enough samples are available. We test DAMOKLE on simulated and real data, showing that DAMOKLE does indeed find subnetworks with significant differences in mutation frequency and that it provides novel insights not obtained by standard methods

Directory of Open Access Journals

DROPS Dagstuhl Research Online Publication Server

Archivio istituzionale della ricerca - Università di Padova

An Efficient Rigorous Approach for Identifying Statistically Significant Frequent Itemsets

Author: PIETRACAPRINA ANDREA ALBERTO
Michael Mitzenmacher
Eli Upfal
VANDIN FABIO
Adam Kirsch
PUCCI GEPPINO
Publication venue
Publication date: 01/01/2012
Field of study

As advances in technology allow for the collection, storage, and analysis of vast amounts of data, the task of screening and assessing the significance of discovered patterns is becoming a major challenge in data mining applications. In this work, we address significance in the context of frequent itemset mining. Specifically, we develop a novel methodology to identify a meaningful support threshold s* for a dataset, such that the number of itemsets with support at least s* represents a substantial deviation from what would be expected in a random dataset with the same number of transactions and the same individual item frequencies. These itemsets can then be flagged as statistically significant with a small false discovery rate. We present extensive experimental results to substantiate the effectiveness of our methodology

Crossref

Archivio istituzionale della ricerca - Università di Padova

Finding near neighbors through cluster pruning

Author: Prabhakar Raghavan
PANCONESI Alessandro
Eli Upfal
CHIERICHETTI FLAVIO
Flavio Chierichetti
Alessandro Tiberi
Mauro Sozio
Alessandro Panconesi
Publication venue
Publication date: 01/01/2007
Field of study

Finding near(est) neighbors is a classic, difficult problem in data management and retrieval, with applications in text and image search,in finding similar objects and matching patterns. Here we study cluster pruning, an extremely simple randomized technique. During preprocessing we randomly choose a subset of data points to be leaders the remaining data points are partitioned by which leader is the closest. For query processing, we find the leader(s) closest to the query point. We then seek the nearest neighbors for the query point among only the points in the clusters of the closest leader(s). Recursion may be used in both preprocessing and in search. Such schemes seek approximate nearest neighbors that are "almost as good" as the nearest neighbors. How good are these approximations and how much do they save in computation. Our contributions are: (1) we quantify metrics that allow us to study the tradeoff between processing and the quality of the approximate nearest neighbors; (2) we give rigorous theoretical analysis of our schemes, under natural generative processes (generalizing Gaussian mixtures) for the data points; (3) experiments on both synthetic data from such generative processes, as well as on from a document corpus, confirming that we save orders of magnitude in query processing cost at modest compromises in the quality of retrieved points. In particular, we show that p-spheres, a state-of-the-art solution, is outperformed by our simple scheme whether the data points are stored in main or in external memory. Copyright 2007 ACM

Crossref

Archivio della ricerca- Università di Roma La Sapienza

Space-round tradeoffs for MapReduce computations

Author: PIETRACAPRINA ANDREA ALBERTO
Eli Upfal
Francesco Silvestri
PUCCI GEPPINO
Matteo Riondato
Geppino Pucci
Andrea Pietracaprina
SILVESTRI FRANCESCO
Publication venue
Publication date: 01/01/2012
Field of study

This work explores fundamental modeling and algorithmic issues arising in the well-established MapReduce framework. First, we formally specify a computational model for MapReduce which captures the functional flavor of the paradigm by allowing for a flexible use of parallelism. Indeed, the model diverges from a traditional processor-centric view by featuring parameters which embody only global and local memory constraints, thus favoring a more data-centric view. Second, we apply the model to the fundamental computation task of matrix multiplication presenting upper and lower bounds for both dense and sparse matrix multiplication, which highlight interesting tradeoffs between space and round complexity. Finally, building on the matrix multiplication results, we derive further space-round tradeoffs on matrix inversion and matching

Crossref

Archivio istituzionale della ricerca - Università di Padova

A Simple and Deterministic Competitive Algorithm for Online Facility Location

Author: Eli Upfal
Hentenryck Pascal Van
Anagnostopoulos Aris
Bent Russell
Upfal Eli
ANAGNOSTOPOULOS ARISTIDIS
Russell Bent
Aris Anagnostopoulos
Pascal Van Hentenryck
P. Van Hentenryck
Publication venue
Publication date: 01/01/2003
Field of study

This paper presents a deterministic and e#cient algorithm for online facility location. The algorithm is based on a simple hierarchical partitioning and is extremely simple to implement. It also applies to a variety of models, i.e., models where the facilities can be placed anywhere in the region, or only at customer sites, or only at fixed locations. The paper shows that the algorithm is O(log n)-competitive under these various models. It also shows that the algorithm is O(1)-competitive with high probability and for any arrival order when customers are uniformly distributed or when they follow a distribution satisfying a smoothness property. Experimental results for a variety of scenarios indicate that the algorithm behaves extremely well in practice

CiteSeerX

Elsevier - Publisher Connector

Archivio della ricerca- Università di Roma La Sapienza

Algorithms on evolving graphs

Author: Kumar Ravi
Eli Upfal
Anagnostopoulos Aris
Upfal Eli
VANDIN FABIO
Ravi Kumar
Fabio Vandin
ANAGNOSTOPOULOS ARISTIDIS
Mahdian Mohammad
Mohammad Mahdian
Aris Anagnostopoulos
Publication venue
Publication date: 01/01/2012
Field of study

Motivated by applications that concern graphs that are evolving and massive in nature, we define a new general framework for computing with such graphs. In our framework, the graph changes over time and an algorithm can only track these changes by explicitly probing the graph. This framework captures the inherent tradeoff between the complexity of maintaining an up-to-date view of the graph and the quality of results computed with the available view. We apply this framework to two classical graph connectivity problems, namely, path connectivity and minimum spanning trees, and obtain efficient algorithms. © 2012 ACM

Crossref

Archivio della ricerca- Università di Roma La Sapienza

Archivio istituzionale della ricerca - Università di Padova