Search CORE

1,721,102 research outputs found

FastFlow parallel programming framework

Author: TORQUATI MASSIMO
Aldinucci Marco
Publication venue
Publication date: 01/01/2010
Field of study

FastFlow is a C++ parallel programming framework advocating high-level, pattern-based parallel programming. It chiefly supports streaming and data parallelism, targeting heterogenous platforms composed of clusters of shared-memory platforms, possibly equipped with computing accelerators such as GPGPUs, Xeon Phi, Tilera TILE64.The main design philosophy of FastFlow is to provide application designers with key features for parallel programming (e.g. time-to-market, efficiency, functional and performance portability) via suitable parallel programming abstractions and a carefully designed run-time support

Archivio della Ricerca - Università di Pisa

Guest Editorial: High-Level Parallel Programming and Applications

Author: Torquati Massimo
Pelagatti Susanna
Danelutto Marco
Pelagatti Susanna
Danelutto Marco
Torquati Massimo
Publication venue
Publication date: 01/01/2016
Field of study

Guest editorial for the International Journal of Parallel Programming special issue on High-Level Parallel Programming and Application

Archivio della Ricerca - Università di Pisa

Structured parallel programming with “core” FastFlow

Author: TORQUATI MASSIMO
Massimo Torquati
Marco Danelutto
DANELUTTO MARCO
Publication venue
Publication date: 01/01/2015
Field of study

FastFlow is an open source, structured parallel programming framework originally conceived to support highly efficient stream parallel computation while targeting shared memory multi cores. Its efficiency mainly comes from the optimized implementation of the base communication mechanisms and from its layered design. FastFlow eventually provides the parallel applications programmers with a set of readyto- use, parametric algorithmic skeletons modeling the most common parallelism exploitation patterns. The algorithmic skeleton provided by FastFlow may be freely nested to model more and more complex parallelism exploitation patterns. This tutorial describes the “core” FastFlow, that is the set of skeletons supported since version 1.0 in FastFlow, and outlines the recent advances aimed at (i) introducing new, higher level skeletons and (ii) targeting networked multi cores, possibly equipped with GPUs, in addition to single multi/many core processing elements

Crossref

Archivio della Ricerca - Università di Pisa

A RISC building block set for structured parallel programming

Author: TORQUATI MASSIMO
M. Danelutto
DANELUTTO MARCO
M. Torquati
Publication venue
Publication date: 01/01/2013
Field of study

We propose a set of building blocks (RISC-pb2l) suitable to build high-level structured parallel programming frameworks. The set is designed following a RISC approach. RISC-pb2l is architecture independent but the implementation of the different blocks may be specialized to make the best usage of the target architecture peculiarities. A number of optimizations may be designed transforming basic building blocks compositions into more efficient compositions, such that parallel application efficiency may be derived by construction rather than by debugging

Crossref

Archivio della Ricerca - Università di Pisa

Increasing Efficiency in Parallel Programming Teaching

Author: Massimo Torquati
Marco Danelutto
Danelutto Marco
Torquati Massimo
Publication venue
Publication date: 01/01/2018
Field of study

The ability to teach parallel programming principles and techniques is becoming fundamental to prepare a new generation of programmers able to master the pervasive parallelism made available by hardware vendors. Classical parallel programming courses leverage either low-level programming frameworks (e.g. those based on Pthreads) or higher level frameworks such as OpenMP or MPI. We discuss our teaching experience within the Master in 'Computer Science and networking' where parallel programming is taught leveraging structured parallel programming principles and frameworks. The paper summarizes the results achieved in eight years of experience and shows how the adoption of a structured parallel programming approach improves the efficiency of the teaching process

Crossref

Archivio della Ricerca - Università di Pisa

The VirtuaLinux Storage Abstraction Layer for Efficient Virtual Clustering

Author: TORQUATI MASSIMO
PIERFRANCESCO ZUCCATO
VANNESCHI MARCO
Publication venue
Publication date: 01/01/2008
Field of study

Archivio della Ricerca - Università di Pisa

Accelerating Apache Farms Through Ad-HOC Distributed scalable object repository

Author: Aldinucci Marco
Torquati Massimo
TORQUATI MASSIMO
Massimo Torquati
Marco Aldinucci
M. Aldinucci
M. Torquati
Publication venue
Publication date: 01/01/2004
Field of study

We present HOC: a fast, scalable object repository providing programmers with a general storage module. HOC may be used to implement DSMs as well as distributed cache subsystems. HOC is composed of a set of hot-pluggable cooperating processes that may sustain a close to optimal network traffic rate. We designed an HOC-based Web cache that extends the Apache Web server and remarkably improves Apache farms performances with no modification to the Apache core code

CiteSeerX

Crossref

Archivio della Ricerca - Università di Pisa

UnipiEprints

Porting Decision Tree Algorithms to Multicore using FastFlow

Author: Aldinucci Marco
Torquati Massimo
RUGGIERI SALVATORE
TORQUATI MASSIMO
Salvatore Ruggieri
Massimo Torquati
Marco Aldinucci
ALDINUCCI M
Ruggieri Salvatore
Publication venue
Publication date: 01/01/2010
Field of study

The whole computer hardware industry embraced multicores. For these machines, the extreme optimisation of sequential algorithms is no longer sufficient to squeeze the real machine power, which can be only exploited via thread-level parallelism. Decision tree algorithms exhibit natural concurrency that makes them suitable to be parallelised. This paper presents an approach for easy-yet-efficient porting of an implementation of the C4.5 algorithm on multicores. The parallel porting requires minimal changes to the original sequential code, and it is able to exploit up to 7X speedup on an Intel dual-quad core machine

Crossref

Archivio della Ricerca - Università di Pisa

UnipiEprints

Elastic-PPQ: A two-level autonomic system for spatial preference query processing over dynamic data streams

Author: TORQUATI MASSIMO
Gabriele Mencagli
Torquati Massimo
Mencagli Gabriele
Massimo Torquati
Marco Danelutto
Danelutto Marco
MENCAGLI GABRIELE
DANELUTTO MARCO
Publication venue
Publication date: 01/01/2018
Field of study

Paradigms like Internet of Things and the most recent Internet of Everything are shifting the attention towards systems able to process unbounded sequences of items in the form of data streams. In the real world, data streams may be highly variable, exhibiting burstiness in the arrival rate and non-stationarities such as trends and cyclic behaviors. Furthermore, input items may be not ordered according to timestamps. This raises the complexity of stream processing systems, which must support elastic resource management and autonomic QoS control through sophisticated strategies and run-time mechanisms. In this paper we present Elastic-PPQ, a system for processing spatial preference queries over dynamic data streams. The key aspect of the system design is the existence of two adaptation levels handling workload variations at different time-scales. To address fast time-scale variations we design a fine regulatory mechanism of load balancing supported by a control-theoretic approach. The logic of the second adaptation level, targeting slower time-scale variations, is incorporated in a Fuzzy Logic Controller that makes scale in/out decisions of the system parallelism degree. The approach has been successfully evaluated under synthetic and real-world datasets

Crossref

ZENODO

Archivio della Ricerca - Università di Pisa

A DSL based toolchain for design space exploration in structured parallel programming

Author: Kilpatrick Peter; id_orcid
Kilpatrick Peter
Danelutto Marco
Torquati Massimo
Publication venue
Publication date: 01/01/2016
Field of study

We introduce a DSL based toolchain supporting the design of parallel applications where parallelism is structured after parallel design pattern compositions. A DSL provides the possibility to write high level parallel design pattern expressions representing the structure of parallel applications, to refactor the pattern expressions, to evaluate their non-functional properties (e.g. ideal performance, total parallelism degree, etc.) and finally to generate parallel code ready to be compiled and run on different target architectures. We discuss a proof-of-concept prototype implementation of the proposed toolchain generating FastFlow code and show some preliminary results achieved using the prototype implementation.</p

Queen's University Belfast Research Portal

Elsevier - Publisher Connector

Crossref

Archivio della Ricerca - Università di Pisa