Search CORE

1,720,972 research outputs found

Detecting Dangerous Behaviors of Mobile Objects in Parking Areas

Author: Foresti GL
Giacinto G
Roli F
Publication venue
Publication date: 01/01/2003
Field of study

In the last decade video-surveillance systems have been developed for monitoring remote environments in order to detect and prevent dangerous situations. Until few years ago, surveillance was performed entirely by human operators, who interpreted the visual information presented to them on one or more monitors

Archivio istituzionale della ricerca - Università di Genova

Detecting Dangerous Behaviors of Mobile Objects in Parking Areas

Author: Foresti GL
ROLI FABIO
GIACINTO GIORGIO
Publication venue
Publication date: 01/01/2003
Field of study

Archivio istituzionale della ricerca - Università di Cagliari

3D object recognition by integration of associative and symbolic techniques

Author: Murino V
REGAZZONI CARLO
Foresti GL
ZUNINO RODOLFO
Publication venue
Publication date: 01/01/1992
Field of study

Archivio istituzionale della ricerca - Università di Genova

Exploiting data diversity in multi-domain federated learning

Author: Foresti GL.
Ahmad Madni H.
Muhammad Umer R.
Publication venue
Publication date: 01/01/2024
Field of study

Federated learning (FL) is an evolving machine learning technique that allows collaborative model training without sharing the original data among participants. In real-world scenarios, data residing at multiple clients are often heterogeneous in terms of different resolutions, magnifications, scanners, or imaging protocols, and thus challenging for global FL model convergence in collaborative training. Most of the existing FL methods consider data heterogeneity within one domain by assuming same data variation in each client site. In this paper, we consider data heterogeneity in FL with different domains of heterogeneous data by raising the problems of domain-shift, class-imbalance, and missing data. We propose a method, multi-domain FL as a solution to heterogeneous training data from multiple domains by training robust vision transformer model. We use two loss functions, one for correctly predicting class labels and other for encouraging similarity and dissimilarity over latent features, to optimize the global FL model. We perform various experiments using different convolution-based networks and non-convolutional Transformer architectures on multi-domain datasets. We evaluate the proposed approach on benchmark datasets and compare with the existing FL methods. Our results show the superiority of the proposed approach which performs better in term of robust FL global model than the exiting methods

Archivio istituzionale della ricerca - Università degli Studi di Udine

Special issue on nonlinear signal and image processing - Part I

Author: Vernazza G
Sicuranza GL
Ramponi G
Foresti GL
Regazzoni C
Publication venue
Publication date: 01/01/2004
Field of study

Archivio istituzionale della ricerca - Università di Genova

Map-driven image interpretation by associative model indexing

Author: Murino V
REGAZZONI CARLO
Foresti GL
ZUNINO RODOLFO
Publication venue
Publication date: 01/01/1991
Field of study

Archivio istituzionale della ricerca - Università di Genova

A late fusion deep neural network for robust speaker identification using raw waveforms and gammatone cepstral coefficients

Author: Drioli C
Gian Luca Foresti
Salvati D
Foresti GL
Carlo Drioli
Daniele Salvati
Publication venue
Publication date: 01/01/2023
Field of study

Speaker identification aims at determining the speaker identity by analyzing his voice characteristics, and relies typically on statistical models or machine learning techniques. Frequency-domain features are by far the most used choice to encode the audio input in sound recognition. Recently, some studies have also analyzed the use of time-domain raw waveform (RW) with deep neural network (DNN) architectures. In this paper, we hypothesize that both time-domain and frequency-domain features can be used to increase the robustness of speaker identification task in adverse noisy and reverberation conditions, and we present a method based on a late fusion DNN using RWs and gammatone cepstral coefficients (GTCCs). We analyze the characteristics of RW and spectrum-based short-time features, reporting advantages and limitations, and we show that the joint use can increase the identification accuracy. The proposed late fusion DNN model consists of two independent DNN branches made primarily by convolutional neural networks (CNN) and fully connected neural networks (NN) layers. The two DNN branches have as input short-time RW audio fragments and GTCCs, respectively. The late fusion is computed on the predicted scores of the DNN branches. Since the method is based on short segments, it has the advantage of being independent from the size of the input audio signal, and the identification task can be computed by summing the predicted scores over several short-time frames. Analysis of speaker identification performance computed with simulations show that the late fusion DNN model improves the accuracy rate in adverse noise and reverberation conditions in comparison to the RW, the GTCC, and the mel-frequency cepstral coefficients (MFCCs) features. Experiments with real-world speech datasets confirm the efficiency of the proposed method, especially with small-size audio samples

Crossref

Archivio istituzionale della ricerca - Università degli Studi di Udine

New Error Measures To Evaluate Features on Three Dimensional Scenes

Author: Bellavia F
TEGOLO Domenico
Domenico Tegolo
Fabio Bellavia
Publication venue
Publication date: 01/01/2011
Field of study

In this paper new error measures to evaluate image features in three-dimensional scenes are proposed and reviewed. The proposed error measures are designed to take into account feature shapes, and ground truth data can be easily estimated. As other approaches, they are not error-free and a quantitative evaluation is given according to the number of wrong matches and mismatches in order to assess their validit

Crossref

Archivio istituzionale della ricerca - Università di Palermo

Dissimilarity representation in multi-feature spaces for image retrieval

Author: Giorgio Giacinto
PIRAS LUCA
Luca Piras
GIACINTO GIORGIO
Publication venue
Publication date: 01/01/2011
Field of study

Crossref

Archivio istituzionale della ricerca - Università di Cagliari

Impairments in Decoding Facial and Vocal Emotional Expressions in High Functioning Autistic Adults and Adolescents.

Author: Fortunati L
Bourbakis N
Esposito A
Foresti GL
Cirillo I
Esposito AM
Escalera S
Publication venue
Publication date: 01/01/2020
Field of study

Archivio Istituzionale della Ricerca - Università degli Studi della Campania "Luigi Vanvitelli"