1,720,979 research outputs found
Automatic Music Synchronization Using Partial Score Representation Based on IEEE 1599
This article proposes the new algorithm ParSi, for Partial Synchronization, which aligns different PCM executions with a partial music score representation. While in general most synchronization algorithms require a complete score representation, this algorithm is able to synchronize different PCM audio executions with a partial music score codified in MIDI. Therefore, only one MIDI instrument - chosen by user - is needed for audio-score alignment. The MIDI file is selected by
taking into account the related IEEE 1599 file. Audio analysis is performed using a notch filter. A set of convenient parameters is used during the decisional phase. Analysis of synchronization results is provided, showing that it possible to have a good automatic synchronization, even if just a partial score is available
Automatic Alignment between Partial Music Score Representation Using IEEE 1599 and Audio Executions
Digital Audio Processing in MP3 Compressed Domain and Evaluation of Perceived Audio Quality
The state of the art provides several digital audio signal processing in uncompressed domain
(PCM Pulse Code Modulation). We can found several works in the literature which
explain different methods to modify an audio signal both in the time and in the frequency
domain, in order to normalize the intensity, or to apply filters, special effects and so on.
Currently the MP3 format has not been deeply considered by literature. The most meaningfully
works are related to MP1 and MP2 formats. There is not a exhaustive formalization
of the digital audio signal processing in the MP3 compressed domain. Furthermore, there
is not a software framework which allow to develop and implement every kind of processing
algorithm in MP3 compressed domain. There are only some simple software which
directly split and wrap MP3 files and process the volume in a very simple way.
In this dissertation we define different approaches to develop every kind of algorithm for
digital signal processing in the MP3 compressed domain. The contributions of this dissertation
are the formalization the problem of MP3 direct processing defining different approaches
(or levels), with respect to the various steps of the decoding/encoding phases, the
development of algorithms for the MP3 format working as nearest as possible to the MP3
domain, and the improvement and the customization of methods and protocol described
in the recommendation of the International Telecommunication Union (ITU-R) to evaluate
the objective and subjective perceived audio quality.
We define three different domain where it is possible to manage MP3-coded audio information.
We develope algorithms to moving the frame, control the gain by RMS, filter and the channel selection. Filters and channel selection have been developed to downgrade
MP3 files. For each algorithm we have chosen the best approach, finding the best trade-off
among time consumption, perceived audio quality and problems related to unmasking and
aliasing. This formalization represents the base concepts for the development of a software
framework which allows the implementation of every kind of algorithm from the PCMdomain
to the MP3-domain.
Finally we improve and customize the methods and the protocol to evaluate the objective
and subjective perceived audio quality, described in the recommendation of the International
Telecommunication Union (ITU-R). We evaluate the objective performance of modern
MP3 codec with respect to tandem coding. We study the level of reliable of objective
tests, comparing them with the subjective. We compare the MP3-coded audio processed
both following the traditional and the direct approach to editing
Tempo Induction Algorithm in MP3 Compressed Domain
In this paper we propose a template matching algorithm to address tempo tracking problem in MP3 domain. The algorithm is based on MP3 Window-Switching Pattern (WSP) only. This means that no frequency analysis is performed by the program itself. Because the WSP is structured coherently with the drums line it is possible to compare this pattern with a simple metronome template. Experimental results are presented for a range of different musical styles, including rock, jazz, and popular songs with a variety of BPM and time signature. A part of the experimentation is dedicated to analyze the train set proposed for MIREX 2006. A computational cost analysis is presented too
Automatic synchronisation between audio and score musical description layers
This work describes algorithms dedicated to score and audio alignment using the MX / IEEE P1599 format. The format allows description of the score, and management of synchronisation points, linking them with different versions of the performed music. An algorithm is proposed here that allows alignment of an MX score and its execution, coded in PCM format, which produces an output for the MX Spine that contains synchronisation between notes and audio signal. The proposed architecture is based on two different steps: the first deals with the audio level and the extraction of features like pitch, onset and the like, while the second determines the alignment between the features extracted in the first step and the notes present in the MX score. For each step, different algorithms are proposed and discussed, and analysis and comparison of synchronisation capabilities are provided
MP3 window-switching pattern preliminary analysis for general purposes beat tracking on Music with Drums
This work analyzes the dependency of the window-switching pattern versus: different encoders, bit rates and encoder quality features. We propose a simple template-matching algorithm to solve beat tracking contest in music with drums. This algorithm uses windows-switching pattern information only. Commonly in a beat-tracking system the window-switching pattern is used to refine the results of a frequency evaluation. Furthermore, this paper wants to demonstrates the reliability of the window-switching pattern to solve beat-tracking problem in music with drums independently from encoders, bit rates, encoders quality features and frequency analysis. This paper confirms the window-switching pattern is adequate information in a beat-tracking contest at every bit rate and for every encoder
- …
