1,721,022 research outputs found
Beyond Transformers: fault type detection in maintenance tickets with Kernel Methods, Boost Decision Trees and Neural Networks
The proper handling of customer tickets and maintenance requests is pivotal for enterprises as it directly impacts customer satisfaction.
The ability to rapidly and efficiently react and solve reported issues is in fact a key factor from the customers' perspective, resulting in positive feedback for the company, leading to higher economic and brand-image revenues.
The automatic detection of failures from maintenance tickets and support requests can grant faster and more efficient reactions to customers' equipment failures as well as reduced maintenance costs.
The analysis of support and maintenance requests is a well-known problem in Natural Language Processing (NLP).
State-of-the-art solutions in this field rely on Transformers models, pre-trained on large text corpora, and then fine-tuned on the specific downstream task.
However, due to their intrinsic nature, support requests are highly domain-specific and usually similar to short telegraph messages, where the focus is typically encapsulated in short sequences rather than in long dependencies.
Hence, ad-hoc methods for pattern recognition might provide comparable performances with respect to Transformers.
In this work, two alternative approaches are proposed, based on: Kernel methods in conjunction with Boost Decision Trees (SpectrumBoost), and Neural Networks for Multiple Representation Learning (DeepMRL).
These models have been tested and compared against state-of-the-art models on a real-world set of 131305 maintenance tickets in the Italian language, suggesting that the proposed models outperform Transformers both in the prediction accuracy and in the time and computational resources required for their training
Discovery of protein-RNA networks
Coding and non-coding RNAs associate with proteins to perform important functions in the cell. Protein-RNA complexes are essential components of the ribosomal and spliceosomal machinery; they are involved in epigenetic regulation and form non-membrane-bound aggregates known as granules. Despite the functional importance of ribonucleoprotein interactions, the precise mechanisms of macromolecular recognition are still poorly understood. Here, we present the latest developments in experimental and computational investigation of protein-RNA interactions. We compare performances of different algorithms and discuss how predictive models allow the large-scale investigation of ribonucleoprotein associations. Specifically, we focus on approaches to decipher mechanisms regulating the activity of transcripts in protein networks. Finally, the catRAPID omics express method is introduced for the analysis of protein-RNA expression networks
Acoustic emission onset time detection for structural monitoring with U-Net neural network architecture
Acoustic Emission (AE) is a non-destructive structural health monitoring technique, which studies elastic waves emitted during crack formation. Utilizing piezoelectric sensors, these waves are converted into electrical signals for subsequent analysis, offering insights into crack propagation and structural durability. This study focuses on the identification of AE signal onset times, crucial for determining crack locations. Conventional methods often encounter challenges with background noise, prompting the need for innovative approaches. Leveraging a U-Net neural network, specialized in segmentation tasks, onset time identification is approached as a one-dimensional segmentation challenge. Through training and testing on Pencil Lead Break (PLB) test data, commonly used in AE evaluations, the effectiveness of the method is demonstrated even with continuous signals, suggesting potential applicability in real-time monitoring
SeAMotE: a method for high-throughput motif discovery in nucleic acid sequences
BACKGROUND: The large amount of data produced by high-throughput sequencing poses new computational challenges. In the last decade, several tools have been developed for the identification of transcription and splicing factor binding sites. RESULTS: Here, we introduce the SeAMotE (Sequence Analysis of Motifs Enrichment) algorithm for discovery of regulatory regions in nucleic acid sequences. SeAMotE provides (i) a robust analysis of high-throughput sequence sets, (ii) a motif search based on pattern occurrences and (iii) an easy-to-use web-server interface. We applied our method to recently published data including 351 chromatin immunoprecipitation (ChIP) and 13 crosslinking immunoprecipitation (CLIP) experiments and compared our results with those of other well-established motif discovery tools. SeAMotE shows an average accuracy of 80% in finding discriminative motifs and outperforms other methods available in literature. CONCLUSIONS: /nSeAMotE is a fast, accurate and flexible algorithm for the identification of sequence patterns involved in protein-DNA and protein-RNA recognition. The server can be freely accessed at http://s.tartaglialab.com/new_submission/seamote.The research leading to these results has received funding from the European Union Seventh Framework Programme (FP7/2007-2013), through the European Research Council, under grant agreement RIBOMYLOME_309545, and from the Spanish Ministry of Economy and Competitiveness/n(SAF2011-26211). We also acknowledge support from the Spanish Ministry of Economy and Competitiveness, ‘Centro de Excelencia Severo Ochoa 2013–2017’ (SEV-2012-0208
Going Beyond Counting First Authors in Author Co-citation Analysis
The present study examines one of the fundamental aspects of author co-citation analysis (ACA) - the way co-citation
counts are defined. Co-citation counting provides the data on which all subsequent statistical analyses and mappings
are based, and we compare ACA results based on two different types of co-citation counting - the traditional type that
only counts the first one among a cited work's authors on the one hand and a non-traditional type that takes into
account the first 5 authors of a cited work on the other hand. Results indicate that the picture produced through this non-traditional author co-citation counting contains more coherent author groups and is therefore considerably clearer. However, this picture represents fewer specialties in the research field being studied than that produced through the traditional first-author co-citation counting when the same number of top-ranked authors is selected and analyzed. Reasons for these effects are discussed
Modeling of the consensus in the allocation of resources in distributed systems
Fil: Agostini, Federico. Universidad Nacional del Nordeste. Facultad de Ciencias Exactas y Naturales y Agrimensura; Argentina.Fil: La Red Martínez, David Luis. Universidad Nacional del Nordeste. Facultad de Ciencias Exactas y Naturales y Agrimensura; Argentina.Fil: Acosta, Julio César. Universidad Nacional del Nordeste. Facultad de Ciencias Exactas y Naturales y Agrimensura; Argentina.When it comes to processes distributed in process nodes that access critical resources shared in the modality of distributed mutual exclusion, it is important to know how these are managed and the order in which the demand for resources is resolved by the processes. Being in a shared environment, it is necessary to comply with certain rules, for instance, access to resources must be achieved through mutual exclusion. In this
work, through an aggregation operator, a consensus mechanism is proposed to establish the order of allocation of resources to the
processes. The consensus is understood as the agreement that must be achieved for the allocation of all the resources requested by each process. To model this consensus, it must be taken into account that the processes can form group of processes or be independent, the state of the nodes where each of them is located,
the computational load, the number of processes, the priorities of the processes, CPU usage, use of main memory, virtual memory,
etc. These characteristics allow the evaluation of the conditions to agree on the order in which allocations of resources to processes will be made
Assignment of Resources in Distributed Systems
In distributed processing systems it is often necessary to coordinate the allocation of shared resources that should be assigned in the processes in the modality of mutual exclusion; in such cases, the order in which the shared resources will be assigned in the processes that require them must be decided; in this paper we propose an aggregation operator (which could be used by a shared resources manager module) that will decide the order of allocation of the resources to the processes considering the requirements of the processes (shared resources) and the state of the distributed nodes where the processes operate (their computational load).Fil: la Red Martínez, David Luis. Universidad Nacional del Nordeste. Facultad de Ciencias Exactas y Naturales y Agrimensura; ArgentinaFil: Acosta, Julio César. Universidad Nacional del Nordeste. Facultad de Ciencias Exactas y Naturales y Agrimensura; ArgentinaFil: Agostini, Federico. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - Nordeste. Instituto de Botánica del Nordeste. Universidad Nacional del Nordeste. Facultad de Ciencias Agrarias. Instituto de Botánica del Nordeste; Argentina. Universidad Nacional del Nordeste. Facultad de Ciencias Exactas y Naturales y Agrimensura; Argentin
Variations on the Author
“Variations on the Author” discusses two of Eduardo Coutinho’s recent films (Um Dia na Vida, from 2010, and Últimas Conversas, posthumously released in 2015) and their contribution to the general question of documentary authorship. The director’s filmography is characterized by a consistent yet self-effacing form of authorial self-inscription: Coutinho often features as an interviewer that rather than express opinions propels discourses; an interviewer that is good at listening. This mode of self-inscription characterizes him as an author who is not expressive but who is nonetheless markedly present on the screen. In Um Dia na Vida, however, Coutinho is completely absent form the image, while Últimas Conversas, on the contrary, includes a confessional prologue that moves the director from the margins to the center of his films. This article examines the ways in which these works stand out in the filmography of a director who offers new insights into the notion of cinematic authorship
Appropriate Similarity Measures for Author Cocitation Analysis
We provide a number of new insights into the methodological discussion about author cocitation analysis. We first argue that the use of the Pearson correlation for measuring the similarity between authors’ cocitation profiles is not very satisfactory. We then discuss what kind of similarity measures may be used as an alternative to the Pearson correlation. We consider three similarity measures in particular. One is the well-known cosine. The other two similarity measures have not been used before in the bibliometric literature. Finally, we show by means of an example that our findings have a high practical relevance.information science;Pearson correlation;cosine;similarity measure;author cocitation analysis
- …
