Search CORE

311 research outputs found

WPI: Markov Logic Network-Based Statistical Predicate Invention

Author: Fatiguso G.
Giuseppe Fatiguso
FERILLI Stefano
Stefano Ferilli
Publication venue
Publication date: 01/01/2015
Field of study

Crossref

Archivio istituzionale della ricerca - Università di Bari

A Sentence Structure-based Approach to Unsupervised Author Identification

Author: FERILLI Stefano
Publication venue
Publication date: 01/01/2016
Field of study

Assessing whether two documents were written by the same author is a crucial task, especially in the Internet age, with possible applications to philology and forensics. The problem has been tackled in the literature by exploiting frequency-based approaches, numeric techniques or writing style analysis. Focusing on this last perspective, this paper proposes a novel technique that takes into account the structure of sentences, assuming that it is strictly related to the author's writing style. Specifically, a (collection of) text(s) in natural language written by a given author is translated into a set of First-Order Logic descriptions, and a model of the author's writing habits is obtained as the result of clustering these descriptions. Then, if an overlapping exists between the models of a known author and of an unknown one, the conclusion can be drawn that they are the same person. Among the advantages of this approach, it does not need a training phase, and performs well also on short texts and/or small collections

Archivio istituzionale della ricerca - Università di Bari

Unsupervised author identification and characterization

Author: Redavid D.
REDAVID DOMENICO
ESPOSITO Floriana
Esposito F
FERILLI Stefano
Stefano Ferilli
Ferilli S.
Domenico Redavid
Floriana Esposito
Publication venue
Publication date: 01/01/2016
Field of study

Author identification is a hot topic, especially in the Internet age. Following our previous work in which we proposed a novel approach to this problem, based on relational representations that take into account the structure of sentences, here we present a tool that computes and visualizes a numerical and graphical characterization of the authors/texts based on several linguistic features. This tool, that extends a previous language analysis tool, is the ideal complement to the author identification technique, that is based on a clustering procedure whose outcomes (i.e., the authors’ models) are not human-readable. Both approaches are unsupervised, which allows them to tackle problems to which other state-of-the-art systems are not applicable

Crossref

Archivio istituzionale della ricerca - Università di Bari

Preface

Author: Ferro N.
FERILLI Stefano
Publication venue
Publication date: 01/01/2015
Field of study

Archivio istituzionale della ricerca - Università di Bari

A study on the Classification of Layout Components for Newspapers

Author: D. Redavid
F. Esposito
S. Ferilli
Stefano Ferilli
Floriana Esposito
Domenico Redavid
Publication venue
Publication date: 01/01/2017
Field of study

Crossref

Archivio istituzionale della ricerca - Università di Bari

A Domain Based Approach to Information Retrieval in Digital Libraries

Author: Rotella F
S. Ferilli
FERILLI Stefano
F. Rotella
Stefano Ferilli
Fabio Leuzzi
Leuzzi F.
Fulvio Rotella
F. Leuzzi
Publication venue
Publication date: 01/01/2012
Field of study

The current abundance of electronic documents requires automatic techniques that support the users in understanding their content and extracting useful information. To this aim, improving the retrieval performance must necessarily go beyond simple lexical interpretation of the user queries, and pass through an understanding of their semantic content and aims. It goes without saying that any digital library would take enormous advantage from the availability of eective Information Retrieval techniques to provide to their users. This paper proposes an approach to Information Retrieval based on a correspondence of the domain of discourse between the query and the documents in the repository. Such an association is based on standard general-purpose linguistic resources (WordNet and WordNet Domains) and on a novel similarity assessment technique. Although the work is at a preliminary stage, interesting initial results suggest to go on extending and improving the approach

CiteSeerX

Crossref

Archivio istituzionale della ricerca - Università di Bari

Improving Speech-Based Human Robot Interaction with Emotion Recognition.

Author: Berardina De Carolis
PALESTRA GIUSEPPE CARLO
FERILLI Stefano
DE CAROLIS Berardina
Stefano Ferilli
Giuseppe Palestra
Publication venue
Publication date: 01/01/2015
Field of study

Crossref

Archivio istituzionale della ricerca - Università di Bari

Integration Strategy and Tool between Formal Ontology and Graph Database Technology

Author: Stefano Ferilli
Ferilli S.
Publication venue
Publication date: 01/01/2021
Field of study

Ontologies, and especially formal ones, have traditionally been investigated as a means to formalize an application domain so as to carry out automated reasoning on it. The union of the terminological part of an ontology and the corresponding assertional part is known as a Knowledge Graph. On the other hand, database technology has often focused on the optimal organization of data so as to boost efficiency in their storage, management and retrieval. Graph databases are a recent technology specifically focusing on element-driven data browsing rather than on batch processing. While the complementarity and connections between these technologies are patent and intuitive, little exists to bring them to full integration and cooperation. This paper aims at bridging this gap, by proposing an intermediate format that can be easily mapped onto the formal ontology on one hand, so as to allow complex reasoning, and onto the graph database on the other, so as to benefit from efficient data handling

Multidisciplinary Digital Publishing Institute

Archivio istituzionale della ricerca - Università di Bari

Learning to Recognize Critical Cells in Document Tables

Author: ESPOSITO Floriana
Nicola Di Mauro
DI MAURO NICOLA
FERILLI Stefano
Stefano Ferilli
Floriana Esposito
Publication venue
Publication date: 01/01/2012
Field of study

Tables are among the most informative components of documents, because they are exploited to compactly and intuitively represent data, typically for understandability purposes. The needs are to identify and extract tables from documents, and, on the other hand, to be able to extract the data they contain. The latter task involves the understanding of a table structure. Due to the variability in style, size, and aims of tables, algorithmic approaches to this task can be insufficient, and the exploitation of machine learning systems may represent an effective solution. This paper proposes the exploitation of a first-order logic representation, that is able to capture the complex spatial relationships involved in a table structure, and of a learning system that can mix the power of this representation with the flexibility of statistical approaches. The obtained encouraging results suggest further investigation and refinement of the proposal

Crossref

Archivio istituzionale della ricerca - Università di Bari

Foundations of Intelligent Systems

Author: F. Esposito O. Pivert, M.-S. Hacid, Z.W. Rás & S. Ferilli
Esposito Floriana
Pivert O.
Ferilli Stefano
Hacid M. S.
Rás Z. W.
Publication venue
Publication date: 01/01/2015
Field of study

Archivio istituzionale della ricerca - Università di Bari