Search CORE

1,721,062 research outputs found

Train-O-Matic: Supervised Word Sense Disambiguation with no (manual) effort

Author: Navigli R.
Pasini T.
Publication venue
Publication date: 01/01/2020
Field of study

Word Sense Disambiguation (WSD) is the task of associating the correct meaning with a word in a given context. WSD provides explicit semantic information that is beneficial to several downstream applications, such as question answering, semantic parsing and hypernym extraction. Unfortunately, WSD suffers from the well-known knowledge acquisition bottleneck problem: it is very expensive, in terms of both time and money, to acquire semantic annotations for a large number of sentences. To address this blocking issue we present Train-O-Matic, a knowledge-based and language-independent approach that is able to provide millions of training instances annotated automatically with word meanings. The approach is fully automatic, i.e., no human intervention is required, and the only type of human knowledge used is a task-independent WordNet-like resource. Moreover, as the sense distribution in the training set is pivotal to boosting the performance of WSD systems, we also present two unsupervised and language-independent methods that automatically induce a sense distribution when given a simple corpus of sentences. We show that, when the learned distributions are taken into account for generating the training sets, the performance of supervised methods is further enhanced. Experiments have proven that Train-O-Matic on its own, and also coupled with word sense distribution learning methods, lead a supervised system to achieve state-of-the-art performance consistently across gold standard datasets and languages. Importantly, we show how our sense distribution learning techniques aid Train-O-Matic to scale well over domains, without any extra human effort. To encourage future research, we release all the training sets in 5 different languages and the sense distributions for each domain of SemEval-13 and SemEval-15 at http://trainomatic.org

Archivio della ricerca- Università di Roma La Sapienza

CLEF 2013: Information Access Evaluation meets Multilinguality, Multimodality, and Visualization

Author: Müller H.
Paredes Palacios Roberto
Müller Henning
Paredes R.
Stein B.
Stein Benno
Rosso Paolo
Tufis Dan
Navigli R.
Navigli Roberto
FERRO NICOLA
Rosso P.
Forner Pamela
Forner P.
Tufis D.
Publication venue
Publication date: 01/01/2013
Field of study

Ferro, N.; Forner, P.; Müller, H.; Navigli, R.; Paredes Palacios, R.; Rosso, P.; Stein, B.... (2013). CLEF 2013: Information Access Evaluation meets Multilinguality, Multimodality, and Visualization. ACM SIGIR Forum. 47(2):15-20. doi:10.1145/2568388.2568392S152047

Crossref

RiuNet

Archivio istituzionale della ricerca - Università di Padova

FENICE: Factuality Evaluation of Summarization Based on Natural Language Inference and Claim Extraction

Author: Navigli R.
Ghonim K.
SCIRE'
Publication venue
Publication date: 01/01/2024
Field of study

Recent advancements in text summarization, particularly with the advent of Large Language Models (LLMs), have shown remarkable performance. However, a notable challenge persists as a substantial number of automatically-generated summaries exhibit factual inconsistencies, such as hallucinations. In response to this issue, various approaches for the evaluation of consistency for summarization have emerged. Yet, these newly-introduced metrics face several limitations, including lack of interpretability, focus on short document summaries (e.g., news articles), and computational impracticality, especially for LLM-based metrics. To address these shortcomings, we propose Factuality Evaluation of summarization based on Natural language Inference and Claim Extraction (FENICE), a more interpretable and efficient factuality-oriented metric. FENICE leverages an NLI-based alignment between information in the source document and a set of atomic facts, referred to as claims, extracted from the summary. Our metric sets a new state of the art on AGGREFACT, the de-facto benchmark for factuality evaluation. Moreover, we extend our evaluation to a more challenging setting by conducting a human annotation process of long-form summarization. In the hope of fostering research in summarization factuality evaluation, we release the code of our metric and our factuality annotations of long-form summarization at ity annotations of long-form summarization at https://github.com/Babelscape/FENICE

Archivio della ricerca- Università di Roma La Sapienza

Natural language understanding: instructions for (Present and Future) use

Author: Navigli Roberto
NAVIGLI R.
Roberto Navigli
Publication venue
Publication date: 01/01/2018
Field of study

In this paper I look at Natural Language Understanding, an area of Natural Language Processing aimed at making sense of text, through the lens of a visionary future: what do we expect a machine should be able to understand? and what are the key dimensions that require the attention of researchers to make this dream come true

Crossref

ZENODO

Archivio della ricerca- Università di Roma La Sapienza

Ontology Learning and Its Application to Automated Terminology Translation

Author: Navigli R.
Gangemi A.
Velardi P.
Publication venue
Publication date: 01/01/2003
Field of study

Our OntoLearn system is an infrastructure for automated ontology learning from domain text. It is the only system, as far as we know, that uses natural language processing and machine learning techniques, and is part of a more general ontology engineering architecture. We describe the system and an experiment in which we used a machine-learned tourism ontology to automatically translate multiword terms from English to Italian. The method can apply to other domains without manual adaptation

Archivio istituzionale della ricerca - Alma Mater Studiorum Università di Bologna

An overview of word and sense similarity

Author: Martelli Federico
Navigli R.
Navigli Roberto
Martelli F.
Publication venue
Publication date: 01/01/2019
Field of study

Over the last two decades, determining the similarity between words as well as between their meanings, that is, word senses, has been proven to be of vital importance in the field of Natural Language Processing. This paper provides the reader with an introduction to the tasks of computing word and sense similarity. These consist in computing the degree of semantic likeness between words and senses, respectively. First, we distinguish between two major approaches: the knowledge-based approaches and the distributional approaches. Second, we detail the representations and measures employed for computing similarity. We then illustrate the evaluation settings available in the literature and, finally, discuss suggestions for future research

ZENODO

Archivio della ricerca- Università di Roma La Sapienza

AAA: Fair evaluation for abuse detection systems wanted

Author: Navigli R.
Calabrese A.
Ross B.
Bevilacqua M.
Tripodi R.
Publication venue
Publication date: 01/01/2021
Field of study

User-generated web content is rife with abusive language that can harm others and discourage participation. Thus, a primary research aim is to develop abuse detection systems that can be used to alert and support human moderators of online communities. Such systems are notoriously hard to develop and evaluate. Even when they appear to achieve satisfactory performance on current evaluation metrics, they may fail in practice on new data. This is partly because datasets commonly used in this field suffer from selection bias, and consequently, existing supervised models overrely on cue words such as group identifiers (e.g., gay and black) which are not inherently abusive. Although there are attempts to mitigate this bias, current evaluation metrics do not adequately quantify their progress. In this work, we introduce Adversarial Attacks against Abuse (AAA), a new evaluation strategy and associated metric that better captures a model's performance on certain classes of hard-to-classify microposts, and for example penalises systems which are biased on low-level lexical features. It does so by adversarially modifying the model developer's training and test data to generate plausible test samples dynamically. We make AAA available as an easy-to-use tool, and show its effectiveness in error analysis by comparing the AAA performance of several state-of-the-art models on multiple datasets. This work will inform the development of detection systems and contribute to the fight against abusive language online

Archivio istituzionale della ricerca - Alma Mater Studiorum Università di Bologna

Archivio della ricerca- Università di Roma La Sapienza

Automatic Ontology Learning: Supporting a Per-Concept Evaluation by Domain Experts

Author: Navigli R.
CUCCHIARELLI ALESSANDRO
Velardi P.
Neri F.
Publication venue
Publication date: 01/01/2004
Field of study

IRIS UniversitÃ Politecnica delle Marche

Universal Semantic Annotator

Author: Navigli R.
Campagnano C.
Conia S.
Orlando R.
Publication venue
Publication date: 01/01/2023
Field of study

Explicit semantic knowledge has often been considered a necessary ingredient to enable the development of intelligent systems. However, current stateof- the-art tools for the automatic extraction of such knowledge often require expert understanding of the complex techniques used in lexical and sentence-level semantics and their linguistic theories. To overcome this limitation and lower the barrier to entry, we present the Universal Semantic Annotator (USeA) ELG pilot project, which offers a transparent way to automatically provide high-quality semantic annotations in 100 languages through state-of-the-art models, making it easy to exploit semantic knowledge in real-world applications

Archivio della ricerca- Università di Roma La Sapienza

Quantitative and Qualitative Evaluation of the Ontolearn Ontology Learning System

Author: NERI F.
CUCCHIARELLI ALESSANDRO
VELARDI P
NAVIGLI R
Publication venue
Publication date: 01/01/2004
Field of study

Ontology evaluation is a critical task, even more so when the ontology is the output of an automatic system, rather than the result of a conceptualisation effort produced by a team of domain specialists and knowledge engineers. This paper provides an evaluation of the OntoLearn ontology learning system. The proposed evaluation strategy is twofold: first, we provide a detailed quantitative analysis of the ontology learning algorithms, in order to compute the accuracy of OntoLearn under different learning circumstances. Second, we automatically generate natural language descriptions of formal concept specifications, in order to facilitate per-concept qualitative analysis by domain specialists

IRIS UniversitÃ Politecnica delle Marche