1,721,087 research outputs found
Providing tool support for unit testing in eXtreme Design
The evaluation of an ontology is a crucial and occasionally overlooked step of the ontology development process. The evaluation can happen in different stages of the development, on different layers, such as lexical, taxonomic, semantic relations, context/application, syntactic, and structural layers, and by different people’s roles, depending on the ontology development methodology. A widespread issue regarding the evaluation of ontologies is the lack of tools to support these methodologies. In the eXtreme Design methodology, the ontology evaluation is a central part of the process, which focuses on assessing whether the requirements, either data-driven or story-driven, have been fulfilled by the ontology module. Momentarily, there is only a jar that provides support for the execution of test cases, but there is no support for other aspects of the evaluation process. This paper presents the progress of the development of tool support which provides semi-automated management for unit testing of owl ontologies on GitHub by means of developed-from-scratch actions and based on the Continuous Integration practice. The fundamental features that are currently developed in the tool are: 1) Setup of the testing environment, 2) Crosscheck and parsing of the ontology tester input, 3) Construction and automatic execution of the unit test, and 4) Documentation of the unit test. The evaluation of the tool itself has been prepared and we expect to have a list of new functional and non-functional requirements as well as identification of bugs or refinement of the existing features
Identity of Resources and Entities on the Web
One of the main strengths of the Web is that it allows any party of its global community to share information with any other party. This goal has been achieved by making use of a unique and uniform mechanism of identification, the uniform resource identifiers (URI). Although URIs succeed when used for retrieving resources on the Web, their suitability for identifying any kind of thing, for example, resources that are not on the Web, is not guaranteed. In this article we investigate the meaning of the identity of a Web resource, and how the current situation, as well as existing and possible future improvements, can be modeled and implemented on the Web. In particular, we propose an ontology, IRE, that provides a formal way to model both the problem and the solution spaces. IRE describes the concept of resource from the viewpoint of the Web, by reusing an ontology of information objects, built on top of DOLCE+ and its extensions. In particular, we formalize the concept of Web resource, as distinguished from the concept of a generic entity, and how those and other concepts are related, for example, by different proxy for relations. Based on the analysis formalized in IRE, we propose a formal pattern for modeling and comparing different solutions to the problems of the identity of resources
The identity of resources on the Web: An ontology for Web architecture
One of the major events that has caused a resurgence in the use of formal ontologies is the advent of the Semantic Web, which seeks to do for knowledge representation what the Web did for hypertext. Yet while the field of formal ontologies is well-understood, the nature of the Web is rather surprisingly cloaked in mystery. Unlike formal computer science, the Web is constructed mostly out of informally and operationally defined terms built from various specifications, in particular IETF RFCs and W3C Recommendations. In order to better understand the nature of the 'Web' in the Semantic Web, we created a formal ontology called the 'Identity of Resources on the Web' (IRW) ontology. The primary goal of the Semantic Web is to use URIs as a universal space to name anything, expanding from using URIs for web-pages to URIs for real objects and imaginary concepts, as phrased by Berners-Lee. This distinction has often been tied to the distinction between information resources, such as web-pages and multimedia files, and other kinds of Semantic Web 'non-information' resources used in Linked Data. This issue of defining the relationship between URIs and resources is not a mandarin metaphysical matter, but has technical repercussions: the W3C has recommended not to use the same URI for information resources and the resources needed to denote 'non-information resources' for the Semantic Web. Basing our work on the normative specifications of the W3C and IETF, we model the relationship between resources and representations formally in an ontology called IRW (Identity and Reference on the Web). From our point of view, IRW is a beautiful ontology. In this paper we motivate why we consider it as such through the identification of a number of criteria on which we based our evaluation. © 2011-IOS Press and the authors. All rights reserved
Latent vs Explicit Knowledge Representation: How ChatGPT Answers Questions about Low-Frequency Entities
In this paper, we present an evaluation of two different approaches to the free-form Question Answering (QA) task. The main difference between the two approaches is that one is based on latent representations of knowledge, and the other uses explicit knowledge representation. For the evaluation, we developed DynaKnowledge, a new benchmark composed of questions concerning Wikipedia low-frequency entities. We wanted to ensure, on the one hand, that the questions are answerable and, on the other, that the models can provide information about very specific facts. The evaluation that we conducted highlights that the proposed benchmark is particularly challenging. The best model answers correctly only on 50% of the questions. Analysing the results, we also found that ChatGPT shows low reliance on low-frequency entity questions, manifesting a popularity bias. On the other hand, a simpler model based on explicit knowledge is less affected by this bias. With this paper, we want to provide a living benchmark for open-form QA to test knowledge and latent representation models on a dynamic benchmark
SHELDON: Semantic Holistic framEwork for LinkeD Ontology data
SHELDON is a framework that builds upon a machine reader for extracting RDF graphs from text so that the output is compliant to Semantic Web and Linked Data patterns. It extends the current humanreadable web by using semantic best practices and technologies in a machine-processable form. Given a sentence in any language, it provides different semantic tasks as well as nice visualization tools which make use of the JavaScript infoVis Toolkit and a knowledge enrichment component on top of RelFinder. The system can be freely used at http://wit.istc.cnr.it/stlab-tools/sheldo
Semantic role labeling for knowledge graph extraction from text
This paper introduces TakeFive, a new semantic role labeling method that transforms a text into a frame-oriented knowledge graph. It performs dependency parsing, identifies the words that evoke lexical frames, locates the roles and fillers for each frame, runs coercion techniques, and formalizes the results as a knowledge graph. This formal representation complies with the frame semantics used in Framester, a factual-linguistic linked data resource. We tested our method on the WSJ section of the Peen Treebank annotated with VerbNet and PropBank labels and on the Brown corpus. The evaluation has been performed according to the CoNLL Shared Task on Joint Parsing of Syntactic and Semantic Dependencies. The obtained precision, recall, and F1 values indicate that TakeFive is competitive with other existing methods such as SEMAFOR, Pikes, PathLSTM, and FRED. We finally discuss how to combine TakeFive and FRED, obtaining higher values of precision, recall, and F1 measure
The practice of self-citations: a longitudinal study
In this article, we discuss the outcomes of an experiment where we analysed whether and to what extent the introduction, in 2012, of the new research assessment exercise in Italy (a.k.a. Italian Scientific Habilitation) affected self-citation behaviours in the Italian research community. The Italian Scientific Habilitation attests to the scientific maturity of researchers and in Italy, as in many other countries, is a requirement for accessing to a professorship. To this end, we obtained from ScienceDirect 35,673 articles published from 1957 to 2016 by the participants to the 2012 Italian Scientific Habilitation, that resulted in the extraction of 1,379,050 citations retrieved through Semantic Publishing technologies. Our analysis showed an overall increment in author self-citations (i.e. where the citing article and the cited article share at least one author) in several of the 24 academic disciplines considered. However, we depicted a stronger causal relation between such increment and the rules introduced by the 2012 Italian Scientific Habilitation in 10 out of 24 disciplines analysed
Ontology Design Patterns
Computational ontologies in the context of information systems are artifacts that encode a description of some world, for some purpose. Under the assumption that there exist classes of problems that can be solved by applying common solutions (as it has been experienced in software engineering), we envision small, task-oriented ontologies with explicit documentation of design rationales. In this chapter, we describe components called Ontology Design Patterns (OP), and methods that support pattern-based ontology design.
We present a typology of OPs, and then focus on Content Ontology Design Patterns in terms of their background, definition, communication means, related work beyond ontology engineering, exemplification, creation, and usage principles. At the time of chapter’s final version, recently performed experiments of patternbased ontology design show remarkable quality improvement within some sample ontology design projects, specially in terms of compliance to tasks expressed as competency questions or scenarios
The Computational Ontology Perspective: Design Patterns for Web Ontologies
Ontologies in the computational world, and especially in the semantic web, are artifacts that are designed in order to make application requirements achievable. Coupling requirements to ontology design solutions is key: a design-oriented view of ontologies is presented, by introducing the Ontology Design Patterns initiative, and the eXtreme ontology Design methods. The application to a legal use case is exemplified. Several references to related work and state-of-art languages and practices are provided
An ontology design pattern for representing recurrent events
In this paper we describe an Ontology Design Pattern for modeling events that recur regularly over time and share some invariant factors, which unify them conceptually. The proposed pattern appears to be foundational, since it models the top-level domain-independent concept of recurrence, as applied to a series of events: we refer to this type of events as recurrent event series. The pattern relies on existing patterns, i.e. Collection, Situation, Classification, Sequence. Indeed, a recurrent event is represented as both a collection of events and a situation in which these events are contextualized and unified according to one or more properties that are peculiar to each event, and occur at regular intervals. We show how this pattern has been used in the context of ArCo, the Knowledge Graph of Italian cultural heritage, in order to model recurrent cultural events, festivals, ceremonies
- …
