Search CORE

1,721,064 research outputs found

Annotation and More Annotation: Some Problems Posed by (and to) Val Tannen

Author: Buneman Peter
Vansummeren Stijn
Publication venue
Publication date: 01/01/2024
Field of study

Among the many research accomplishments of Val Tannen, his work on provenance and semirings is probably the most widely known. In this paper, we discuss questions that arise when applying this general framework to the setting of curated databases, and in particular the setting where we can have multiple annotations on the same data, as well as annotations on annotations

DROPS Dagstuhl Research Online Publication Server

Document Server@UHasselt (Universiteit Hasselt)

Document Server@UHasselt

Provenance Composition in PROV

Author: Buneman Peter
Buneman Peter; id_orcid
Murray-Rust Dave
Moreau Luc
Gascon Caro Adrian
Publication venue
Publication date: 01/01/2017
Field of study

When two communicating processes each record their own provenance,what extra information needs to be recorded in order that a satisfactoryaccount can be given, of the combined process? We propose a setof requirements on (i) the kind of information that the processes can shareand (ii) the kind of queries that should be answerable from the combinedprovenance graph. We describe a solution using PROV

Southampton (e-Prints Soton)

Edinburgh Research Explorer

Provenance Management in Curated Databases

Author: Buneman Peter
Adriane P. Chapman
Buneman Peter; id_orcid
Chapman Adriane
Cheney James
Cheney James; id_orcid
Adriane Chapman
James Cheney
Peter Buneman
Publication venue
Publication date: 01/01/2006
Field of study

Curated databases in bioinformatics and other disciplines are the result of a great deal of manual annotation, correction and transfer of data from other sources. Provenance information concerning the creation, attribution, or version history of such data is crucial for assessing its integrity and scientific value. General purpose database systems provide little support for tracking provenance, especially when data moves among databases. This paper investigates general-purpose techniques for recording provenance for data that is copied among databases. We describe an approach in which we track the user's actions while browsing source databases and copying data into a curated database, in order to record the user's actions in a convenient, queryable form. We present an implementation of this technique and use it to evaluate the feasibility of database support for provenance management. Our experiments show that although the overhead of a na ve approach is fairly high, it can be decreased to an acceptable level using simple optimizations

CiteSeerX

Southampton (e-Prints Soton)

Crossref

Edinburgh Research Explorer

Data citation and the citation graph

Author: Buneman Peter
Dosso Dennis
Silvello Gianmaria
Matteo Lissandrini
Gianmaria Silvello
Dennis Dosso
Lissandrini Matteo; id_orcid
Peter Buneman
Publication venue
Publication date: 01/01/2022
Field of study

The citation graph is a computational artifact that is widely used to represent the domain of published literature. It represents connections between published works, such as citations and authorship. Among other things, the graph supports the computation of bibliometric measures such as h-indexes and impact factors. There is now an increasing demand that we should treat the publication of data in the same way that we treat conventional publications. In particular, we should cite data for the same reasons that we cite other publications.In this paper we discuss what is needed for the citation graph to represent data citation. We identify two challenges: (i) to model the evolution of credit appropriately (through references) over time and (ii) to model data citation not only to a dataset treated as a single object but also to parts of it. We describe an extension of the current citation graph model that addresses these challenges. It is built on two central concepts: citable units and reference subsumption. We discuss how this extension would enable data citation to be represented within the citation graph and how it allows for improvements in current practices for bibliometric computations both for scientific publications and for data.<br/

Directory of Open Access Journals

Catalogo dei prodotti della ricerca Università degli Studi di Verona

VBN (Videnbasen) Aalborg Universitets forskningsportal

A Provenance Model for Manually Curated Data

Author: Buneman Peter
Cheney J
Vansummeren Stijn
Chapman Adriane
Cheney James
Stijn Vansummeren
Buneman P
Adriane Chapman
James Cheney
Chapman A
Peter Buneman
Publication venue
Publication date: 01/01/2006
Field of study

Many curated databases are constructed by scientists integrating various existing data sources. Most current approaches to provenance in databases are based on views and fail to take account of the added value of the work done by scientists in manually creating and modifying data. Capturing provenance in such an environment is a challenging problem, requiring changes in practice, changes to existing software, and crucially, a good model of the process of curation.info:eu-repo/semantics/publishe

Southampton (e-Prints Soton)

Crossref

DI-fusion

Document Server@UHasselt (Universiteit Hasselt)

Document Server@UHasselt

Author Instructions

Author: Instructions Author
Publication venue
Publication date: 04/11/2013
Field of study

Crossref

Cartographic Perspectives (E-Journal - North American Cartographic Information Society, NACIS)

Going Beyond Counting First Authors in Author Co-citation Analysis

Author: Zhao Dangzhi
Publication venue
Publication date: 01/01/2005
Field of study

The present study examines one of the fundamental aspects of author co-citation analysis (ACA) - the way co-citation counts are defined. Co-citation counting provides the data on which all subsequent statistical analyses and mappings are based, and we compare ACA results based on two different types of co-citation counting - the traditional type that only counts the first one among a cited work's authors on the one hand and a non-traditional type that takes into account the first 5 authors of a cited work on the other hand. Results indicate that the picture produced through this non-traditional author co-citation counting contains more coherent author groups and is therefore considerably clearer. However, this picture represents fewer specialties in the research field being studied than that produced through the traditional first-author co-citation counting when the same number of top-ranked authors is selected and analyzed. Reasons for these effects are discussed

E-LIS

Variations on the Author

Author: Sayad Cecilia
Publication venue
Publication date: 01/01/2016
Field of study

“Variations on the Author” discusses two of Eduardo Coutinho’s recent films (Um Dia na Vida, from 2010, and Últimas Conversas, posthumously released in 2015) and their contribution to the general question of documentary authorship. The director’s filmography is characterized by a consistent yet self-effacing form of authorial self-inscription: Coutinho often features as an interviewer that rather than express opinions propels discourses; an interviewer that is good at listening. This mode of self-inscription characterizes him as an author who is not expressive but who is nonetheless markedly present on the screen. In Um Dia na Vida, however, Coutinho is completely absent form the image, while Últimas Conversas, on the contrary, includes a confessional prologue that moves the director from the margins to the center of his films. This article examines the ways in which these works stand out in the filmography of a director who offers new insights into the notion of cinematic authorship

Crossref

Kent Academic Repository

Data and Knowledge model:A proposal

Author: Apers Peter M.G.
Houtsma Maurice A.W.
Publication venue
Publication date: 1987
Field of study

University of Twente Research Information

Appropriate Similarity Measures for Author Cocitation Analysis

Author: Waltman L.R.
Eck N.J.P. van
Publication venue
Publication date
Field of study

We provide a number of new insights into the methodological discussion about author cocitation analysis. We first argue that the use of the Pearson correlation for measuring the similarity between authorsâ€™ cocitation profiles is not very satisfactory. We then discuss what kind of similarity measures may be used as an alternative to the Pearson correlation. We consider three similarity measures in particular. One is the well-known cosine. The other two similarity measures have not been used before in the bibliometric literature. Finally, we show by means of an example that our findings have a high practical relevance.information science;Pearson correlation;cosine;similarity measure;author cocitation analysis

Research Papers in Economics