1,720,988 research outputs found
PROV-XML: The PROV XML Schema
Provenance is information about entities, activities, and people involved in producing a piece of data or thing, which can be used to form assessments about its quality, reliability or trustworthiness. PROV-DM is the conceptual data model that forms a basis for the W3C provenance (PROV) family of specifications. It defines a concepts for expressing provenance information enabling interchange. This document introduces an XML schema for the PROV data model (PROV-DM), allowing instances of the PROV data model to be serialized in XML
U.S. Global Change Research Program National Climate Assessment Global Change Information System
The program: a) Coordinates Federal research to better understand and prepare the nation for global change. b) Priori4zes and supports cutting edge scientific work in global change. c) Assesses the state of scientific knowledge and the Nation s readiness to respond to global change. d) Communicates research findings to inform, educate, and engage the global community
Going Beyond Counting First Authors in Author Co-citation Analysis
The present study examines one of the fundamental aspects of author co-citation analysis (ACA) - the way co-citation
counts are defined. Co-citation counting provides the data on which all subsequent statistical analyses and mappings
are based, and we compare ACA results based on two different types of co-citation counting - the traditional type that
only counts the first one among a cited work's authors on the one hand and a non-traditional type that takes into
account the first 5 authors of a cited work on the other hand. Results indicate that the picture produced through this non-traditional author co-citation counting contains more coherent author groups and is therefore considerably clearer. However, this picture represents fewer specialties in the research field being studied than that produced through the traditional first-author co-citation counting when the same number of top-ranked authors is selected and analyzed. Reasons for these effects are discussed
Enabling Reproducibility of Scientific Data Flows Through Tracking and Representation of Provenance
Reproducibility of results is a key tenet of science. Some modern scientific domains, such as Earth Science, have become computationally complicated and, particularly with the advent of higher resolution space based remote sensing platforms, tremendously data intensive. Over the last few decades, these complexities along with the the rapid advancement of the state of the art confound the goal of scientific transparency. This thesis explores concepts of data identification, organization, equivalence and reproducibility for such data intensive scientific processing. It presents a conceptual model useful for describing and representing data provenance suitable for very precise data and processing identification. It presents algorithms for creating and maintaining precise dataset membership and provenance equivalence at various degrees of granularity and data aggregation. This model will be described and demonstrated first with a simple example, then in a more complicated example based on the real-world operational scenario of NASA Ozone Monitoring Instrument data processing system. Application of the model will allow more specific data citations in scientific literature based on large datasets and the data provenance equivalence. Our provenance representations will enable independent reproducibility required by scientific transparency. Increasing transparency will contribute to understanding, and ultimately, credibility of scientific results
Formal Provenance Representation of the Data and Information Supporting the National Climate Assessment
The Global Change Information System (GCIS) provides a framework for the formal representation of structured metadata about data and information about global change. The pilot deployment of the system supports the National Climate Assessment (NCA), a major report of the U.S. Global Change Research Program (USGCRP). A consumer of that report can use the system to browse and explore that supporting information. Additionally, capturing that information into a structured data model and presenting it in standard formats through well defined open inter- faces, including query interfaces suitable for data mining and linking with other databases, the information becomes valuable for other analytic uses as well
- …
