1,720,988 research outputs found

    PROV-XML: The PROV XML Schema

    No full text
    Provenance is information about entities, activities, and people involved in producing a piece of data or thing, which can be used to form assessments about its quality, reliability or trustworthiness. PROV-DM is the conceptual data model that forms a basis for the W3C provenance (PROV) family of specifications. It defines a concepts for expressing provenance information enabling interchange. This document introduces an XML schema for the PROV data model (PROV-DM), allowing instances of the PROV data model to be serialized in XML

    U.S. Global Change Research Program National Climate Assessment Global Change Information System

    Full text link
    The program: a) Coordinates Federal research to better understand and prepare the nation for global change. b) Priori4zes and supports cutting edge scientific work in global change. c) Assesses the state of scientific knowledge and the Nation s readiness to respond to global change. d) Communicates research findings to inform, educate, and engage the global community

    Going Beyond Counting First Authors in Author Co-citation Analysis

    Full text link
    The present study examines one of the fundamental aspects of author co-citation analysis (ACA) - the way co-citation counts are defined. Co-citation counting provides the data on which all subsequent statistical analyses and mappings are based, and we compare ACA results based on two different types of co-citation counting - the traditional type that only counts the first one among a cited work's authors on the one hand and a non-traditional type that takes into account the first 5 authors of a cited work on the other hand. Results indicate that the picture produced through this non-traditional author co-citation counting contains more coherent author groups and is therefore considerably clearer. However, this picture represents fewer specialties in the research field being studied than that produced through the traditional first-author co-citation counting when the same number of top-ranked authors is selected and analyzed. Reasons for these effects are discussed

    Provenance Challenges for Earth Science Dataset Publication

    Full text link
    No abstract availabl

    Preservation Strategies: Intro to the OAIS Reference Model

    Full text link
    No abstract availabl

    Enabling Reproducibility of Scientific Data Flows Through Tracking and Representation of Provenance

    No full text
    Reproducibility of results is a key tenet of science. Some modern scientific domains, such as Earth Science, have become computationally complicated and, particularly with the advent of higher resolution space based remote sensing platforms, tremendously data intensive. Over the last few decades, these complexities along with the the rapid advancement of the state of the art confound the goal of scientific transparency. This thesis explores concepts of data identification, organization, equivalence and reproducibility for such data intensive scientific processing. It presents a conceptual model useful for describing and representing data provenance suitable for very precise data and processing identification. It presents algorithms for creating and maintaining precise dataset membership and provenance equivalence at various degrees of granularity and data aggregation. This model will be described and demonstrated first with a simple example, then in a more complicated example based on the real-world operational scenario of NASA Ozone Monitoring Instrument data processing system. Application of the model will allow more specific data citations in scientific literature based on large datasets and the data provenance equivalence. Our provenance representations will enable independent reproducibility required by scientific transparency. Increasing transparency will contribute to understanding, and ultimately, credibility of scientific results

    Formal Provenance Representation of the Data and Information Supporting the National Climate Assessment

    Full text link
    The Global Change Information System (GCIS) provides a framework for the formal representation of structured metadata about data and information about global change. The pilot deployment of the system supports the National Climate Assessment (NCA), a major report of the U.S. Global Change Research Program (USGCRP). A consumer of that report can use the system to browse and explore that supporting information. Additionally, capturing that information into a structured data model and presenting it in standard formats through well defined open inter- faces, including query interfaces suitable for data mining and linking with other databases, the information becomes valuable for other analytic uses as well

    Data Provenance for Distributed Data Sets

    Full text link
    No abstract availabl
    corecore