1,721,110 research outputs found

    XML Data Mining

    No full text
    XML is the standard language for representing semi-structured data. With the spreading of XML sources, mining XML data can be an important objective in the near future. This paper presents a project focussed on designing a general-purpose query language in support of mining XML data. In our framework, raw data, mining models and domain knowledge are represented by way of XML documents and stored inside XML native databases. Data mining tasks are expressed in an extension of XQuery. Special attention is given to the frequent pattern discovery problem, and a way of exploiting domain-dependent optimizations and efficient data structures as deeper as possible in the extraction process is presented. We report the results of a first bunch of experiments, showing that a good trade-off between expressiveness and efficiency in XML data mining is not a chimera

    Knowledge Discovery from spatial transactions

    No full text
    We propose a general mechanism to represent the spatial transactions in a way that allows the use of the existing data mining methods. Our proposal allows the analyst to exploit the layered structure of geographical information systems in order to define the layers of interest and the relevant spatial relations among them. Given a reference object, it is possible to describe its neighborhood by considering the attribute of the object itself and the objects related by the chosen relations. The resulting spatial transactions may be either considered like “traditional” transactions, by considering only the qualitative spatial relations, or their spatial extension can be exploited during the data mining process. We explore both these cases. First we tackle the problem of classifying a spatial dataset, by taking into account the spatial component of the data to compute the statistical measure (i.e., the entropy) necessary to learn the model. Then, we consider the task of extracting spatial association rules, by focusing on the qualitative representation of the spatial relations. The feasibility of the process has been tested by implementing the proposed method on top of a GIS tool and by analyzing real world data

    Extracting Spatial Assosiation Rules from Spatial Transactions

    No full text
    Georeferenced information is growing every day, and geographical information systems are becoming crucial in many decision processes. As a consequence, extracting knowledge from GIS's may have an important impact. The paper presents a general approach for extracting sets of spatial transactions from GIS's, and for applying data mining algorithms to them. As a basic example of the process we present the extraction of spatial association rules from georeferenced data

    MQL: An Algebraic Query Language for Knowledge Discovery

    No full text
    MQL is a system supporting the process of Knowledge Discovery. The central step of knowledge discovery, i.e. the application and combination of data mining steps, is expressed via queries written in an algebraic query language. The query processing engine exploits an XML based representation of queries and data mining models to favor the interoperability of different data mining tools and the expandibility of the system
    corecore