1,721,110 research outputs found
XML Data Mining
XML is the standard language for representing semi-structured data. With the spreading of XML sources, mining XML data can be an important objective in the near future. This paper presents a project focussed on designing a general-purpose query language in support of mining XML data. In our framework, raw data, mining models and domain knowledge are represented by way of XML documents and stored inside XML native databases. Data mining tasks are expressed in an extension of XQuery. Special attention is given to the frequent pattern discovery problem, and a way of exploiting domain-dependent optimizations and efficient data structures as deeper as possible in the extraction process is presented. We report the results of a first bunch of experiments, showing that a good trade-off between expressiveness and efficiency in XML data mining is not a chimera
Knowledge Discovery from spatial transactions
We propose a general mechanism to represent the spatial transactions in a way that allows the use of the existing data mining methods. Our proposal allows the analyst to exploit the layered structure of geographical information systems in order to define the layers of interest and the relevant spatial relations among them. Given a reference object, it is possible to describe its neighborhood by considering the attribute of the object itself and the objects related by the chosen relations. The resulting spatial transactions may be either considered like “traditional” transactions, by considering only the qualitative spatial relations, or their spatial extension can be exploited during the data mining process. We explore both these cases. First we tackle the problem of classifying a spatial dataset, by taking into account the spatial component of the data to compute the statistical measure (i.e., the entropy) necessary to learn the model. Then, we consider the task of extracting spatial association rules, by focusing on the qualitative representation of the spatial relations. The feasibility of the process has been tested by implementing the proposed method on top of a GIS tool and by analyzing real world data
Extracting Spatial Assosiation Rules from Spatial Transactions
Georeferenced information is growing every day, and geographical information systems are becoming crucial in many decision processes. As a consequence, extracting knowledge from GIS's may have an important impact. The paper presents a general approach for extracting sets of spatial transactions from GIS's, and for applying data mining algorithms to them. As a basic example of the process we present the extraction of spatial association rules from georeferenced data
MQL: An Algebraic Query Language for Knowledge Discovery
MQL is a system supporting the process of Knowledge Discovery. The central step of knowledge discovery, i.e. the application and combination of data mining steps, is expressed via queries written in an algebraic query language. The query processing engine exploits an XML based representation of queries and data mining models to favor the interoperability of different data mining tools and the expandibility of the system
- …
