Search CORE

1,721,181 research outputs found

Frequent set mining

Author: GOETHALS Bart
Publication venue
Publication date: 01/01/2005
Field of study

Memory issues in frequent itemset mining

Author: Goethals Bart
Bart Goethals
Publication venue
Publication date: 01/01/2004
Field of study

During the past decade, many algorithms have been proposed to solve the frequent itemset mining problem, i.e. find all sets of items that frequently occur together in a given database of transactions. Although very efficient techniques have been presented, they still suffer from the same problem. That is, they are all inherently dependent on the amount of main memory available. Moreover, if this amount is not enough, the presented techniques are simply not applicable anymore, or significantly need to pay in performance. In this paper, we give a rigorous comparison between current state of the art techniques and present a new and simple technique, based on sorting the transaction database, resulting in a sometimes more efficient algorithm for frequent itemset mining using less memor

Crossref

Institutional Repository Universiteit Antwerpen

Document Server@UHasselt (Universiteit Hasselt)

Document Server@UHasselt

Minimal k-free representations

Author: GOETHALS Bart
Calders Toon
Publication venue
Publication date: 01/01/2003
Field of study

Document Server@UHasselt

8th Pacific-Asia Conference, PAKDD 2004, Sydney, Australia, May 26-28, 2004. Proceedings

Author: GOETHALS Bart
Bonchi F.
Publication venue
Publication date: 01/01/2004
Field of study

In the context of mining frequent itemsets, numerous strategies have been proposed to push several types of constraints within the most well known algorithms. In this paper, we integrate the recently proposed ExAnte data reduction technique within the FP-growth algorithm. Together, they result in a very efficient frequent itemset mining algorithm that effectively exploits monotone constraints

Document Server@UHasselt (Universiteit Hasselt)

Document Server@UHasselt

Quick inclusion-exclusion

Author: GOETHALS Bart
Calders T
Publication venue
Publication date: 01/01/2006
Field of study

Many data mining algorithms make use of the well-known Inclusion-Exclusion principle. As a consequence, using this principle efficiently is crucial for the success of all these algorithms. Especially in the context of condensed representations, such as NDI, and in computing interesting measures, a quick inclusion-exclusion algorithm can be crucial for the performance. In this paper, we give an overview of several algorithms that depend on the inclusion-exclusion principle and propose an efficient algorithm to use it and evaluate its complexity. The theoretically obtained results axe supported by experimental evaluation of the quick IE technique in isolation, and of an example application

Document Server@UHasselt (Universiteit Hasselt)