1,721,040 research outputs found
Planning Complex Updates to Satisfy Constraint Rules Using a Constraint Logic Search Engine
Planning Complex Updates to Satisfy Constraint Rules Using a Constraint Logic Search Engine
Twister Tries: Approximate Hierarchical Agglomerative Clustering for Average Distance in Linear Time
Many commonly used data-mining techniques utilized across
research fields perform poorly when used for large data sets.
Sequential agglomerative hierarchical non-overlapping clustering
is one technique for which the algorithms’ scaling
properties prohibit clustering of a large amount of items.
Besides the unfavorable time complexity of O(n
2
), these
algorithms have a space complexity of O(n
2
), which can
be reduced to O(n) if the time complexity is allowed to
rise to O(n
2
log2 n). In this paper, we propose the use of
locality-sensitive hashing combined with a novel data structure
called twister tries to provide an approximate clustering
for average linkage. Our approach requires only linear space.
Furthermore, its time complexity is linear in the number of
items to be clustered, making it feasible to apply it on a
larger scale. We evaluate the approach both analytically
and by applying it to several data sets.peerReviewe
On the Move to Meaningful Internet Systems: OTM 2014 Conferences - Confederated International Conferences: CoopIS, and ODBASE 2014, Amantea, Italy, October 27-31, 2014, Proceedings
Towards Multimedia Fragmentation
Database fragmentation is a process for reducing irrelevant data accesses by grouping data frequently accessed together in dedicated segments. In this paper, we address multimedia database fragmentation by extending existing fragmentation algorithms to take into account key characteristics of multimedia objects. We particularly discuss multimedia primary horizontal fragmentation and provide a partitioning strategy based on low-level multi-media features. Our approach particularly emphasizes the importance of multimedia predicates implications in optimizing multimedia fragments. To validate our approach, we have implemented a prototype computing multimedia predicates implications. Experimental results are satisfactory.xv, 448 p. : ill.Includes bibliographical references
- …
