1,721,032 research outputs found
Recommendations Biases and Beyond-Accuracy Objectives in Collaborative Filtering
Recommender systems research, traditionally focused on accuracy, is currently paying more and more attention to additional factors for evaluating the perceived quality and usefulness of recommendation lists. In this paper, we present a survey of the most important dimensions, other than accuracy, usually taken into account in the collaborative filtering literature. We survey beyond-accuracy objectives, i.e. novelty, diversity and serendipity, and the main techniques for increasing them. Moreover, we discuss possible undesired biases occurring in collaborative filtering algorithms, and how to effectively deal with them
Full-Text Search Engines for Databases
Current databases are able to store several Tbytes of free-text documents. The main purpose of a database from the user’s viewpoint is the efficient information retrieval. In the case of textual data, information retrieval mostly concerns the selection and the ranking of documents. The selection criteria can contain elements that apply to the content or the grammar of the language. In the traditional database management systems (DBMS), text manipulation is restricted to the usual string manipulation facilities, i.e. the exact matching of substrings. Although the new SQL1999 standard enables the usage of more powerful regular expressions, this traditional approach has some major drawbacks. The traditional string-level operations are very costly for large documents as they work without task-oriented index structures. The required full-text management operations belong to text mining, an interdisciplinary field of natural language processing and data mining. As the traditional DBMS engine is inefficient for these operations, database management systems are usually extended with a special full-text search (FTS) engine module. We present here the particular solution of Oracle; there for making the full-text querying more efficient, a special engine was developed that performs the preparation of full-text queries and provides a set of language and semantic specific query operators. </jats:p
Exact trade-off between approximation accuracy and interpretability: solving the saturation problem for certain FRBSs
Text categorization on a multi-lingual corpus
This paper presents experiments with a hierarchical text categorizer on a multi-lingual (English, French) corpus. The results obtained are very similar for both languages. The results allow us to apply in the near future cross-language text categorization that can be used to support automatic translation to create multi-lingual topic glossary.
- …
