1,721,069 research outputs found
Information Retrieval Techniques for Pattern Matching - Managing and Searching Textual and XML Information in 21st Century Applications
Information is the main value of Information Society. The recent developments in computing power and telecommunications, along with the constant drop of Internet access costs and data management and storing, created the right conditions for the global diffusion of the Web and, more generally, of new research tools able to analyze information and their contents. Depending on the particular application scenario and on the type of information that has to be managed and searched, different techniques need to be devised. In this book, the author deals with the two most common types of information: plain text, discussed in the first part, and semi-structured data, in particular XML documents, deeply discussed the second part. The detailed analysis of approximate matching, duplicate document detection, exact, approximate and semantic query answering, multi-version document management and personalized access techniques offered in this book will guide Information Technology professionals and users in effectively and efficiently managing information and knowledge, thus answering the increasingly complex Information needs of most 21st century applications
EXTRA: Example Based Machine Translation
In collaboration with LOGOS Group, the software is in the EBMT (Example Based Machine Translation) field, in which approximate matching techniques are applied to sentences, also considering additional problems related to multi-linguism. It also extends the syntactic similarity to the semantic field, by means of the study of techniques of disambiguation based on the use of WordNet
XML-S3MART: Similarity Search on Semi-Structured Data
The software solves the problem of approximate search on XML data coming from heterogenous sources. In particular, it considers the field of advanced search engines for digital libraries containing semi-structured information describing the same reality but coming from different sources and, therefore, satisfying different structural requirements. In this context, it includes techniques allowing the automatic rewriting of the queries submitted by the users (query rewriting) w.r.t. every document of the digital library which can be useful to satisfy their information need
Shaping Tomorrow Information Management, Today
The recent developments in computing power and telecommunications, and, in general, the advanced ICT (Information and Communication Technology) of the 20th century, accelerated the use and value of Information in our society. Indeed, Information is the main value of Information Society. In this respect, the World Wide Web, Peer-to-Peer networks, mobile devices and ubiquitous computing systems and sensors give us more and more interesting possibilities today; however, current research on the relevant technologies, structures and services is still not enough mature.Research at the Information Systems Group (ISGroup), inside the Information Engineering Department (DII) of the Modena and Reggio Emilia University, is focused on the design and development of new systems, algorithms and data structures for the access and management of Information. The group constantly devises and puts into practice, also by means of national and international research projects and collaborations, innovative solutions able to answer, both effectively and efficiently, increasingly complex Information needs in several 21st century applications
X-SITER: Efficient XML Query Processing
The software includes twig query processing techniques allowing flexible methods of structural interrogation, both for ordered (the order of the sibling nodes of a query is important) and unordered (the order of the sibling nodes is not influential). Further, it includes algorithms and structures allowing an efficient execution also on remarkable amounts of data
DANCER: Similarity Search on Plain Text
The software includes techniques for the access of textual data, both based on the syntax and on the semantic analysis. In particular, it exploits similarity search techniques allowing to go beyond the simple exact search, and metrics for syntactic similarities suitable for textual sequences of any type, i.e. sequences of words (phrases) or generic sequences of symbols (like genetic codes)
Facilitate IT-Providing SMEs in Software Development: a Semantic Helper for Filtering and Searching Knowledge
Software development is still considered a bottleneck in the advance of the Information Society. The recently started FACIT-SME European FP-7 project targets to facilitate the use and sharing of Software Engineering methods and best practices among software developing SMEs. On top of an Open Reference Model (ORM) serving as an underlying knowledge backbone, specific filtering/search mechanisms will support the identification of adequate processes and practices for specific enterprise needs. In this paper, we focus on the proposal of knowledge-based text analysis and retrieval techniques which will form a key component of the advanced filtering mechanisms of the project. The proposed solution is designed to be more powerful and flexible than standard syntactic search techniques, but also to be easily applicable for any SME. The experimental evaluation on the preliminary implementation shows promising results
STRIDER: Structural Disambiguation
The software implements versatile disambiguation approaches which can be used to make explicit the meaning of structure based information such as XML schemas, XML document structures, web directories, and ontologies
SUNRISE: P2P Networks for Data and Service Sharing
The software implements techniques for creating, maintaining and accessing Peer-to-Peer networks for data and service sharing
SocialGQ: Towards semantically approximated and user-Aware querying of social-graph data
The proliferation of social and collaborative sites makes users increasingly active in the generation of socialgraph data; however, such sea of data often hinders them from finding the information they need. In this paper, we present SocialGQ ("Social-Graph Querying"), a novel approach for the effective and efficient querying of socialgraph data overcoming the limitations of typical search approaches proposed in the literature. SocialGQ allows users to compose complex queries in a simple way, and is able to retrieve useful knowledge (top-k answers) by jointly exploiting: (a) the structure of the graph, semantically approximating the user's requests with meaningful answers; (b) the unstructured textual resources of the graph; (c) its social and user-Aware dimension. An experimental evaluation comparing SocialGQ to leading approaches shows strong gains on a real social-graph data scenario
- …
