1,720,970 research outputs found
On Designing Semantic Lexicon-Based Architectures for Web Information Retrieval
In this work, a novel framework for designing Web Information Retrieval systems with particular reference to semantic search engines is presented. The key idea is to add the semantic dimension to the classical Term-Document Matrix thus having a three-dimensional dataset. This enhancement allows for defining a lexico-semantic user interface where the query process is performed at the conceptual level thanks to the use of a Semantic Lexicon. WordNet Semantic Lexicon is used here as golden ontology for handling polysemy and synonymy, hence it is useful for disambiguating user queries at the semantic level. A layered multi-agent system is employed for supporting the design process. Particular emphasis is given to formal system knowledge representation, the interface layer managing user-system interaction and the markup layer performing the semantic tagging process
Fingerprinting lexical contexts over the Web
In this paper a novel technique for identifying lexical contexts in web resources is presented. The basic idea is to consider web site anchortexts as lexicalized descriptions of an individual ontology organized in the form of a graph of concept words. In the search for peculiar semantic patterns, the concept of web minutia (transposed from the forensic domain) is introduced. The proposed technique consists in searching for web minutiae in the analyzed web sites by means of a golden ontology. Web minutiae act as fingerprints for context-specific web resources; in this sense they are a powerful computational tool to identify and categorize the Web. The WordNet database has been used as golden ontology for our experiments on English web documents. WordNet allows for indexing and retrieving word senses and interword taxonomical relations like hyponymy and hypernymy. It has proven to be an efficient mediator between web ontologies and context-dependent taxonomies. Our experiments have been carried out on a preliminary data set of several tens of thousand links taken by web sites of thirteen UK universities. Preliminary results seem to confirm the ability of web minutiae to identify lexical contexts across the We
M-DUST: an Innovative Low-Cost Smart PM Sensor
In this work, we present M-DUST, a novel low-cost and real-time smart monitoring sensor for Particulate Matter (PM) emission measurement. It is based on the Tyndall scattering process to count particles concentration. A comparison on different methods to evaluate particles concentration has been discussed. A mechanical filter is used to select the particulate matter with the appropriate cut-off aerodynamic diameter. The presented device is an intelligent sensor thanks to its features, such as: ability to make self-diagnosis, self-adaptation and transparency to communication interface. Tests are carried out in the Italian city of Taranto by using non-toxic substances and analysis chamber
Sistemi per la gestione semantica di materiale di supporto in piattaforme di e-learning
Il lavoro descrive un sistema a base semantica per piattaforme di e-learning con la duplice finalità di facilitare il docente nella organizzazione del materiale di supporto al Learning Object impartito e di guidare il discente nell’apprendimento della struttura di conoscenza specifica di dominio. La base di conoscenza è realizzata ispirandosi al paradigma del Web Semantico ed utilizza WordNet come ontologia di riferimento. Essa è caratterizzata da entità semantico-lessicali che permettono l’indicizzazione della conoscenza a diversi livelli di astrazione. La fruizione dei contenuti è implementata nella forma di un chatbot che guida lo studente nel processo di comprensione strutturata dei contesti a diversi livelli di dettaglio (libro, capitolo, paragrafo, capoverso, frase) e nella loro reciproca organizzazione logica. Sul lato della produzione della base di conoscenza, il sistema proposto contribuisce alla scrittura automatica del materiale di supporto in quanto richiede al formatore solo di indicare delle risorse in formato digitale (e-books, documenti, pagine Web) che trattino in maniera strutturata l’argomento somministrato. Sul lato del discente, l’interazione via chat con un assistente virtuale rende più accattivante la ricerca e l’esplorazione dei contenuti. Un prototipo del sistema è attualmente in fase di test presso il Laboratorio AeFLab del Politecnico di Bari
On Designing Task-Oriented Intelligent Interfaces: An E-Mail Based Design Framework
This paper presents a design framework for building intelligent interfaces using e-mails to dialogue with human users in task-oriented settings. In particular, the proposed approach is pursued from the pattern matching standpoint. Human-computer interaction (HCI) is faced as a classification process where the input data is represented by the user query written in natural language and the output is represented by the most likely classes of system services with a certain degree of match. In case of partial matching, the system instantiates a dialogue with the human user, attempting to disambiguate the meaning of the written text in the context of system services. A case study is reported and preliminary results are commented
Going Beyond Counting First Authors in Author Co-citation Analysis
The present study examines one of the fundamental aspects of author co-citation analysis (ACA) - the way co-citation
counts are defined. Co-citation counting provides the data on which all subsequent statistical analyses and mappings
are based, and we compare ACA results based on two different types of co-citation counting - the traditional type that
only counts the first one among a cited work's authors on the one hand and a non-traditional type that takes into
account the first 5 authors of a cited work on the other hand. Results indicate that the picture produced through this non-traditional author co-citation counting contains more coherent author groups and is therefore considerably clearer. However, this picture represents fewer specialties in the research field being studied than that produced through the traditional first-author co-citation counting when the same number of top-ranked authors is selected and analyzed. Reasons for these effects are discussed
Variations on the Author
“Variations on the Author” discusses two of Eduardo Coutinho’s recent films (Um Dia na Vida, from 2010, and Últimas Conversas, posthumously released in 2015) and their contribution to the general question of documentary authorship. The director’s filmography is characterized by a consistent yet self-effacing form of authorial self-inscription: Coutinho often features as an interviewer that rather than express opinions propels discourses; an interviewer that is good at listening. This mode of self-inscription characterizes him as an author who is not expressive but who is nonetheless markedly present on the screen. In Um Dia na Vida, however, Coutinho is completely absent form the image, while Últimas Conversas, on the contrary, includes a confessional prologue that moves the director from the margins to the center of his films. This article examines the ways in which these works stand out in the filmography of a director who offers new insights into the notion of cinematic authorship
Appropriate Similarity Measures for Author Cocitation Analysis
We provide a number of new insights into the methodological discussion about author cocitation analysis. We first argue that the use of the Pearson correlation for measuring the similarity between authors’ cocitation profiles is not very satisfactory. We then discuss what kind of similarity measures may be used as an alternative to the Pearson correlation. We consider three similarity measures in particular. One is the well-known cosine. The other two similarity measures have not been used before in the bibliometric literature. Finally, we show by means of an example that our findings have a high practical relevance.information science;Pearson correlation;cosine;similarity measure;author cocitation analysis
- …
