1,721,223 research outputs found

    Contextualized BERT Sentence Embeddings for Author Profiling: The Cost of Performances

    No full text
    The necessity to know information about the real identity of an online subject is a highly relevant issue in User Profiling, especially for analysis from digital sources such as social media. The digital identity of a user does not always present explicit data about her offline life such as age, gender, work, and more. This problem makes the task of user profiling complex and incomplete. For many years this issue has received a considerable amount of attention from the whole community, which has developed several solutions, also based on machine learning, to estimate user characteristics. The increasing diffusion of deep learning approaches has allowed, on the one hand, to obtain a considerable increase in predictive performance, but on the other hand, to have available models that cannot be interpreted and that require very high computational power. Considering the validity of new pre-trained language models on extensive data for resolving many natural language processing and classification tasks, we decided to propose a BERT-based approach (BERT-DNN) also for the author profiling task. In a first analysis, we compared the results obtained by our model with them of more classical approaches. As a follow, a critical analysis was carried out. We analyze the advantages and disadvantages of these approaches also in terms of resources needed to run them. The results obtained by our model are encouraging in terms of reliability but very disappointing if we consider the computational power required for running it

    Harnessing distributional semantics to build context-aware justifications for recommender systems

    Full text link
    This paper introduces a methodology to generate review-based natural language justifications supporting personalized suggestions returned by a recommender system. The hallmark of our strategy lies in the fact that natural language justifications are adapted to the different contextual situations in which the items will be consumed. In particular, our strategy relies on the following intuition: Just like the selection of the most suitable item is influenced by the contexts of usage, a justification that supports a recommendation should vary as well. As an example, depending on whether a person is going out with her friends or her family, a justification that supports a restaurant recommendation should include different concepts and aspects. Accordingly, we designed a pipeline based on distributional semantics models to generate a vector space representation of each context. Such a representation, which relies on a term-context matrix, is used to identify the most suitable review excerpts that discuss aspects that are particularly relevant for a certain context. The methodology was validated by means of two user studies, carried out in two different domains (i.e., movies and restaurants). Moreover, we also analyzed whether and how our justifications impact on the perceived transparency of the recommendation process and allow the user to make more informed choices. As shown by the results, our intuitions were supported by the user studies

    Cross-language Linking of eGov Services to the LOD Cloud

    No full text
    CroSer (Cross-language Semantic Retrieval) is an IR system able to discover links between eGov services described in different languages. CroSeR supports public administrators to link their own source catalogs of eGov services described in any language to a target catalog whose services are described in English and are available in the Linked Open Data (LOD) cloud. Our system is based on a cross-language semantic matching method that i) translates service labels in English using a machine translation tool, ii) extracts a Wikipedia-based semantic representation from the translated service labels using Explicit Semantic Analysis (ESA), iii) evaluates the similarity between two services using their Wikipedia-based representations. The user selects a service in a source catalog and exploits the ranked list of matches suggested by CroSeR to establish a relation (of type narrower, equivalent, or broader match) with other services in the English catalog. The method is independent from the language adopted in the source catalog and it does not assume the availability of information about the services other than very short text descriptions used as service labels. CroSeR is a web application accessible via http://siti-rack.siti.disco.unimib.it:8080/croser/3. The work has been partially supported by the Italian PON project PON01 00861 SMART-Services and Meta-services for smART eGovernment

    GPR Prospecting at the Castle of Alceste in San Vito dei Normanni (Brindisi, Italy)

    Full text link
    In this contribution we will show the results of a non-invasive GPR prospecting performed in the archaeological site of San Vito dei Normanni, in the outskirts of Brindisi, Apulia region, southern Italy, where the ancient population of the Messapians had an important settlement. The present work was performed in the framework of a school in the field financed by the Basilicata Region. The interpretation of the data has been based on the depth slices, on the processed Bscans and on the a-priori available information is provided. Future excavations will be driven also by the results achieved from this GPR prospecting

    Going Beyond Counting First Authors in Author Co-citation Analysis

    Full text link
    The present study examines one of the fundamental aspects of author co-citation analysis (ACA) - the way co-citation counts are defined. Co-citation counting provides the data on which all subsequent statistical analyses and mappings are based, and we compare ACA results based on two different types of co-citation counting - the traditional type that only counts the first one among a cited work's authors on the one hand and a non-traditional type that takes into account the first 5 authors of a cited work on the other hand. Results indicate that the picture produced through this non-traditional author co-citation counting contains more coherent author groups and is therefore considerably clearer. However, this picture represents fewer specialties in the research field being studied than that produced through the traditional first-author co-citation counting when the same number of top-ranked authors is selected and analyzed. Reasons for these effects are discussed

    Variations on the Author

    Full text link
    “Variations on the Author” discusses two of Eduardo Coutinho’s recent films (Um Dia na Vida, from 2010, and Últimas Conversas, posthumously released in 2015) and their contribution to the general question of documentary authorship. The director’s filmography is characterized by a consistent yet self-effacing form of authorial self-inscription: Coutinho often features as an interviewer that rather than express opinions propels discourses; an interviewer that is good at listening. This mode of self-inscription characterizes him as an author who is not expressive but who is nonetheless markedly present on the screen. In Um Dia na Vida, however, Coutinho is completely absent form the image, while Últimas Conversas, on the contrary, includes a confessional prologue that moves the director from the margins to the center of his films. This article examines the ways in which these works stand out in the filmography of a director who offers new insights into the notion of cinematic authorship
    corecore