1,721,035 research outputs found

    How to Annotate Emotions in Historical Italian Novels: a Case Study on I Promessi Sposi

    No full text
    This paper describes the annotation of a chapter taken from I Promessi Sposi, the most famous Italian novel of the 19th century written by Alessandro Manzoni, following 3 emotion classifications. The aim of this methodological paper is to understand: i) how the annotation procedure changes depending on the granularity of the classification, ii) how the different granularities impact the inter-annotator agreement, iii) which granularity allows good coverage of emotions, iv) if the chosen classifications are missing emotions that are important for historical literary texts. The opinion of non-experts is integrated in the present study through an online questionnaire. In addition, preliminary experiments are carried out using the new dataset as a test set to evaluate the performances of different approaches for emotion polarity detection and emotion classification respectively. Annotated data are released both as aggregated gold standard and with non-aggregated labels (that is labels before reconciliation between annotators) so to align with the perspectivist approach, that is an established practice in the Humanities and, more recently, also in NLP

    Creating a platform for navigating verbo-visual art collections

    No full text
    The “Verbo-Visual Virtual” (V.V.V.) project aims at developing a new web portal to unify and make available the Verbo-Visual collections of two museums, namely the portion of “Archivio Nuova Scrittura” (ANS) owned by the Museum of Modern and Contemporary Art of Trento and Rovereto (MART) and the part owned by the Museum for Modern and Contemporary Art of Bolzano (Museion)

    That branch of the Lake of Como...: Developing a New Resource for the Analysis of I Promessi Sposi and its Historical Translations

    No full text
    This paper presents a directional parallel corpus of the Ventisettana, that is the version of I Promessi Sposi published by Manzoni in 1827, aligned at sentence level with the anonymous English translation published in London in 1834 by Richard Bentley. After describing the procedure followed for creating the resource and analyzing the results of the manual alignment, the corpus is used as a gold standard to evaluate Bertalign automatic aligner. This new linguistic resource can benefit the research community, in particular in the fields of the history of literature and translation studies, and be useful for developing new automatic tools specific for handling the peculiarities of historical literary texts

    Sentiment Analysis for the Humanities: the Case of Historical Texts

    No full text
    In this paper we investigate the possibility to adapt existing lexical resources and Natural Language Processing (NLP) methodologies related to Sentiment Analysis (SA) to the historical domain. The corpus used for our research is the complete corpus of writings of Alcide De Gasperi (1881-1954), one of the founders of the Italian Republic, made of about 3,000 documents and 3,000,000 words. Using this corpus as a case study, two experiments have been carried out and are described in this paper. The aim of these experiments were the evaluation of i) how existing lexical resources for SA perform in the historical domain and ii) the feasibility of a sentiment annotation task for historical texts either for expert annotators and for crowdsourcing workers

    Linguistica computazionale. Introduzione all'analisi automatica dei testi.

    No full text
    Dopo aver fornito i cenni storici sulla disciplina, il libro introduce le nozioni di linguistica e statistica necessarie per il trattamento automatico del linguaggio. Sono affrontati poi nell’ordine: i metodi e le tecniche di apprendimento automatico (machine learning) tradizionale e con reti neurali, le rappresentazioni vettoriali del significato delle parole, l’annotazione di dati linguistici a scopi computazionali e i diversi task di Natural Language Processing: da quelli di pre-processing a quelli di classificazione e di generazione di testi. Conclude il volume un capitolo che fornisce informazioni su dove trovare gli strumenti utili per effettuare l’analisi automatica dei testi

    The “Verbo-Visual Virtual” Platform for Digitizing and Navigating Cultural Heritage Collections

    No full text
    Linked collections and archives in the field of contemporary art are usually less common than in other fields of the humanities such as in historical and literary research. The “Verbo-Visual Virtual” (Vvv) project aims at developing a new web portal which unifies the Verbo-Visual collections of two museums, namely the “Archivio Nuova Scrittura” (Ans) owned in part by the Museum of Modern and Contemporary Art of Trento and Rovereto (Mart) and in part by the Museum for Modern and Contemporary Art of Bolzano (Museion). Through the Vvv web portal, it will be possible for users to search for pieces of art and the corresponding information records now stored in two different museums and accessible using two different catalogues. It will also be possible to discover new information about the collection by performing complex queries over the underlying database and the links manually set among the records

    Going Beyond Counting First Authors in Author Co-citation Analysis

    Full text link
    The present study examines one of the fundamental aspects of author co-citation analysis (ACA) - the way co-citation counts are defined. Co-citation counting provides the data on which all subsequent statistical analyses and mappings are based, and we compare ACA results based on two different types of co-citation counting - the traditional type that only counts the first one among a cited work's authors on the one hand and a non-traditional type that takes into account the first 5 authors of a cited work on the other hand. Results indicate that the picture produced through this non-traditional author co-citation counting contains more coherent author groups and is therefore considerably clearer. However, this picture represents fewer specialties in the research field being studied than that produced through the traditional first-author co-citation counting when the same number of top-ranked authors is selected and analyzed. Reasons for these effects are discussed
    corecore