1,721,564 research outputs found

    One Origin of Digital Humanities. Fr. Roberto Busa in His Own Words

    No full text
    This book gathers, and makes available in English, with new introductions, previously out of print or otherwise difficult to access articles by Fr Roberto Busa S.J. (1913 - 2011). Also included is a comprehensive bibliography of Busa, an oral history interview with Busa's translator, and a substantial new chapter that evaluates Busa's contributions and intellectual legacies. The result is a groundbreaking book that is of interest to digital humanists and computational linguists as well as historians of science, technology and the humanities. As the application of computing to cultural heritage becomes ever more ubiquitous, new possibilities for transmitting, shaping, understanding, questioning and even imagining the human record are opening up. Busa is considered by many to be among the pioneers in this field, and his research on projects like the Index Thomisticus is one of the earliest known examples of a humanities project that incorporated automation; it continues to be widely cited and used today. Busa published more than 350 academic articles and shorter pieces in numerous languages, but despite the unquestionable importance of his early work for understanding the history and development of fields like humanities computing and computational linguistics, a large part of his canon and thinking remained inaccessible or difficult to access until this book

    Preface

    No full text
    Preface of the Proceedings of the LT4HALA 2020 Worksho

    Preface

    No full text
    Preface of the volum

    Introduction. The added value of diachronic treebanks for historical linguistics

    No full text
    Over the last few decades, the widespread diffusion of digital technology has increased availability of primary textual sources, radically changing the everyday life of scholars in the humanities, who are now able to access, query and process a wealth of empirical evidence in ways not possible before. Also for ancient languages, corpora enhanced with increasingly complex layers of metalinguistic information, such as part-of-speech tagging and syntactic annotation (called ‘treebanks’) are now available. In particular, diachronic treebanks, which provide data for a language across several historical stages of a given language, allow for a new approach to diachronic studies of syntactic phenomena where scholars previously had to content themselves with empirical work on a much smaller scale. This is the introduction of a volume that brings together a set of papers that report research on various diachronic matters supported by evidence from diachronic treebanks. The contents of the papers cover a wide range of languages, including English, French, Russian, Old Church Slavonic, Latin and Ancient Greek. Originally published as special issue of Diachronica 35:3 (2018)

    Preface

    No full text
    Preface of the volum

    Overview of the EvaLatin 2022 Evaluation Campaign

    No full text
    This paper describes the organization and the results of the second edition of EvaLatin, the campaign for the evaluation of Natural Language Processing tools for Latin. The three shared tasks proposed in EvaLatin 2022, i. e. Lemmatization, Part-of-Speech Tagging and Features Identification, are aimed to foster research in the field of language technologies for Classical languages. The shared dataset consists of texts mainly taken from the LASLA corpus. More specifically, the training set includes only prose texts of the Classical period, whereas the test set is organized in three sub-tasks: a Classical sub-task on a prose text of an author not included in the training data, a Cross-genre sub-task on poetic and scientific texts, and a Cross-time sub-task on a text of the 15th century. The results obtained by the participants for each task and sub-task are presented and discussed

    Overview of the EvaLatin 2024 Evaluation Campaign

    No full text
    This paper describes the organization and the results of the third edition of EvaLatin, the campaign for the evaluation of Natural Language Processing tools for Latin. The two shared tasks proposed in EvaLatin 2024, i.,e., Dependency Parsing and Emotion Polarity Detection, are aimed to foster research in the field of language technologies for Classical languages. The shared datasets are described and the results obtained by the participants for each task are presented and discussed

    The Rise and Fall of Dependency Parsing in Dante Alighieri's Divine Comedy

    No full text
    In this paper, we conduct parsing experiments on Dante Alighieri{'}s Divine Comedy, an Old Italian poem composed between 1306-1321 and organized into three Cantiche {---}Inferno, Purgatorio, and Paradiso. We perform parsing on subsets of the poem using both a Modern Italian training set and sections of the Divine Comedy itself to evaluate under which scenarios parsers achieve higher scores. We find that employing in-domain training data supports better results, leading to an increase of approximately +17{\%} in Unlabeled Attachment Score (UAS) and +25-30{\%} in Labeled Attachment Score (LAS). Subsequently, we provide brief commentary on the differences in scores achieved among subsections of Cantiche, and we conduct experimental parsing on a text from the same period and style as the Divine Comedy

    Preface

    No full text
    Preface to the Proceedings of the Third Workshop on Annotation of Corpora for Research in the Humanities (ACRH-3
    corecore