Search CORE

1,721,564 research outputs found

One Origin of Digital Humanities. Fr. Roberto Busa in His Own Words

Author: Nyhan Julianne
Passarotti Marco
Passarotti Marco
Publication venue
Publication date: 01/01/2019
Field of study

This book gathers, and makes available in English, with new introductions, previously out of print or otherwise difficult to access articles by Fr Roberto Busa S.J. (1913 - 2011). Also included is a comprehensive bibliography of Busa, an oral history interview with Busa's translator, and a substantial new chapter that evaluates Busa's contributions and intellectual legacies. The result is a groundbreaking book that is of interest to digital humanists and computational linguists as well as historians of science, technology and the humanities. As the application of computing to cultural heritage becomes ever more ubiquitous, new possibilities for transmitting, shaping, understanding, questioning and even imagining the human record are opening up. Busa is considered by many to be among the pioneers in this field, and his research on projects like the Index Thomisticus is one of the earliest known examples of a humanities project that incorporated automation; it continues to be widely cited and used today. Busa published more than 350 academic articles and shorter pieces in numerous languages, but despite the unquestionable importance of his early work for understanding the history and development of fields like humanities computing and computational linguistics, a large part of his canon and thinking remained inaccessible or difficult to access until this book

PubliCatt

Preface

Author: Passarotti Marco
Sprugnoli Rachele
Publication venue
Publication date: 01/01/2020
Field of study

Preface of the Proceedings of the LT4HALA 2020 Worksho

PubliCatt

Preface

Author: Passarotti Marco
Publication venue
Publication date: 01/01/2021
Field of study

Preface of the volum

PubliCatt

Introduction. The added value of diachronic treebanks for historical linguistics

Author: Passarotti Marco
Publication venue
Publication date: 01/01/2020
Field of study

Over the last few decades, the widespread diffusion of digital technology has increased availability of primary textual sources, radically changing the everyday life of scholars in the humanities, who are now able to access, query and process a wealth of empirical evidence in ways not possible before. Also for ancient languages, corpora enhanced with increasingly complex layers of metalinguistic information, such as part-of-speech tagging and syntactic annotation (called ‘treebanks’) are now available. In particular, diachronic treebanks, which provide data for a language across several historical stages of a given language, allow for a new approach to diachronic studies of syntactic phenomena where scholars previously had to content themselves with empirical work on a much smaller scale. This is the introduction of a volume that brings together a set of papers that report research on various diachronic matters supported by evidence from diachronic treebanks. The contents of the papers cover a wide range of languages, including English, French, Russian, Old Church Slavonic, Latin and Ancient Greek. Originally published as special issue of Diachronica 35:3 (2018)

PubliCatt

Preface

Author: Passarotti Marco
Sprugnoli Rachele
Publication venue
Publication date: 01/01/2024
Field of study

Preface of the volum

PubliCatt

Overview of the EvaLatin 2022 Evaluation Campaign

Author: Passarotti Marco
Sprugnoli Rachele
Cecchini Flavio Massimiliano
Publication venue
Publication date: 01/01/2022
Field of study

This paper describes the organization and the results of the second edition of EvaLatin, the campaign for the evaluation of Natural Language Processing tools for Latin. The three shared tasks proposed in EvaLatin 2022, i. e. Lemmatization, Part-of-Speech Tagging and Features Identification, are aimed to foster research in the field of language technologies for Classical languages. The shared dataset consists of texts mainly taken from the LASLA corpus. More specifically, the training set includes only prose texts of the Classical period, whereas the test set is organized in three sub-tasks: a Classical sub-task on a prose text of an author not included in the training data, a Cross-genre sub-task on poetic and scientific texts, and a Cross-time sub-task on a text of the 15th century. The results obtained by the participants for each task and sub-task are presented and discussed

PubliCatt

Overview of the EvaLatin 2024 Evaluation Campaign

Author: Iurescia Federica
Passarotti Marco Carlo
Sprugnoli Rachele
Publication venue
Publication date: 01/01/2024
Field of study

This paper describes the organization and the results of the third edition of EvaLatin, the campaign for the evaluation of Natural Language Processing tools for Latin. The two shared tasks proposed in EvaLatin 2024, i.,e., Dependency Parsing and Emotion Polarity Detection, are aimed to foster research in the field of language technologies for Classical languages. The shared datasets are described and the results obtained by the participants for each task are presented and discussed

Archivio istituzionale della Ricerca - Università degli Studi di Parma

The Rise and Fall of Dependency Parsing in Dante Alighieri's Divine Comedy

Author: Passarotti Marco
Publication venue
Publication date: 01/01/2024
Field of study

In this paper, we conduct parsing experiments on Dante Alighieri{'}s Divine Comedy, an Old Italian poem composed between 1306-1321 and organized into three Cantiche {---}Inferno, Purgatorio, and Paradiso. We perform parsing on subsets of the poem using both a Modern Italian training set and sections of the Divine Comedy itself to evaluate under which scenarios parsers achieve higher scores. We find that employing in-domain training data supports better results, leading to an increase of approximately +17{\%} in Unlabeled Attachment Score (UAS) and +25-30{\%} in Labeled Attachment Score (LAS). Subsequently, we provide brief commentary on the differences in scores achieved among subsections of Cantiche, and we conduct experimental parsing on a text from the same period and style as the Divine Comedy

PubliCatt

Proceedings of the Third Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA) @ LREC-COLING-2024

Author: Passarotti Marco Carlo
Sprugnoli Rachele
Publication venue
Publication date: 01/01/2024
Field of study

Archivio istituzionale della Ricerca - Università degli Studi di Parma

Preface

Author: Mambrini Francesco
Passarotti Marco Carlo
Publication venue
Publication date: 01/01/2013
Field of study

Preface to the Proceedings of the Third Workshop on Annotation of Corpora for Research in the Humanities (ACRH-3

PubliCatt