1,720,980 research outputs found
Early Vedic compounds. A typological reappraisal
Despite a longstanding tradition of studies in Sanskrit compounds, a description that enables comparisons in cross-linguistic perspective has not yet been worked out. The present article follows classificatory criteria introduced by Bisetto & Scalise ( 2005 , 2009 ) and sketches a typology of compounds in the most archaic variety of Sanskrit, Early Vedic, as transmitted by the RigVeda. Analyzing compounds on the basis of the grammatical relations holding between their constituents provides a classification into coordinate, subordinate, and attributive compounds, with the endocentric/exocentric divide cutting across all classes. In order to identify the position of Early Vedic compounds on the syntax-morphology continuum, the article investigates the degree of cohesiveness at the level of morphology, prosody, syntax, and semantics. With this respect, the RigVeda offers a varied picture where relics of archaic, less cohesive forms occur side-to-side with more productive and word-like ones
Annotating “Absolute” Preverbs in the Homeric and Vedic Treebanks
Indo-European preverbs are uninflected morphemes attaching to verbs and modifying their meaning. In Early Vedic and Homeric Greek, these morphemes held ambiguous morphosyntactic status raising issues for syntactic annotation. This paper focuses on the annotation of preverbs in so-called “absolute” position in two Universal Dependencies treebanks. This issue is related to the broader topic of how to annotate ellipsis in Universal Dependencies. After discussing some of the current annotations, we propose a new scheme that better accounts for the variety of absolute constructions
Linking the Sanskrit WordNet to the Vedic Dependency Treebank: a pilot study
The Sanskrit WordNet is a resource currently under development, whose core was induced from a Vedic text sample semantically annotated by means of an ontology mapped on the Princeton WordNet synsets. Building on a previous case study on Ancient Greek (Zanchi et al. 2021), we show how sentence frames can be extracted from morphosyntactically parsed corpora by linking an existing dependency treebank of Vedic Sanskrit to verbal synsets in the Sanskrit WordNet. Our case study focuses on two verbs of asking, y{\=a}c- and prach-, featuring a high degree of variability in sentence frames. Treebanks enhanced with WordNet-based semantic information revealed to be of crucial help in motivating sentence frame alternations
Building new resources for historical linguistics
This volume collects the papers originally presented at the workshop Building New Resources for Historical Linguistics, held at the University of Pavia in November 2020. The purpose of this workshop was to provide an opportunity for researchers engaged in the development of linguistic resources for historical linguistics to share their experience and knowledge. Sharing is crucial in computational linguistics in order to avoid multiplying efforts and encourage the use of compatible tools, formats, and formalisms to increase the interoperability. Reflecting the purpose of the original workshop, this book introduces the reader to different projects aimed at creating, developing and linking linguistic resources for historical linguistics. While some of the papers in the volume describe mature resources and discuss their possible application, others introduce resources that are still in progress, presenting their aims, the challenges faced in their construction, and the methodologies employed to tackle them. The different types of resources described in the volume include syntactically annotated corpora (treebanks), dependency lexica, as well as lexical and typological databases. Furthermore, some of the papers are concerned with the thriving field of Linguistic Linked Open Data, the current up-to-date standard to link linguistic resources
Linking the Ancient Greek WordNet to the Homeric Dependency Lexicon
The Ancient Greek WordNet is a new resource that is being developed at the Universities of Pavia and Exeter, based on the Princeton WordNet. The Princeton WordNet provides sentence frames for verb senses, but this type of information is lacking in most WordNets of other languages. In fact, exporting sentence frames from English to other languages is not a trivial task, as sentence frames depend on the syntax of individual languages. In addition, the information provided by the Princeton WordNet is not corpus-based but relies on native speakers’ knowledge. This type of information is not available for dead languages, which are by definition corpus languages. In this paper, we show how sentence frames can be extracted from morpho-syntactically parsed corpora by linking an existing depend- ency lexicon of Homeric verbs (HoDeL) to verbs in the Ancient Greek WordNet. Given its features, HoDeL allows automatically extracting all subcategorization frames available for each verb along with information concerning their frequency as well as semantic information regarding the possible arguments occurring in specific frames. In the paper, we show our method to automatically link the two resources and compare some of the resulting sentence frames with the English sentence frames in the Princeton WordNet
Toward the creation of WordNets for ancient IE languages
This paper presents the work in progress toward the creation of a family of WordNets for Sanskrit, Ancient Greek, and Latin. Building on previous attempts in the field, we elaborate these efforts bridging together WordNet relational semantics with theories of meaning from Cognitive Linguistics. We discuss some of the innovations we have introduced to the WordNet architecture, to better capture the polysemy of words, as well as Indo-European language family-specific features. We conclude the paper framing our work within the larger picture of resources available for ancient languages and showing that WordNet-backed search tools have the potential to re-define the kinds of questions that can be asked of ancient language corpora
Going Beyond Counting First Authors in Author Co-citation Analysis
The present study examines one of the fundamental aspects of author co-citation analysis (ACA) - the way co-citation
counts are defined. Co-citation counting provides the data on which all subsequent statistical analyses and mappings
are based, and we compare ACA results based on two different types of co-citation counting - the traditional type that
only counts the first one among a cited work's authors on the one hand and a non-traditional type that takes into
account the first 5 authors of a cited work on the other hand. Results indicate that the picture produced through this non-traditional author co-citation counting contains more coherent author groups and is therefore considerably clearer. However, this picture represents fewer specialties in the research field being studied than that produced through the traditional first-author co-citation counting when the same number of top-ranked authors is selected and analyzed. Reasons for these effects are discussed
Variations on the Author
“Variations on the Author” discusses two of Eduardo Coutinho’s recent films (Um Dia na Vida, from 2010, and Últimas Conversas, posthumously released in 2015) and their contribution to the general question of documentary authorship. The director’s filmography is characterized by a consistent yet self-effacing form of authorial self-inscription: Coutinho often features as an interviewer that rather than express opinions propels discourses; an interviewer that is good at listening. This mode of self-inscription characterizes him as an author who is not expressive but who is nonetheless markedly present on the screen. In Um Dia na Vida, however, Coutinho is completely absent form the image, while Últimas Conversas, on the contrary, includes a confessional prologue that moves the director from the margins to the center of his films. This article examines the ways in which these works stand out in the filmography of a director who offers new insights into the notion of cinematic authorship
- …
