1,721,014 research outputs found

    Negotiating corpus identity: from body to web

    No full text
    The paper explores some of the issues raised by the notion of the web-as-corpus (Kilgarriff-Greffenstette 2003; Baroni – Bernardini 2006) on both theoretical and applicative grounds. The basic assumption is that the very existence of a trend within corpus linguistics that can be freely labelled as web-as-corpus does not simply question the way we conceive of a corpus in modern linguistics, but rather “serves as a magnifying glass for the methodological issues that corpus linguists have discussed all along” (Hundt et al. 2007), and possibly mirrors also changes taking place in society at large. The idea of considering the web as a corpus presupposes a definition of what a corpus is and possibly entails a renegotiation of what a corpus can be. The paper thus revises some key issues in corpus linguistics such as “authenticity”, “representativeness”, “size” “content” (Sinclair 1991; McEnery-Wilson 2001: Tognini Bonelli 2001; Hunston 2002) in the light of the peculiarities of the web as a ‘spontaneous’, ‘self-generating’ collection of texts, and explores some of the new issues which the emerging notion of the web-as-corpus seems to raise, such as “dynamism”, “reproducibility”, “relevance and reliability”, “distributed architecture” (Fletcher 2004; Baroni 2006; Lüdeling 2007). These theoretical issues are then tested on applicative grounds through case studies based on the most common tools and methods devised to exploit the enormous potential of the web as a linguistic resource, either via ordinary search engines or through linguistically-oriented tools such as WebCorp or BootCat (Kehoe – Renouf 2002; Baroni - Bernardini 2004 and 2006). This ongoing process finally leaves room for further investigations. While the reasons for turning to the web as a corpus may have in fact been pre-eminently opportunistic at the outset (size, low cost, ease of access...), it is also self-evident that the web has also imposed itself to the linguists’ attention as an object of scientific enquiry thanks to its intrinsic peculiarities as “a social phenomenon ... whose chief stock-in-trade is language” (Crystal 2006). Indeed, the ‘changing face’ (Renouf 2006) of corpus linguistics clearly signifies the “convergence of technologies and standards in several related fields which have in common the goal of delivering linguistic content through electronic means” (Wynne 2002), and can be seen as the outcome of a wider process of redefinition in terms of flexibility, multiplicity and complexity which corpus linguistics is undergoing along with other fields of human activity

    Going Beyond Counting First Authors in Author Co-citation Analysis

    Full text link
    The present study examines one of the fundamental aspects of author co-citation analysis (ACA) - the way co-citation counts are defined. Co-citation counting provides the data on which all subsequent statistical analyses and mappings are based, and we compare ACA results based on two different types of co-citation counting - the traditional type that only counts the first one among a cited work's authors on the one hand and a non-traditional type that takes into account the first 5 authors of a cited work on the other hand. Results indicate that the picture produced through this non-traditional author co-citation counting contains more coherent author groups and is therefore considerably clearer. However, this picture represents fewer specialties in the research field being studied than that produced through the traditional first-author co-citation counting when the same number of top-ranked authors is selected and analyzed. Reasons for these effects are discussed

    Are they still warriors? Reflections on the use of Maori lexis in New Zealand English

    No full text
    New Zealand English (NZE) is a language whose vocabulary has been strongly influenced by British, Scottish and Irish but what makes it uniquely different from any other English variety is the presence of words in te reo maori, i.e. the Maori language (Bell & Kuiper, 2000). The first borrowings from Maori came into NZE when the country was colonized (at the end of the 18th century): most of them were words for plants and animals, some were cultural terms. Since then, a rather large number of Maori words have found their way both in spoken and written NZE (Kennedy, 2001). This paper concerns present-day written NZE and is motivated by two general observations: 1) Maori terms can be found in New Zealand books and newspapers without any translation into English, indicating that most New Zealanders know – or allegedly know – what they mean; 2) most of them are used exclusively in connection with Maori culture, which points at their no more than partial integration into NZE (Trudgill & Hannah, 2002). For the present study five Maori words have been selected, namely aroha (‘love’), mana (‘power’), marae (‘meeting ground’), tapu (‘taboo’), and whakapapa (‘genealogy’), on the basis of their frequency of occurrence in NZE and their significance in terms of cultural identity. The analysis (both quantitative and qualitative) will be carried out working on a corpus of on-line NZ newspapers and magazines published in the last three years in order to investigate the five Maori words in their contexts of use. The study aims at pointing out: a)what has been lost of the original meanings, b) which new meanings have been added, c) the extent to which the use of these words contributes to the maintenance and/or creation of cultural stereotypes and d)the extent to which the use of these words can be taken as a reflection of assimilationist policies

    Variations on the Author

    Full text link
    “Variations on the Author” discusses two of Eduardo Coutinho’s recent films (Um Dia na Vida, from 2010, and Últimas Conversas, posthumously released in 2015) and their contribution to the general question of documentary authorship. The director’s filmography is characterized by a consistent yet self-effacing form of authorial self-inscription: Coutinho often features as an interviewer that rather than express opinions propels discourses; an interviewer that is good at listening. This mode of self-inscription characterizes him as an author who is not expressive but who is nonetheless markedly present on the screen. In Um Dia na Vida, however, Coutinho is completely absent form the image, while Últimas Conversas, on the contrary, includes a confessional prologue that moves the director from the margins to the center of his films. This article examines the ways in which these works stand out in the filmography of a director who offers new insights into the notion of cinematic authorship

    Appropriate Similarity Measures for Author Cocitation Analysis

    Full text link
    We provide a number of new insights into the methodological discussion about author cocitation analysis. We first argue that the use of the Pearson correlation for measuring the similarity between authors’ cocitation profiles is not very satisfactory. We then discuss what kind of similarity measures may be used as an alternative to the Pearson correlation. We consider three similarity measures in particular. One is the well-known cosine. The other two similarity measures have not been used before in the bibliometric literature. Finally, we show by means of an example that our findings have a high practical relevance.information science;Pearson correlation;cosine;similarity measure;author cocitation analysis
    corecore