Search CORE

1,720,964 research outputs found

PAROLE reference corpus

Author: Biagini Lisa
Picchi Eugenio
Calzolari Nicoletta
Rossi Sergio
Zampolli Antonio
Orsolini Paola
Monachini Monica
Goggi Sara
Marinelli Rita
Bindi Remo
Publication venue
Publication date: 20/03/2024
Field of study

The PAROLE project (Preparatory Action for Linguistic Resources Organization for Language Engineering) has produced a set of harmonized corpora and lexicons for a large number of European languages. Each corpus, made up of 20 million words, was built up as reference corpus for Human Language Technology applications, to provide full information about a large variety of text types in the language considered, to represent the use of contemporary language and to become the first nucleus of an electronic text library. The texts have been stored using a common format following the standards recommended in the CES (Corpus Encoding Standard), according to flexibility and multifunctionality criteria. The texts belong to a wide range of media and genres, selected in proportions aimed at reflecting their prominence within the society, classified according to medium, genre, topic and time of production. For more info see also Goggi, Sara, Lisa Biagini, Remo Bindi, and Sergio Rossi. 1997. ‘Italian Corpus Documentation - LE-PAROLE WP2.11’, October. https://zenodo.org/records/8167985. Marinelli, Rita, Lisa Biagini, Remo Bindi, Sara Goggi, Monica Monachini, Paola Orsolini, Eugenio Picchi, Sergio Rossi, Nicoletta Calzolari, and A. Zampolli. 1996. ‘The Italian “Parole” Corpus : An Overview’. Linguistica Computazionale Computational Linguistics in Pisa-Special Issue I (XVI/XVII, 1996/1997): 401–21. https://doi.org/10.1400/18167. https://www.ilc.cnr.it/wp-content/uploads/2022/05/Z224.pdf The corpus is annotated at textual level, with some Named Entities annotation. A portion of this corpus was annotated morpho-syntactic information and is available here: Sara Goggi, Sara Goggi remo Bindi, Lisa Biagini e Sergio Rossi, 1997, Corpus Parole (3 milions words), ILC-CNR for CLARIN-IT repository hosted at Institute for Computational Linguistics "A. Zampolli", National Research Council, in Pisa, http://hdl.handle.net/20.500.11752/ILC-1001

ILC4CLARIN: Linguistic Data and NLP Tool

A Geographical Visualization of GL Community: a Snapshot

Author: Monachini Monica (ILC-CNR)
GreyNet Grey Literature Network Service
Bartolini Roberto (ILC-CNR)
Russo Irene (ILC-CNR)
Goggi Sara (ILC-CNR)
Pardelli Gabriella (ILC-CNR)
Publication venue
Publication date: 2017
Field of study

Includes: Conference preprint, Powerpoint presentation, Abstract and Biographical notesXAInternationa

OpenGrey Repository

Grey Literature Between Tradition and Innovation: Is There a Continuum?

Author: Sassi Manuela (ILC-CNR)
GreyNet Grey Literature Network Service
Goggi Sara (ILC-CNR)
Pardelli Gabriella (ILC-CNR)
Publication venue
Publication date: 2011
Field of study

This study wants to explore new ways of social media communication for Grey Literature. In particular it describes the role of social media in relation with traditional channels and how social media applications can be used for Grey.Includes: Conference preprint, Powerpoint presentation, Abstract and Biographical notesXAInternationa

OpenGrey Repository

Open Grey for Natural Language Processing: a ride on the network

Author: Sassi Manuela (ILC-CNR)
GreyNet Grey Literature Network Service
Goggi Sara (ILC-CNR)
Pardelli Gabriella (ILC-CNR)
Publication venue
Publication date: 2013
Field of study

The aim of this paper is to introduce the Open Access movement for Natural Language Processing (NLP) by means of a wide range of open access Grey Literature documentation available on the web. In 2008 Robert Dale, in the last issue of volume 35 of Computational Linguistics said: "There are a number of definitions of the term 'open access' in circulation, but almost all share the key principle that scientific literature should be freely available for all to read, download, copy, distribute, and use (with appropriate attribution) without restriction". At first glance it might seem that the Open Access movement has gradually become more influential in the field of language technology by building repositories accessible through the network. Today's digital archives are niches of intellectual production spread by means of a wide range of documents (such as journal articles and proceedings) which, paradoxically, the search engines do not always reach. The use of inappropriate terms in the formulation of queries and the fragmentation of repositories in this area of investigation does not allow to retrieve information on a large scale. The full paper, after a first introductory section, will be organized in two sections: 1) the first dedicated to the methodology for searching and tracing open access resources and to the criteria for analyzing and selecting the online documentation; 2) the second devoted to a description of the state-of-the-art of Open Access Grey Literature material in a statistical and thematic scenario. As things stand, standardization of computational systems interconnected by links and tools of various nature allowing Internet users to easily retrieve the information that the web naturally makes available would then be essential.Includes: Conference preprint, Powerpoint presentation, Abstract and Biographical notesXAInternationa

OpenGrey Repository

A terminological “journey” in the Grey Literature domain

Author: Silvia Giannini (ISTI-CNR)
Biagioni Stefania (ISTI-CNR)
GreyNet Grey Literature Network Service
Bartolini Roberto (ILC-CNR)
Goggi Sara (ILC-CNR)
Pardelli Gabriella (ILC-CNR)
Publication venue
Publication date: 2017
Field of study

Includes: Conference preprint, Powerpoint presentation, Abstract and Biographical notesXAInternationa

OpenGrey Repository

A semantic engine for grey literature retrieval in the oceanography domain

Author: Monachini Monica (ILC-CNR)
Bustaffa Franco (DP2000)
GreyNet Grey Literature Network Service
Bartolini Roberto (ILC-CNR)
Goggi Sara (ILC-CNR)
Manzella Giuseppe (ETTsolutions)
De Mattei Maurizio (DP2000)
Pardelli Gabriella (ILC-CNR)
Frontini Francesca (ILC-CNR)
Publication venue
Publication date: 2016
Field of study

Here we present the final results of the MAPS (Marine Planning and Service Platform) project, an environment designed for gathering, classifying, managing and accessing marine scientific literature and data, making it available for search to Operative Oceanography researchers of various institutions by means of standard protocols. The system takes as input non-textual data (measurements) and text - both published papers and documentation - and it provides an advanced search facility thanks to the rich set of metadata and, above all, to the possibility of a refined and domain targeted key-word indexing of texts using Natural Language Processing (NLP) techniques. The paper describes the system in its details providing also evidence of evaluation.Includes: Conference preprint, Powerpoint presentation, Abstract and Biographical notesXAInternationa

OpenGrey Repository

Marine Planning and Service Platform (MAPS): An Advanced Research Engine for Grey Literature in Marine Science

Author: Monachini Monica (ILC-CNR)
Bustaffa Franco (DP2000)
GreyNet Grey Literature Network Service
Bartolini Roberto (ILC-CNR)
Goggi Sara (ILC-CNR)
Manzella Giuseppe (ETTsolutions)
De Mattei Maurizio (DP2000)
Pardelli Gabriella (ILC-CNR)
Frontini Francesca (ILC-CNR)
Publication venue
Publication date: 2015
Field of study

The MAPS (Marine Planning and Service Platform) project is a development of the Marine project (Ricerca Industriale e Sviluppo Sperimentale Regione Liguria 2007-2013) aiming at building a computer platform for supporting a Marine Information and Knowledge System, as part of the data management activities. One of the main objective of the project is to develop a repository that should gather, classify and structure marine scientific literature and data thus guaranteeing their accessibility to researchers and institutions by means of standard protocols. We will present the scenario of the Operative Oceanography together with the technologies used to develop an advanced search engine which aims at providing rapid and efficient access to a Digital Library of oceanographic data. The case-study is also highlighting how the retrieval of grey literature from this specific marine community could be reproduced for similar communities as well, thus revealing the great impact that the processing, re-use as well as application of grey data have on societal needs/problems and their answers.Includes: Conference preprint, Powerpoint presentation, Abstract and Biographical notesXAInternationa

OpenGrey Repository

Short Report of "The European Language Resources and Technologies Forum: Shaping the Future of the Multilingual Digital Europe" (Vienna, 12-13 February 2009)

Author: Calzolari Nicoletta
Calzolari N.
Baroni Paola
Monachini Monica
Bel N.
Monachini M.
van Bel N.
Goggi Sara
Toral Antonio
Toral A.
Choukri Khalid
Baroni A.
Budin Gerhard
Piperidis S
Goggi S
Odijk J.E.J.M.
Overkoepelend onderzoeksprogramma UiL-OTS
Quochi Valeria
Soria Claudia
Quochi V.
Odijk Jan
Mariani J.
Choukri K.
Mariani Joseph
LS OZ Taal en spraaktechnologie
Bel N?ria
Soria C.
Budin G.
Piperidis Stelios
Publication venue
Publication date: 01/01/2009
Field of study

The Forum combined the FLaReNet themes with the i2010 objectives to address some of the technological, market and policy challenges to be faced in a multilingual digital Europe. The Forum represented an occasion to identify the grounds for future directions and strategies in the area of Language Resources and Language Technologies

PUblication MAnagement

Utrecht University Repository

Author Instructions

Author: Instructions Author
Publication venue
Publication date: 04/11/2013
Field of study

Crossref

Cartographic Perspectives (E-Journal - North American Cartographic Information Society, NACIS)

Corpus Parole (3 milions words)

Author: Sara Goggi Sara Goggi remo Bindi, Lisa Biagini e Sergio Rossi
Publication venue
Publication date: 26/10/1997
Field of study

ILC4CLARIN: Linguistic Data and NLP Tool