Speech & Language Data Repository (SLDR)

OpenProDat - Italian

Author: Bigi Brigitte
Hirst Daniel
Publication venue: http://lpl-aix.fr
Publication date: 06/03/2013
Field of study

A corpus of read texts in Italia

RATP-DECODA

Author: BECHET FREDERIC
Publication venue: http://www.lif.univ-mrs.fr
Publication date: 20/11/2013
Field of study

Ce corpus contient environ 2000 dialogues collectés dans le centre d'appel de la RATP, à Paris, dans le cadre du projet ANR DECODA (CONTINT 2009). Ces dialogues sont anonymisés, transcrits manuellement (Transcriber) et étiquetés syntaxiquement (POS, disfluences, entités nommées, analyse en dépendance). Cette ressource contient les fichiers sons et les diverses annotations effectuées

Ngbugu digital wordlist: Archival form

Author: OLSON Kenneth
Publication venue: http://www.sil.org
Publication date: 16/07/2013
Field of study

A recording of a 204-item wordlist of Ngbugu elicited in French. The original recording was on analogue cassette tape. The responses are transcribed in IPA (converted from the original Ngbugu orthographic transcription) and aligned to the recording. The wordlist instrument is based on Moñino's (1988) list. Ngbugu is an Ubangian language spoken by some 95,000 people in Central African Republic

OpenProDat - French

Author: Bigi Brigitte
Hirst Daniel
Publication venue: http://lpl-aix.fr
Publication date: 06/03/2013
Field of study

A corpus of read texts in Frenc

apero nextgen

Author: MATTEI Marc
Publication venue: http://gsite.univ-provence.fr/document.php?pagendx=5712&project=up
Publication date: 14/01/2013
Field of study

Reconstruction du corpus apero en manipulant le langage python pour en faire un fichier xml afin de mieux exploiter les données

Corpus oral, lecture, parole préparée, parole spontanée

Author: AMRAOUI Soad
Publication venue: http://gsite.univ-provence.fr/document.php?pagendx=5712&project=up
Publication date: 14/01/2013
Field of study

Ce corpus audio est enregistré dans une classe de maternelle de 32 enfants par un enregistreur Zoom. l'enregistrement est constitué de deux parties, dans la première partie la maîtresse réalise la tâche de lecture d'une histoire et dans la deuxième elle raconte l'histoire sans support écrit. Les transcriptions sont faites sous Praat avec des fichiers Textgrid

Cyberbase Gradignan

Author: Praxiling - UMR 5267 (Montpellier FR)
Telem - EA 4195 (Bordeaux FR)
Publication venue: http://www.u-bordeaux3.fr
Publication date: 27/03/2013
Field of study

Le corpus 'Cyberbase Gradignan' a été recueilli de juillet 2010 à juin 2012 dans le cadre de l'expérimentation Cyber-base®Justice mise en œuvre la Maison d'Arrêt de Gradignan et finalisée à l'accès à l'information, à l'apprentissage de l'informatique et à l'enseignement. Il est constitué d'enregistrements audiovisuels portant, d'une part, sur les activités dans l'espace informatique de la Maison d'Arrêt et, d'autre part, sur des entretiens avec les différents acteurs. L'ensemble du corpus a d'abord été segmenté et indexé (l'indexation, à la fois contextuelle et thématique, a été reportée dans les noms de fichiers). Une partie des séquences a ensuite été transcrite et annotée

OpenProDat - Thai

Author: Bigi Brigitte
Hirst Daniel
Publication venue: http://lpl-aix.fr
Publication date: 19/03/2013
Field of study

A corpus of read texts in Tha

Labial vibrants in Mangbetu: Archival form

Author: OLSON Kenneth
Publication venue: http://www.sil.org
Publication date: 10/05/2013
Field of study

A recording of lexical items in the Meegye variety of Mangbetu, elicited in French. The lexical items exemplify occurrences of bilabial trills and the labiodental flap in the language. The original recording was made in March 2004 on analog cassette tape. The responses are transcribed in orthography and IPA and aligned to the recording. The recording was digitized in March 2005. Mangbetu is a Central Sudanic language spoken by about 620,000 people in the Democratic Republic of the Congo

Impact de l'amorçage rythmique sur la production de la parole chez l'enfant sourd prélingual

Author: HIDALGO Céline
Publication venue: http://gsite.univ-provence.fr/document.php?project=up&locale=fr&pagendx=142&noempty=1&engine_open=337
Publication date: 30/12/2013
Field of study

Échantillons de corpus enregistrés lors d'une étude (soumise à publication) menée conjointement par l'Institut des Neurosciences des Systèmes et le CAMSP Déficiences Auditives de la Timone. 14 enfants sourds prélinguaux âgés de 5 à 13 ans ont été soumis à des répétitions de phrases sans (baseline), puis avec répétition d'un amorçage rythmique (expérience); cet amorçage étant congruent (match) ou non congruent (mismatch) avec la métrique de la phrase cible

24

full texts

268

metadata records

Updated in last 30 days.

Speech & Language Data Repository (SLDR)

Access Repository Dashboard

Do you manage Open Research Online? Become a CORE Member to access insider analytics, issue reports and manage access to outputs from your repository in the CORE Repository Dashboard! 👇