CRDO


Visible documents : 141
Downloadings : 308
Members : 159 (27 countries)
Publications : 64
Spoken languages : 148

[Valid RSS]   [Valid Atom 1.0]

CRDO - Resource Center for the Description of Oral

http://crdo.fr

TGE-Adonis

CLARIN OAI
Open archives


-   [Sign up]   /   [Login]   - 
--- --- --- --- --- --- --- --- --- --- --- --- 
/ 中文 /  English / español / français / 

Sharing linguistic data for speech research
(CRDO, Aix-en-Provence)

CRDO (a Resource Center for the Description of Oral) is offering speech research labs and scholars a free-of-charge service for sharing their data and archiving it with the help of procedures compliant with the OAIS model for long-term preservation. Its entire storage is referenced in international repositories such as OLAC (Open Language Archives Community) and Virtual Language World. Items of four kinds are available on this site:

  • Primary data: sound/video corpora and any speech-related signal ;
  • Resources: annotations of corpora, lexicons, reference databases, systems of representation, grammars etc. ;
  • Tools for linguistic research ;
  • Collections of items as defined above.

Signing up on the CRDO website will grant you access to :

  • the downloading of items available on this server (if granted to your group) ;
  • the uploading of your own data ;
  • contributing to the CrdoWiki pages describing material, teams and projects related to data shared or documented on this site.

The latest deposits >> more
Primary data (corpus) Aborigènes de Taiwan : locuteurs amis/chinois en milieu urbain, entretiens type sociolinguistique (Francois DE SULAUZE)
Wenzao Ursuline College of Languages (WTUC)
25 cassettes audio d'enregistrements de locuteurs amis vivant en milieu urbain interrogés dans des entretiens semi directifs entre janvier 2002 et août 2003, plus des enregistrements en public réalisés dans les villes de Hualien et Taipei.
Corpus en cours de numérisation. [More]
(sociolinguistics)
Chinese -> Amis (阿美)
pictopicto2
2010-03-12
[ARK] Primary data (corpus) Mơ Piu (Geneviève CAELEN-HAUMONT)
Multimedia, Information, Communication and Applications (MICA)
Data collected from the Mơ Piu community during an initial field trip in Nậm Tu Thuʼợng betwween 8 and 12 June 2009. Recordings of speech, songs and music are indexed on day, theme, speaker, question, (song or musical piece). Each question/answer pair is assigned a WAV file. Recordings are in stereo format. On the first track the day, theme, speaker and question number are indicated, followed by a question in Vietnamese (asked by a Vietnamese speaker). On the second track the same question is asked in Mơ Piu and followed by its answer.
4 male speakers, 3 female speakers, 7 female singers et 2 male singers have been recorded. In the whole, 7 hours of speech and 1 hour of singing. [More]
(anthropological_linguistics)
Vietnamese -> Mơ Piu

Rapport.pdf
2010-03-02
[ARK] Primary data (corpus) Patois du Valjouffrey (Clément GIRARD)
Personal contribution
A detailed lexical and morphological description of Valjouffrey patois completed with recordings of 4 speakers. This comprises a 215-page memoir and approx. 10 hours of recordings of native speakers.
These data have been collected in 1969-1970 as an academic work under the direction of Prof. Gaston TUAILLON (Univ. Stendhal, Grenoble). [More]
(language_documentation, lexicography, sociolinguistics, anthropological_linguistics, text_and_corpus_linguistics)
Occitan (post 1500) -> Provençal (provençau)

>> Collection Valjouffrey crdo000007
picto


MemoireDeClementGirard-diffusion.pdf
2010-02-12
Primary data (corpus) Valjouffrey - corpus 2010 (Médéric GASQUET-CYRUS)
Laboratoire parole et langage (LPL)
1) The analytical documentation of an almost extinct language: Valfouffrey's dialect/patois; 2) The construction of a multi-speaker corpus of spontaneous speech in response to specific requirements of research on prosody, gestures, language/communication interactions, and comparison of languages; 3) An ethnolinguistic, cultural and historical enquiry on the Valjouffrey valley. This project has been granted support by the TUL-ILF federations (CNRS) and the French delegation of French language and languages spoken in France (DGLFLF, Ministry of Culture). [More]
(language_documentation, lexicography, sociolinguistics, anthropological_linguistics)
Occitan (post 1500) -> Provençal (provençau)

>> Collection Valjouffrey crdo000007
pictopicto2
1-TexteDuProjet.pdf

Steps on the snow and the music of chairs...

“Tous des Roumains” (J. Gaillard)

“Petit Papa Noël” (R. Bois)

“Both of us will speak patois… Well, people will say we're crazy!” (H. Balmet & J. Gaillard)
2009-10-24
Primary data (corpus) Gangubai (Bernard BEL, Hema RAIRKAR)
Personal contribution
An interview with Gangu Ambore, a leper woman in Tadkalas, Parbhani district, Maharashtra (India), recorded on February 5, 1997. Gangubai expresses her intimate thoughts with the support of grindmill and devotional songs borrowed to a popular bhakti tradition.
गंगुबाई अंबोरे या ताडकळस, जिल्हा परभणी, महाराष्ट्र, भारत, इथे रहाणार्या महारोगी बाईंची मुलाखत १९९६-९७ ला ध्वनीमुद्रित केली. त्या आपल्या मनातले अगदी जवळचे विचार जात्या वरील गणी व भक्तिपरंपरेतील अभंग, गौळणींच्या आधारावर अभिव्यक्त करतात. [More]
(sociolinguistics, anthropological_linguistics, linguistics_and_literature, pragmatics)
Marathi (मराठी)

>> Collection Popular cultural productions in Marathi language crdo000749
picto
Le champ du dire et le soi de la parole (Guy Poitevin)
Bhakti, a Faith for Rehabilitation (G. Poitevin, H. Rairkar)

See preview




2009-06-29
[ARK] Primary data (corpus) PSH/DISPE - Parole subaquatique et/ou hyperbare (Alain MARCHAL)
Laboratoire parole et langage (LPL)
Issu d'une collaboration entre le Laboratoire Parole et Langage (LPL) et l'Institut de Plongée Professionnelle de Marseille (INPP) en 1991, le corpus PSH/DISPE répond à la demande d'une base de sons pour le développement de nouveaux procédés de « décodage » de la parole hyperbare, et d'un outil pour l'évaluation des systèmes de communication vocale.
Les fichiers d'annotations sont conformes au format standard SAM Europec du projet CEE-ESPRIT n°2589. (Une conversion au format TextGrid de Praat est à l'étude.)
Dans cette distribution, tous les fichiers texte sont encodés en UTF8 et les fichiers signal en WAVE. [More]
(phonetics, phonology)
English; French





2009-06-12
>> more

The 8 most popular items
Primary data (corpus) CID (Roxane BERTRAND)Downloaded 35 time(s) (see members)
Resource VfrLPL (Stéphane RAUZY)Downloaded 27 time(s) (see members)
Resource Annotations of CID corpus (Roxane BERTRAND)Downloaded 25 time(s) (see members)
Resource Grammar of French language (GP) (Marie-Laure GUéNOT)Downloaded 18 time(s) (see members)
Primary data (corpus) Aix-MARSEC database (Daniel HIRST)Downloaded 14 time(s) (see members)
Primary data (corpus) EUROM1_fr (Daniel HIRST)Downloaded 11 time(s) (see members)
Tool MOMEL (Daniel HIRST)Downloaded 9 time(s) (see members)
Tool MELISM (Geneviève CAELEN-HAUMONT)Downloaded 7 time(s) (see members)

This site is optimized for FireFox or any browser with the 'tabs' option set.