CEDEL2: Corpus Escrito del Español L2 (version 2)

CEDEL2 (v2)

v2.0
Sept. 2020

CEDEL2 team

CEDEL2 direction, corpus design and compilation:

Cristóbal Lozano (Universidad de Granada)

CEDEL2 (version 1) linguistic team:

Cristóbal Lozano (Universidad de Granada)

Amaya Mendikoetxea (Universidad Autónoma de Madrid)

CEDEL2 (version 2=this version) linguistic team:

Cristóbal Lozano (Universidad de Granada)

Nobuo Ignacio López-Sako (Universidad de Granada)

Ana Díaz-Negrillo (Universidad de Granada)

CEDEL2 research assistants:

Athanasios Georgopoulos (Universidad de Granada & University of Athens)

Teresa Quesada (Universidad de Granada)

Fernando Martín-Villena (Universidad de Granada)

Elena García Guerrero (Universidad de Granada)

Aphrodite Amanatidou (Universidad de Granada & University of Ioannina)

Bethany Gardner (Rochester University - IES Abroad Granada)

Lourdes Medina (Universidad de Granada)

Mónica Bautista (Universidad de Granada)

CEDEL2 spoken data transcribers:

Teresa Quesada (Universidad de Granada)

Aphrodite Amanatidou (Universidad de Granada & University of Ioannina)

CEDEL2 software development:

CEDEL2 web interface (version 1):Athanasios Georgopoulos (Universidad de Granada)

CEDEL2 web interface (version 2 = this version): Mario Barcala, NLPgo.

CEDEL2 participants:

CEDEL2 is the result of thousands of anonymous volunteers who generously participated online from different countries. A small group of them participated in situ at the Universidad de Granada during the oral recordings. Participants read an informed consent form first and participation was entirely voluntary. Ethics approval was granted by the Human Research Ethics Committee at the Universidad de Granada.We thank all participants for their help!

CEDEL2 data-collection collaborators:

Many Spanish teachers from all over the world (mainly the US, UK and Spain) helped in the data collection process. An international team of main collaborators participated in the coordination of the data collection of several subcorpora and/or translated the online forms into the participant’s mother tongue. We thank them all for their generous help and support!

Table: Main data collectors and collaborators (ranked in order of words and documents collected)

Subcorpus # words # docs Main data collector(s)
L1 English-L2 Spanish 558731 1931 Cristóbal Lozano (Uni Granada)
L1 Greek-L2 Spanish 64105 216 Athanasios Georgopoulos (Uni Granada, Uni Athens) & Aphrodite Amanatidou (Uni Ioannina-Uni Granada)
L1 Japanese-L2 Spanish 23049 243 Nobuo Ignacio López-Sako (Uni Granada)
L1 Portuguese-L2 Spanish 21662 164 Joana Teixeira & Ana Madeira (Uni Nova Lisboa UNL)
L1 German-L2 Spanish 16164 82 Jacopo Torregrossa (Uni Frankfurt)
L1 Russian-L2 Spanish 16117 101 Tatiana Portnova & Benamí Barros (Uni Granada)
L1 Italian-L2 Spanish 14426 83 Pau Montserrat (Uni Florence)
L1 Arabic-L2 Spanish 9118 74 Amal Haddad (Uni Granada)
L1 Dutch-L2 Spanish 9069 60 Kim Collewaert (Vrije Uni Brussel & Uni Granada), An Vande Casteele (Vrije Uni Brussel) & Mª Carmen Parafita (Uni Leiden)
L1 French-L2 Spanish 8213 59 Hugues Lacroix (Uni Montréal) & Ismael Ramos Ruiz (Uni Paris)
L1 Chinese-L2 Spanish 4373 22 Juan José Ciruela (Uni Granada) & Aida García (Uni Autónoma Madrid) & Cristóbal Lozano (Uni Granada)
SUBTOTAL7450273035
Spanish natives 304211 1112 Cristóbal Lozano (Uni Granada)
English natives* 40805 172 Ana Díaz Negrillo & Cristóbal Lozano (Uni Granada) & Verónica Montero (Central College)
Japanese natives 9126 47 Nobuo Ignacio López-Sako (Uni Granada)
Portuguese natives 3348 16 Joana Teixeira & Ana Madeira (Uni Nova Lisboa UNL)
Greek natives 2031 12 Aphrodite Amanatidou (Uni Ioannina-Uni Granada)
Arabic natives 1465 6 Amal Haddad (Uni Granada)
SUBTOTAL3609861365
GRAND TOTAL CEDEL2 v211060134400

*The L1 English native subcorpus is part of COREFL (Corpus of English as a Foreign Language): Lozano, C., Díaz-Negrillo, A., & Callies, M. (2020). Designing and compiling a learner corpus of written and spoken narratives: COREFL. In C. Bongartz & J. Torregrossa (Eds.), What’s in a Narrative? Variation in Story-Telling at the Interface between Language and Literacy (pp. 9-32). Bern: Peter Lang.

This web site uses own and third party cookies to allow it to work fine and to allow us to know how it is being used. If you click on ACCEPT these both types of cookies will be enabled. If you want more information, you can read the COOKIES POLICY document of our web site. Cookie settings

Technical cookies So that our website can work. Activated by default.

Technical cookies are strictly necessary for our website to work and for you to navigate through it. These types of cookies are those that, for example, allow us to identify you, give you access to certain restricted parts of the website if necessary, or remember different options or services already selected by you, such as your privacy preferences. Therefore, they are activated by default and your authorization is not necessary.

Through the configuration of your browser, you can block or alert the presence of this type of cookies, although such blocking will affect the proper functioning of the different functionalities of our website.

Analysis cookies To allow us to know how our web is being used. You can enable or disable them.

Analysis cookies allow us to study the navigation of the users of our website in general (for example, which sections of the site are the most visited, which services are used most and if they work correctly, etc.). From this statistical information about navigation on our website, we can improve both the operation of the site itself and the different services it offers. Therefore, these cookies do not have an advertising purpose, but only serve to make our website work better, adapting to our users in general. By activating them you will contribute to this continuous improvement.

You can activate or deactivate these cookies by changing the corresponding sliders.