CEDEL2: Corpus Escrito del Español L2 (version 3)

CEDEL2 (v2)

v3.0
Oct. 2025

ISSN: 2695-9550

What is CEDEL2?

CEDEL2 is a large linguistic corpus (database) that contains the written and spoken language produced by learners of Spanish as a second/foreign language (L2). It also contains data from native speakers of different languages.

This is the new version of CEDEL2 (version 3), Oct. 2025. It already incorporates the data from earlier versions (CEDEL2 v.1 and v.2). CEDEL2 (v3.0) currently amounts to a total of 6,560 participants and 1,504,430 words, which makes it one of the largest corpora of its kind.

CEDEL2 holds data from 15 subcorpora of learners of Spanish with different L1 backgrounds (where ‘L1’ means the learners’ mother tongue and ‘L2’ their foreign language). For comparative purposes, CEDEL2 also contains 13 subcorpora from native speakers of different L1s.

Learner subcorpora Native control subcorpora
  • L1 English - L2 Spanish
  • L1 Japanese - L2 Spanish
  • L1 Greek - L2 Spanish
  • L1 Italian - L2 Spanish
  • L1 Russian - L2 Spanish
  • L1 Portuguese - L2 Spanish
  • L1 French - L2 Spanish
  • L1 Arabic - L2 Spanish
  • L1 German - L2 Spanish
  • L1 Dutch - L2 Spanish
  • L1 Estonian - L2 Spanish
  • L1 Polish - L2 Spanish
  • L1 Chinese - L2 Spanish
  • L1 Turkish - L2 Spanish
  • L1 Vietnamese - L2 Spanish
  • Spanish
  • English
  • Greek
  • German
  • Japanese
  • Portuguese
  • Turkish
  • Russian
  • Arabic
  • Italian
  • Chinese
  • French
  • Vietnamese
Note: subcorpora are listed decreasingly according to number of words.

Can I use/download CEDEL2?

Can I participate in CEDEL2?

You can participate online in CEDEL2 at: learnercorpora.com

Can I collaborate in CEDEL2?

If you are a teacher/researcher of Spanish, you can collaborate in the data collection. There are many ways of doing this and your learners can benefit from it. Please get in touch with the CEDEL2 director (Cristóbal Lozano, Universidad de Granada) by clicking on the tab ‘Contact’.

Open Data Science

CEDEL2 follows the Open Data Science philosophy. CEDEL2 is publicly available, fully searchable and freely downloadable. It is licensed under a Creative Commons license (CC BY-NC-ND 3.0 ES) . You can use CEDEL2 data for your research/teaching purposes provided you cite the corpus appropriately.

Further info

Funding

CEDEL2 has been publicly funded over the past 20 years by several research project grants from the Spanish Government, which we gratefully acknowledge: PID2020-113818GB-I00 (Ministerio de Ciencia e Innovación); FFI2016-75106-P (Ministerio de Economía y Competitividad); FFI2012-30755 (Ministerio de Economía y Competitividad); FFI2008-01584/FILO (Ministerio de Ciencia e Innovación); HUM2005-01728/FILO (Ministerio de Educación y Ciencia).

This web site uses own and third party cookies to allow it to work fine and to allow us to know how it is being used. If you click on ACCEPT these both types of cookies will be enabled. If you want more information, you can read the COOKIES POLICY document of our web site. Cookie settings

Technical cookies So that our website can work. Activated by default.

Technical cookies are strictly necessary for our website to work and for you to navigate through it. These types of cookies are those that, for example, allow us to identify you, give you access to certain restricted parts of the website if necessary, or remember different options or services already selected by you, such as your privacy preferences. Therefore, they are activated by default and your authorization is not necessary.

Through the configuration of your browser, you can block or alert the presence of this type of cookies, although such blocking will affect the proper functioning of the different functionalities of our website.

Analysis cookies To allow us to know how our web is being used. You can enable or disable them.

Analysis cookies allow us to study the navigation of the users of our website in general (for example, which sections of the site are the most visited, which services are used most and if they work correctly, etc.). From this statistical information about navigation on our website, we can improve both the operation of the site itself and the different services it offers. Therefore, these cookies do not have an advertising purpose, but only serve to make our website work better, adapting to our users in general. By activating them you will contribute to this continuous improvement.

You can activate or deactivate these cookies by changing the corresponding sliders.