Titre | Quelques problèmes observés dans l'élaboration de dictionnaires à partir de corpus | |
---|---|---|
Auteur | Alexander Geyken | |
Revue | Langages | |
Numéro | no 171, septembre 2008 Construction des faits en linguistique : la place des corpus | |
Page | 77-94 | |
Résumé anglais |
This work investigates the quantitative and qualitative criteria that preside over the construction of electronic corpora in the context of the elaboration or the update of dictionaries. In particular the concepts of balanced and opportunistic corpora are addressed. It is shown that there are interesting linguistic phenomena that are not present in the largest balanced corpora currently available. Opportunistic corpora are many times bigger due to the availability of large quantities of electronic newspaper text. However, different studies conducted e.g. on the gender distribution or on archaisms show that the results vary considerably depending on the size and the sampling of the corpora. Hence, frequency is no longer a reliable criterion which poses a problem for opportunistic corpora with regards to their objectivity. Source : Éditeur (via Cairn.info) |
|
Article en ligne | http://www.cairn.info/article.php?ID_ARTICLE=LANG_171_0077 |