Publishing Partner: Cambridge University Press CUP Extra Publisher Login

The LINGUIST List is dedicated to providing information on language and language analysis, and to providing the discipline of linguistics with the infrastructure necessary to function in the digital world. LINGUIST is a free resource, run by linguistics students and faculty, and supported by your donations. Please support LINGUIST List during the 2017 Fund Drive.

E-mail this page

Conference Information

Full Title: Workshop: (Learner) Corpora and their Application in Language Testing and Assessment

Location: Santiago de Compostela, Spain
Start Date: 22-May-2013 - 22-May-2013
Contact: Marcus Callies & Sandra Götz
Meeting Email: click here to access email
Meeting URL:
Meeting Description: (Learner) Corpora and their Application in Language Testing and Assessment
Pre-conference workshop to be held at ICAME 34
‘English Corpus Linguistics on the Move: Applications and Implications’
Santiago de Compostela, Spain
Wednesday, 22 May 2013

Corpora and corpus linguistic tools and methods are frequently used in the study of second language (L2) learning, most notably in Learner Corpus Research (LCR). LCR has contributed significantly to the description of interlanguages and many of its findings have resulted in useful applications for foreign language teaching and learning. Learner- and native-speaker corpora have also received increasing attention in the area of language testing and assessment (LTA; Barker 2010; Taylor & Barker 2008). Practical applications of corpora in LTA can range from corpus-informed to corpus-based and corpus-driven approaches, depending on how corpus data are actually put into practice, the aims and outcomes for LTA, and the degree of involvement of the researcher in the process of data retrieval, analysis and interpretation (Barker 2010; Callies, Zaytseva & Diez-Bedmar to appear).

More recently, researchers have also turned to corpora to inform, validate, and develop the way proficiency is operationalized in the Common European Framework of Reference for Languages (CEFR; Council of Europe 2001, 2009). While the CEFR has been highly influential in language testing and assessment, the way it defines proficiency levels using ‘can-do-statements’ has been criticized, because they are often too impressionistic. For example, a learner at the C2 level is expected to maintain ‘consistent grammatical control of complex language’, whereas at C1 he/she should ‘consistently maintain a high degree of grammatical accuracy’ (Council of Europe 2001, 2009). Such global, vague and underspecified descriptions have limited practical value to distinguish between proficiency levels and also fail to give in-depth linguistic details regarding individual languages or learners’ skills in specific registers. These shortcomings have led to an increasing awareness among researchers of the need to identify more specific linguistic descriptors or ‘criterial features’ which can be quantified by learner data. The aim of such corpus-based approaches is to add ‘grammatical and lexical details of English to CEFR’s functional characterisation of the different levels’ (Hawkins & Filipovic 2012: 5).

While (learner) corpora have the potential to increase transparency, consistency and comparability in the assessment of L2 proficiency, several problems and challenges may also be encountered. One major difficulty is that ‘proficiency level’ has often been a fuzzy variable in learner corpus compilation and analysis (Carlsen 2012), because, due to practical constraints, proficiency has mostly been operationalized and assessed globally by means of external criteria, typically learner-centred methods such as learners’ institutional status. However, recent studies show that global proficiency measures based on external criteria alone are not reliable indicators of proficiency for corpus compilation (Mukherjee 2009; Callies to appear 2013), and ‘hidden’ differences in proficiency (e.g. Pendar & Chapelle 2008) often go undetected or tend to be disregarded in learner corpus analysis (e.g. Götz 2013). Thus, the field still seems to be in need of a corpus-based description of language proficiency to account for inter-learner variability and seek homogeneity in learner corpus compilation and L2 assessment. Another issue that has been intensively debated is the appropriate basis of comparison for learner corpus data, i.e. against what yardstick learner performance should be compared and evaluated.
Linguistic Subfield: Applied Linguistics; Text/Corpus Linguistics
Subject Language: None
LL Issue: 23.5100

This is a session of the following meeting:
English Corpus Linguistics on the Move: Applications and Implications

Calls and Conferences main page