LINGUIST List 23.3784
|
Mon Sep 10 2012
Confs: Text/Corpus Linguistics/France
Editor for this issue: Xiyan Wang
<xiyan linguistlist.org>
|
Date: 10-Sep-2012
From: Marine Damiani <marinedamiani gmail.com>
Subject: COLDOC: Processing Linguistic Corpus. Tools and Methods
E-mail this message to a friend
COLDOC: Processing Linguistic Corpus. Tools and Methods
Date: 04-Oct-2012 - 05-Oct-2012
Location: Paris, France
Contact: ColDoc 2012
Contact Email: < click here to access email >
Meeting URL: http://www.modyco.fr/index.php?view=article&id=1774
Linguistic Field(s): Text/Corpus Linguistics
Meeting Description:
COLDOC is a conference organized every year by postgraduate students and young researchers of the MoDyCo laboratory (UMR 7114 - CNRS/Université Paris Ouest Nanterre/Université Paris Descartes). This year our aim is to explore tools and methods which have emerged around corpus-based studies. Over the last decades, linguistics has undergone a considerable evolution in its object of study: it tends to focus less on language itself (as an a priori unlimited and introspective object) and more on corpus (as an attested sample of language). Today, the central position of corpus in linguistic research has an important effect on the majority of linguistic studies made by both linguistic experts and postgraduate students. This rise of corpus-related issues fuels a latent informal debate: the new perspective is often presented either in an exaggeratedly negative light (as a simplistic ‘fashion’ that inhibits theoretical studies), or in a too positive one (as a revolution that makes linguistics more ‘scientific’ and ‘real’). We would like to go over this reductive conflict and invite all willing postgraduate students and young researchers to examine the range of methods and tools that has emerged with this ‘new age’ of corpus studies. It is our hope to highlight the connection between observation and analysis, attempting to follow the idea of a complementarity of the empirical and theoretical ways, as was already emphasized in his time by Francis Bacon: Those who have handled sciences have been either men of experiment or men of dogmas. The men of experiment are like the ant, they only collect and use; the reasoners resemble spiders, who make cobwebs out of their own substance. But the bee takes a middle course: it gathers its material from the flowers of the garden and of the field, but transforms and digests it by a power of its own. Novum Organum (1620), Livre I, 95 The heart of our discussion will be this metaphorical ‘art of the bee’ in working with corpus. From the point of collecting the utterances or texts to the final theoretical interpretation and its applications, ‘processing’ the corpus work does indeed resemble a phase of ‘digestion’ of empirical data. More precisely, this evolution seems to have an intrinsic link with the development of tools in informatics and computer sciences (text navigation, online corpora, transcription tools, analyzer tools), which have dramatically changed our access to sources and affected the procedures of linguistic study. We assume that these technological evolutions have had an influence not only on our field of linguistics but also in an interdisciplinary way in other social sciences. It seems that in these fields, a similar trend of ‘experimental’ and ‘data processing’ approaches has soared over the last period. The development of internet and computers has introduced a whole range of possibilities in corpus exploration. Part of the linguistic community is working on corpora as such, providing an always more detailed analysis, whereas others investigate the development of instruments through NLP. In both cases, central problem is how to pool the findings. The situation is rather complex because of the great variety of approaches which depend on topics and orientations chosen, and on tendencies to accompany them (constitution of ‘big’ corpora, annotation workshop).
Jeudi 4 Octobre 8.30 - 9.00 Accueil des participant-e-s 9.00 - 9.15 Présentation du colloque par Jean-Luc Minel, directeur du labo 9.15 - 10.15 Conférence plénière: Anne Condamines Méthodes de linguistique outillée pour l'analyse de corpus spécialisés 10.15 - 10.45 Mériem Zlitni Outils, méthodes et problèmes de traitement d'un corpus multilingue: le cas du journal Simpaticuni (Tunis, 1911-1933) 10.45 - 11.15 Mona Zegai Les usages des techniques lexicométriques en sociologie au sein d'un dispositif méthodologique: outil exploratoire ou méthode explicative ? 11.15 - 11.45 Pause café 11.45 - 12.15 Naïma Ben Bourenane Analyse de la communication chez un sujet infirme moteur cérébral (IMC): l'apport des techniques augmentatives et alternatives (exemple du synthé 4) 12.15 - 12.45 Caroline Atallah Exploitation d'un corpus annoté pour l'analyse des relations causales 12.45 - 14.30 Déjeuner 14.30 - 15.00 Isabel Colón de Carvajal Choix méthodologiques pour analyse de conversation en situation de jeu vidéo 15.00 - 15.30 Ciara Wigham et Aurélie Bayle Enjeux, outils et méthodologie de constitution de corpus d'apprentissage 15.30 - 16.15 Pause posters et café 16.15 - 16.45 Cécile Viollain Perspectives sur la rhoticité et le 'r' de sandhi dans le corpus PAC Nouvelle-Zélande 16.45 - 17.15 Samira Moukrim La transcription des langues à tradition orale: un palier d'interaction entre écriture et formalisation 17.15 - 17.45 Discussion(S) de fin de journée Vendredi 5 Octobre 8.30 - 9.00 Re-accueil des participant-e-s 9.00 - 10.00 Conférence Plénière: Bernard Combettes Les grands corpus informatisés et la recherche en linguistique historique: apports et limites 10.00 - 10.30 Tony Onguene Ce que le corpus nous enseigne sur le lexique verbal des collégiens des classes de sixième à la troisième des lycées de Yaoundé 10.30 - 11.00 Tove Larsson La corpus-based study of anticipatory it patterns in university student writing: the role of nativeness and student levels 11.00 - 11.45 Pause posters et café 11.45 - 12.15 Ali Belghanem Sémantique du discours scientifique de Pierre Bourdieu: construction et classification d'un corpus de travail 12.15 - 12.45 Marine Espinat Corpus et phraséologie: un « catalogue » de « prêt-à-parler » ? 12.45 - 14.30 Déjeuner 14.30 - 15.00 Sondes Hamdi The spatialization of time in French and English: a corpus-based analysis 15.00 - 15.30 Maryna Lytvynova et Huy-Linh Dao Relatives narratives et descriptives: entre corpus et théorie 15.30 - 16.00 Pause café 16.00 - 16.30 Kamel Nebhi Annotation automatique de documents pour le web sémantique 16.30 - 17.30 Table ronde et conclusion(S) 17.30 Pot de clôture
Read more issues|LINGUIST home page|Top of issue
|
|
Page Updated: 10-Sep-2012
|
|
About LINGUIST
|
Contact Us
While the LINGUIST List makes every effort to ensure the linguistic relevance of sites listed
on its pages, it cannot vouch for their contents.
|
|