LINGUIST List 18.248
|
Wed Jan 24 2007
Calls: Computational Linguistics,Text/Corpus Linguistics/Belgium
Editor for this issue: Ania Kubisz
<ania linguistlist.org>
|
As a matter of policy, LINGUIST discourages the use of abbreviations
or acronyms in conference announcements unless they are explained in
the text. To post to LINGUIST, use our convenient web form at
http://linguistlist.org/LL/posttolinguist.html.
|
Directory
1. Isabelle
Lecroart,
Web as Corpus 2007
Message 1: Web as Corpus 2007
|
Date: 23-Jan-2007
From: Isabelle Lecroart <lecroart tedm.ucl.ac.be>
Subject: Web as Corpus 2007
Full Title: Web as Corpus 2007 Short Title: WAC3 Date: 15-Sep-2007 - 16-Sep-2007 Location: Louvain-La-Neuve, Belgium Contact Person: Bernadette Dehottay Meeting Email: dehottay tedm.ucl.ac.be Web Site: http://cental.fltr.ucl.ac.be/wac3 Linguistic Field(s): Computational Linguistics; Text/Corpus Linguistics Call Deadline: 01-May-2006 Meeting Description: 3rd Web as Corpus Workshop (WAC3) Incorporating Cleaneval An ACL-SIGWAC Event We invite submissions which - describe Web corpus collection projects, or modules for one part of the process (crawling, filtering, language-id, tokenising, lemmatising, POS-tagging or indexing) - explore characteristics of Web data, from a linguistics/NLP perspective including registers, domains, frequency distribution - use crawled Web data for NLP purposes (with emphasis on the data rather than the use) Call for papers 3rd Web as Corpus Workshop (WAC3) Incorporating Cleaneval An ACL-SIGWAC Event Sept. 15-16, 2007 University of Louvain, Louvain-la-Neuve, Belgium More and more people are using Web data for linguistic and NLP research. The workshop provides a venue for exploring how we can use it effectively and what we will find if we do. Cleaneval Anyone using Web data needs to clean it, to get rid of unwanted material including, for example, HTML markup, navigation bars, advertisements. To date there has been no sharing of resources or expertise and the cleaning has often been done minimally. Cleaneval is an exercise to promote sharing and to improve our understanding of the issues. It will take the now-familiar form of an open competition and shared task. More info at Cleaneval Previous WAC workshops. More info on WAC1 at Corpus Linguistics conference, Birmingham, UK, July 2005. More info on WAC2 at EACL, Trento, Italy, April 2006. Invited speaker: Kevin Scannell Kevin Scannell, of Saint Louis Univ., Missouri, USA, has been working with scholars of a range of smaller languages to develop Web corpora for those languages: website currently lists 135 corpora/languages. Submission For regular papers Papers (6-10 pages), demos (max. 2 pages) and posters (max. 2 pages) to be written in English and follow ACL formatting. Template files (.doc & Latex) available on the website. For Cleaneval submissions see Cleaneval website. Deadline: 1 May 2007 Venue Université catholique de Louvain, in the elegant new city of Louvain-la-Neuve (Belgium). Large computer rooms will be available for demo sessions. Points of contact Workshop Co-chairs Cédrick Fairon, UCLouvain, Cental, fairon tedm.ucl.ac.be Prof. Gilles-Maurice de Schryver, Universiteit Gent Cleaneval committee Marco Baroni, U Trento; Secretary, SIGWAC Tony Hartley, U Leeds Adam Kilgarriff, Lexical Computing Ltd; Chair, SIGWAC Serge Sharoff, U Leeds Local organisation team Bernadette Dehottay, UCLouvain, Cental, dehottay tedm.ucl.ac.be Julia Medori, CENTAL, UCLouvain Laurent Kevers, CENTAL, UCLouvain Hubert Naets, CENTAL, UCLouvain Isabelle Lecroart, CENTAL, UCLouvain Claude Devis, CENTAL, UCLouvain Contact us Bernadette Dehottay Université catholique de Louvain Centre for Natural Language Processing (CENTAL) Place Blaise Pascal, 1 1348 Louvain-la-Neuve Phone. +32 10 47 37 88 Fax. +32 10 47 26 06 dehottay tedm.ucl.ac.be
Respond to list|Read more issues|LINGUIST home page|Top of issue
|
|

Please report any bad links or misclassified data
LINGUIST Homepage | Read
LINGUIST | Contact us

While the LINGUIST List makes every effort to ensure the linguistic relevance of sites listed on its pages, it cannot vouch for their contents.
|
|