Editor for this issue: Marie Klopfenstein <marie
linguistlist.org>
Workshop on Characterisation of Internet Content Date: 31-Jan-2004 - 31-Jan-2004 Location: Paris, France Contact: Beauvisage Thomas Contact Email: indices.internetMail to author|Respond to list|Read more issues|LINGUIST home page|Top of issueml.free.fr Meeting URL: http://www.atala.org/je Linguistic Sub-field: Computational Linguistics, Semantics, Text/Corpus Linguistics Meeting Description: This workshop is organised to point to existing problems faced by NLP tools for the description and use of material available on the Internet (Web pages and sites, mail, fora, instant mail, etc). Such problems concerns in particular: - methods for content collection, accessibility of content, formalisms for information storage, etc. - the semantics of Internet content: textual content vs. services offered, multimedia and interactive content, semiotics of web pages; - categorisation methods: topic detection, categorisation of sites and pages, Internet-specific genres. Globally speaking, these problems reveal new links between NLP and the Internet. On the same time, because of these problems, computational linguistics has to study further the benefits and limitations of NLP tools for the description and use of Internet content. Characterisation of Internet content: beyond keywords. Semantic approach. Workshop of the Atala, organized by : Fran�ois Rastier (CNRS - UMR 7114, Paris X - MoDyCo), Natalia Grabar (CRIM/INaLCO, STIM / DSI / AP-HP, Paris 6) and Thomas Beauvisage (France T�l�com R&D - DIH/UCE, Paris X - MoDyCo) Date: Saturday 31 January 2004 Location: ENST, 49, rue Vergnault, 75013 Paris Amphith�atre Emeraude M�tro : Corvisart Free entry Program: 9h15 Presentation of the Workshop 9h30 Thomas Beauvisage (France T�l�com R&D) Utiliser les annuaires du Web pour d�crire les parcours sur la Toile (Using Web directories to describe users' paths) 10h00 Kamel Sma�li et Armelle Brun (LORIA) Routage automatique de courriers �lectroniques (Automatic routing of emails ) 10h30 Break 11h00 Antoine Marzin, Lionel Martin, Christel Vrain et Guillaume Cleuziou (LIFO, U. Orl�ans) Classification de pages Web en Genre (Genre-based Web pages classification) 11h30 Martine Hurault-Plantet (LIMSI-CNRS) S�lection de traits et d�tection de th�mes pour l'analyse d'un corpus de pages personnelles Web (Selection of traits and topic detection for the analysis of a corpus of personal Web pages) 12h00 Lunch 14h00 Aur�lie N�v�ol, Lina Soualmia, Alexandrina Rogozan, Magaly Douy�re, Beno�t Thirion, St�fan Darmoni (CISMeF, Rouen / PSI-CNRS / U. Rouen) Caract�risation des contenus de lEuro(tm)Internet en sant� : l'exemple CISMeF (Characterisation of Health-related Internet content: the CISMeF example) 14h30 Mathieu Valette (CRIM, Inalco) Projet Princip : application de r�gles s�mantiques � la d�tection de documents racistes sur Internet (The Princip project: application of semantic rules to the detection of racists documents on the Internet) 15h00 Break 15h30 Monika Nicinski, (CRIM, Inalco) Typologie et description s�mantique des images utilis�es dans les sites Internet racistes (Typology and semantic description of images used in racist Web sites) 16h00 Fran�ois Rastier (CNRS - UMR 7114, Paris X - MoDyCo) La s�miotique du document num�rique et son incidence sur les traitements s�mantiques (The semiotics of electronic document and its incidence on semantic processing) 16h30 Round table 17h00 End of the Workshop Important : Le samedi, l'acc�s a l'ENST se fait par la rue Vergnaud (de l'autre c�t� du p�t� de maison par rapport � la rue Barrault). N'oubliez pas de vous munir du programme de la journ�e ; ce programme vous sera demand� au poste de s�curit�.