Academic Paper |
|
|
|
|
| Title: | Rewriting the orthography of SMS messages |
| Author: | Francois Yvon |
| Institution: | Université Paris Sud 11 |
| Linguistic Field: | Computational Linguistics; Writing Systems |
| Subject Language: |
French
|
| Abstract: | Electronic written texts used in computer-mediated interactions (emails, blogs, chats, and the like) contain significant deviations from the norm of the language. This paper presents the detail of a system aiming at normalizing the orthography of French SMS messages: after discussing the linguistic peculiarities of these messages and possible approaches to their automatic normalization, we present, compare, and evaluate various instantiations of a normalization device based on weighted finite-state transducers. These experiments show that using an intermediate phonemic representation and training, our system outperforms an alternative normalization system based on phrase-based statistical machine translation techniques. |
|
|
|
|
This article appears in Natural Language Engineering Vol. 16, Issue 2, which you can read on Cambridge's site or on LINGUIST . |
|
|
|
|
Back
Add a new paper Return to Academic Papers main page Return to Directory of Linguists main page |
|


