Academic Paper |
|
|
|
|
| Title: | Resolution of Unidentified words in Machine Translation |
| Author: | Sana Ullah |
| Email: | click here to access email |
| Homepage: | www.sana-ullah.com |
| Institution: | Inha University |
| Author: | Kyung Sup Kwak |
| Author: | Md. Asdaque Hussain |
| Email: | click here to access email |
| Linguistic Field: | Computational Linguistics |
| Subject Language: |
Pashto, Northern
|
| Abstract: | This paper presents a mechanism of resolving unidentified lexical units in text-based machine translation (TBMT). In machine translation system it is unlikely to have a complete MT lexicon and hence there is need of a mechanism to handle the problem of unidentified words. These unknown words could be abbreviations, names, acronyms and newly introduced terms. We have proposed an algorithm for the resolution of the unidentified words. This algorithm takes discourse unit (primitive discourse) as a unit of analysis and provides real time updates to the lexicon. We have manually applied the algorithm to news paper fragments. Along with anaphora and cataphora resolution, many unknown words especially names and abbreviations were updated to the lexicon. Moreover, flowchart of the proposed algorithm is also presented. |
| Type: | Individual Paper |
| Status: | Completed |
| Venue: | Seoul, Korea |
| Publication Info: | NEXT 07, pp. 603-606 |
|
|
|
|
Back
Add a new paper Return to Academic Papers main page Return to Directory of Linguists main page |
|


