LINGUIST List 18.1716
|
Tue Jun 05 2007
Diss: Computational Ling/Text&Corpus Ling: Pado: 'Cross-Lingual Ann...'
Editor for this issue: Hunter Lockwood
<hunter linguistlist.org>
|
To post to LINGUIST, use our convenient web form at
http://linguistlist.org/LL/posttolinguist.html.
|
Directory
1. Sebastian
Pado,
Cross-Lingual Annotation Projection Models for Role-Semantic Information
Message 1: Cross-Lingual Annotation Projection Models for Role-Semantic Information
|
Date: 05-Jun-2007
From: Sebastian Pado <pado coli.uni-saarland.de>
Subject: Cross-Lingual Annotation Projection Models for Role-Semantic Information
Institution: Saarland University
Program: Department of Computational Linguistics and Phonetics
Dissertation Status: Completed
Degree Date: 2007
Author: Sebastian Pado
Dissertation Title: Cross-Lingual Annotation Projection Models for Role-Semantic Information
Dissertation URL: http://www.coli.uni-saarland.de/~pado/pub/papers/phd.shtml
Linguistic Field(s):
Computational Linguistics
Text/Corpus Linguistics
Dissertation Director:
Mirella Lapata
Manfred Pinkal
Dissertation Abstract:
Due to the high cost of manual annotation, resources with role-semantic annotation exist only for a small number of languages, notably English. This thesis addresses the resulting resource scarcity problem by developing methods to induce role-semantic annotation for new languages automatically. We address the induction task with annotation projection, a general procedure to exchange linguistic information between aligned sentences in a parallel corpus. Annotation projection is a knowledge-lean approach, and thus applicable even to resource-poor languages. We evaluate our approach by using FrameNet, a large English resource for frame semantics, to induce frame-semantic annotation for two target languages, German and French. We project semantic classes and roles in two separate steps, since the two tasks have different profiles. The projection of semantic classes can be realised using simply by using correspondences between predicates, which are usually single words. Translational shifts, i.e., translations which change the semantic class (frame) of the original predicate, can be filtered out with knowledge-lean filtering mechanisms that rely on distributional properties. In contrast, the projection of semantic roles relies mainly on clean correspondences between sentential constituents (i.e.,role-bearing phrases). We show that such correspondences can be obtained by formalising the task as a graph matching problem that integrates knowledge about syntactic bracketings. The resulting correspondences show a high precision even for noisy input data from automatic shallow semantic parsing. In sum, the results of this thesis indicate that the semantic generalisations made by frame semantics carry over to a considerable degree from English to other languages not only on the type, but also on the token level. The projection methods we have developed can be applied to robustly and automatically create frame-semantic resources for new languages.
Respond to list|Read more issues|LINGUIST home page|Top of issue
|
|

Please report any bad links or misclassified data
LINGUIST Homepage | Read
LINGUIST | Contact us

While the LINGUIST List makes every effort to ensure the linguistic relevance of sites listed on its pages, it cannot vouch for their contents.
|
|