Publishing Partner: Cambridge University Press CUP Extra Publisher Login
amazon logo
More Info


New from Oxford University Press!

ad

Linguistic Diversity and Social Justice

By Ingrid Piller

Linguistic Diversity and Social Justice "prompts thinking about linguistic disadvantage as a form of structural disadvantage that needs to be recognized and taken seriously."


New from Cambridge University Press!

ad

Language Evolution: The Windows Approach

By Rudolf Botha

Language Evolution: The Windows Approach addresses the question: "How can we unravel the evolution of language, given that there is no direct evidence about it?"


The LINGUIST List is dedicated to providing information on language and language analysis, and to providing the discipline of linguistics with the infrastructure necessary to function in the digital world. LINGUIST is a free resource, run by linguistics students and faculty, and supported primarily by your donations. Please support LINGUIST List during the 2016 Fund Drive.

Academic Paper


Title: Modeling Word Senses with Fuzzy Clustering
Paper URL: http://wo.uio.no/as/WebObjects/theses.woa/wo/2.3.9
Author: Erik Velldal
Email: click here TO access email
Institution: University of Oslo
Linguistic Field: Computational Linguistics; Language Acquisition; Semantics; Text/Corpus Linguistics
Abstract: This thesis describes a clustering approach to automatically inferring soft semantic classes and characterizing senses of a set of Norwegian nouns. The words are represented by way of their distribution in text, identified as local contexts in the form of lexical-syntactic relations. Through a shallow processing step the context features are extracted for lemmatized word forms in syntactically tagged corpora. The corresponding frequency counts of noun-context co-occurrences are weighted with a statistical association measure, and the distributional profile of a given word is represented in the form of a feature vector in a semantic space model. A hybrid approach is taken when clustering the word vectors; a bottom-up hierarchical method is used to initialize various types of fuzzy partitional clusterings. With the purpose of capturing the notion of typicality the clusters are construed as fuzzy sets, and the words are assigned varying degrees of membership with respect to the various classes. Words are assigned graded memberships in clusters on the basis of their resemblance towards a class prototype. The goal is to automatically uncover semantic classes, where the various memberships of a given word in these fuzzy clusters can be used to characterize its various senses.
Type: Individual Paper
Status: Completed
URL: http://wo.uio.no/as/WebObjects/theses.woa/wo/2.3.9


Add a new paper
Return to Academic Papers main page
Return to Directory of Linguists main page