* * * * * * * * * * * * * * * * * * * * * * * *
LINGUIST List logo Eastern Michigan University Wayne State University *
* People & Organizations * Jobs * Calls & Conferences * Publications * Language Resources * Text & Computer Tools * Teaching & Learning * Mailing Lists * Search *
* *
 
E-mail this message to a friend
Title: Research and Implementation of A Domain-Unconstrained Chinese Automatic Abstracting System
Author: Junjie Li
Email: click here to access email
Homepage: http://world.kaist.ac.kr/~jklee/
Degree Awarded: Harbin Institute of Technology , Computer Science and Engineering
Degree Date: 1995
Linguistic Subfield(s): Computational Linguistics
Subject Language(s): Chinese, Mandarin
Director(s): Wang Kaizhu

Abstract:

In this dissertation, the author

(1) presented a new text representation, called Hierarchical Network, which is based on the natural hierarchical structure of text, i.e. Chapter, paragraph, sentence, sub-sentence, word and character to automatically index the text and corpus.

(2) designed and implemented a Non-dictionary Automatic Chinese Segmentation System which provides a new efficient method to segment new words and reach a 98% correctness rate of segmentation for open test.

(3) developed a new characteristic word weighting function which is based on word frequency and word length, where corpus-based co-occurrence computation is used in calculating word frequency.

(4) developed a new sentence importance weighting function which synthetically reflects the factors of sentence length, the number of sub-sentences, the sum of characteristic words and so on, instead of utilizing static information such as sentence location, connective expression words and phrases etc..

(5) designed and implemented a Unconstrained Chinese Automatic Abstracting System, which can generate abstracts of the texts in deferent field, style and length by any abstracting rate.
Add a dissertation
Update dissertation
Page Updated: 29-Nov-2009

Please report any bad links or misclassified data

LINGUIST Homepage | Read LINGUIST | Contact us

NSF Logo

While the LINGUIST List makes every effort to ensure the linguistic relevance of sites listed
on its pages, it cannot vouch for their contents.