Publishing Partner: Cambridge University Press CUP Extra Publisher Login
amazon logo
More Info


New from Cambridge University Press!

ad

From Utterances to Speech Acts

By Mikhail Kissine

"Kissine offers a new theory of speech acts which is philosophically sophisticated and builds on work in cognitive science, formal semantics, and linguistic typology. This highly readable, brilliant essay is a major contribution to the field."

--François Recanati, Institut Jean-Nicod



Query Details


Query Subject:   Genre-Specific Corpora
Author:   Marina Santini
Submitter Email:  click here to access email

Linguistic LingField(s):  Computational Linguistics
Text/Corpus Linguistics

Query:   Hi,

I am doing some research in concept extraction from different types of
texts or genres.

I am looking for free research corpora (in English and in any other
language) belonging to the following genres:

1) FAQs (I have already downloaded some small collections, but I
would like to have a more comprehensive range of topics).
2) Chat logs transcripts (I have already downloaded the NPS
Collection, 3 Codiac datasets and several smallish Many Eyes
datasets)
3) Telephone conversation transcripts (missing)
4) Emails (I have already downloaded the Enron dataset and a couple
of junk mail collections)
5) Twitter posts corpora (missing, apparently the Edinburgh's Twitter
corpus is not available any more)
6) Corporate weblog corpora (missing)

I will be glad to share all the links and related documentation, once I got
all the genres in the list.

Thanks in advance for your suggestions.

Best Regards,

Marina Santini
Researcher at Artificial Solutions
LL Issue: 22.1852
Date posted: 26-Apr-2011



Back

Sums main page