Global Information Lookup Global Information

Brown Corpus information


The Department of Cognitive Linguistic & Psychological Sciences at Brown University

The Brown University Standard Corpus of Present-Day American English, better known as simply the Brown Corpus, is an electronic collection of text samples of American English, the first major structured corpus of varied genres. This corpus first set the bar for the scientific study of the frequency and distribution of word categories in everyday language use. Compiled by Henry Kučera and W. Nelson Francis at Brown University, in Rhode Island, it is a general language corpus containing 500 samples of English, totaling roughly one million words, compiled from works published in the United States in 1961.

and 23 Related for: Brown Corpus information

Request time (Page generated in 0.8506 seconds.)

Brown Corpus

Last Update:

The Brown University Standard Corpus of Present-Day American English, better known as simply the Brown Corpus, is an electronic collection of text samples...

Word Count : 1056

Corpus linguistics

Last Update:

genres. The Brown Corpus was the first computerized corpus designed for linguistic research. Kučera and Francis subjected the Brown Corpus to a variety...

Word Count : 2576

Perplexity

Last Update:

about its accuracy. The lowest perplexity that had been published on the Brown Corpus (1 million words of American English of varying topics and genres) as...

Word Count : 1840

Most common words in English

Last Update:

Another English corpus that has been used to study word frequency is the Brown Corpus, which was compiled by researchers at Brown University in the...

Word Count : 858

Stop word

Last Update:

derived from the Brown Corpus: This paper reports an exercise in generating a stop list for general text based on the Brown corpus of 1,014,000 words...

Word Count : 1015

Quranic Arabic Corpus

Last Update:

The Quranic Arabic Corpus (Arabic: المدونة القرآنية العربية, romanized: al-modwana al-Qurʾāni al-ʿArabiyya) is an annotated linguistic resource consisting...

Word Count : 599

British National Corpus

Last Update:

British National Corpus (BNC) is a 100-million-word text corpus of samples of written and spoken English from a wide range of sources. The corpus covers British...

Word Count : 3894

Corpus of Contemporary American English

Last Update:

English. American National Corpus British National Corpus Bank of English Brown Corpus Milana, Prior (2021). A Comparative Corpus Study on Intensifier Usage...

Word Count : 1135

Enron Corpus

Last Update:

The Enron Corpus is a database of over 600,000 emails generated by 158 employees of the Enron Corporation in the years leading up to the company's collapse...

Word Count : 728

Oxford English Corpus

Last Update:

The Oxford English Corpus (OEC) is a text corpus of 21st-century English, used by the makers of the Oxford English Dictionary and by Oxford University...

Word Count : 345

Hapax legomenon

Last Update:

legomena. Thus, in the Brown Corpus of American English, about half of the 50,000 distinct words are hapax legomena within that corpus. Hapax legomenon refers...

Word Count : 3548

A Comprehensive Grammar of the English Language

Last Update:

as three corpora: a corpus from the Survey of English Usage, the Lancaster-Oslo-Bergen Corpus (UK English), and the Brown Corpus (US English). In 1988...

Word Count : 292

Naval Air Station Corpus Christi

Last Update:

Naval Air Station Corpus Christi (IATA: NGP, ICAO: KNGP, FAA LID: NGP) is a United States Navy naval air base located six miles (10 km) southeast of the...

Word Count : 1180

Sketch Engine

Last Update:

corpora includes British National Corpus, Brown Corpus, Cambridge Academic English Corpus and Cambridge Learner Corpus, CHILDES corpora of child language...

Word Count : 1419

International Corpus of English

Last Update:

used for the Brown Corpus. Unlike Brown or the Lancaster-Oslo-Bergen (LOB) Corpus (or indeed mega-corpora such as the British National Corpus), however,...

Word Count : 1229

Plain text

Last Update:

Thus, early text projects such as Roberto Busa's Index Thomisticus, the Brown Corpus, and others had to resort to conventions such as keying an asterisk preceding...

Word Count : 1658

Linguistic categories

Last Update:

common nouns, NP for singular proper nouns (see the POS tags used in the Brown Corpus). Other tagging systems use a smaller number of tags and ignore fine...

Word Count : 2571

TenTen Corpus Family

Last Update:

sequences or a specific part of the corpus. First text corpora were created in the 1960s, such as the 1-million-word Brown Corpus of American English. Over time...

Word Count : 1201

Cambridge English Corpus

Last Update:

The Cambridge International Corpus (CIC) is a collection of over 800 million words of real spoken and written English . The texts are stored in a database...

Word Count : 1016

Switchboard Telephone Speech Corpus

Last Update:

The Switchboard Telephone Speech Corpus is a corpus of spoken English language consisted of almost 260 hours of speech. It was created in 1990 by Texas...

Word Count : 453

BulPosCor

Last Update:

697 lexical items. BulPosCor has been compiled from the Structured "Brown" Corpus of Bulgarian by sampling 300+ word-excerpts (expanded to sentence boundary)...

Word Count : 264

Distinctive feature

Last Update:

vs. "NNS" for plural noun, vs. "NNS$" for plural possessive noun (see Brown Corpus). Others provide more explicit separation of features, even formalizing...

Word Count : 1785

PropBank

Last Update:

is a corpus that is annotated with verbal propositions and their arguments—a "proposition bank". Although "PropBank" refers to a specific corpus produced...

Word Count : 377

PDF Search Engine © AllGlobal.net