Corpus of Contemporary American English information
A more than 560-million-word corpus of American English
This article has multiple issues. Please help improve it or discuss these issues on the talk page. (Learn how and when to remove these template messages)
This article is in list format but may read better as prose. You can help by converting this article, if appropriate. Editing help is available.(March 2022)
This article has an unclear citation style. The references used may be made clearer with a different or consistent style of citation and footnoting.(March 2022) (Learn how and when to remove this message)
(Learn how and when to remove this message)
The Corpus of Contemporary American English (COCA) is a one-billion-word corpus[1] of contemporary American English. It was created by Mark Davies, retired professor of corpus linguistics at Brigham Young University (BYU).[2][3]
^Cite error: The named reference :0 was invoked but never defined (see the help page).
^"Mark Davies, Professor of (Corpus) Linguistics, Brigham Young University (BYU)". www.mark-davies.org. Retrieved November 9, 2021.
^
Kauhanen, Henri (March 21, 2011). "The Corpus of Contemporary American English: Background and history". VARIENG. Retrieved October 13, 2011.
and 27 Related for: Corpus of Contemporary American English information
Standard Corpusof Present-Day AmericanEnglish, better known as simply the Brown Corpus, is an electronic collection of text samples ofAmericanEnglish, the...
British National CorpusCorpusofContemporaryAmericanEnglish (COCA) American National Corpus Frequency analysis "The Oxford EnglishCorpus". Sketch Engine...
corpus to corpus – for example splitting the prepositional use of "to" from the use as a particle. Also the CorpusofContemporaryAmericanEnglish (COCA)...
The American National Corpus (ANC) is a text corpusofAmericanEnglish containing 22 million words of written and spoken data produced since 1990. Currently...
The Enron Corpus is a database of over 600,000 emails generated by 158 employees of the Enron Corporation in the years leading up to the company's collapse...
British National Corpus (BNC) is a 100-million-word text corpusof samples of written and spoken English from a wide range of sources. The corpus covers British...
Simple English Wiktionary. The Academic Vocabulary List, based on the Academic Word List, drawing from the CorpusofContemporaryAmericanEnglish (COCA)...
German Reference Corpus (original: Deutsches Referenzkorpus; short: DeReKo) is an electronic archive of text corpora ofcontemporary written German. It...
The Cambridge International Corpus (CIC) is a collection of over 800 million words of real spoken and written English . The texts are stored in a database...
International CorpusofEnglish (ICE) is a set of text corpora representing varieties ofEnglish from around the world. Over twenty countries or groups of countries...
The Spoken EnglishCorpus (SEC) is a speech corpus collection of recordings of spoken British English compiled during 1984–1987. The corpus manual can...
Arabic Corpus (Arabic: المدونة القرآنية العربية, romanized: al-modwana al-Qurʾāni al-ʿArabiyya) is an annotated linguistic resource consisting of 77,430...
analysis of an electronic corpusofcontemporary text, the Collins Corpus, later leading to the development of the Bank ofEnglish, and the production of the...
three corpora: a corpus from the Survey ofEnglish Usage, the Lancaster-Oslo-Bergen Corpus (UK English), and the Brown Corpus (US English). In 1988, Rodney...
the CorpusofContemporaryAmericanEnglish, was first attested to in 1999, and does not appear in any of these three lists. The Teachers Word Book of 30...
Corpus, forming part of the "Brown Family" of corpora, together with LOB, Frown and F-LOB CorpusofContemporaryAmericanEnglish (COCA) 425 million words...
Criterion: A combined total of at least five occurrences on the British National Corpus and the CorpusofContemporaryAmericanEnglish, including both the singular...
The Wellington Corpusof Spoken New Zealand English is a one-million-word corpusof transcribed English compiled from materials collected between 1988...
Digraphia is an uncommon term in current English usage. For instance, the CorpusofContemporaryAmericanEnglish, which includes over 425,000,000 words...
TIMIT is a corpusof phonemically and lexically transcribed speech ofAmericanEnglish speakers of different sexes and dialects. Each transcribed element...
is a corpus that is annotated with verbal propositions and their arguments—a "proposition bank". Although "PropBank" refers to a specific corpus produced...
National Corpus FidaPLUS is the 621 million words (tokens) corpusof the Slovenian language, gathered from selected texts written in Slovenian of different...