Global Information Lookup Global Information

Wellington Corpus of Spoken New Zealand English information


The Wellington Corpus of Spoken New Zealand English is a one-million-word corpus of transcribed English compiled from materials collected between 1988 and 1994, which is made up of excerpts from a range of speakers who have lived in New Zealand since before the age of 10. The corpus was collected under the direction of linguist Janet Holmes and includes broadcast transcripts as well as informal conversations, telephone conversations, lectures, and oral history interviews.[1]

The corpus, which was distributed as part of the 1999 ICAME CD-ROM, has been used for a number of academic studies including those looking at morphology,[2] pronoun use[3] and language contact studies, as of the influence of Māori on NZ English.[4][5]

  1. ^ Janet Holmes, Bernadette Vine and Gary Johnson, and Bernadette Vine (1998). "Wellington Corpus". Retrieved May 28, 2015.{{cite web}}: CS1 maint: multiple names: authors list (link)
  2. ^ Hundt, Marianne (1998). New Zealand English Grammar: Fact or Fiction. John Bengjamins.
  3. ^ Holmes, Janet (1998). "Generic pronouns in the Wellington Corpus of Spoken New Zealand English". Kōtare: New Zealand Notes & Queries.
  4. ^ Macalister, John (2006). "The Maori presence in the New Zealand English lexicon, 1850–2000: Evidence from a corpus-based study". English World-Wide.
  5. ^ Macalister, John (1999). "Trends in New Zealand English: Some Observations on the Presence of Maori Words in the Lexicon". New Zealand English Journal.

and 29 Related for: Wellington Corpus of Spoken New Zealand English information

Request time (Page generated in 1.1076 seconds.)

Wellington Corpus of Spoken New Zealand English

Last Update:

The Wellington Corpus of Spoken New Zealand English is a one-million-word corpus of transcribed English compiled from materials collected between 1988...

Word Count : 234

Oxford English Corpus

Last Update:

The Oxford English Corpus (OEC) is a text corpus of 21st-century English, used by the makers of the Oxford English Dictionary and by Oxford University...

Word Count : 345

Spoken English Corpus

Last Update:

The Spoken English Corpus (SEC) is a speech corpus collection of recordings of spoken British English compiled during 1984–1987. The corpus manual can...

Word Count : 1278

Brown Corpus

Last Update:

Standard Corpus of Present-Day American English, better known as simply the Brown Corpus, is an electronic collection of text samples of American English, the...

Word Count : 1056

Corpus of Contemporary American English

Last Update:

The Corpus of Contemporary American English (COCA) is a one-billion-word corpus of contemporary American English. It was created by Mark Davies, retired...

Word Count : 1135

Enron Corpus

Last Update:

The Enron Corpus is a database of over 600,000 emails generated by 158 employees of the Enron Corporation in the years leading up to the company's collapse...

Word Count : 712

Bank of English

Last Update:

of spoken data using material from radio, TV and informal conversations. The Bank of English totals 650 million running words. Copies of the corpus are...

Word Count : 147

British National Corpus

Last Update:

British National Corpus (BNC) is a 100-million-word text corpus of samples of written and spoken English from a wide range of sources. The corpus covers British...

Word Count : 3894

Cambridge English Corpus

Last Update:

The Cambridge International Corpus (CIC) is a collection of over 800 million words of real spoken and written English . The texts are stored in a database...

Word Count : 1016

International Corpus of English

Last Update:

own national or regional variety of English. Each ICE corpus consists of one million words of spoken and written English produced after 1989. For most participating...

Word Count : 1229

Quranic Arabic Corpus

Last Update:

Arabic Corpus (Arabic: المدونة القرآنية العربية, romanized: al-modwana al-Qurʾāni al-ʿArabiyya) is an annotated linguistic resource consisting of 77,430...

Word Count : 599

Switchboard Telephone Speech Corpus

Last Update:

The Switchboard Telephone Speech Corpus is a corpus of spoken English language consisted of almost 260 hours of speech. It was created in 1990 by Texas...

Word Count : 453

American National Corpus

Last Update:

The American National Corpus (ANC) is a text corpus of American English containing 22 million words of written and spoken data produced since 1990. Currently...

Word Count : 605

TIMIT

Last Update:

TIMIT is a corpus of phonemically and lexically transcribed speech of American English speakers of different sexes and dialects. Each transcribed element...

Word Count : 561

COBUILD

Last Update:

analysis of an electronic corpus of contemporary text, the Collins Corpus, later leading to the development of the Bank of English, and the production of the...

Word Count : 175

Corpus linguistics

Last Update:

a number of similarly structured corpora: the LOB Corpus (1960s British English), Kolhapur (Indian English), Wellington (New Zealand English), Australian...

Word Count : 2338

Scottish Corpus of Texts and Speech

Last Update:

Scottish Corpus of Texts & Speech (SCOTS) is an ongoing project to build a corpus of modern-day (post-1940) written and spoken texts in Scottish English and...

Word Count : 349

Europarl Corpus

Last Update:

The Europarl Corpus is a corpus (set of documents) that consists of the proceedings of the European Parliament from 1996 to 2012. In its first release...

Word Count : 800

Czech National Corpus

Last Update:

National Corpus (CNC) (Czech : Český národní korpus) is a large electronic corpus of written and spoken Czech language, developed by the Institute of the Czech...

Word Count : 476

Arabic Speech Corpus

Last Update:

used as part of a larger corpus for training speech recognition systems. The package contains the following: 1813 .wav files containing spoken utterances...

Word Count : 388

Bergen Corpus of London Teenage Language

Last Update:

The Bergen Corpus of London Teenage Language (COLT) is a data set of samples of spoken English that was compiled in 1993 from tape recorded and transcribed...

Word Count : 361

PropBank

Last Update:

is a corpus that is annotated with verbal propositions and their arguments—a "proposition bank". Although "PropBank" refers to a specific corpus produced...

Word Count : 377

Tatoeba

Last Update:

it as Tatoeba. In September 2007, about 150,000 English-Japanese sentence pairs from the Tanaka Corpus — a public-domain compilation released in 2001 by...

Word Count : 2056

Sketch Engine

Last Update:

National Corpus, Brown Corpus, Cambridge Academic English Corpus and Cambridge Learner Corpus, CHILDES corpora of child language, OpenSubtitles (a set of 60...

Word Count : 1419

Persian Speech Corpus

Last Update:

Speech Corpus is a Modern Persian speech corpus for speech synthesis. The corpus contains phonetic and orthographic transcriptions of about 2.5 hours of Persian...

Word Count : 355

Buckeye Corpus

Last Update:

The Buckeye Corpus of conversational speech is a speech corpus created by a team of linguists and psychologists at Ohio State University led by Prof....

Word Count : 315

TenTen Corpus Family

Last Update:

The TenTen Corpus Family (also called TenTen corpora) is a set of comparable web text corpora, i.e. collections of texts that have been crawled from the...

Word Count : 1201

International Computer Archive of Modern and Medieval English

Last Update:

of Spoken New Zealand English, the Kolhapur Corpus of Indian English, the Bergen Corpus of London Teenage Language (COLT), the Helsinki Corpus of Older...

Word Count : 616

National Corpus of Polish

Last Update:

The National Corpus of Polish (Polish : Narodowy Korpus Języka Polskiego NKJP) is the biggest and the most important corpus of the Polish language. A...

Word Count : 462

PDF Search Engine © AllGlobal.net