Encyclopedia > Corpus linguistics

  Article Content

Corpus linguistics

Corpus Linguistics is the study of language as expressed in samples (corpora) or "real world" text. The approach runs counter to Noam Chomsky's view that real language is riddled with performance-related errors, thus requiring careful analysis of small speech samples obtained in a highly controlled laboratory setting. Corpus Linguistics does away with Chomsky's competence/performance split, viewing that we can only ever reliably analyse language if the researcher does not interfere.

In some areas there is an overlap with computational linguistics, as the latter moves towards language processing applications. This means dealing with real input data, where descriptions based on a linguist's intuition are not usually helpful.

The COBUILD dictionaries, designed for users learning English as a foreign language, are based on corpus linguistics; definitions are based on how words are used rather than on historical definitions of their meaning.

Some keywords:

Some links:

  • The Centre for Corpus Linguistics at Birmingham University:
http://www.corpus.bham.ac.uk/



All Wikipedia text is available under the terms of the GNU Free Documentation License

 
  Search Encyclopedia

Search over one million articles, find something about almost anything!
 
 
  
  Featured Article
North Haven, New York

... 2.77. In the village the population is spread out with 17.4% under the age of 18, 3.5% from 18 to 24, 22.3% from 25 to 44, 28.7% from 45 to 64, and 28.1% who are 65 years ...

 
 
 
This page was created in 26 ms