When.com Web Search

Search results

  1. Results From The WOW.Com Content Network
  2. List of text corpora - Wikipedia

    en.wikipedia.org/wiki/List_of_text_corpora

    Text corpora (singular: text corpus) are large and structured sets of texts, which have been systematically collected. Text corpora are used by corpus linguists and within other branches of linguistics for statistical analysis, hypothesis testing, finding patterns of language use, investigating language change and variation, and teaching ...

  3. Text corpus - Wikipedia

    en.wikipedia.org/wiki/Text_corpus

    Text corpora are also used in the study of historical documents, for example in attempts to decipher ancient scripts, or in Biblical scholarship. Some archaeological corpora can be of such short duration that they provide a snapshot in time. One of the shortest corpora in time may be the 15–30 year Amarna letters texts .

  4. TenTen Corpus Family - Wikipedia

    en.wikipedia.org/wiki/TenTen_Corpus_Family

    The TenTen Corpus Family (also called TenTen corpora) is a set of comparable web text corpora, i.e. collections of texts that have been crawled from the World Wide Web and processed to match the same standards. These corpora are made available through the Sketch Engine corpus manager. There are TenTen corpora for more than 35 languages.

  5. International Corpus of English - Wikipedia

    en.wikipedia.org/wiki/International_Corpus_of...

    Each corpus contains one million words in 500 texts of 2000 words, [7] following the sampling methodology used for the Brown Corpus. Unlike Brown or the Lancaster-Oslo-Bergen (LOB) Corpus (or indeed mega-corpora such as the British National Corpus), however, the majority of texts are derived from spoken data.

  6. Category:Corpora - Wikipedia

    en.wikipedia.org/wiki/Category:Corpora

    Pages in category "Corpora" The following 51 pages are in this category, out of 51 total. This list may not reflect recent changes. A. ... Corpus of Electronic Texts;

  7. Mark Davies (linguist) - Wikipedia

    en.wikipedia.org/wiki/Mark_Davies_(linguist)

    Mark E. Davies (born 1963) is an American linguist. He specializes in corpus linguistics and language variation and change.He is the creator of most of the text corpora from English-Corpora.org (including the Corpus of Contemporary American English/ COCA) as well as the Corpus del español and the Corpus do português.

  8. Ancient text corpora - Wikipedia

    en.wikipedia.org/wiki/Ancient_text_corpora

    Ancient text corpora are the entire collection of texts from the period of ancient history, defined in this article as the period from the beginning of writing up to 300 AD. These corpora are important for the study of literature , history , linguistics , and other fields, and are a fundamental component of the world's cultural heritage .

  9. List of YouTubers - Wikipedia

    en.wikipedia.org/wiki/List_of_YouTubers

    Comedy sketches. The 40th most subscribed YouTube channel. As of late 2020, he had taken a break from YouTube. Caitlin Hill: Australia S Facts Rapper Jaclyn Hill: United States Jaclynhill1 Known for her makeup tutorial videos Lewis Hilsenteger: Canada unboxtherapy Unboxing and technology YouTube channel produced by Lewis George Hilsenteger and ...