When.com Web Search

Search results

  1. Results From The WOW.Com Content Network
  2. List of text corpora - Wikipedia

    en.wikipedia.org/wiki/List_of_text_corpora

    Text corpora (singular: text corpus) are large and structured sets of texts, which have been systematically collected.Text corpora are used by corpus linguists and within other branches of linguistics for statistical analysis, hypothesis testing, finding patterns of language use, investigating language change and variation, and teaching language proficiency.

  3. Ancient text corpora - Wikipedia

    en.wikipedia.org/wiki/Ancient_text_corpora

    Ancient text corpora are the entire collection of texts from the period of ancient history, defined in this article as the period from the beginning of writing up to 300 AD. These corpora are important for the study of literature , history , linguistics , and other fields, and are a fundamental component of the world's cultural heritage .

  4. Text corpus - Wikipedia

    en.wikipedia.org/wiki/Text_corpus

    There are two main types of parallel corpora which contain texts in two languages. In a translation corpus, the texts in one language are translations of texts in the other language. In a comparable corpus, the texts are of the same kind and cover the same content, but they are not translations of each other. [2]

  5. Open Richly Annotated Cuneiform Corpus - Wikipedia

    en.wikipedia.org/wiki/Open_Richly_Annotated...

    These usually incorporate full transliterations and translations of texts in a given corpus, and many offer supplementary material such as an introduction to the corpus, discussion of its historical context, and interpretive syntheses of its content. A few other projects serve as research tools for Assyriological studies (dictionary, sign list).

  6. TenTen Corpus Family - Wikipedia

    en.wikipedia.org/wiki/TenTen_Corpus_Family

    The TenTen Corpus Family (also called TenTen corpora) is a set of comparable web text corpora, i.e. collections of texts that have been crawled from the World Wide Web and processed to match the same standards. These corpora are made available through the Sketch Engine corpus manager. There are TenTen corpora for more than 35 languages.

  7. Lexical lists - Wikipedia

    en.wikipedia.org/wiki/Lexical_lists

    The most notable text is LU A, a list of professions which would be reproduced for the next thousand years until the end of the Old Babylonian period virtually unchanged. Later third millennium lists dating to around 2600 BC have been uncovered at Fara and Abū Ṣalābīkh , including the Fara God List , the earliest of this genre.

  8. Howard and Moore Complete Checklist of the Birds of the World

    en.wikipedia.org/wiki/Howard_and_Moore_Complete...

    The Howard and Moore Complete Checklist of the Birds of the World is a book by Richard Howard and Alick Moore which presents a list of the bird species of the world. It was the first single-volume world bird list to include subspecies names, and until the publication of the 5th edition of James Clements' Checklist of Birds of the World was the only one to do so.

  9. Brown Corpus - Wikipedia

    en.wikipedia.org/wiki/Brown_Corpus

    The Brown University Standard Corpus of Present-Day American English, better known as simply the Brown Corpus, is an electronic collection of text samples of American English, the first major structured corpus of varied genres. This corpus first set the bar for the scientific study of the frequency and distribution of word categories in ...