When.com Web Search

Search results

  1. Results From The WOW.Com Content Network
  2. List of text corpora - Wikipedia

    en.wikipedia.org/wiki/List_of_text_corpora

    Text corpora (singular: text corpus) are large and structured sets of texts, which have been systematically collected.Text corpora are used by corpus linguists and within other branches of linguistics for statistical analysis, hypothesis testing, finding patterns of language use, investigating language change and variation, and teaching language proficiency.

  3. CLAN program - Wikipedia

    en.wikipedia.org/wiki/CLAN_program

    The CLAN (Computerized Language ANalysis) program is a cross-platform program designed by Brian MacWhinney and written by Leonid Spektor for the purpose of creating and analyzing transcripts in the Child Language Exchange System database. CLAN is open source software and can be freely downloaded.

  4. Text corpus - Wikipedia

    en.wikipedia.org/wiki/Text_corpus

    Language technology, natural language processing, computational linguistics. The analysis and processing of various types of corpora are also the subject of much work in computational linguistics, speech recognition and machine translation, where they are often used to create hidden Markov models for part

  5. Corpus linguistics - Wikipedia

    en.wikipedia.org/wiki/Corpus_linguistics

    Corpus linguistics is an empirical method for the study of language by way of a text corpus (plural corpora). [1] Corpora are balanced, often stratified collections of authentic, "real world", text of speech or writing that aim to represent a given linguistic variety. [1]

  6. Discourse analysis - Wikipedia

    en.wikipedia.org/wiki/Discourse_analysis

    Discourse analysis (DA), or discourse studies, is an approach to the analysis of written, spoken, or sign language, including any significant semiotic event. [ citation needed ] The objects of discourse analysis ( discourse , writing, conversation, communicative event ) are variously defined in terms of coherent sequences of sentences ...

  7. Linguistic performance - Wikipedia

    en.wikipedia.org/wiki/Linguistic_performance

    This is a commonly applied measurement of syntax for first and second language learners, with samples gathered from both elicited and spontaneous oral discourse. Methods for eliciting speech for these samples come in many forms, such having the participant answering questions or re-telling a story.

  8. Lexical analysis - Wikipedia

    en.wikipedia.org/wiki/Lexical_analysis

    Analysis generally occurs in one pass. Lexers and parsers are most often used for compilers, but can be used for other computer language tools, such as prettyprinters or linters . Lexing can be divided into two stages: the scanning , which segments the input string into syntactic units called lexemes and categorizes these into token classes ...

  9. Latent semantic analysis - Wikipedia

    en.wikipedia.org/wiki/Latent_semantic_analysis

    Latent semantic analysis (LSA) is a technique in natural language processing, in particular distributional semantics, of analyzing relationships between a set of documents and the terms they contain by producing a set of concepts related to the documents and terms.