When.com Web Search

Search results

  1. Results From The WOW.Com Content Network
  2. List of large language models - Wikipedia

    en.wikipedia.org/wiki/List_of_large_language_models

    A large language model (LLM) is a type of machine learning model designed for natural language processing tasks such as language generation. LLMs are language models with many parameters, and are trained with self-supervised learning on a vast amount of text. This page lists notable large language models.

  3. Llama (language model) - Wikipedia

    en.wikipedia.org/wiki/Llama_(language_model)

    Llama (Large Language Model Meta AI, formerly stylized as LLaMA) is a family of large language models (LLMs) released by Meta AI starting in February 2023. [2] [3] The latest version is Llama 3.3, released in December 2024. [4] Llama models are trained at different parameter sizes, ranging between 1B and 405B. [5]

  4. BLOOM (language model) - Wikipedia

    en.wikipedia.org/wiki/BLOOM_(language_model)

    BigScience Large Open-science Open-access Multilingual Language Model (BLOOM) [1] [2] is a 176-billion-parameter transformer-based autoregressive large language model (LLM). The model, as well as the code base and the data used to train it, are distributed under free licences. [3]

  5. Chinchilla (language model) - Wikipedia

    en.wikipedia.org/wiki/Chinchilla_(language_model)

    It is named "chinchilla" because it is a further development over a previous model family named Gopher. Both model families were trained in order to investigate the scaling laws of large language models. [2] It claimed to outperform GPT-3. It considerably simplifies downstream utilization because it requires much less computer power for ...

  6. PaLM - Wikipedia

    en.wikipedia.org/wiki/PaLM

    PaLM (Pathways Language Model) is a 540 billion-parameter dense decoder-only transformer-based large language model (LLM) developed by Google AI. [1] Researchers also trained smaller versions of PaLM (with 8 and 62 billion parameters) to test the effects of model scale. [2]

  7. Vicuna LLM - Wikipedia

    en.wikipedia.org/wiki/Vicuna_LLM

    Vicuna LLM is an omnibus Large Language Model used in AI research. [1] Its methodology is to enable the public at large to contrast and compare the accuracy of LLMs "in the wild" (an example of citizen science) and to vote on their output; a question-and-answer chat format is used.

  8. T5 (language model) - Wikipedia

    en.wikipedia.org/wiki/T5_(language_model)

    T5 (Text-to-Text Transfer Transformer) is a series of large language models developed by Google AI introduced in 2019. [ 1 ] [ 2 ] Like the original Transformer model, [ 3 ] T5 models are encoder-decoder Transformers , where the encoder processes the input text, and the decoder generates the output text.

  9. GPT-2 - Wikipedia

    en.wikipedia.org/wiki/GPT-2

    Generative Pre-trained Transformer 2 (GPT-2) is a large language model by OpenAI and the second in their foundational series of GPT models. GPT-2 was pre-trained on a dataset of 8 million web pages. [2] It was partially released in February 2019, followed by full release of the 1.5-billion-parameter model on November 5, 2019. [3] [4] [5]