When.com Web Search

Search results

  1. Results From The WOW.Com Content Network
  2. List of datasets in computer vision and image processing

    en.wikipedia.org/wiki/List_of_datasets_in...

    Large dataset of images for object classification. Images categorized and hand-sorted. 30,607 Images, Text Classification, object detection 2007 [29] [30] G. Griffin et al. COYO-700M Imagetext-pair dataset 10 billion pairs of alt-text and image sources in HTML documents in CommonCrawl 746,972,269 Images, Text Classification, Image-Language ...

  3. Contrastive Language-Image Pre-training - Wikipedia

    en.wikipedia.org/wiki/Contrastive_Language-Image...

    The dataset contains 500,000 text-queries, with up to 20,000 (image, text) pairs per query. The text-queries were generated by starting with all words occurring at least 100 times in English Wikipedia , then extended by bigrams with high mutual information , names of all Wikipedia articles above a certain search volume, and WordNet synsets .

  4. Text-to-image model - Wikipedia

    en.wikipedia.org/wiki/Text-to-image_model

    Training a text-to-image model requires a dataset of images paired with text captions. One dataset commonly used for this purpose is the COCO dataset. Released by Microsoft in 2014, COCO consists of around 123,000 images depicting a diversity of objects with five captions per image, generated by human annotators.

  5. List of datasets for machine-learning research - Wikipedia

    en.wikipedia.org/wiki/List_of_datasets_for...

    Images, text Classification, clustering 2015 [313] [314] T. Munisami et al. Oxford Flower Dataset 17 category dataset of flowers. Train/test splits, labeled images, 1360 Images, text Classification 2006 [315] [316] M-E Nilsback et al. Plant Seedlings Dataset 12 category dataset of plant seedlings. Labelled images, segmented images, 5544 Images

  6. Stable Diffusion - Wikipedia

    en.wikipedia.org/wiki/Stable_Diffusion

    Stable Diffusion was trained on pairs of images and captions taken from LAION-5B, a publicly available dataset derived from Common Crawl data scraped from the web, where 5 billion image-text pairs were classified based on language and filtered into separate datasets by resolution, a predicted likelihood of containing a watermark, and predicted ...

  7. LAION - Wikipedia

    en.wikipedia.org/wiki/LAION

    LAION (acronym for Large-scale Artificial Intelligence Open Network) is a German non-profit which makes open-sourced artificial intelligence models and datasets. [1] It is best known for releasing a number of large datasets of images and captions scraped from the web which have been used to train a number of high-profile text-to-image models, including Stable Diffusion and Imagen.

  8. MNIST database - Wikipedia

    en.wikipedia.org/wiki/MNIST_database

    Extended MNIST (EMNIST) is a newer dataset developed and released by NIST to be the (final) successor to MNIST. [15] [16] MNIST included images only of handwritten digits. EMNIST includes all the images from NIST Special Database 19 (SD 19), which is a large database of 814,255 handwritten uppercase and lower case letters and digits.

  9. Flux (text-to-image model) - Wikipedia

    en.wikipedia.org/wiki/Flux_(text-to-image_model)

    Flux (also known as FLUX.1) is a text-to-image model developed by Black Forest Labs, based in Freiburg im Breisgau, Germany. Black Forest Labs were founded by former employees of Stability AI. As with other text-to-image models, Flux generates images from natural language descriptions, called prompts.