Search results
Results From The WOW.Com Content Network
Kaggle is a data science competition platform and online community for data scientists and machine learning practitioners under Google LLC.Kaggle enables users to find and publish datasets, explore and build models in a web-based data science environment, work with other data scientists and machine learning engineers, and enter competitions to solve data science challenges.
Phishing Websites Dataset Dataset of phishing websites. Many features of each site are given. 2456 Text Classification 2015 [449] R. Mustafa et al. Online Retail Dataset Online transactions for a UK online retailer. Details of each transaction given. 541,909 Text Classification, clustering 2015 [450] D. Chen Freebase Simple Topic Dump
Includes Handwritten Numeral Dataset (10 classes) and Basic Character Dataset (50 classes), each dataset has three types of noise: white gaussian, motion blur, and reduced contrast. All images are centered and of size 32x32. Numeral Dataset: 23330, Character Dataset: 76000 Images, text Handwriting recognition, classification 2017 [152] [153]
Provides an RDF data set about scientific publications and related entities, such as authors, institutions, journals, and fields of study. The data set is based on the Microsoft Academic Graph. [105] [106] Free University of Freiburg: MyScienceWork: Science Database includes more than 70 million scientific publications and 12 million patents. Free
Google Dataset Search is a search engine from Google that helps researchers locate online data that is freely available for use. [1] The company launched the service on September 5, 2018, and stated that the product was targeted at scientists and data journalists .
Data.gov is a U.S. Government website launched in late May 2009 by the Federal Chief Information Officer (CIO) of the United States, Vivek Kundra.Data.gov aims to improve public access to high value, machine-readable datasets generated by the Executive Branch of the Federal Government. [1]
re3data.org is a global registry of research data repositories from all academic disciplines. It provides an overview of existing research data repositories in order to help researchers to identify a suitable repository for their data and thus comply with requirements set out in data policies.
The USGS Gap Analysis Program maintains four primary data sets: land cover, protected areas, species and aquatic. The GAP Land Cover Data Set is the most complete map ever produced of vegetative associations for the US. Classified into 551 ecological systems, and 32 modified ecological systems (where human impacts have had an effect).