Search results
Results From The WOW.Com Content Network
Kaggle is a data science competition platform and online community for data scientists and machine learning practitioners under Google LLC.Kaggle enables users to find and publish datasets, explore and build models in a web-based data science environment, work with other data scientists and machine learning engineers, and enter competitions to solve data science challenges.
Phishing Websites Dataset Dataset of phishing websites. Many features of each site are given. 2456 Text Classification 2015 [449] R. Mustafa et al. Online Retail Dataset Online transactions for a UK online retailer. Details of each transaction given. 541,909 Text Classification, clustering 2015 [450] D. Chen Freebase Simple Topic Dump
This is a 21 class land use image dataset meant for research purposes. There are 100 images for each class. 2,100 Image chips of 256x256, 30 cm (1 foot) GSD Land cover classification 2010 [171] Yi Yang and Shawn Newsam SAT-4 Airborne Dataset Images were extracted from the National Agriculture Imagery Program (NAIP) dataset.
Data Commons is an open-source platform [1] created by Google [2] that provides an open knowledge graph, combining economic, scientific and other public datasets into a unified view. [3] Ramanathan V. Guha, a creator of web standards including RDF, [4] RSS, and Schema.org, [5] founded the project, [6] which is now led by Prem Ramaswami. [7]
Apache Superset is an open-source software application for data exploration and data visualization able to handle data at petabyte scale ().The application started as a hack-a-thon project by Maxime Beauchemin (creator of Apache Airflow) while working at Airbnb and entered the Apache Incubator program in 2017. [1]
scikit-learn (formerly scikits.learn and also known as sklearn) is a free and open-source machine learning library for the Python programming language. [3] It features various classification, regression and clustering algorithms including support-vector machines, random forests, gradient boosting, k-means and DBSCAN, and is designed to interoperate with the Python numerical and scientific ...
SciPy (pronounced / ˈ s aɪ p aɪ / "sigh pie" [2]) is a free and open-source Python library used for scientific computing and technical computing. [3]SciPy contains modules for optimization, linear algebra, integration, interpolation, special functions, FFT, signal and image processing, ODE solvers and other tasks common in science and engineering.
The name is derived from the term "panel data", an econometrics term for data sets that include observations over multiple time periods for the same individuals, [3] as well as a play on the phrase "Python data analysis". [4]: 5 Wes McKinney started building what would become Pandas at AQR Capital while he was a researcher there from 2007 to ...