Search results
Results From The WOW.Com Content Network
Common Crawl is a nonprofit 501(c)(3) organization that crawls the web and freely provides its archives and datasets to the public. [1] [2] Common Crawl's web archive consists of petabytes of data collected since 2008. [3] It completes crawls approximately once a month. [4] Common Crawl was founded by Gil Elbaz. [5]
Kaggle is a data science competition platform and online community for data scientists and machine learning practitioners under Google LLC.Kaggle enables users to find and publish datasets, explore and build models in a web-based data science environment, work with other data scientists and machine learning engineers, and enter competitions to solve data science challenges.
The Java Media Framework (JMF) is a Java library that enables audio, video and other time-based media to be added to Java applications and applets. Java Topology suite Java Topology Suite (JTS) is an open-source Java software library that provides an object model for Euclidean planar linear geometry together with a set of fundamental geometric ...
Matrix Toolkit Java is a linear algebra library based on BLAS and LAPACK. ojAlgo is an open source Java library for mathematics, linear algebra and optimisation. exp4j is a small Java library for evaluation of mathematical expressions. SuanShu is an open-source Java math library. It supports numerical analysis, statistics and optimization.
The Comprehensive Knowledge Archive Network (CKAN) is an open-source open data portal for the storage and distribution of open data.Initially inspired by the package management capabilities of Debian Linux, [2] CKAN has developed into a powerful data catalogue system that is mainly used by public institutions seeking to share their data with the general public.
C++, C#, Java, JavaScript, Node.js. Python, HTTP Proprietary GPU-accelerated, in-memory, distributed database for analytics. Functions like a RDBMS (structured data) for fast analytics on datasets in the hundreds of GBs to tens of TBs range. Interact with SQL and REST API. Geospatial objects and functions.
Colt was last updated in 2004 (when Java 1.4 was the current release) and its code base has been incorporated into the Parallel Colt code base, which has received more recent development. Colt provides an infrastructure for scalable scientific and technical computing in Java. It is particularly useful in the domain of High Energy Physics at CERN.
Apache Jena is an open source Semantic Web framework for Java. It provides an API to extract data from and write to RDF graphs. The graphs are represented as an abstract "model". A model can be sourced with data from files, databases, URLs or a combination of these. A model can also be queried through SPARQL 1.1.