- ACM SIGKDD: Home Page
SIGKDD aims to provide the premier forum for advancement and adoption of the "science" of knowledge discovery and data mining.
in Science > Reference with data discovery knowledge mining
- CommonCrawl
Common Crawl Foundation is a California 501(c)3 non-profit founded by Gil Elbaz with the goal of democratizing access to web information by producing and maintaining an open repository of web crawl data that is universally accessible.
in Web > reference with crawl data open opensource web
- data.gov.uk - Unlocking innovation
Advised by Sir Tim Berners-Lee and Professor Nigel Shadbolt and others, government are opening up data for reuse. This site seeks to give a way into the wealth of government data and is under constant development.
in Web > reference with data semantic by 4 users
- data.gov.uk - Unlocking innovation
This site seeks to give a way into the wealth of government data. As highlighted by the Power of Information Taskforce, this means it needs to be: * easy to find; * easy to licence; and * easy to re-use. We are drawing on the expertise and wisdom of Sir Tim Berners-Lee and Professor Nigel Shadbolt to publish government data as RDF – enabling data to be linked together.
in Web > reference with data rdf semantic web by 2 users
- I N F I N I S P A N
Open Source Data Grids - JBoss Community
in Software with data grid java jboss opensource
- Many Eyes
View your data, ask questions, and share your discoveries. Harness the collective intelligence of the net for insight and analysis.
in Web with collective data intelligence visualization
- microformats
Designed for humans first and machines second, microformats are a set of simple, open data formats built upon existing and widely adopted standards.
in Web > reference with data format open standards by 27 users
- Needlebase
platform for acquiring, integrating, cleansing, analyzing and publishing data on the web.
in Web > tools with analysis data engine mining search by 2 users
- openQRM
openQRM is the next generation, open-source Data-center management platform. Its fully pluggable architecture focuses on automatic, rapid- and appliance-based deployment, monitoring, high-availability, cloud computing and especially on supporting and conforming multiple virtualization technologies. openQRM is a single-management console for the complete IT-infra structure and provides a well defined API which can be used to integrate third-party tools as additional plugins.
in Software > tools with center cloud computing console data management monitoring opensource platform virtualization
- Parchive - Wikipedia
Parchive (a contraction of parity archive volume set) is an open source software project that emerged in 2001 to develop a parity file format, as conceived by Tobias Rieper and Stefan Wehlus. These parity files use a forward error correction-style system that can be used to perform data verification, and allow recovery when data is lost or corrupted.
2008 archivers articles ascii august base64 comparison copy download edit extended file format from needing parchive wikipedia
in Software with archive correction data error recovery
- Recorded Future
Receive custom email updates when we uncover predictions about the future of topics you're interested in. We call these email alerts, Futures.
in Web with analysis data future mining prediction timeline
- sig.ma
Semantic Information MAshup http://Sig.mais a tool to explore and leverage the Web of Data. At any time, information in Sigma is likely to come from multiple, unrelated Web sites – potentially any web site that embeds information in RDF, RDFa or Microformats (standards for the Web of Data).
in Web with data engine information mashup search semantic web by 2 users
- StreamBase Streaming Platform
StreamBase Systems, the leader in high performance Complex Event Processing (CEP), provides software for rapidly building systems that analyze and act on real-time streaming data for instantaneous decision-making. Complex Event Processing (CEP) is a technology for low-latency filtering, correlating, aggregating, and computing on real- world event data.
in Software with complex data decision event making mining processing real-time streaming by 3 users
- Weka 3
Data Mining with Open Source Machine Learning Software in Java
in Software with data java learning mining open software source by 2 users
data from all users