大数据
大数据处理、分析与可视化
共 262 个源码项目crossdata
DISCONTINUED - Easy access to big things. Library for Apache Spark extending and improving its capabilities
streamline
StreamLine - Streaming Analytics
tephra
Apache Tephra: Transactions for HBase.
infovore
RDF-Centric Map/Reduce Framework and Freebase data conversion tool
BTDB
Key Value Database in .Net with Object DB Layer, RPC, dynamic IL and much more
pycytominer
Python package for processing image-based profiling data
PivotalR-archive
An convenient R tool for manipulating tables in PostgreSQL type databases and a wrapper of Apache MADlib.
mpich2-yarn
Running MPICH2 on Yarn
velox-modelserver
暂无描述
yurita
Anomaly detection framework @ PayPal
HiveSwarm
Helpful user defined fuctions / table generating functions for Hive
straw
A platform for real-time streaming search
samza-luwak
Integration of Samza and Luwak
kindmetrics
Kind metrics analytics for your website
rbhive
Ruby gem for querying Apache Hive
schedoscope
Schedoscope is a scheduling framework for painfree agile development, testing, (re)loading, and monitoring of your datahub, lake, or whatever you choose to call your Hadoop data warehouse these days.
hailstorm
Haskell distributed stream processing with exactly-once semantics
app
Just a little analytics insight for your personal or indie project
streamingbandit
Python application to setup and run streaming (contextual) bandit experiments.
incubator-retired-slider
Mirror of Apache Slider
akela
A bunch of utility classes for Java, Hadoop, HBase, Pig, etc.
awesome-help-wanted
⚡️🗒 Awesome list of free and open source tools to use & help
EBImage
:art: Image processing toolbox for R
Beetest
A super simple utility for testing Apache Hive scripts locally for non-Java developers.
第 9 / 11 页,共 262 个项目
