多码网
返回分类

大数据

大数据处理、分析与可视化

262 个源码项目

PigPen

Map-Reduce for Clojure

56551ClojureApache-2.0

elephantdb

Distributed database specialized in exporting key/value data from Hadoop

55950JavaBSD-3-Clause

eventsim

Event data simulator. Generates a stream of pseudo-random events from a set of users, designed to simulate web traffic.

542142Scala

streamiz

.NET Stream Processing Library for Apache Kafka 🚀

53380C#MIT

edis

An Erlang implementation of Redis

51937ErlangApache-2.0

kafkat

KafkaT-ool

50280RubyApache-2.0

streamDM

Stream Data Mining Library for Spark Streaming

497141ScalaApache-2.0

Lipstick

Pig Visualization framework

467132JavaScriptApache-2.0

hydra

暂无描述

43685JavaApache-2.0

AWStats

AWStats Log Analyzer project (official sources)

430134Perl

schema-registry-ui

Web tool for Avro Schema Registry |

425112JavaScript

graviton

Graviton Database: ZFS for key-value stores.

42422GoGPL-3.0

zeppelin

DEPRECATED. Zeppelin has moved to Apache. Please make pull request there

405111

substation

Substation is a toolkit for routing, normalizing, and enriching security event and audit logs.

40032GoMIT

timely

Accumulo backed time series database

390110JavaApache-2.0

kareldb

A Relational Database Backed by Apache Kafka

38827JavaApache-2.0

Decider

Flexible and Extensible Machine Learning in Ruby

38354RubyMIT

zoie

realtime search/indexing system

371120Java

Conjecture

Scalable Machine Learning in Scalding

36056JavaMIT

spark-gotchas

Spark Gotchas. A subjective compilation of the Apache Spark tips and tricks

36078NOASSERTION

apex-core

Mirror of Apache Apex core

350170JavaApache-2.0

spindle

Next-generation web analytics processing with Scala, Spark, and Parquet.

33058JavaScriptApache-2.0

sparrow

Sparrow scheduling platform (U.C. Berkeley).

32890PythonApache-2.0

mist

Serverless proxy for Spark cluster

32570ScalaApache-2.0

7 / 11 页,共 262 个项目