多码网
返回分类

大数据

大数据处理、分析与可视化

262 个源码项目

PigPen

Map-Reduce for Clojure

56551ClojureApache-2.0

elephantdb

Distributed database specialized in exporting key/value data from Hadoop

55950JavaBSD-3-Clause

eventsim

Event data simulator. Generates a stream of pseudo-random events from a set of users, designed to simulate web traffic.

538142Scala

streamiz

.NET Stream Processing Library for Apache Kafka 🚀

53280C#MIT

edis

An Erlang implementation of Redis

52437ErlangApache-2.0

kafkat

KafkaT-ool

50280RubyApache-2.0

streamDM

Stream Data Mining Library for Spark Streaming

498141ScalaApache-2.0

Lipstick

Pig Visualization framework

466132JavaScriptApache-2.0

hydra

暂无描述

43785JavaApache-2.0

AWStats

AWStats Log Analyzer project (official sources)

427133Perl

schema-registry-ui

Web tool for Avro Schema Registry |

425112JavaScript

graviton

Graviton Database: ZFS for key-value stores.

42422GoGPL-3.0

zeppelin

DEPRECATED. Zeppelin has moved to Apache. Please make pull request there

406111

substation

Substation is a toolkit for routing, normalizing, and enriching security event and audit logs.

39432GoMIT

timely

Accumulo backed time series database

389111JavaApache-2.0

kareldb

A Relational Database Backed by Apache Kafka

38827JavaApache-2.0

Decider

Flexible and Extensible Machine Learning in Ruby

38354RubyMIT

zoie

realtime search/indexing system

372121Java

Conjecture

Scalable Machine Learning in Scalding

36056JavaMIT

spark-gotchas

Spark Gotchas. A subjective compilation of the Apache Spark tips and tricks

36078NOASSERTION

apex-core

Mirror of Apache Apex core

350171JavaApache-2.0

spindle

Next-generation web analytics processing with Scala, Spark, and Parquet.

33058JavaScriptApache-2.0

sparrow

Sparrow scheduling platform (U.C. Berkeley).

32890PythonApache-2.0

mist

Serverless proxy for Spark cluster

32670ScalaApache-2.0

7 / 11 页,共 262 个项目