多码网
返回分类

大数据

大数据处理、分析与可视化

262 个源码项目

incubator-heron

Apache Heron (Incubating) is a realtime, distributed, fault-tolerant stream processing engine from Twitter

3,637583JavaApache-2.0

atlas

In-memory dimensional time series database.

3,554341ScalaApache-2.0

smart_open

Utils for streaming large files (S3, HDFS, gzip, bz2...)

3,444387PythonMIT

linkis

Apache Linkis builds a computation middleware layer to facilitate connection, governance and orchestration between the upper applications and the underlying data engines.

3,4101,165JavaApache-2.0

heka

DEPRECATED: Data collection and processing made easy.

3,402519GoNOASSERTION

flocker

Container data volume manager for your Dockerized application

3,383282PythonApache-2.0

DataSphereStudio

DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.

3,2571,039JavaApache-2.0

beringei

Beringei is a high performance, in-memory storage engine for time series data.

3,155288C++NOASSERTION

shynet

Modern, privacy-friendly, and detailed web analytics that works without cookies or JS.

3,133204PythonApache-2.0

pytorch_geometric_temporal

PyTorch Geometric Temporal: Spatiotemporal Signal Processing with Neural Machine Learning Models (CIKM 2021)

2,980405PythonMIT

bfs

The Baidu File System.

2,850557C++BSD-3-Clause

elasticsearch-jdbc

JDBC importer for Elasticsearch

2,821699JavaApache-2.0

airpal

Web UI for PrestoDB.

2,750444JavaApache-2.0

Open-Web-Analytics

Official repository for Open Web Analytics which is an open source alternative to commercial tools such as Google Analytics. Stay in control of the data you collect about the use of your website or app. Please consider sponsoring this project.

2,663484PHPGPL-2.0

pipelinedb

High-performance time-series aggregation for PostgreSQL

2,660243CApache-2.0

kafka-node

Node.js client for Apache Kafka 0.8 and later.

2,655617JavaScriptMIT

awesome-network-embedding

A curated list of network embedding techniques.

2,625495

nats-streaming-server

NATS Streaming System Server

2,531289GoApache-2.0

featurebase

A crazy fast analytical database, built on bitmaps. Perfect for ML applications. Learn more at: http://docs.featurebase.com/. Start a Docker instance: https://hub.docker.com/r/featurebasedb/featurebase

2,526237GoApache-2.0

griddb

GridDB is a next-generation open source database that makes time series IoT and big data fast,and easy.

2,4744,996C++AGPL-3.0

fs2

Compositional, streaming I/O library for Scala

2,445629ScalaNOASSERTION

schema-registry

Confluent Schema Registry for Kafka

2,4311,157JavaNOASSERTION

kapacitor

Open source framework for processing, monitoring, and alerting on time series data

2,369482GoMIT

portaljs

🌀 Rapidly build feature-rich data portals using a modern frontend framework. Native CKAN support. OpenMetadata and Git compatible.

2,273331TypeScriptMIT

3 / 11 页,共 262 个项目