多码网
返回分类

大数据

大数据处理、分析与可视化

262 个源码项目

incubator-heron

Apache Heron (Incubating) is a realtime, distributed, fault-tolerant stream processing engine from Twitter

3,655583JavaApache-2.0

atlas

In-memory dimensional time series database.

3,554337ScalaApache-2.0

smart_open

Utils for streaming large files (S3, HDFS, gzip, bz2...)

3,442385PythonMIT

heka

DEPRECATED: Data collection and processing made easy.

3,424521GoNOASSERTION

linkis

Apache Linkis builds a computation middleware layer to facilitate connection, governance and orchestration between the upper applications and the underlying data engines.

3,4091,165JavaApache-2.0

flocker

Container data volume manager for your Dockerized application

3,384282PythonApache-2.0

DataSphereStudio

DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.

3,2561,038JavaApache-2.0

beringei

Beringei is a high performance, in-memory storage engine for time series data.

3,160288C++NOASSERTION

shynet

Modern, privacy-friendly, and detailed web analytics that works without cookies or JS.

3,135205PythonApache-2.0

pytorch_geometric_temporal

PyTorch Geometric Temporal: Spatiotemporal Signal Processing with Neural Machine Learning Models (CIKM 2021)

2,980403PythonMIT

bfs

The Baidu File System.

2,850558C++BSD-3-Clause

elasticsearch-jdbc

JDBC importer for Elasticsearch

2,821699JavaApache-2.0

airpal

Web UI for PrestoDB.

2,750445JavaApache-2.0

Open-Web-Analytics

Official repository for Open Web Analytics which is an open source alternative to commercial tools such as Google Analytics. Stay in control of the data you collect about the use of your website or app. Please consider sponsoring this project.

2,660484PHPGPL-2.0

pipelinedb

High-performance time-series aggregation for PostgreSQL

2,658243CApache-2.0

kafka-node

Node.js client for Apache Kafka 0.8 and later.

2,656617JavaScriptMIT

awesome-network-embedding

A curated list of network embedding techniques.

2,624496

nats-streaming-server

NATS Streaming System Server

2,533289GoApache-2.0

featurebase

A crazy fast analytical database, built on bitmaps. Perfect for ML applications. Learn more at: http://docs.featurebase.com/. Start a Docker instance: https://hub.docker.com/r/featurebasedb/featurebase

2,526237GoApache-2.0

griddb

GridDB is a next-generation open source database that makes time series IoT and big data fast,and easy.

2,4744,999C++AGPL-3.0

fs2

Compositional, streaming I/O library for Scala

2,446632ScalaNOASSERTION

schema-registry

Confluent Schema Registry for Kafka

2,4241,157JavaNOASSERTION

kapacitor

Open source framework for processing, monitoring, and alerting on time series data

2,370481GoMIT

portaljs

🌀 Rapidly build feature-rich data portals using a modern frontend framework. Native CKAN support. OpenMetadata and Git compatible.

2,273331TypeScriptMIT

3 / 11 页,共 262 个项目