大数据
大数据处理、分析与可视化
共 262 个源码项目incubator-heron
Apache Heron (Incubating) is a realtime, distributed, fault-tolerant stream processing engine from Twitter
atlas
In-memory dimensional time series database.
smart_open
Utils for streaming large files (S3, HDFS, gzip, bz2...)
heka
DEPRECATED: Data collection and processing made easy.
linkis
Apache Linkis builds a computation middleware layer to facilitate connection, governance and orchestration between the upper applications and the underlying data engines.
flocker
Container data volume manager for your Dockerized application
DataSphereStudio
DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.
beringei
Beringei is a high performance, in-memory storage engine for time series data.
shynet
Modern, privacy-friendly, and detailed web analytics that works without cookies or JS.
pytorch_geometric_temporal
PyTorch Geometric Temporal: Spatiotemporal Signal Processing with Neural Machine Learning Models (CIKM 2021)
bfs
The Baidu File System.
elasticsearch-jdbc
JDBC importer for Elasticsearch
airpal
Web UI for PrestoDB.
Open-Web-Analytics
Official repository for Open Web Analytics which is an open source alternative to commercial tools such as Google Analytics. Stay in control of the data you collect about the use of your website or app. Please consider sponsoring this project.
pipelinedb
High-performance time-series aggregation for PostgreSQL
kafka-node
Node.js client for Apache Kafka 0.8 and later.
awesome-network-embedding
A curated list of network embedding techniques.
nats-streaming-server
NATS Streaming System Server
featurebase
A crazy fast analytical database, built on bitmaps. Perfect for ML applications. Learn more at: http://docs.featurebase.com/. Start a Docker instance: https://hub.docker.com/r/featurebasedb/featurebase
griddb
GridDB is a next-generation open source database that makes time series IoT and big data fast,and easy.
fs2
Compositional, streaming I/O library for Scala
schema-registry
Confluent Schema Registry for Kafka
kapacitor
Open source framework for processing, monitoring, and alerting on time series data
portaljs
🌀 Rapidly build feature-rich data portals using a modern frontend framework. Native CKAN support. OpenMetadata and Git compatible.
第 3 / 11 页,共 262 个项目
