大数据
大数据处理、分析与可视化
共 262 个源码项目superset
Apache Superset is a Data Visualization and Data Exploration Platform
keras
Deep Learning for humans
pathway
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
metabase
The easy-to-use open source Business Intelligence and Embedded Analytics tool that lets everyone work with data :bar_chart:
airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
ray
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
faiss
A library for efficient similarity search and clustering of dense vectors.
umami
Umami is a modern, privacy-focused analytics platform. An open-source alternative to Google Analytics, Mixpanel and Amplitude.
posthog
🦔 PostHog is an all-in-one developer platform for building successful products. We offer product analytics, web analytics, session replay, error tracking, feature flags, experimentation, surveys, data warehouse, a CDP, and an AI product assistant to help debug your code, ship features faster, and keep all your usage and customer data in one stack.
cockroach
CockroachDB — the cloud native, distributed SQL database designed for high availability, effortless scale, and control over data placement.
influxdb
Scalable datastore for metrics, events, and real-time analytics
redash
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
nsq
A realtime distributed messaging platform
analytics
Open source, privacy-first web analytics. Lightweight, cookie-free Google Analytics alternative. Self-hosted or cloud.
dgraph
high-performance graph database for real-time use cases
goaccess
GoAccess is a real-time web log analyzer and interactive viewer that runs in a terminal in *nix systems or through your browser.
rqlite
The lightweight, fault-tolerant database built on SQLite. Designed to keep your data highly available with minimal effort.
tikv
Distributed transactional key-value database, originally created to complement TiDB
weaviate
Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database.
dagster
An orchestration platform for the development, production, and observation of data assets.
pulsar
Apache Pulsar - distributed pub-sub messaging system
cayley
An open-source graph database
awesome-bigdata
A curated list of awesome big data frameworks, ressources and other awesomeness.
thanos
Highly available Prometheus setup with long term storage capabilities. A CNCF Incubating project.
第 1 / 11 页,共 262 个项目
