多码网
返回分类

大数据

大数据处理、分析与可视化

262 个源码项目

superset

Apache Superset is a Data Visualization and Data Exploration Platform

72,76817,237TypeScriptApache-2.0

keras

Deep Learning for humans

64,05819,765PythonApache-2.0

pathway

Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.

63,3041,662PythonNOASSERTION

metabase

The easy-to-use open source Business Intelligence and Embedded Analytics tool that lets everyone work with data :bar_chart:

47,2496,445ClojureNOASSERTION

airflow

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

45,33617,012PythonApache-2.0

ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

42,5037,541PythonApache-2.0

faiss

A library for efficient similarity search and clustering of dense vectors.

39,9704,370C++MIT

umami

Umami is a modern, privacy-focused analytics platform. An open-source alternative to Google Analytics, Mixpanel and Amplitude.

36,5707,073TypeScriptMIT

posthog

🦔 PostHog is an all-in-one developer platform for building successful products. We offer product analytics, web analytics, session replay, error tracking, feature flags, experimentation, surveys, data warehouse, a CDP, and an AI product assistant to help debug your code, ship features faster, and keep all your usage and customer data in one stack.

34,3952,703PythonNOASSERTION

cockroach

CockroachDB — the cloud native, distributed SQL database designed for high availability, effortless scale, and control over data placement.

32,1404,120GoNOASSERTION

influxdb

Scalable datastore for metrics, events, and real-time analytics

31,4903,704RustApache-2.0

redash

Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.

28,5664,585PythonBSD-2-Clause

nsq

A realtime distributed messaging platform

25,7272,897GoMIT

analytics

Open source, privacy-first web analytics. Lightweight, cookie-free Google Analytics alternative. Self-hosted or cloud.

24,8021,396ElixirAGPL-3.0

dgraph

high-performance graph database for real-time use cases

21,6731,587GoApache-2.0

goaccess

GoAccess is a real-time web log analyzer and interactive viewer that runs in a terminal in *nix systems or through your browser.

20,5201,177CMIT

rqlite

The lightweight, fault-tolerant database built on SQLite. Designed to keep your data highly available with minimal effort.

17,487780GoMIT

tikv

Distributed transactional key-value database, originally created to complement TiDB

16,6682,271RustApache-2.0

weaviate

Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database​.

16,1581,274GoBSD-3-Clause

dagster

An orchestration platform for the development, production, and observation of data assets.

15,4702,116PythonApache-2.0

pulsar

Apache Pulsar - distributed pub-sub messaging system

15,2303,730JavaApache-2.0

cayley

An open-source graph database

15,0421,240GoApache-2.0

awesome-bigdata

A curated list of awesome big data frameworks, ressources and other awesomeness.

14,3842,582MIT

thanos

Highly available Prometheus setup with long term storage capabilities. A CNCF Incubating project.

14,0682,297GoApache-2.0
2...11下一页

1 / 11 页,共 262 个项目