多码网
返回分类

大数据

大数据处理、分析与可视化

262 个源码项目

superset

Apache Superset is a Data Visualization and Data Exploration Platform

72,51317,075TypeScriptApache-2.0

keras

Deep Learning for humans

64,02519,749PythonApache-2.0

pathway

Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.

63,4401,624PythonNOASSERTION

metabase

The easy-to-use open source Business Intelligence and Embedded Analytics tool that lets everyone work with data :bar_chart:

46,9626,390ClojureNOASSERTION

airflow

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

45,12016,896PythonApache-2.0

ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

42,2447,474PythonApache-2.0

faiss

A library for efficient similarity search and clustering of dense vectors.

39,7924,341C++MIT

umami

Umami is a modern, privacy-focused analytics platform. An open-source alternative to Google Analytics, Mixpanel and Amplitude.

36,2446,948TypeScriptMIT

posthog

🦔 PostHog is an all-in-one developer platform for building successful products. We offer product analytics, web analytics, session replay, error tracking, feature flags, experimentation, surveys, data warehouse, a CDP, and an AI product assistant to help debug your code, ship features faster, and keep all your usage and customer data in one stack.

32,6902,543PythonNOASSERTION

cockroach

CockroachDB — the cloud native, distributed SQL database designed for high availability, effortless scale, and control over data placement.

32,0754,119GoNOASSERTION

influxdb

Scalable datastore for metrics, events, and real-time analytics

31,4443,705RustApache-2.0

redash

Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.

28,5224,583PythonBSD-2-Clause

nsq

A realtime distributed messaging platform

25,7892,898GoMIT

analytics

Open source, privacy-first web analytics. Lightweight, cookie-free Google Analytics alternative. Self-hosted or cloud.

24,6401,386ElixirAGPL-3.0

dgraph

high-performance graph database for real-time use cases

21,6681,587GoApache-2.0

goaccess

GoAccess is a real-time web log analyzer and interactive viewer that runs in a terminal in *nix systems or through your browser.

20,4391,173CMIT

rqlite

The lightweight, fault-tolerant database built on SQLite. Designed to keep your data highly available with minimal effort.

17,435773GoMIT

tikv

Distributed transactional key-value database, originally created to complement TiDB

16,6412,265RustApache-2.0

weaviate

Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database​.

16,0531,260GoBSD-3-Clause

dagster

An orchestration platform for the development, production, and observation of data assets.

15,3492,097PythonApache-2.0

pulsar

Apache Pulsar - distributed pub-sub messaging system

15,2063,722JavaApache-2.0

cayley

An open-source graph database

15,0411,243GoApache-2.0

awesome-bigdata

A curated list of awesome big data frameworks, ressources and other awesomeness.

14,3522,584MIT

thanos

Highly available Prometheus setup with long term storage capabilities. A CNCF Incubating project.

14,0232,286GoApache-2.0
2...11下一页

1 / 11 页,共 262 个项目