datahub icon datahub

The Metadata Platform for the Modern Data Stack

dbt-core icon dbt-core

dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.

ddia icon ddia

《Designing Data-Intensive Application》DDIA中文翻译

debezium icon debezium

Change data capture for a variety of databases. Please log issues at

delta icon delta

An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.

delta-rs icon delta-rs

A native Rust library for Delta Lake, with bindings into Python and Ruby.

dgraph icon dgraph

Native GraphQL Database with graph backend

duckdb icon duckdb

DuckDB is an in-process SQL OLAP Database Management System

envoy icon envoy

Cloud-native high-performance edge/middle/service proxy

etcd icon etcd

Distributed reliable key-value store for the most critical data of a distributed system

faiss icon faiss

A library for efficient similarity search and clustering of dense vectors.

faster icon faster

Fast persistent recoverable log and key-value store + cache, in C# and C++.

feast icon feast

Feature Store for Machine Learning

feathr icon feathr

Feathr – An Enterprise-Grade, High Performance Feature Store

fiber icon fiber

Distributed Computing for AI Made Simple

flatbuffers icon flatbuffers

FlatBuffers: Memory Efficient Serialization Library

folly icon folly

An open-source C++ library developed and used at Facebook.

gazelle_plugin icon gazelle_plugin

Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.

gold-miner icon gold-miner


gpdb icon gpdb

Greenplum Database - Massively Parallel PostgreSQL for Analytics. An open-source massively parallel data platform for analytics, machine learning and AI.

gporca icon gporca

A modular query optimizer for big data

