Ziyi Tan's Projects
Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
Ceph is a distributed object, block, and file storage platform
ClickHouseยฎ is a free analytics DBMS for big data
Source code for "Multi-Source Deep Domain Adaptation with Weak Supervision for Time-Series Sensor Data" (KDD 2020)
ๅพ่งฃ่ฎก็ฎๆบ็ฝ็ปใๆไฝ็ณป็ปใ่ฎก็ฎๆบ็ปๆใๆฐๆฎๅบ๏ผๅ
ฑ 1000 ๅผ ๅพ + 50 ไธๅญ๏ผ็ ด้คๆฆๆถฉ้พๆ็่ฎก็ฎๆบๅบ็ก็ฅ่ฏ๏ผ่ฎฉๅคฉไธๆฒกๆ้พๆ็ๅ
ซ่กๆ๏ผ๐ ๅจ็บฟ้
่ฏป๏ผhttps://xiaolincoding.com
Dev container for ColumnStore ๐ฅ
CubeFS is a cloud native unstructured data storage
Curve is a high-performance, lightweight-operation, cloud-native open source distributed storage system. Curve can be applied to: 1) mainstream cloud-native infrastructure platforms OpenStack and Kubernetes; 2) high-performance storage for cloud-native databases; 3) cloud storage middleware using S3-compatible object storage as a data storage.
Source for Curve website.
Curve Storage Orchestration for Kubernetes
ไธๅ้็งๆ้จ็ฝฒzerotier-planetๆๅก
Apache Doris is an easy-to-use, high performance and unified analytics database.
My Linux / Mac dotfiles ยท powered by dotbot โก๏ธ
An open source, standard data file format for graph data storage and retrieval
๐จ ๐ ๐ป ๐ GraphScope: A One-Stop Large-Scale Graph Computing System from Alibaba ๆฅ่ช้ฟ้ๅทดๅทด็ไธ็ซๅผๅคง่งๆจกๅพ่ฎก็ฎ็ณป็ป ๅพๅๆ ๅพๆฅ่ฏข ๅพๆบๅจๅญฆไน
GSoC 2022 @MariaDB - MariaDB server is a community developed fork of MySQL server. Started by core members of the original MySQL team, MariaDB actively works with outside developers to deliver the most featureful, stable, and sanely licensed open SQL server in the industry.
Fast and Lightweight Observability Data Collector
JuiceFS is a distributed POSIX file system built on top of Redis and S3.
A multi-sandbox container runtime that provides cloud-native, all-scenario multiple sandbox container solutions.
Apache Kvrocks is a distributed key value NoSQL database that uses RocksDB as storage engine and is compatible with Redis protocol.
LeetCode Solutions C++ / Py โณ
Core storage engine - UM and PM Process code
mfsan for time series
Vector database for scalable similarity search and AI applications.
100 numpy exercises (with solutions)
OpenMLDB is an open-source machine learning database that provides a feature platform computing consistent features for training and inference.
ใMachine Learning Systems: Design and Implementationใ- Chinese Version
The OpenTelemetry C++ Client
Pretty, minimal and fast ZSH prompt