kkxiaotikk Goto Github PK
4mc - splittable lz4 and zstd in hadoop/spark/flink
A port of Snappy, LZO, LZ4, and Zstandard to Java
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Alluxio, data orchestration for analytics and machine learning in the cloud
A Flexible and Powerful Parameter Server for large-scale machine learning
ANTLR (ANother Tool for Language Recognition) is a powerful parser generator for reading, processing, executing, or translating structured text or binary files.
Arctic is a streaming lake warehouse service open sourced by NetEase
ArcticDB is a high performance, serverless DataFrame database built for the Python Data Science ecosystem.
A GPU-powered real-time analytics storage and query engine.
Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
Apache Arrow DataFusion and Ballista query engines
Official Rust implementation of Apache Arrow
Implementation connecting Arrow to Spark, effectively making all code related to reading in Spark redundant.
Arroyo is a distributed stream processing engine written in Rust
Mirror of Apache AsterixDB
Apache Avro is a data serialization system.
AI Native database for embedding vectors
The Patterns of Scalable, Reliable, and Performant Large-Scale Systems
The Amazon Athena Query Federation SDK allows you to customize Amazon Athena with your own data sources and code.
BaikalDB, A Distributed HTAP Database.
Apache Beam is a unified programming model for Batch and Streaming
Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.
An industrial-grade C++ implementation of RAFT consensus algorithm based on brpc, widely used inside Baidu to build highly-available distributed systems.
Build system, successor to Buck
A new way of working with Protocol Buffers.
Ceph is a distributed object, block, and file storage platform
CeresDB is a high-performance, distributed, cloud native time-series database.
ChubaoFS (abbrev. CBFS) is a cloud native distributed file system and object store.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.