liujinhui1994 Goto Github PK
Name: liujinhui
Type: User
Name: liujinhui
Type: User
alibabacloud-dla-demo
Amoro is a Lakehouse management system built on open data lake formats.
酷玩 Spark: Spark 源代码解析、Spark 类库等
Dinky is a real-time data development platform based on Apache Flink, enabling agile data development, deployment and operation.
Apache Doris is an MPP-based interactive SQL data warehousing for reporting and analysis.
Flink Connector for Apache Doris
Cluster manager for Apache Doris
Spark Connector for Apache Doris
emr-hudi-example
Code for the paper: Detecting Photoshopped Faces by Scripting Photoshop
Apache Flink
Gluten: Plugin to Double SparkSQL's Performance
🌉 基于Go+Vue实现的openLDAP后台管理项目
World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.
Apache HertzBeat(incubating) is a real-time monitoring system with agentless, performance cluster, prometheus-compatible, custom monitoring and status page building capabilities.
Upserts, Deletes And Incremental Processing on Big Data.
Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.
SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).
Uniffle is a high performance, general purpose Remote Shuffle Service.
Apache Kylin
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
Monitoring and insights on your data lakehouse tables
Linkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.
Nessie: Transactional Catalog for Data Lakes with Git-like semantics
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.
Apache ORC - the smallest, fastest columnar storage for Hadoop workloads
Scalable, redundant, and distributed object store for Apache Hadoop
Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.
Apache Parquet
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.