Coder Social home page Coder Social logo

liujinhui's Projects

amoro icon amoro

Amoro is a Lakehouse management system built on open data lake formats.

dinky icon dinky

Dinky is a real-time data development platform based on Apache Flink, enabling agile data development, deployment and operation.

doris icon doris

Apache Doris is an MPP-based interactive SQL data warehousing for reporting and analysis.

faldetector icon faldetector

Code for the paper: Detecting Photoshopped Faces by Scripting Photoshop

gluten icon gluten

Gluten: Plugin to Double SparkSQL's Performance

gravitino icon gravitino

World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.

hertzbeat icon hertzbeat

Apache HertzBeat(incubating) is a real-time monitoring system with agentless, performance cluster, prometheus-compatible, custom monitoring and status page building capabilities.

hudi icon hudi

Upserts, Deletes And Incremental Processing on Big Data.

incubator-celeborn icon incubator-celeborn

Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.

incubator-seatunnel icon incubator-seatunnel

SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).

kyuubi icon kyuubi

Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.

lakeview icon lakeview

Monitoring and insights on your data lakehouse tables

linkis icon linkis

Linkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.

nessie icon nessie

Nessie: Transactional Catalog for Data Lakes with Git-like semantics

openmetadata icon openmetadata

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.

orc icon orc

Apache ORC - the smallest, fastest columnar storage for Hadoop workloads

ozone icon ozone

Scalable, redundant, and distributed object store for Apache Hadoop

paimon icon paimon

Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.