raavioli Goto Github PK
Name: raavioli
Type: User
Company: CultData
Bio: data etc
Location: Rabbit Hole
Name: raavioli
Type: User
Company: CultData
Bio: data etc
Location: Rabbit Hole
Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.
The control center for ML in the cloud
😎 Awesome lists about all kinds of interesting topics
A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning
Master programming by recreating your favorite technologies from scratch.
A high-performance, zero-overhead, extensible Python compiler using LLVM
A place for programming language instructors to share educational materials
The Data Contract Specification Repository
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
A list of SaaS, PaaS and IaaS offerings that have free tiers of interest to devops and infradev
:books: Freely available programming books
H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs
Upserts, Deletes And Incremental Processing on Big Data.
Apache Iceberg
Data Integration via Confluent Kafka
Jitsu is an open-source Segment alternative. Fully-scriptable data ingestion engine for modern data teams. Set-up a real-time data pipeline in minutes, not days
Lenses, Folds, and Traversals - Join us on freenode #haskell-lens
🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and transforming data.
Open source platform for the machine learning lifecycle
Open standard for machine learning interoperability
The official home of the Presto distributed SQL query engine for big data
Advanced Python Mastery (course by @dabeaz)
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a toolkit of libraries (Ray AIR) for accelerating ML workloads.
Apache Spark - A unified analytics engine for large-scale data processing
The AsyncAPI specification allows you to create machine-readable definitions of your asynchronous APIs.
Benchmark for Airflow with BigQuery as the Data Warehouse using TPC - DI
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.