tgluon Goto Github PK
Name: XiuhongTang
Type: User
Bio: BigData Infra Architect
Location: 杭州
Name: XiuhongTang
Type: User
Bio: BigData Infra Architect
Location: 杭州
💥🔥 为了解决企业建设大数据平台的痛难点, 本项目旨在对Apache众多大数据平台组件进行二次开发维护,并输出一款通用的大数据平台底座,重点解决数据采集, 数据存储, 数据计算, 数据开发和数据运营场景遇到的问题与挑战, 初衷是建设开源业界领先的一站式大数据平台, 赋能成千上万个中小企业的业务快速发展, 以及给热爱大数据的开发者提供一系列解决方案。
Alluxio, data orchestration for analytics and machine learning in the cloud
Arctic is a streaming lake warehouse service open sourced by NetEase
Arroyo is a distributed stream processing engine written in Rust
Auto-Editor: Efficient media analysis and rendering
An experimental open-source attempt to make GPT-4 fully autonomous.
用文本编辑器剪视频
自建梯子教程-翻墙-科学上网-google
⚠️(OBSOLETE) Curated applications for Kubernetes
赤兔实时计算平台是基于 Apache Flink 构建的企业级、一站式、高性能、低门槛实时大数据实时计算平台,广泛适用于流式数据应用开发场景。
虎牙,斗鱼,抖音,BiliBili,TikTok,Twitch🔥热门🔥智能直播视频剪辑发布AI机器人,自动化🤖,全智能化⚙(智能生成切片,标题,封面,简介),可视化👓,平台热门监控🌡,丰富插件随意扩展🕹,快速部署⚡,视频账号打造自动发布🌟,支持DIY🎮
A lightweight solution to manager bigdata cluster(hadoop、hive、Doris and etc..) on kubernetes. 一款基于kubernetes的云原生大数据平台,致力于简化k8s上大数据集群的运维管理
酷玩 Spark: Spark 源代码解析、Spark 类库等
抖音(a_bogus最新版)、快手、哔哩哔哩、小红书、淘宝、京东、微博平台帖子、评论、搜索、用户作品高性能爬虫服务器。docker一键快速部署。
云原生一站式机器学习平台,多租户,数据资产,notebook在线开发,拖拉拽任务流编排,多机多卡分布式训练,超参搜索,推理服务,多集群调度,多项目组资源组,边缘计算,大模型实时训练, ai应用商店
数据血缘,Hive/Sqoop/HBase/Spark等,发送到kafka后,解析处理使用neo4j生成血缘
数据库比对工具:hive 表数据比对,mysql 数据比对,实现自动化配置进行数据比对,避免频繁写sql 进行处理
The Metadata Platform for the Modern Data Stack
Repository of helm charts for deploying DataHub on a Kubernetes cluster
It is committed to rapidly implementing the deployment, management, monitoring and automatic operation and maintenance of the big data cloud native platform, helping you quickly build a stable, efficient, elastic and scalable big data cloud native platform.
Know your data better!Datavines is Next-gen Data Observability Platform, support metadata manage and data quality.
这个平台旨在提供一个高效、便捷的数据处理和分析环境,适用于数据科学家、数据工程师以及任何对数据处理有需求的用户。
《Designing Data-Intensive Application》DDIA中文翻译
Deep Learning Book Chinese Translation
An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
FD第一届程序设计大赛
Dinky is a real-time data development platform based on Apache Flink, enabling agile data development, deployment and operation.
Deep Learning on Flink aims to integrate Flink and deep learning frameworks (e.g. TensorFlow, PyTorch, etc) to enable distributed deep learning training and inference on a Flink cluster.
🤖 A minimal and customizable Docker image running the Android emulator as a service.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.