gzdx-chenghui Goto Github PK
Name: [email protected]
Type: User
Name: [email protected]
Type: User
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
爬虫项目+简单数据分析
Data quality check tools by execute sql
The premier open source Data Quality solution
数据质量场景手册
DataX 是阿里巴巴集团内被广泛使用的离线数据同步工具/平台,实现包括 MySQL、Oracle、HDFS、Hive、OceanBase、HBase、OTS、ODPS 等各种异构数据源之间高效的数据同步功能。
通用数据采集工具,源自 Alibaba DataX,增加了更多的读写插件,HDFS读写功能增强,支持 cassandra, clickhouse, dbf, hive, mysql, oracle, prestosql, postgresql, sqlserver, text 等数据源
datax数据同步elasticsearch的reader和writer插件,支持一对多的扁平数据转换成es的嵌套对象,也支持嵌套对象的读取和ognl表达式过滤,理论上可以无限嵌套。
DataX 3.0 平台上脱敏算法的集成与实现。
DataX集成可视化页面,选择数据源一键生成JSON并脱敏,集成定时任务,支持分布式,支持增量获取,实时查看运行日志,监控执行器资源,kill运行进程。
为DataX(https://github.com/alibaba/DataX) 提供远程多语言调用(ThriftServer,HttpServer) 分布式运行(DataX on YARN) 功能
Code and data for paper "Deep Painterly Harmonization": https://arxiv.org/abs/1804.03189
数据治理、数据质量检核/监控平台(Django+jQuery+MySQL)
The IK Analysis plugin integrates Lucene IK analyzer into elasticsearch, support customized dictionary.
This project is the basis for a BedCon talk and should make it possible for the listener to build an own recommender.
spark streaming 计算用户画像
Apache Flink
免费ss账号 免费shadowsocks账号 免费v2ray账号 (长期更新)
学习强国 懒人刷分工具 自动学习
基金投资策略分析,基金回测工具
A Cluster Computing System for Processing Large-Scale Spatial Data
Pivotal Greenplum Database
Mirror of Apache griffin
Config files for my GitHub profile.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.