tien-duong115 Goto Github PK
Name: Tien Duong
Type: User
Bio: “It’s not because things are difficult that we dare not venture. It’s because we dare not venture that they are difficult.”-- Seneca
Location: San Jose
Name: Tien Duong
Type: User
Bio: “It’s not because things are difficult that we dare not venture. It’s because we dare not venture that they are difficult.”-- Seneca
Location: San Jose
Testing Airflow Operators and configuration
Capstone project from Udacity with coinlore API
IBM_coursera_capstone_project. Using k-mean cluster analysis to explore the city of Toronto
Repo contained C/C++, Python, and project related materials
Project Overview Welcome to the exploratory data analysis of the relation between population census dataset and gun record information. The `gun census` file contain total of 27 columns and 12485 rows. The file is the record of information comes from the FBI's national instant Criminal Background check system or NICS. Whenever there is a firearm purchase, gunshop's owner will run a check through the NICS system to ensure that the buyer meet all of the qualification before their purchase. Accompanying the NICS dataset is the U.S. census dataset of which contain serveral variables at the state level. Most variables have only one data point per state (2016), but a few have data for more than one year (poverty). <a href="https://www.census.gov/">Census link</a>
Using the supervised classification method to predict loan credit default with 346 records of individuals
The goal of this project is to analyze and visualize the lyfy_bike_data.csv. This project is part of the Udacity's Data analysis program.
Project developing Python application OOP design
The purpose of this project is to perform an multivariate-testing of 5 version of a University cover page. The data was extracted from Young, Scott W.H. (2014) Improving Library User Experience with A/B Testing: Principles and Process. Weave: Journal of Library User Experience. University of Michigan Library. http://dx.doi.org/10.3998/weave.12535642.0001.101
Files for Udemy Course on Algorithms and Data Structures
Created with CodeSandbox
Creating ETL process using S3 as staging db and store into AWS redshift DWH for higher performance analysis
Sandbox for webscraping
ENGR195 capstone class Career chat bot, work in group with other Social Science Students
Project goal is to setup a database using postgresSQL to help analytics team perform user's behavioral preferences of the most trendy song
A very easy to read script that you can use for recognising phrases when creating a basic chat bot.
Config files for my GitHub profile.
Introduction In this data wrangling project, the goal is to clean up the data quality and tidiness issues using both visual and programmatic assessments
Use airflow to establish ETL scheduling processes (DAG). Execute SQL for creates table in user S3 bucket and inserted data from another S3 buckets for staging and relational modeling accord to time schedule and sensor.
PIPELINE data for analytic team to perform analysist, extract dataset from S3 bucket location, use Spark on top of HDFS to compute than load back into S3 bucket
Udacity course on Data architecture and engineering
Using Python to the Visualization area of interest in Data Science field of study. Use folium map to visualize the location of the district within map of the City of San Francisco.
As we all know that global temperature is getting warmer as ever. In this project,the data set was given to me as part of the Udacity program. The average temperature of global weather trends file is to be extracted from the SQL database. I extract the data and export its into two files global weather.csvand local weather csv. The global weather.csv consists of 2 columns, year and avg_temp which recorded the average temperage each years in Celsius. The local weather.csv contain 4 columns, year, city, country, avg_temp, which also recorded the average temperature in Celsius of San Jose which also known as Silicon Valley.
R-language using Statistical t-test to evaluate the differences between user clicks rate between the two dataset
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.