Coder Social home page Coder Social logo

etl's Introduction

What

This repo contains script for demonstrating a simple ETL data pipeline. Starting from extracting data from the source, transforming into a desired format, and loading into a SQLite file.

Then, perform simple analysis queries on the stored data. See: analysis notebook.

Note: The used data is about the US population and unemployment rate over the past decade.


Getting started

First, make sure to have Python 3.12 installed

Then, install the project dependencies

pip install -r requirements.txt

Note: preferable, run under a virtualenv

$ python -m venv venv
$ pip install -r requirements.txt

Second, run the main pipeline file

python pipeline.py

Requirements

  • pandas
  • xlrd (for reading excel file)
  • Python >= 3.6

Data source and description

POPULATION BY METROPOLITAN AREA AND COUNTY

UNEMPLOYMENT BY COUNTY

etl's People

Contributors

danielgalvis98 avatar scot3004 avatar iamaziz avatar

Forkers

juanpiza18

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.