Coder Social home page Coder Social logo

census-data's Introduction

Census Data Analysis for Affordable Housing Development

This project aims to develop a scalable and efficient data pipeline for analyzing publicly available Census data relevant to affordable housing development projects. The insights generated will assist real estate professionals, often with limited technical expertise, in making data-driven decisions.

Project Background

Report Purpose: This project will generate a report providing valuable information to evaluate potential areas for developing or preserving affordable housing.

Project Goals

  • Develop a Scalable Report: The report should be adaptable to different real estate development projects across the country and utilize any relevant Census dataset.
  • Census API Integration: Understand interaction with the Census API(s) and document the process for easy integration.
  • Local Development Environment: Create a user-friendly local development environment with a clear onboarding process.
  • Collaboration: Establish efficient communication patterns, work allocation strategies, and decision-tracking mechanisms for a collaborative workflow.

List Items

  • Report Design: Determine the desired format and content of the PDF report.
  • Data Exploration: Identify the specific data points most valuable for analysis.
  • Census Data Access: Obtain a Census Bureau API access token (avoid storing in plain text).
  • Dev Environment Setup: Create a local development environment with clear instructions.
  • Test Pipeline Construction:
    • Implement calls to the Census API using the census library (https://pypi.org/project/census/).
    • Store retrieved data in an S3 bucket.
    • Load data into a pandas DataFrame for further manipulation.
    • Perform any necessary data cleaning and transformations.
    • Visualize the data using appropriate charts.
    • Deliver a parameterized dashboard with 2-3 key visualizations.
  • Scalability Improvements: Optimize the pipeline to efficiently handle multiple Census datasets.
  • Cost Optimization: Identify cost-effective strategies for data pipelines.
  • Machine Learning Exploration: Investigate potential machine learning applications for future demographic forecasting.

Resources

  • Trello Board: Project management and task tracking
  • Google Drive Folder: Document storage and collaboration
  • Lucidchart Diagram: Workflow and data flow visualization
  • GitHub Repo: Source code repository
  • System Design Documentation: System architecture details

Data Sources

  • Census Tables:
    • DP05 – ACS Demographic and Housing Estimates
    • S1501 – Educational Attainment
    • S1901 – Income in the Past 12 Months (average per state, per month)
    • S1903 – Median Income in the Past 12 Months
    • S1701 – Poverty Status in the Past 12 Months
    • S2502 – Demographic Characteristics for Occupied Housing Units
    • B25106 – Tenure by Housing Cost as a Percent of Household Income in the Past 12 Months
    • DP04 – Selected Housing Characteristics
  • FFIEC:

Census API Learnings

Tables Required

Example Visualizations

Visualization 1

Screenshot 2024-04-11 at 9 05 15 AM

Visualization 2

Screenshot 2024-04-11 at 9 05 31 AM

census-data's People

Contributors

camcash17 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.