Coder Social home page Coder Social logo

daily-rates-dim-cross-account's Introduction

AWS Data Configuration and Processing Guide

image

This README provides a detailed guide on setting up AWS configurations for data handling and processing. The configurations include AWS IAM roles, Glue connections, S3 bucket creation, and a step function.

Important Notes

  • Manually add the following inline policy to role_from_step_function:

    {
        "Effect": "Allow",
        "Action": "sts:AssumeRole",
        "Resource": "arn:aws:iam::ACCOUNTID1:role/role_to_create"
    }
  • The role_to_create includes a trust relationship in the script execution (Line 80) as follows:

    {
        "Effect": "Allow",
        "Principal": {
            "AWS": "arn:aws:iam::ACCOUNTID2:role/role_from_step_function"
        },
        "Action": "sts:AssumeRole"
    }
  • Ensure role_from_step_function and role_to_create include all necessary permissions for executing or provisioning services configured with this script.

Configuration Details

  • Profiles, regions, connection names, IP addresses, ports, database names, and other local variables are defined.
  • AWS provider configurations for different profiles are set.
  • AWS Glue connection for database connectivity is defined.
  • IAM roles with necessary policies and trust relationships are created.
  • S3 buckets for storing code and data are configured.
  • Glue jobs for data processing are defined.
  • AWS Step Functions for orchestrating tasks are set up.

AWS Glue Script

The provided Python script (#%help section) is for an AWS Glue job. It handles:

  • Data extraction from a JDBC source.
  • Data transformation and checking for existing data paths.
  • Data loading to S3 in Parquet format.

Step Function Configuration

The JSON configuration ({...}) outlines the state machine for handling data processing tasks, including starting Glue jobs and crawlers, and making choices based on crawler states.


Note: Replace ACCOUNTID1, ACCOUNTID2, and other placeholders with actual account IDs and values as per your AWS setup.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.