Coder Social home page Coder Social logo

dhvani-k / youtrend_insights_analyzing_youtube_video_landscape Goto Github PK

View Code? Open in Web Editor NEW
0.0 1.0 0.0 61 KB

An end-to-end solution for managing and analyzing YouTube video data from Kaggle, leveraging AWS services and visualized through Quicksight and Tableau

Shell 17.50% Python 82.50%
aws aws-athena aws-glue aws-glue-crawler aws-iam aws-lambda aws-lambda-python aws-s3 content-strategy marketing quicksight tableau user-insights youtube youtube-api

youtrend_insights_analyzing_youtube_video_landscape's Introduction

YouTrend Insights: Analyzing YouTube Video Landscape

Project Overview

This project is centered around efficiently managing, streamlining, and performing comprehensive analysis on structured and semi-structured YouTube video data. The initiative begins by ingesting data from the Kaggle Dataset. Thereafter, we utilize AWS Lambda for the data cleaning process. The subsequent phase involves processing via the Glue ETL job. Once the processing is complete, the data finds its resting place in AWS Athena, achieved by running Glue Crawler on the dedicated S3 buckets. The culmination of the project lies in the analysis, executed through AWS Quicksight. Additionally, efforts have been put into generating insightful reports via Tableau.

Architecture Diagram

Architecture Diagram

What's Inside?

  • Data Ingestion: Pulling rich datasets straight from Kaggle.
  • Data Cleaning: Ensuring data quality and integrity using AWS Lambda.
  • Data Transformation: Harnessing AWS Glue for a robust ETL pipeline.
  • Analysis & Reporting: Craft beautiful visuals with AWS QuickSight and Tableau.

AWS Services Implemented:

  • Amazon S3: Acts as the storage realm, particularly for our structured data in the format of a data lake. Emphasis has been put on partitioning and securely storing data within S3 buckets.
  • AWS IAM: Dedicated to the management of AWS resources access. Customized IAM policies are a part of this layer, ensuring data access only to those bearing requisite authorization.
  • QuickSight/Tableau: AWS QuickSight steps into the shoes of a report and insight generator, tapping into the vast reservoir of data within the data lake. Moreover, the platform promises a seamless user interface for crafting reports. Tableau further augments the reporting capabilities.
  • AWS Glue: Serves as the backbone for the ETL (Extract, Transform, Load) system. Glue seamlessly orchestrates the migration of data into the data lake.
  • AWS Lambda: A versatile service that springs into action in the face of certain events. Notably, Lambda steps up when there's new data in the source S3 path, initiating the ETL process. Additionally, it plays a crucial role in the data cleansing phase.
  • AWS Athena: Empowers users with the ability to run SQL queries on data residing in the data lake, all without the hassle of server management.

Importance of Trend Analysis

The digital age is governed by data. Analyzing YouTube's trending videos provides:

  • User Insights: Understand the pulse of the audience.
  • Content Strategy: Unearth what kind of content resonates.
  • Marketing Leverage: Optimize advertising and outreach efforts.

youtrend_insights_analyzing_youtube_video_landscape's People

Contributors

dhvani-k avatar dhwani-k avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.