Super Slow Motion Video Creation Using Generative AI on AWS

Table of Content

Overview
- Cost
Prerequisites
- Operating System
Deployment Steps
Deployment Validation
Running the Example
Next Steps
Cleanup

Overview

This guide demonstrates how to utilize generative AI models such as Google's Frame Interpolation for Large Motion (FILM) to produce super slow motion video from standard footage. FILM is a frame interpolation model that analyzes motion between input frames and synthesizes new transitional frames, creating seamless, ultra-high frame rate slow motion. We will host the FILM model at scale using Amazon SageMaker to process the video frames. The original frames and synthesized frames will then be assembled into a high frame rate slow motion video. Side-by-side comparison videos will showcase the contrast between the original footage and the 3x slow motion result.

Original Video	Slow-Mo Video

Architecture Diagram

The architecture diagram above provides an overview of the full end-to-end solution. However, this sample focuses only on the core component highlighted in bold below. Users can reference the complete solution architecture to understand how this example fits into the bigger picture, and build on top of it as needed.

Invoke an Amazon API Gateway RESTful API endpoint with AWS Identity and Access Management (IAM).
Amazon API Gateway invokes an AWS Lambda function to process the request.
AWS Lambda function uploads model artifacts (FILM model) and endpoint configuration to an Amazon Simple Storage Service (Amazon S3) bucket and creates an Amazon SageMaker Asynchronous Inference endpoint.
Amazon SageMaker Asynchronous Inference endpoint are used to run the FILM.
Upload a short video to an Amazon S3 bucket for processing..
An Amazon S3 event triggers an AWS Step Functions state machine execution through Amazon EventBridge to process the request.
An AWS Lambda function extracts frames from the video and store them in S3 bucket.
An AWS Lambda function creates an inference job by invoking the SageMaker Asynchronous inference endpoint where the FILM model interpolates new frames. The state machine execution is paused and waits for a job completion status.
SageMaker Inference endpoint sends job status to Amazon Simple Notification Service (Amazon SNS).
The state machine execution resumes where an AWS Lambda function encodes all new frames to create a slow motion video for storage in the S3 bucket.
Finally, once the slow video creation is completed and uploaded to output S3 bucket, Amazon SNS notifies the operator or end user.

Cost

You are responsible for the cost of the AWS services used while running this example. As of December 2023, the cost for running this example with the default settings and default sample video in the US-East-1 is approximately $3.50.

Prerequisites

You need to have an AWS account. Make sure your AWS identity has the requisite permissions which includes ability to create SageMaker Resources (Domain, Model, and Endpoints) in addition to S3 access to upload model artifacts. Alternatively, you can attach the AmazonSageMakerFullAccess managed policy to your IAM User or Role.

Operating System

This notebook is tested using default python3 kernel on SageMaker Studio. An GPU instance such as ml.g4dn.xlarge is recommended.

Service limits

You need at least one ml.g5.4xlarge instance for inference, more if you want to process multiple video chunks in parallel. Please make sure your AWS account has sufficient quota for SageMaker inference.

Deployment Steps

To deploy the solution manually, download the AWS CloudFormation template to your local hard drive.
Sign in to the AWS CloudFormation console.
Select Create Stack.
On the Create stack page, Specify template section, select Upload a template file.
Under Upload a template file, select Choose file and select the downloaded template from your local drive.
Choose Next and follow the steps in Launch the stack. One of the input parameters before you launch the stack will be to chose VPC and subnets to host SageMaker Studio Domain. For getting started quickly, you can choose Default VPC. You can also select any other VPC which has internet connectivity.
This will take a few minutes and set up a SageMaker Studio Domain. Follow the instructions here to launch the Studio environment.
Create a JupyterLab space and access your JupyterLab environment following instructions at page under section To create a space and open JupyterLab. Also, pay attention to Step 6 choose Instance ml.g4dn.xlarge. For Step 8 choose 100GB for storage. Below screenshot shows details.
Select "Run Space" and once JupyterLab environment is ready, Select "Open JupyterLab". On the JupyterLab home page, open a new terminal window by selecting "File -> New -> Terminal". On terminal, clone the git repo by running below command.

git clone https://github.com/aws-samples/super-slow-motion-video-creation-using-generative-ai-on-aws.git

Deployment Validation

After successfully cloning the repo, following files and libraries will be downloaded in the following directory structure:

|-- assets/                      Assets folder
|-- deployment/                  CloudFormation template to deploy SageMaker environment
|-- source/                      Code directory to host FILM model and generate slow-mo video
|   |--slow-mo.ipynb
|   |--helper.py
    └── slow_mo_generator        Model and inference code for SageMaker Asynchronous Inference
        |-- interpolator.py
        |-- model.py
        |-- requirements.txt
        |-- serving.properties
        └── utils.py

Running the Example

cd to the repo folder super-slow-motion-video-creation-using-generative-ai-on-aws
open slow-mo.ipynb notebook, and follow the instructions to run through each cell.
Note: the notebook automatically provide a sample video to test. Please feel free to replace the sample video with your own.

Next Steps

To further enhance your solution at scale, there are several suggested next steps. First, create automated orchestration using AWS Step Functions to coordinate the workflow. Second, add an automated event trigger so that uploading a video to S3 will automatically trigger the orchestration. Third, incorporate AWS Batch jobs to split and assemble frames in order to maximize system parallelization. Finally, if you want to process 4K videos, you can adjust the model input parameters to split each 4K frame into 4 slices and then process each slice in parallel. Implementing these recommendations will allow you to scale your video processing pipeline to handle higher volumes with optimal performance.

Cleanup.

To avoid incurring AWS charges after you are done with testing the example, make sure you delete below resources-

Amazon SageMaker Studio Domain.
Amazon SageMaker Asynchronous Inference Endpoint.

amitkalawat / super-slow-motion-video-creation-using-generative-ai-on-aws Goto Github PK