Light

rkp64 / videointerpreterai Goto Github PK

View Code? Open in Web Editor NEW

This project forked from alanh90/videointerpreterai

0.0 0.0 0.0 4 KB

Uses GPT-4 vision model to interpret videos

Python 100.00%

videointerpreterai's Introduction

Video Understanding App

This Streamlit-based application integrates computer vision and AI to analyze and understand video content. It uses OpenAI's powerful GPT-4 API to generate descriptive text for video sequences, offering a glimpse into the content and context of the videos.

Features

Video Upload: Users can upload videos in MP4, AVI, or MOV formats.
Frame Display: The application displays the first frame of the uploaded video.
Automated Description: Leverages OpenAI's GPT-4 to generate descriptions for the video content.

How to Use

Upload a video file using the Streamlit file uploader.
Wait as the app processes the video and displays the first frame.
Read the generated description of the video content.

Technical Details

The app saves the video to a temporary file for processing.
Frames are extracted and converted to base64 for analysis.
Descriptions are generated by sending frames to the OpenAI API.
The first frame of the video and its description are displayed on the UI.

Getting Started

To run this application locally, follow these steps:

Ensure you have Python installed on your system.
Clone this repository to your local machine.
Install the required dependencies
Create a .env file in the root directory of the project and add your OpenAI API key:
```
OPENAI_API_KEY='your_api_key_here'
```
Start the Streamlit app:
```
streamlit run your_script_name.py
```

Replace your_script_name.py with the name of your Python script.

Installation

Make sure you have the following Python packages installed:

streamlit
opencv-python-headless
base64
tempfile
openai
dotenv
requests

You can install these packages using pip

videointerpreterai's People

Contributors

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.