Coder Social home page Coder Social logo

video-summarization's Introduction

Extended LSTM for Video Summarization

This code implements part of Extended LSTM for video representation and experiments on Charades dataset. The quality of summarization is evaluated by comparing the action classification rate using the video summary against the classification rate using the original videos.

Requirements

This code is written in Lua and requires Torch. For details of prerequisites please refer to Karpathy's char-rnn.

Usage

Data

General information about Charades dataset can be found here.

First, download RGB features and annotation provided in Charades website:

  1. two-stream RGB features @8fps
  2. annotation file and evaluation

Uncompress the files and save them under a directory $DATA, which later will be specified in the option '-dir_data'. The last package should contain two .csv files: Charades_v1_train.csv and Charades_v1_test.csv. These are the annotation files the code needs to import from.

$ main_charades.lua -dir_data $DATA

Preprocessing

Open ./prep/trainData_gen.lua. In line 18, change dir_data, dir_anno, dir_dest to your own directories. Repeat the same with file ./prep/testData_gen.lua.

Training

models:

There are four versions of our extended LSTM model: LSTM_extended, LSTM_binaryJ, LSTM_cumulative, and LSTM_topK.

  • LSTM_extended: the baseline version
  • LSTM_binaryJ: j_gates are binary (0/1)
  • LSTM_cumulative: a small buffer to take past information into account
  • LSTM_topK: pick K frames from every video that correspond to the top K j_gates.

examples:

Here's an example of training with LSTM_topK model.

$ main_charades.lua -model LSTM_topK -

video-summarization's People

Contributors

gigicoding avatar

Stargazers

 avatar  avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.