j-min / adversarial_video_summary Goto Github PK

Unofficial PyTorch Implementation of SUM-GAN from "Unsupervised Video Summarization with Adversarial LSTM Networks" (CVPR 2017)

Python 100.00%

video summarization gan vae pytorch

adversarial_video_summary's Introduction

Adversarial_Video_Summary

PyTorch Implementation of SUM-GAN

from "Unsupervised Video Summarization with Adversarial LSTM Networks (CVPR 2017)"
by Behrooz Mahasseni, Michael Lam and Sinisa Todorovic
Code Author: Jaemin Cho
Used as baseline for unsupervised video summarization

Changes from Original paper

Video feature extractor
- GoogleNet pool5 (1024) => ResNet-101 pool5 (2048)
- Followed by linear projection to 500-dim
Stable GAN Training
- Discriminator's learning rate: 1e-5 (Others: 1e-4)
- Fix Discriminators' parameters for first 15 steps at every epoch.

Model figures

Algorithm

adversarial_video_summary's People

Contributors

Stargazers

Watchers

Forkers

19ai haiantyz zmxheart iqbal-chowdhury benben0413 hephaex haowangxidian malleshamdasari shubhampachori12110095 zongka sungjinlees robbertonly nassimanoufail panna19951227 emininem yuan-wenhua awesome-archive zyayi vinace keode zhique930716 balrajashwath zzthechaos wangxinqi94 chenbohua3 xuqiong1989 harlanhong e-apostolidis yfamy123 junaid112 wannawannawanna linglingzhao shineyusong ammieqi ledduy610 acc-l jacksyu dr-zhuang chencheng1003 smile-zbj sodiqadewole rumace jnzs1836 pdsyaom zlj2015106 speedsters chohyoungseo sudao-he anshul-miglani-17 yuqinghao1 ai-timi markjhonbao iq-scm kshireen trminhnam whyiswmm

adversarial_video_summary's Issues

How can get change points using KTS?

I tried to get change points using KTS code.
But i couldn't get proper change points.

If someone get change points using KTS, please help me?

The worst github repository you can work on.

if you are planning to use this repo, dont waste time and just skip it.

literally 0 support from the author
No single issue was closed or even one usefull information was mentioned.
The porject suppose that you have some dataset that you can not ever find it anywhere
Completly waste of time.

what is the purpose of uploading this project to open source and make it public if no one can contribute to it?

Can you give me the data set provided in the original text?

data

Can you tell me where you downloaded the dataset? I want to run your code has not run through, can I have your contact information?

How to evaluate the result on TVSUM or SUMME dataset.

Sorry to impose, I could only see 360airballoon in your code.
Can you release the code to evaluate on TVSUM or SUMME?

about the score of every frame

I note that the sLSTM output is of [0, 1], so how can i ensure if the frame is a key frame?
If it is better when output the {0, 1}?

How to extract video features and the number of seq_len?

Hi, in this code, the video features is extracted using resnet，but I don't know if the features are extracted for one by one frame and what is the number of seq_len. Is a 2048-dimension feature of just one video frame extracted and saved as a h5df file or the features of whole video frames are saved as just one h5df file? Could give me some instruction about how to extract and save the features of the whole original video frames in h5df file. Thank you very much.

Inclusion of DPP loss as summary length regularization doesn't help in quality summarization

The authors propose DPP loss for Diversity Regularization in their model. Detrimental Point processes are a idea that help in sampling diverse subset of points from a set of points. This is a dire extension without which the model implementation is incomplete. I am willing to help in this. So can you raise a ticket about things to do and add it in read me. Cheers

Can you provide the pre-trained model for testing?

where to download the dataset?

Where is the 360 dataset downloaded?

Where is the 360 dataset downloaded? Thank you very much for your code.

log dir AttributeError: 'PosixPath' object has no attribute 'split'

There is not split attribute...
6nipdb> ipdb> logdir
PosixPath('/content/data1/jmcho/SUM_GAN/360airballoon')
6nipdb> ipdb> n
--Return--
None

/content/Adversarial_Video_Summary/utils.py(13)init()
12 import ipdb; ipdb.set_trace()
---> 13 super(TensorboardWriter, self).init(logdir)
14 self.logdir = self.file_writer.get_logdir()

6nipdb> ipdb> n
AttributeError: 'PosixPath' object has no attribute 'split'

/content/Adversarial_Video_Summary/solver.py(68)build()
67 import ipdb; ipdb.set_trace()
---> 68 self.writer = TensorboardWriter(self.config.log_dir)
69

6nipdb> ipdb> n
--Return--
None

/content/Adversarial_Video_Summary/solver.py(68)build()
67 import ipdb; ipdb.set_trace()
---> 68 self.writer = TensorboardWriter(self.config.log_dir)
69

6nipdb> ipdb> self.config.log_dir.split()
*** AttributeError: 'PosixPath' object has no attribute 'split'

6nipdb> ipdb> type(self.config.log_dir)
<class 'pathlib.PosixPath'>
6nipdb> ipdb> dir(self.config.log_dir)

The original video features also need feed into eLSTM and dLSTM

Base on the paper, the original video features also need feed into eLSTM and dLSTM and then feed it to Discriminator(cLSTM). But this implementation seems feed the original features directly into Discriminator after a linear_compress layer. Is this a Bug here ?

I can not run the code owing to the dataset can you offer the dataset ?

Input format of SumMe dataset

Hello, I am using SumMe dataset. Would you like to use mp4 format or webm format?

poor results applying video summarization on BDD100 dataset

I am trying to apply the network on BDD100 dataset which is for drives so c_loss is -Gan_loss

in the paper we should :

For learning {θs, θe}, minimize
(Lreconst+Lprior+Lsparsity). ==> s_e_epoch
For learning θd, minimize (Lreconst+LGAN). d_epoch
For learning θc, maximize LGAN. which is -c_loss so minimize c_epoch

but i am having this behaviour? what could be the problem ?

Could you tell me how to test this code?

thank you for your code, and could you tell me how to test it ? Very appreciate it.

model train help

hi, thank you for your codes. I have tried the codes, but failed. About the training, could you share a pre-trained one or give more guides?

How to run train processing by data of OVP(open video project)

Hello,
Thank you for your code, I plan to train the algorithm by data of OVP(open video project). but the data of OVP are .mpg format files, it's not directly used to be training SUM-GAN. so, how should I do for running train processing.
Thanks

Version Specifications

Hello and thanks for the code!

Is it possible to add in README the version specifications for the packages used? If they exist, it seems that I've missed them.
Packages like "Pillow" rarely break code in their version upgrades, but the same doesn't seem to happen for deep learning libraries (keras and tensorflow break backwards-compatibility very often).
I'm mostly interested in the "torch" and "torchvision" versions used.

Thanks!
Alex