Kinetics Dataset Downloader
Kinetics-600 is a large-scale high-quality dataset of YouTube video URLs which include a diverse range of human focused actions.
Warn·ing! This work comes with no warranty!
This script downloads the Kinetics-600 dataset according to author guidelines at https://arxiv.org/pdf/1808.01340.pdf. The original work is from https://deepmind.com/research/open-source/open-source-datasets/kinetics/. The original paper can be found at https://arxiv.org/pdf/1705.06950.pdf. This script downloads only ~10 second long clips using the provided youtube video ID. Some videos may be absent because they may have been taken off YouTube.
Pre·req·ui·sites ( dependencies )
-
Environment: Ubuntu 16.04 (This is some-what irrelevant but worth mentioning)
-
Python 3
-
tqdm
pip3 install tqdm
- youtube-dl
sudo wget https://yt-dl.org/downloads/latest/youtube-dl -O /usr/local/bin/youtube-dl
sudo chmod a+rx /usr/local/bin/youtube-dl
- ffmpeg
sudo add-apt-repository ppa:jonathonf/ffmpeg-4
sudo apt-get update
sudo apt-get install ffmpeg
Us·age
python kinetics_dataset.py -t [data_set_categoty] -d [destination_to_where_videos_should_be_downloaded_to]
eg. python kinetics_dataset.py -t train -d ~/Documents/Kinetics_dataset
Command Line Parameters
-t
: Type of dataset category to be downloaded, such as train, validate, test and holdout-d
: Final destintion where all videos should be dwonloaded and saved to. A directory is automatically created in this destination folder according to the parameter of-t
From the above exampple, the final destination folder may look like
|~/Documents/Kinetics_dataset
|-- train
|-- validate
|-- test
|-- holdout