Comments (1)
when i use swin_base_patch244_window877_kinetics400_22k to train my dataset, the config file:
dataset settings
dataset_type = 'VideoDataset' data_root = 'data/kinetics400/train' data_root_val = 'data/kinetics400/val' ann_file_train = 'data/kinetics400/kinetics400_train_list.txt' ann_file_val = 'data/kinetics400/kinetics400_val_list.txt' ann_file_test = 'data/kinetics400/kinetics400_val_list.txt' img_norm_cfg = dict( mean=[123.675, 116.28, 103.53], std=[58.395, 57.12, 57.375], to_bgr=False) train_pipeline = [ dict(type='DecordInit'), dict(type='SampleFrames', clip_len=32, frame_interval=2, num_clips=1), dict(type='DecordDecode'), dict(type='Resize', scale=(-1, 256)), dict(type='RandomResizedCrop'), dict(type='Resize', scale=(224, 224), keep_ratio=False), dict(type='Flip', flip_ratio=0.5), dict(type='Normalize', **img_norm_cfg), dict(type='FormatShape', input_format='NCTHW'), dict(type='Collect', keys=['imgs', 'label'], meta_keys=[]), dict(type='ToTensor', keys=['imgs', 'label']) ]
because my frame txt format is :
some/directory-1 163 1 some/directory-2 122 1 some/directory-3 258 2 some/directory-4 234 2 some/directory-5 295 3 some/directory-6 121 3
I want to change dataset_type = 'RawframeDataset', do i need modify "dict(type='DecordInit')" ?
Hello! My dataset format is like yours, could you share your config file?
Thank you very much!
from video-swin-transformer.
Related Issues (20)
- About learning rate and batch size HOT 2
- Error about pretrained model HOT 2
- RuntimeError: Default process group has not been initialized, please make sure to call init_process_group.
- RuntimeError : one of the variables needed for gradient computation has been modified by an inplace operation:
- onnxruntime.capi.onnxruntime_pybind11_state.InvalidArgument: [ONNXRuntimeError] HOT 1
- Results cannot be reproduced through training.
- ValueError: Expected input batch_size (32) to match target batch_size (1).
- Question about the pretrained model used in Sthv2
- How to train/test on custom dataset like HMDB51 HOT 6
- How to train MyDataset HOT 1
- About the input image shape HOT 2
- Is there any plan to release the video swin transformer code and pre-trained models of swin transformer V2?
- Embeddings
- Error: av_read_frame failed with 1094995529
- Swin-L pretrain
- Can't export ONNX transformer HOT 3
- is there a priority between drop out and patch norm?
- Error found in the code about shift_size calculation
- How much video memory does a single GPU need to run SSV2? I use the 16G ,then it prompt CUDA out of memory
- Pretraining for SSv2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from video-swin-transformer.