Code for the paper: F. Ragusa, G. M. Farinella, A. Furnari. StillFast: An End-to-End Approach for Short-Term Object Interaction Anticipation. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops.
The class is inherited from FasterCNN class. Is it correct that when initializing the pretrained config is set to False, and is loaded separetly from StillFast class.
Besides, I tried to run the model on v2 ego4d. I am afraid I am doing something wrong since the results are very poor. For still images, I extract the image from lmdb database, instead of loading them separately from an extracted image(as followed in code).
In the paper it says u trained with a batch size of 8 on four V100s. Does it mean 8 for each GPU thus 32 in total? Or does it mean 2 for each GPU and 8 in total?
Cos in the config file of this codebase you set batch_size = 14, probably meaning 3 for each GPU when using 4 GPUs, which confused me.