Hi, thanks for your great work.
Just for clarification, it would be great to know the dimension of tensors in Section 3.2.
Below is what I've understood about the tensor dimension when using the HICO-DET dataset.
If there's any misunderstanding, please kindly let me know.
\tilde{l}_o : (1, 80)
A_o : (80, 600)
l_v : (1, 117)
A_v : (117, 600)
Therefore, \bar{y} : (1, 600). Is this correct?
And also, since the composed HOI label should be in the original 600 HOI triplet set, is it correct that discovering a novel HOI triplet is impossible using this method and the main focus of the work is correctly learning affordances via feature composition?
Again, thanks for sharing your great work.