Hi. Why use fully connected blocks to transform those two scores to two weights? There are some other easy ways like softmax. How do you design this architecture?
Thanks a lot.
By reading your article, the channel wise representativeness score is calculated through variance。But images often have a non Gaussian distribution, is it appropriate to use variance for measurement?
Hi! What is the meaning of the double vertical symbol in Formula 4, and can you provide an explanation? Is it a European distance?Is this value directly proportional to discrimination?