Thanks for your great work! I understand u stress the unsupervised learning of the

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

Some Question About the Ablation Exps about maskflownet HOT 2 CLOSED

lidongyv commented on August 27, 2024

Some Question About the Ablation Exps

from maskflownet.

Comments (2)

simon1727 commented on August 27, 2024 1

Hi Lidong, thanks for your interest and your insightful questions!

For question 1, we haven't done any experiment on the comparison. A problem is that the mask used to filter useless information (shaded area) after warping might not be the same with the ground truth occlusion map at each level. Please feel free to draw comparison between our method and others' and we are looking forward to hearing what you discover!

For question 2, we have already released the weights to produce the visualization, so you can use them to produce more! What we think is that, object-object case has smaller size and larger relative motion, so it might be harder than fore-background case, but the mechanisms behind them are the same- background can be seen just as an object with relatively small motion.

from maskflownet.

lidongyv commented on August 27, 2024

@simon1727 Thanks for your answer.

For question 1, the reason I talked it is that I used to conduct some exps on occlusion years ago. I did use the occlusion map(ground truth) to guide the refinement, it gave a great promotion. But the promotion becomes not that obvious when I switch the occlusion map to the supervised learned one. Although my result is lost now, I still remember how hard it is to learn the occlusion map.
It is a great discovery of your work to show that the unsupervised learning of attention maps might be more proper to guide the refinement. An unsupervised activation map from the learned feature map in the last few layers indeed shows the attention for the final activation. Thanks again for your discovery.

For question 2, what I am interested in is actually the big motion cases. Object-object cases are always having bigger relative motion. I may be able to generate some cases on crowded scenes if I can access the data later. The problem might be changed if the motion is too big as we all know both Flownet and PWC is hard to cover a low-frequency video. I might need to conduct more exps to see the result.

Thanks again for your great work and patience.

from maskflownet.

Recommend Projects

Some Question About the Ablation Exps about maskflownet HOT 2 CLOSED

Comments (2)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent