Coder Social home page Coder Social logo

Comments (3)

xuebinqin avatar xuebinqin commented on July 22, 2024

from dis.

faruknane avatar faruknane commented on July 22, 2024

@xuebinqin Thank you for your reply.

Thanks for your interest. There are multiple different skip connections. Even the shallowest skip connections are after at least one convolution layer, which is trying to pass the details to the high-dimensional space. So your comments "the GT encoder can pass all the information from the input to the output without compressing or processing any high-level info since there are skip connections” is not very accurate, since the GT encoder is also deeply supervised and we cannot ignore these layers at the bottom.

I think there is still a possibility (not really) that it can pass the input directly to the output without processing even though there are convolution layers or kernels. One kernel might be just like the below and pass all the information:

image

However, now I see your point. We can't ignore other layers and the random initialization of the kernel values which means such a case will never happen, right? So, with the convolution layer at the beginning, the input mask will be distorted a bit no matter what we do or how we initialize the model. Also, the other layers will play a role in distorting the input or some of them in fixing the distortion. Then, the distorted input will be translated to the original input all the way, which the last layer will take different parts of information from different kernel results of the previous conv layer. We take feature maps before the last conv which gives us richer information about the input map. I am just trying to enrich my understanding. Your comments about this are really important to me.

More importantly, the features produced by the GT encoder used for supervising the training of DIS have more channels than the single-channel ground truth. So the gt encoder can be understood like a ground truth decompositor or descriptor other than a compressor. The GT encoder tries to convert the ground truth to another high-dimensional space, which provides supervisions from different perspectives. The motivation is to provide denser supervision for the training process for reducing the overfitting. Of course, there must be other better ways to encode the ground truth. We believe there are more possibilities that need to be explored. In this paper, we just provide one possible way to do that. You can also try to remove the skip connections and to see if that works better, which would be a very interesting topic.

Now, I look at the GT encoder from a different perspective. I get it that it is like a decomposer providing more meaningful features about the mask. Thank you again @xuebinqin ! I have been following your research for a while and I am happy about it. Hope you will develop more amazing ideas!

from dis.

faruknane avatar faruknane commented on July 22, 2024

Closing the issue.

from dis.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.