Coder Social home page Coder Social logo

cops's People

Contributors

aabbas90 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

Forkers

nik1806

cops's Issues

Fully differentiable training doesn't work with lower batch size

I don't have available any gpu with enough memory to execute your training scripts with the config you provided.
Because of that I have executed the pretrain on cityscapes with a batchsize of 2 (instead of 12), ending with a network with the following validation metrics (note that not having the small small validation set, I simply used the whole cityscapes validation set):

PQ SQ RQ PQ_th SQ_th RQ_th PQ_st SQ_st RQ_st
54.8540 80.6902 66.7585 42.9238 79.5514 53.7625 63.5305 81.5184 76.2101

(the training total_loss was 1.501 at the final iteration)
I then used the last checkpoint of this pretrain for the fully differentiable training, with a bachsize of 8 (instead of 24), but during the training the total_loss went up to around 9.3, while the pq and sq went down to 0 (both the training and the evaluation ones at the end). I also tried lowering the learning rate, but the results were always the same.

Is this a known issue? Are there any solutions?

PS: in the paper you said you used {1, 4, 16, 32, 64, 128} as edge distances with cityscapes, but in your training configuration the maximum distance is 64. Is it intentional?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.