Coder Social home page Coder Social logo

Comments (2)

aosokin avatar aosokin commented on September 28, 2024

Hi Simon,

The main goal of resizing images is to align object and images-to-detect with model expectations. Object images are internally resized to have maximum size of 240, and this number is baked in the architecture - the number of parameters of the Transformation network (more details here). Os2dHead internally resizes (here) feature maps to match the expected size.

So to get the best performance on your own dataset in the eval mode, you need to use an image pyramid such that at least one of its levels has objects-to-detect of size approximately 240. For training, you need to select the sampling of patches to match the corresponding sizes. All of these can potentially be set by approximately computing dataset_scale for your dataset. We did exactly this for all datasets we touched in the paper (e.g., grozi).

Hope this helps!

Best,
Anton

from os2d.

chasb799 avatar chasb799 commented on September 28, 2024

Thank you for your helpful respone!

from os2d.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.