Coder Social home page Coder Social logo

Comments (4)

jonkhler avatar jonkhler commented on August 21, 2024

I just ran it on my laptop. If I see it right, these "weird result" are just coming from the distortion introduced by the stereographic projection. As written in the paper, we compare two data sets: one in which each digit is projected on the northern hemisphere and one in which each projected digit is additionally randomly rotated (p.6). As the two coordinates of the spherical image correspond to polar and azimuth angle, a projection onto a pole must end up either in the top or the bottom of the image. For the digits this can introduce significant distortion (the whole point of the architecture ;-)), but if you closely inspect your weird images, they make a lot of sense (e.g. the topology of the 0 is wrapped around the pole, the "hole" of the 9 is clearly visible, as is the curves of the 3). If you inspect your own image, you can see that it just corresponds to the original image, just wrapped around one pole.

If you inspect randomly rotated images/digits (without the flags activated), you will recognize them easier in the cases where the random rotation of the grid rotation pushes them away from the poles. Then you will see pictures like those in figure 4 of the original paper. To test, that those images should indeed correspond to the original images, just wrapped around the pole of the sphere, you can rotate them (e.g. in fourier space as given in the examples) and see how they change according to changes of azimuth and polar angles.

EDIT: but I realize that the current version of the code must be indeed slightly different to the original version I used for generating the digits in the paper. If I remember correctly, the grid was already rotated in a way, such that the digit ends up at the central area of the spherical image. I will check for this and probably refactor the part to avoid confusion here as soon as I find time.l

from s2cnn.

townblack avatar townblack commented on August 21, 2024

Thanks a lot, Your explanation makes it clear for me to understand. I will have a try as you mentioned.

from s2cnn.

mariogeiger avatar mariogeiger commented on August 21, 2024

59371395-79775880-8d77-11e9-828d-bdec7425208c

The center of the image (green dot) is mapped to the pole (green line).
The four corners (other colored dots) are mapped to the equator.

from s2cnn.

townblack avatar townblack commented on August 21, 2024

Thank you very much! It helps me to understand the projection process.

from s2cnn.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.