Coder Social home page Coder Social logo

Comments (3)

mrezaei92 avatar mrezaei92 commented on July 1, 2024

Dear @TheoCarme,

Please note that TriHorn-Net takes a single image as input. If you plan on performing hand pose estimation on a video stream, the network architecture should be modified. To this end, you need to consider some factors such as:

  • Are you going to predict hand pose for the latest frame in the video, or are you going to perform hand pose estimation on each frame in the sequence by taking advantage of temporal information embedded in the video?
  • What dataset are you going to use? If you plan on working on RGB images, there are some video datasets, but in the realm of depth images, as far as I know, there is only NYU. You should note that there are some discontinuities in the frames of NYU, which have to be detected manually.

from trihorn-net.

TheoCarme avatar TheoCarme commented on July 1, 2024

Dear @mrezaei92 thank you for responding.

When I wrote "on a video stream" I meant : to perform the hand pose estimation one frame at the time as I grab them.
As for the dataset I intend to use MSRA since contrary to ICVL and NYU it has 21 joints per frame.
So could you provide me with some examples please ?

Also I was wondering if it is normal that when I trained on MSRA, at the end of the 60th epoch it automatically restarted the training at the first epoch.

from trihorn-net.

TheoCarme avatar TheoCarme commented on July 1, 2024

Dear @mrezaei92 could you please help me ?

from trihorn-net.

Related Issues (10)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.