Coder Social home page Coder Social logo

Comments (8)

styap94 avatar styap94 commented on August 20, 2024 3

https://github.com/harvitronix/five-video-classification-methods
you may try with this. i tried with my own data, it is able to predict. but the accuracy it not so satisfy.

from two-stream-action-recognition.

11yusuf avatar 11yusuf commented on August 20, 2024 2

@hanako94 hello, did you find a solution for that ? could you even test it with your own videos ?

from two-stream-action-recognition.

newpro avatar newpro commented on August 20, 2024 2

hey @hanako94
Method from this repo requires optical flow as pre-processing data, to target at higher accuracy. If your classification category are limited, speed bottleneck is at OF calcuation in real time. More recent works have been done to eliminated OF calculation from pipelines to speed up and achieve end-to-end training, without sacrifice much of accuracy. For example Hidden two streams model is worth checking out.

Cheers!

from two-stream-action-recognition.

speculaas avatar speculaas commented on August 20, 2024

Dear Hanako94,
Harvitronix's 5-video-classify-methods says passing a video and getting a predicted class is not implemented yet.
Is it true?
Or it can already do demos, and the README is not updated accordingly?
Or you modified it to do demos (by demo I mean passing a video and get a class prediction)?
BR,
JimmyYS

from two-stream-action-recognition.

styap94 avatar styap94 commented on August 20, 2024

yup, i modified myself to accept mobile camera input (every 30 frames) and predict the outcome on phone

from two-stream-action-recognition.

speculaas avatar speculaas commented on August 20, 2024

Dear Hanako,
thx for your help!
do you use this pre trained model:

saved_model = 'data/checkpoints/lstm-features.026-0.239.hdf5'
?
as specified in :

five-video-classification-methods\demo.py

or you use other pre-trained model?

Can you share how you did the demo?
Some guide? Hint?
Or maybe commit patch?

BR,
JimmyYS

from two-stream-action-recognition.

styap94 avatar styap94 commented on August 20, 2024

i'm not using the model mentioned, i trained my own model, by using my own dataset. But my scope is very small, just to recognize 4 classes only.
to modify the demo to read video input or web camera frame input and do real time prediction, you might need to spend some time to study the code written by Harvitronix and modify from there, especially on how he extracts frames from video, convert it and pass it for inference.
i think for lstm and mlp is hard to make real time prediction because it takes time to extract feature from frames before do the prediction. lrcn will be faster, cause it use only frames for prediction (drawback is that the accuracy is not so satisfied)

from two-stream-action-recognition.

speculaas avatar speculaas commented on August 20, 2024

Dear Hanako,
Thanks for your generous advice,
So far, I have read Andrej Karpathy's large scale video classification with conv neural network.
There are at least two more paper I plan to study.
I think Andrej's paper mentioned several interesting ideas, but, unfortunately, their source code is not open source.

I find some repo on github, but till now, I haven't found any pre-trained model yet:
https://github.com/wangheda/youtube-8m
https://github.com/jeffreyhuang1/two-stream-action-recognition
https://github.com/harvitronix/five-video-classification-methods

BR,
JimmyYS

from two-stream-action-recognition.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.