Coder Social home page Coder Social logo

How to obtain multiple hotspots? about vrb HOT 56 OPEN

ubless607 avatar ubless607 commented on September 28, 2024
How to obtain multiple hotspots?

from vrb.

Comments (56)

ubless607 avatar ubless607 commented on September 28, 2024 2

@Tom-Huang Sent!

from vrb.

jun7-shi avatar jun7-shi commented on September 28, 2024 1

@ubless607 thanks so much.
Using the pretrained weights you provided, i could reproduce the results of multiple hotspots.

from vrb.

ubless607 avatar ubless607 commented on September 28, 2024 1

@ZY199301 sent!

from vrb.

Kimagure7 avatar Kimagure7 commented on September 28, 2024 1

Hi @ubless607 or anyone in the above thread, could you send the weights to me? Thank you very much in advance! :) My email is: [email protected]

sent!

from vrb.

Hotsburp avatar Hotsburp commented on September 28, 2024 1

@ubless607

Yup, i will try this! May i know what is the cuda and nvidia driver versions that you were using for your case?

Update: I have managed to run the demo.py script, but i used the commit version "https://github.com/IDEA-Research/GroundingDINO/tree/da9f1c0751935e9284890572069f4b2fef4cbfdc" instead else the "supervision" versions will conflict for pillow.

I have a CUDA out of memory error so i probably need to source for a better gpu before proceeding.

from vrb.

Hotsburp avatar Hotsburp commented on September 28, 2024 1

@ubless607

Thanks for the info on the setup! I think it would be best to run GroundingDino using the https://github.com/IDEA-Research/GroundingDINO/tree/da9f1c0751935e9284890572069f4b2fef4cbfdc version (follow the GPU steps as well) as it is at the same time when VRB was pushed (~April 2023). LangSegAnything can be the latest version and modified based on your recommendations. Just need to make sure supervision==0.6.0, pillow==9.3.0, and CUDA==1.7 as the system will highlight dependency conflicts.

Anyway, I have managed to run the demo.py code on the images by the authors and also some generalized images. Looks like the generalization is not the best as it is limited to a few items (cup, drawer, potlid, microwave) on inference.py and performs pretty inconsistently on certain viewpoints. But i think the author has addressed that it does not always work well in the #3 (comment) issue.

Also, do you happen to have the actual input image that you were using above? I wanted to test and maybe see if i have the same results as you but i can't find it on the project page.

The project has a lot of potential and i hope @shikharbahl releases the training code soon for future works!

from vrb.

Hotsburp avatar Hotsburp commented on September 28, 2024 1

Hey @ubless607, yeap i tried out a few more use cases but it is sensitive to the image size and dimensions. As mentioned, I got good results only for certain tasks and angles but there are also a lot of wrong trajectories too haha.

Would be good that they release more tasks and also how this can be used for 3d pose/state beyond 2d image coordinates.

What about yourself, did you explore more too?

from vrb.

ubless607 avatar ubless607 commented on September 28, 2024 1

@Hotsburp I didn't try on this much further :(

from vrb.

ubless607 avatar ubless607 commented on September 28, 2024 1

@codepk37 Sent!

from vrb.

ubless607 avatar ubless607 commented on September 28, 2024 1

@ashiqfauna Sent!

from vrb.

ubless607 avatar ubless607 commented on September 28, 2024

@yolo01826 Can you write your e-mail address?

from vrb.

ubless607 avatar ubless607 commented on September 28, 2024

@yolo01826 sent!

from vrb.

jun7-shi avatar jun7-shi commented on September 28, 2024

@ubless607 hi, could you please send me the pretrained weights as well? [email protected] , thanks!

from vrb.

ubless607 avatar ubless607 commented on September 28, 2024

@jun7-shi Sent!

from vrb.

ZY199301 avatar ZY199301 commented on September 28, 2024

hi, could you please send me the pretrained weights as well? [email protected], thank you very much!

from vrb.

JaneLin-0409 avatar JaneLin-0409 commented on September 28, 2024

@ubless607 hi, could you please send me the pretrained weights as well? [email protected] , thanks!

from vrb.

wadeKeith avatar wadeKeith commented on September 28, 2024

@ubless607 Could you please send me the pretrained weights as well? [email protected], thank you very much!

from vrb.

ubless607 avatar ubless607 commented on September 28, 2024

@JaneLin-0409 sent!

from vrb.

ubless607 avatar ubless607 commented on September 28, 2024

@wadeKeith sent!

from vrb.

fuyh20 avatar fuyh20 commented on September 28, 2024

@ubless607 I also need the pretrained weights. Could you send me it? [email protected], thanks!

from vrb.

Byo0923 avatar Byo0923 commented on September 28, 2024

@ubless607
I also need the pretrained weights too.
Could you please send me the pretrained weights as well? , [email protected] , thanks!

from vrb.

ubless607 avatar ubless607 commented on September 28, 2024

@fuyh20 Sent!

from vrb.

ubless607 avatar ubless607 commented on September 28, 2024

@Byo0923 Sent!

from vrb.

Byo0923 avatar Byo0923 commented on September 28, 2024

@ubless607
I was able to download the checkpoints. Thank you very, very much. I will try it.

from vrb.

Kimagure7 avatar Kimagure7 commented on September 28, 2024

I also need the pretrained weights too. Could you please send me the pretrained weights as well? [email protected] , thanks! @ubless607

from vrb.

ubless607 avatar ubless607 commented on September 28, 2024

@Kimagure7 Sent!

from vrb.

fuyh20 avatar fuyh20 commented on September 28, 2024

@ubless607 Thanks for the model weights. And I have already used it. You can set the manual seed in order to get different outputs, and the output is random.

from vrb.

YujpVc avatar YujpVc commented on September 28, 2024

I also need the pretrained weights, could you send me well? [email protected], thanks

from vrb.

nys601350111 avatar nys601350111 commented on September 28, 2024

@ubless607 I also need the pretrained weights, could you send me well? [email protected]](mailto:[email protected]) thanks!

from vrb.

ubless607 avatar ubless607 commented on September 28, 2024

@YujpVc sent!

from vrb.

ubless607 avatar ubless607 commented on September 28, 2024

@nys601350111 sent!

from vrb.

YujpVc avatar YujpVc commented on September 28, 2024

from vrb.

Hotsburp avatar Hotsburp commented on September 28, 2024

Hi @ubless607 or anyone in the above thread, could you send the weights to me? Thank you very much in advance! :) My email is: [email protected]

from vrb.

Hotsburp avatar Hotsburp commented on September 28, 2024

Did anyone face any issue pip installing the lang-segment-anything portion from the "pip install -U git+https://github.com/luca-medeiros/lang-segment-anything.git" command?

Mine seems to be stuck at this portion for really really long.

Collecting lightning<3.0.0,>=2.0.1
Using cached lightning-2.0.2-py3-none-any.whl (1.8 MB)
Using cached lightning-2.0.1.post0-py3-none-any.whl (1.8 MB)
Using cached lightning-2.0.1-py3-none-any.whl (1.8 MB)

Not sure if i am doing anything wrong or it just takes super long to install. Thanks in advance again!

I will open a new issue but just wanted to ask here since some of you may be active.

from vrb.

ubless607 avatar ubless607 commented on September 28, 2024

@Hotsburp I don't have any problem.
Did you try on new conda envrionment? / Did you update pip to the lastest version?

from vrb.

Hotsburp avatar Hotsburp commented on September 28, 2024

Hi @ubless607, thanks for getting back! Yup i tried on new conda environment and also updating pip.

I think the issue is with the GroundingDino that LangSegmentAnything uses because there are a lot of issues with the version conflict of packages recently due to GroundingDino updates.

May i know which commit version (date or hash) of the LangSegmentAnything did you use or did you just use the latest version at that point of time? This is because this affects the version of GroundingDino used and i would also like to track back the date of the version that VRB used.

While i managed to now install somewhat properly, i had to manually find an older version of both LangSegmentAnything and GroundingDino.

But by doing so, i had this error when running demo.py using the weights:
=> _IncompatibleKeys(missing_keys=[], unexpected_keys=['label_enc.weight'])

This results in:
RuntimeError: PytorchStreamReader failed reading zip archive: failed finding central directory

I would appreciate if you all have any insights! :)

from vrb.

ubless607 avatar ubless607 commented on September 28, 2024

@Hotsburp It has been a while I tested this repository. Can you check this issue?
luca-medeiros/lang-segment-anything#66

from vrb.

Hotsburp avatar Hotsburp commented on September 28, 2024

Yeap, i saw that too. No worries, i will try it out again. Thanks for you help!

I will let you know if i find any insights to your original issue of obtaining multiple hotspots.

from vrb.

ubless607 avatar ubless607 commented on September 28, 2024

@Hotsburp

  1. Git clone this fork version and install GroundingDINO
    https://github.com/IDEA-Research/GroundingDINO/tree/df5b48a3efbaa64288d8d0ad09b748ac86f22671

  2. Git clone lang-segment-anything

  3. Edit pyproject.toml

  • huggingface-hub = "^0.16.4, <0.17.0"
  • transformers = "4.27.4"
  • lightning = "2.0.1"
  • erase GroundingDINO line

Can you try?

from vrb.

64327069 avatar 64327069 commented on September 28, 2024

@ubless607 I also need the pretrained weights, could you send me well? [email protected], thanks

from vrb.

ubless607 avatar ubless607 commented on September 28, 2024

@64327069 Sent!

from vrb.

ubless607 avatar ubless607 commented on September 28, 2024

@Hotsburp I am glad that you were able to reproduce the environment setting.

vrb_initial_v2_compressed

from vrb.

Hotsburp avatar Hotsburp commented on September 28, 2024

@ubless607 During running demo.py on this image. Do you face the error "ValueError: tile cannot extend outside image"? I think i may be something to do with image crop as the error comes from the input_img in Line 84 of inference.py.

If i input img instead of input_img in Line 84, i get this:

kitchen_out4

I guess there is still multiple hotspots but the script is sensitive to the cropped image and its dimensions.

from vrb.

bethalageetachandraraju avatar bethalageetachandraraju commented on September 28, 2024

can anyone from the above thread send me pretrained weights please, my email: [email protected]
@ubless607 @Byo0923 @wadeKeith @JaneLin-0409 @64327069 @Kimagure7

from vrb.

ojh6404 avatar ojh6404 commented on September 28, 2024

can anyone from the above thread send me pretrained weights please, my email: [email protected] @ubless607 @Byo0923 @wadeKeith @JaneLin-0409 @64327069 @Kimagure7

Could anyone send me pretrained weights, too? my email : [email protected]

from vrb.

achrxme-1838 avatar achrxme-1838 commented on September 28, 2024

could anyone from the above thread send me pretrained weights too please, my email: [email protected]
I'll be very appreciate it!!
@ubless607 @Byo0923 @wadeKeith @JaneLin-0409 @64327069 @Kimagure7

from vrb.

ubless607 avatar ubless607 commented on September 28, 2024

@achrxme-1838 @ojh6404 @bethalageetachandraraju Sent!

from vrb.

achrxme-1838 avatar achrxme-1838 commented on September 28, 2024

from vrb.

ubless607 avatar ubless607 commented on September 28, 2024

@ubless607 During running demo.py on this image. Do you face the error "ValueError: tile cannot extend outside image"? I think i may be something to do with image crop as the error comes from the input_img in Line 84 of inference.py.

If i input img instead of input_img in Line 84, i get this:

kitchen_out4

I guess there is still multiple hotspots but the script is sensitive to the cropped image and its dimensions.

I didn't encounter that error message. Have you done any additional work on this?

from vrb.

Tom-Huang avatar Tom-Huang commented on September 28, 2024

could anyone from the above thread send me pretrained weights too please, my email: [email protected]

from vrb.

codepk37 avatar codepk37 commented on September 28, 2024

@ubless607 @Byo0923 @wadeKeith @JaneLin-0409 @64327069 @Kimagure7 Could anyone please send me the pretrained weights aswell? (mailto: [email protected])

from vrb.

ashiqfauna avatar ashiqfauna commented on September 28, 2024

@ubless607 Could you please send me the pretrained weights as well? (mailto: ashiq).

Thanks :)

from vrb.

dtbpkmte avatar dtbpkmte commented on September 28, 2024

@ubless607 Can I have the weights please? My email address is [email protected]. Thank you!

from vrb.

ubless607 avatar ubless607 commented on September 28, 2024

@dtbpkmte Sent!

from vrb.

ZTE-dzq avatar ZTE-dzq commented on September 28, 2024

@ubless607 I also need the pretrained weights, could you send me? My email address is [email protected] , thanks!

from vrb.

prac-101 avatar prac-101 commented on September 28, 2024

@ubless607 @Byo0923 @wadeKeith @JaneLin-0409 @64327069 @Kimagure7 I also need the pretrained weights, could anyone please send me the pretrained weights too please? My email address is [email protected], thanks a lot!

from vrb.

Related Issues (9)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.