Comments (3)
Hi @sfchen94,
We trained CoTracker with the smallest DINOv2 model, but it was not helping at all. I think DINO can help mostly with semantic correspondences by roughly identifying the corresponding point in another frame. However, it seems that semantic correspondences are not really needed if we have a continuous video, and not just a pair of images of the same object. We do not plan to release the model trained with DINOv2 features for now, but we will keep working on motion estimation, and will try to explore other approaches.
Yes, the batching problem is indeed solved! Thank you for pointing this out, I just removed assert B==1
from the predictor. You can train and run the model with different batch sizes now.
from co-tracker.
@nikitakaraevv
Thanks for the update.
It seems you ignore the training with the DINOv2 feature.
As section 3.4, 'Unrolled Window Training', mentions,
does this indicate that the batching problem has been resolved,
allowing for multiple batches on a single GPU? Since compute_sparse_tracks still force the batch to 1.
from co-tracker.
Hi @sfchen94,
We trained CoTracker with the smallest DINOv2 model, but it was not helping at all. I think DINO can help mostly with semantic correspondences by roughly identifying the corresponding point in another frame. However, it seems that semantic correspondences are not really needed if we have a continuous video, and not just a pair of images of the same object. We do not plan to release the model trained with DINOv2 features for now, but we will keep working on motion estimation, and will try to explore other approaches.
Yes, the batching problem is indeed solved! Thank you for pointing this out, I just removed
assert B==1
from the predictor. You can train and run the model with different batch sizes now.
Cool.
I'm deeply grateful for your update.
from co-tracker.
Related Issues (20)
- Solve camera poses? HOT 3
- Inference with sequence of multiple images for a particular single pixel using co-tracker2 HOT 3
- cannot backward tracking in online model with specific queries in middle frame HOT 2
- About the training window size of stride 8 model. HOT 1
- About reproduction of training. HOT 1
- Tracking errors with cotracker2 HOT 2
- Question About GPU Memory Requirements for Model Training HOT 2
- question about backward_tracking HOT 1
- Efficency of V2 and adding more points? HOT 1
- Can not reproduce table3 with the provided checkpoint. HOT 2
- Code for Training and Testing on PointOdyssey HOT 2
- Is it possible to track specified point in the video from the webcam? HOT 8
- track fast moving objects HOT 1
- thx
- question about windowed inference HOT 3
- how to track a specified x-y point? HOT 2
- Add points as they get visible throughout the video HOT 5
- Sample occluded points during training HOT 3
- where can I find the confidence value of prediction?
- huggingface space down?
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from co-tracker.