Coder Social home page Coder Social logo

Comments (8)

sujitpal avatar sujitpal commented on August 18, 2024 1

We didn't use the MEGA data these appear to have been added after we were done with the project. Not sure if one is a subset of the other. If you are going to retrain with more data, then might make sense to download both these archives and check.

And as I mentioned earlier, we are using NMSLib for searching in the HF-spaces demo.

from clip-rsicd.

INF800 avatar INF800 commented on August 18, 2024

Hmm, found this link from somewhere https://mega.nz/folder/wCpSzSoS#RXzIlrv--TDt3ENZdKN8JA

from clip-rsicd.

sujitpal avatar sujitpal commented on August 18, 2024

Hi @INF800 hope this is enough information to start:

  • you can download the images and captions directly from the RSICD repository (https://github.com/201528014227051/RSICD_optimal) -- the link is also available from the README.md file for this project.
  • once done, you can download one of our models (again link and example code available on README.md) and generate the vectors from the image-caption pairs.
  • at this point I suspect you would be ready to upload them to your Elasticsearch instance. We used NMSLib for the demo (code and demo links available in README.md) but in another variation we used the Vespa engine, which offers text, vector and hybrid (text+vector) searching.

from clip-rsicd.

INF800 avatar INF800 commented on August 18, 2024

I am stuck at the first step currently. (Because I am also looking at the training code which I will use in future)

These two files are actually downloadable

  1. The new download source of RSICD-MEGA
dataset.json (2.8MB)
imgs.rar (252.9MB)
  1. The new download source of Sydney-captions and UCM-catpions-MEGA.
annotations_rsicd.rar (836KB)
RSICD_images.rar (459.8MB)
  • The first file ( 252MB) is subset of second file (459MB) right? Or are they both different?
  • And are we using only second file (459MB) for training the best model(s) ?

from clip-rsicd.

INF800 avatar INF800 commented on August 18, 2024

And also, can I know what are we using for searching in HF spaces? The speed is appreciable.

from clip-rsicd.

nikky4D avatar nikky4D commented on August 18, 2024

@sujitpal for the data used, does the RSICD data also contain Sydney and UCM-Merced data? If not, did you use all three in your training or only the RSICD data?

from clip-rsicd.

sujitpal avatar sujitpal commented on August 18, 2024

For the training I believe we used RSICD plus the Sydney and UCM-Merced datasets, but I will defer to @arampacha for the authoritative answer.

from clip-rsicd.

arampacha avatar arampacha commented on August 18, 2024

IIRC the data is not overlapping and we used all 3 datasets. This is also consistent with model cards we have. And don't see a reason not to trust those ;)

from clip-rsicd.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.