Comments (17)
Hi, sorry for the late relay cause I'm too busy these days. I would like to first share th preprocessed .txt
file of COCO box at here and you can use the below script to process it to .pt
file:
cd ./train
python3 preprocess.py --corpus_path train_box.txt \
--vocab_path models/google_uncased_en_coord_vocab.txt \
--dataset_path train_seq.pt --processes_num 8 \
--seq_length 1024 --tgt_seq_length 1024 --data_processor lm
I will try to provide the code for converting box/mask/keypoint annotations of .json
to sequences .txt
in the coming days. :)
from visorgpt.
Hello, I will clean the code and prepare the instruction in the coming days. Maybe a week.
from visorgpt.
Great, looking forward to your update.
from visorgpt.
Hi, thanks @Sierkinhane for show how to create pt file from txt :)
from visorgpt.
Hi, sorry for the late relay cause I'm too busy these days. I would like to first share th preprocessed
.txt
file of COCO box at here and you can use the below script to process it to.pt
file:cd ./train python3 preprocess.py --corpus_path train_box.txt \ --vocab_path models/google_uncased_en_coord_vocab.txt \ --dataset_path train_seq.pt --processes_num 8 \ --seq_length 1024 --tgt_seq_length 1024 --data_processor lm
I will try to provide the code for converting box/mask/keypoint annotations of
.json
to sequences.txt
in the coming days. :)
Hello, if I want to train Object Centric Bounding-Box, the content of corpus is similar to "box; object centric; large; 1; 0; great white shark; [xmin 95 ymin 66 xmax 510 ymax 310]", or "box ; object centric; large; 1; 0; [ great white shark xmin 95 ymin 66 xmax 510 ymax 310 ]?
from visorgpt.
Hi, the second prompt is for continuous generation or scene completion for multiple objects. If only one object is involved in an image, the first prompt is sufficient.
from visorgpt.
Hi, the second prompt is for continuous generation or scene completion for multiple objects. If only one object is involved in an image, the first prompt is sufficient.
Thank you, I really appreciate your reply
from visorgpt.
Hello! Thank you so much for your work! Do you have any plans to make keypoint annotations .txt file public recently?
from visorgpt.
Exactly. I'm quite busy these months, but I plan to update the repository with the complete files next month. The txt files of cocokeypoints and crowdpose are available at here and here.
from visorgpt.
Thank you very much for your reply! May I ask if they are both processed with preprocess.py for pre-processing? Also, the two links you provided both seem to be crowdpose.txt files:)
from visorgpt.
Hi, I have updated the link. You can merge these txt files into one file and process it using preprocess.py.
from visorgpt.
Thank you very much for your prompt reply! There's a question I'd like to ask. I see that in the keypoint data, there is “person, person; [ a ”as well as “[ person a...], [ person a...]"Two types, does this affect the effectiveness of the training? Because I see in the demo, the type of seq_prompt is in the format of [ person.
from visorgpt.
They are two kinds of prompts and will not affect the modeling a lot. Maybe you can refer to the paper for details.
from visorgpt.
Thank you very much, I re-read the paper again. However, I have now trained and saved the file "visorgpt_dagger_train_seq.bin-200000" (430M), how do I handle it as "visorgpt_dagger_train_seq.bin/200000/mp_rank_00_model_states. pt" such file type?
from visorgpt.
It seems that you didn't use the deepspeed strategy. You can try to set --load_model_path as the .bin file.
from visorgpt.
OK! Your suggestion is valid! Looking forward to your complete inference code and your subsequent exciting work :)
from visorgpt.
Great! Thank you.
from visorgpt.
Related Issues (7)
- .cache/torch_extensions/py38_cu117/utils/utils.so HOT 5
- What does the generated_sentence.txt generated after training represent? HOT 2
- Does VISORGPT supports to generate multiple instances with different sizes in an image? HOT 1
- Some issues in training HOT 2
- Questions about Kullback-Leibler divergence calculation in Table 4. HOT 1
- Error while loading Gligen HOT 5
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from visorgpt.