The metaformer from dqshuai

bert_embedding_cub

作者您好，可以提供一下bert_embedding_cub这个文件吗？我在使用meta训练的时候报错了，非常感谢！

model weights for MetaFormer-2 fine tuned on iNat 2018

are the weights for MetaFormer-2 fine tuned on iNat 2018 available? I'm doing research for this model and it would help a lot in training time and computation resources if I can get them without retraining again, thanks!

TypeError: MetaFG.init() got an unexpected keyword argument 'pretrained_cfg'

Hello, thank you for the achievement I have a problem with using the cub attribute can you provide the run config parameter of the property, thank you very much

I'm using the default parameters and I get an error

about acc of stanfordcars

CUDA Version？

I don`t know the cuda version to run this project?
Maybe conda install pytorch==1.5.1 torchvision==0.6.1 cudatoolkit=10.2 -c pytorch?

Apex and Cuda Version

Ran into issues installing Apex. Just wanted to post a note that the install worked with torch 1.5.1, CUDA 10.1, and rolling apex back via the comment here NVIDIA/apex#1043 (comment)

model zoo

Hi, could you please share model zoo with baidu cloud disk ? Thanks! @dqshuai

I have a question about "linear embbeding" and "non-linear embbeding".

Thanks for all your great work!

I have two questions about the paper.

Is figure 2 on page 4 of the paper and figure 1 on page 10 of the paper referring to the same architecture?
The term "non-linear embbedding" and "linear embbedding" are used to describe embedding meta-information, but if the figures refer to the same architecture, what is the intention behind the different designations?
Neural networks are iterations of processes that perform linear transformations and activation functions that perform nonlinear transformations. Is it correct to say that you used "non-linear embbedding" because you are using an activation function relu that performs a non-linear transformation?

RuntimeError in

Hi,thanks for your patience.
I'm new here,and when I try to run the train in CUB200,meet the error
Could you please help .THANGKS.

Here are my run commend
python3 -m torch.distributed.launch --nproc_per_node 6 --master_port 12345 main.py --cfg ./configs/MetaFG_meta_1_224.yaml --batch-size 12 --tag cub-200_v1 --lr 5e-5 --min-lr 5e-7 --warmup-lr 5e-8 --epochs 2500 --warmup-epochs 20 --dataset cub-200 --accumulation-steps 2 --opts DATA.IMG_SIZE 224

And the error

Traceback (most recent call last):
File "main.py", line 403, in
main(config)
File "main.py", line 163, in main
train_one_epoch_local_data(config, model, criterion, data_loader_train, optimizer, epoch, mixup_fn, lr_scheduler)
File "main.py", line 210, in train_one_epoch_local_data
outputs = model(samples,meta)
File "/home/miniconda3/envs/pytorch/lib/python3.8/site-packages/torch/nn/modules/module.py", line 889, in _call_impl
result = self.forward(*input, **kwargs)
File "/home/miniconda3/envs/pytorch/lib/python3.8/site-packages/torch/nn/parallel/distributed.py", line 705, in forward
output = self.module(*inputs[0], **kwargs[0])
File "/home/miniconda3/envs/pytorch/lib/python3.8/site-packages/torch/nn/modules/module.py", line 889, in _call_impl
result = self.forward(*input, **kwargs)
File "/home/MetaFormer/models/MetaFG_meta.py", line 231, in forward
x = self.forward_features(x,meta)
File "/home/MetaFormer/models/MetaFG_meta.py", line 171, in forward_features
metas = torch.split(meta,self.meta_dims,dim=1)
File "/home/miniconda3/envs/pytorch/lib/python3.8/site-packages/torch/functional.py", line 156, in split
return tensor.split(split_size_or_sections, dim)
File "/home/miniconda3/envs/pytorch/lib/python3.8/site-packages/torch/tensor.py", line 499, in split
return super(Tensor, self).split_with_sizes(split_size, dim)
RuntimeError: split_with_sizes expects split_sizes to sum exactly to 32 (input tensor's size at dimension 1), but got split_sizes=[4, 3]

about pre-trained checkpoint

Very nice job!

Could you please provide some pre-trained checkpoints? For example, the 92.3% CUB accuracy MetaFormer-1, and the 92.9% CUB accuracy MetaFormer-2?

Appreciate for your generosity!

about the checkpoint without meta info

Do you have a model trained on inat21 without meta, can you provide it to us, thank you

Regarding Inference code.

Can I create a pull-request for adding Inference code?

Checkpoint on iNaturalist 2018

Thanks for your wonderful work. If it is possible, could you please share your checkpoints on iNaturalist 2018? Thank you very much!

For CUB-200 meta data, i wonder know where is the bert_embedding_cub in the link of CUB-200

There is bert_embedding_root = os.path.join(root,'bert_embedding_cub') in dataset_fg.py, but I can not find any dir called bert_embedding_cub in the CUB-200.

metadata-generation-failed

您好，我在安装环境的时候出现问题：
cwd: /home/gpu/PycharmProjects/MetaFormer-master/apex/
Preparing metadata (setup.py) ... error
error: metadata-generation-failed
× Encountered error while generating package metadata.

能麻烦您看一下是哪里出了问题吗？我每一步都是按照您的readme文件进行的，麻烦您了

bert_embedding_cub

您好，请问您可以提供一下bert_embedding_cub这个文件吗？非常感谢！

Regarding Inferencing.

I have trained the model for 28 epochs on CUB-200 dataset, also I wrote a small inferencing script which accepts a single image along with its meta info. but while predicting it is not at all giving good result. Do I need to train more?

Any suggestions here?

Thank you.

Regarding embedding files for cub.

Can you please suggest is this the right way to generate the embeddings?

from transformers import AutoTokenizer, AutoModel
import pickle

text_file = "file.txt"

with open(text_file, 'r') as rr:
    data = rr.read()

tokenizer = AutoTokenizer.from_pretrained("bert-base-uncased")
model = AutoModel.from_pretrained("bert-base-uncased")

inputs = tokenizer(data, return_tensors="pt")
outputs = model(**inputs)
print(outputs[0])

embeddings = {'embedding_words': outputs[0].detach().numpy()}

with open('filepickle', 'wb') as pkl:
    pickle.dump(embeddings, pkl)

Thank You

NAbirds dataset

Hello, can you provide the data set of NAbirds? An official website can't download it. Thank you very much

How to generate bert_embedding_cub for cub-200?

FileNotFoundError: [Errno 2] No such file or directory: 'D:/dataset/CUB_200_2011\\bert_embedding_cub\\001.Black_footed_Albatross/Black_Footed_Albatross_0046_18.pickle'

About running on one GPU

I have only one GPU. I have set local_rank=-1 ,but failed to run the code.
What do I need to revise to successfully run on one GPU?

can you provide the pre-train parameters

Training time on inaturalist

Hi, thanks a lot for publishing the code. I had a question: What is the typical training time for inat17?

Danish Fungi 2020 - Performance Evaluation

Dear All,

This is more a feature request than a bug; anyway, can you test/utilize your method on the DF20 dataset? We include much more metadata within the dataset, thus, the performance evaluation with regular ViT architecture might is interesting.

The link to the paper and web follows:

Best,
Lukas

Some errata found on the code

Hi,
Thanks for sharing wonderful work :)

While running the code, I found some minor errata building the data augmentation module.

MetaFormer/data/build.py

Line 16 in 669bf18

from timm.data.transforms import _pil_interp

I guess this line should fixed with
from timm.data.transforms import str_to_pil_interp
since there is no _pil_interp in timm.data.transforms. https://github.com/rwightman/pytorch-image-models/blob/master/timm/data/transforms.py

If there is something I missed, pleased let me know :)

Is it fair to use larger pretrained model?

Hi,
First of all congratulations for your great work!

I always worried about the effect of pretrainning for FGVC. There is high risk of data overlapping of pretrained dataset and fine tune dataset. Take CUB dataset for example, it already find that CUB200-2011 have overlapping images in test dataset with imagenet1k train dataset see here. So it is highly possible that there will be more overlap of CUB with imagenet21k and iNaturalist. So there seems twio possible sources that can explain the obviously improvement when using pretrained model with larger dataset:

the pretrained model may learned some commonly useful structures which improves performance on CUB task, this is good
the pretrained model with larger dataset just has seem more test image from CUB test dataset, so it performs well, this is bad

So what is your opinion about this risk?

dqshuai / metaformer Goto Github PK

metaformer's People

Contributors

Stargazers

Watchers

Forkers

metaformer's Issues

Recommend Projects

Recommend Topics

Recommend Org