chencodex / bilinear_cnn_dog_classifi Goto Github PK

View Code? Open in Web Editor NEW

92.0 5.0 28.0 298 KB

tensorflow->BCNN + pytorch -> vgg16/resnet/BCNN

Python 100.00%

vgg inception resnet bcnn

bilinear_cnn_dog_classifi's People

Contributors

Stargazers

Watchers

bilinear_cnn_dog_classifi's Issues

问题

在网络模型中，在BCNN的步骤中，做了如下的一个步骤：
self.phi_I = tf.divide(self.phi_I,784.0)
这里为什么要除784啊？

请问这个均值是什么均值，怎么计算的

with tf.name_scope('preprocess') as scope:
mean = tf.constant([110.61220668946963, 127.15494646868024, 135.13440353131017]

这是整个数据集图片的均值吗

image data

您好，这个源码的train_test文件夹里只有.txt文件，请问对应的图像数据在哪里？求这些数据集，可以付费，电子邮箱：[email protected]

about the perfomance

Hi.
Excellent work.
But could you tell me the accuracy that you model can achieve?
Best wishes.

datasets

您好，小白请教您一下，down_pic.py中的test.txt是训练数据集的地址吗？clone下来没有这个文件，可不可以发我一份，万分感谢。[email protected]

生成hdf5数据时错误

我用的时stanford dogs数据集，当对测试集数据生成hdf5格式时成功，但是对训练集数据生成hdf5格式时报错：

请问是什么原因

Can you let me know the specific steps and the correct rate?

Hello!
I am repeating your source code, would like to ask you under the final test of the correct rate can be achieved? Because know the correct rate can give me a reference. Can you provide detailed running steps to help me better reproduce? Thank you for your help!
Thanks very much!

请问这个是参加百度的比赛吗？

Why the feature outputs of BCNN is the outer product of the (only) one CNN and itself?

Hi, chencodeX

I think that the core codes of BCNN is the following codes from bcnn_finetuning.py in the end of class vgg16

        ''' Reshape conv5_3 from [batch_size, height, width, number_of_filters]
           to [batch_size, number_of_filters, height, width]'''
        self.conv5_3 = tf.transpose(self.conv5_3, perm=[0,3,1,2])


        ''' Reshape conv5_3 from [batch_size, number_of_filters, height*width] '''
        self.conv5_3 = tf.reshape(self.conv5_3,[-1,512,784])

        ''' A temporary variable which holds the transpose of conv5_3 '''
        conv5_3_T = tf.transpose(self.conv5_3, perm=[0,2,1])

        '''Matrix multiplication [batch_size,512,784] x [batch_size,784,512] '''
        self.phi_I = tf.matmul(self.conv5_3, conv5_3_T)

But I get confused when I try to understand it.

When I read the paper and other docs, I think that there must have 2 CNNs and they get the outer product when combine the outputs of them, then we can connect it to FC layers or others.

But in the BCNN codes, when they build models, it seems like there is only 1 CNN and it get outer product of it self.

I think 2 CNNs can extract different image features, but I don't understand the code of BCNN.

Anything wrong of my understanding? Could you give me some suggestions?

Thank you!

chencodex / bilinear_cnn_dog_classifi Goto Github PK

bilinear_cnn_dog_classifi's People

Contributors

Stargazers

Watchers

Forkers

bilinear_cnn_dog_classifi's Issues

Recommend Projects

Recommend Topics

Recommend Org