cesc-park / attend2u Goto Github PK

🖼️ Attend to You: Personalized Image Captioning with Context Sequence Memory Networks. In CVPR, 2017. Expanded : Towards Personalized Image Captioning via Multimodal Memory Networks. In IEEE TPAMI, 2018.

License: MIT License

Python 98.84% Shell 1.16%

attend2u's People

Contributors

Stargazers

Watchers

Forkers

jdc08161063 wanjinchang zhhezhhe iamduyang benjamesbabala mindis oppa3109 xiongshufeng ml-lab wavelet303 qingguo123 jolinxql leezqcst gitcha-beginners vikingmew ywang370 jaejaywoo frankiegu dimplesl ziliangwang0505 jtcollins shubhampachori12110095 cfh3c smellly solrefa-csi ai3dvision levelsethu ayanmaj92 adityabantwal kakoedlinnoeslovo junhyuk k-sandhu luckystar1992 xun-yang jacobdanovitch b2220333 louis24 kingsaint freshzy jihyukkim-nlp fendaq onratlgn arundasan91 amirunpri2018 chenghuige bailianfa hareesh-ravi baicalin ammieqi jiazhi412 nikolausn ramesh152 iamyourboss touqeer121 ankitshah009 leekyungmoon hanu14

attend2u's Issues

How to run hashtag prediction

It's default mode is caption generation.
I want to know how to run hashtag prediction

Any Implementation on Baselines?

Thanks for the greatwork! It is creative and the shown results are promising.
I saw from paper you have several baselines to be compared against your proposed CSMN (e.g. 1-nearest neighbor to user contents, RNN seq2seq with active vocabulary)
Would you release the implementations of those baselines?

Besides that, given the recent advancement on NLP (transformer, GPT-2 ... etc) , would you (and how would you) propose your CSMN differently under modern context (as of 2020)?

about superscript a and c

In your paper，I‘ve read the CSMN model.I have trouble with the understanding of superscript a and c，would you mind telling me what the porpose of add superscript a and c to the image memory vector and the user context memory vector?Thanks a lot!

Doubt about steps and InstaPic dataset size

Do we need to train for 5 lac. steps as specified in code ? or for just 20 epochs as per paper ?

Also, earlier I downloaded InstaPic data set, which only had 1.03M samples instead of 1.1M. why ?

Hoping to get a reply soon. Thanks :)

Error in cnn_feature_extractor.py

i keep getting errors saying that unexpected keyword arguments for is_training in line 111

Can anyone help me to resolve it?

Memory Error

혹시 돌리셨던 환경을 좀 구할수 있을까요 ??

Difference between papers

Hi @cesc-park
If I am correct, the only difference between CVPR and TPAMI papers is use of data sets. Both use the same CSMN architecture.
Can you or someone validate this?

InstaPIC-1.1M Dataset availability?

When will the InstaPIC-1.1M Dataset be made available again?

getting error while training on CPU

Traceback (most recent call last):
File "/usr/lib/python2.7/runpy.py", line 174, in _run_module_as_main
"main", fname, loader, pkg_name)
File "/usr/lib/python2.7/runpy.py", line 72, in _run_code
exec code in run_globals
File "/home/lalit/notebooks/Lalit/image_caption/attend2u/train.py", line 211, in
tf.app.run()
File "/home/lalit/.local/lib/python2.7/site-packages/tensorflow/python/platform/app.py", line 125, in run
_sys.exit(main(argv))
File "/home/lalit/notebooks/Lalit/image_caption/attend2u/train.py", line 208, in main
train()
File "/home/lalit/notebooks/Lalit/image_caption/attend2u/train.py", line 134, in train
loss = _tower_loss(inputs, scope)
File "/home/lalit/notebooks/Lalit/image_caption/attend2u/train.py", line 33, in _tower_loss
net = CSMN(inputs, ModelConfig(FLAGS))
File "utils/configuration.py", line 33, in init
super(ModelConfig, self).init(FLAGS)
File "utils/configuration.py", line 12, in init
attrs = FLAGS.dict['__flags']
KeyError: '__flags'

about the validation dataset part?

In papar's EXperiments part, said "we randomly split the dataset into 90% for training, 5k posts for test and the rest for validation" but I read the code and found the dataset is parted into train.txt test1.txt and test2.txt, but the text2.txt is not used in code, Did I miss something?
looking forward to your reply.

Pretrained model 문의

혹시 Pretrained model 있으면 한번 확인이 가능할지요??
개인적으로 돌려보니 생각보다.. GPU 성능이 부족하네요.

부탁드립니다.

[email protected]

Dataset cannot be downloaded

The YFCC100M data set cannot be downloaded via Google Cloud Disk. Can you provide other download methods? And I hope you can provide the InstaPIC-1.1M Dataset again.

ImportError: libcublas.so.8.0: cannot open shared object file: No such file or directory

Hey, I'm getting error while training the data after extraction, Can you help me with it?

Errors in code for feature extraction

I cloned the repository and followed the instructions as given by you guys on this github repo, but there are too many issues in the code. First, even though I installed the tensorflow as per the version given in requirements.txt but there is no module named preprocessing in slim. Anyways I managed that but then I ran into the following errrors:

Traceback (most recent call last):
  File "cnn_feature_extractor.py", line 148, in <module>
    tf.app.run()
  File "/home/pdguest/Lalit/Selfie/HashtagPred/hashtag/local/lib/python2.7/site-packages/tensorflow/python/platform/app.py", line 48, in run
    _sys.exit(main(_sys.argv[:1] + flags_passthrough))
  File "cnn_feature_extractor.py", line 112, in main
    processed_images, num_classes=1000, is_training=False
TypeError: resnet_v1_101() got an unexpected keyword argument 'is_training'
Traceback (most recent call last):
  File "cnn_feature_extractor.py", line 148, in <module>
    tf.app.run()
  File "/home/pdguest/Lalit/Selfie/HashtagPred/hashtag/local/lib/python2.7/site-packages/tensorflow/python/platform/app.py", line 48, in run
    _sys.exit(main(_sys.argv[:1] + flags_passthrough))
  File "cnn_feature_extractor.py", line 112, in main
    processed_images, num_classes=1000, is_training=False
TypeError: resnet_v1_101() got an unexpected keyword argument 'is_training'
Traceback (most recent call last):
  File "cnn_feature_extractor.py", line 148, in <module>
    tf.app.run()
  File "/home/pdguest/Lalit/Selfie/HashtagPred/hashtag/local/lib/python2.7/site-packages/tensorflow/python/platform/app.py", line 48, in run
    _sys.exit(main(_sys.argv[:1] + flags_passthrough))
  File "cnn_feature_extractor.py", line 112, in main
    processed_images, num_classes=1000, is_training=False
TypeError: resnet_v1_101() got an unexpected keyword argument 'is_training'
Traceback (most recent call last):
  File "cnn_feature_extractor.py", line 148, in <module>
    tf.app.run()
  File "/home/pdguest/Lalit/Selfie/HashtagPred/hashtag/local/lib/python2.7/site-packages/tensorflow/python/platform/app.py", line 48, in run
    _sys.exit(main(_sys.argv[:1] + flags_passthrough))
  File "cnn_feature_extractor.py", line 112, in main
    processed_images, num_classes=1000, is_training=False
TypeError: resnet_v1_101() got an unexpected keyword argument 'is_training'
Traceback (most recent call last):
  File "cnn_feature_extractor.py", line 148, in <module>
    tf.app.run()
  File "/home/pdguest/Lalit/Selfie/HashtagPred/hashtag/local/lib/python2.7/site-packages/tensorflow/python/platform/app.py", line 48, in run
    _sys.exit(main(_sys.argv[:1] + flags_passthrough))
  File "cnn_feature_extractor.py", line 112, in main
    processed_images, num_classes=1000, is_training=False
TypeError: resnet_v1_101() got an unexpected keyword argument 'is_training'
Traceback (most recent call last):
  File "cnn_feature_extractor.py", line 148, in <module>
    tf.app.run()
  File "/home/pdguest/Lalit/Selfie/HashtagPred/hashtag/local/lib/python2.7/site-packages/tensorflow/python/platform/app.py", line 48, in run
    _sys.exit(main(_sys.argv[:1] + flags_passthrough))
  File "cnn_feature_extractor.py", line 112, in main
    processed_images, num_classes=1000, is_training=False
TypeError: resnet_v1_101() got an unexpected keyword argument 'is_training'

Can you please check your code once again and update it ?

[ERROR:2017-05-08 12:42:08,943] Exception in QueueRunner: 0-th value returned by pyfunc_0 is double, but expects float
	 [[Node: PyFunc = PyFunc[Tin=[DT_STRING], Tout=[DT_FLOAT], token="pyfunc_0", _device="/job:localhost/replica:0/task:0/cpu:0"](DecodeCSV)]]

Caused by op u'PyFunc', defined at:
  File "/usr/lib/python2.7/runpy.py", line 162, in _run_module_as_main
    "__main__", fname, loader, pkg_name)
  File "/usr/lib/python2.7/runpy.py", line 72, in _run_code
    exec code in run_globals
  File "/home/HashtagPred/train.py", line 211, in <module>
    tf.app.run()
  File "/home/HashtagPred/hashtag/local/lib/python2.7/site-packages/tensorflow/python/platform/app.py", line 48, in run
    _sys.exit(main(_sys.argv[:1] + flags_passthrough))
  File "/home/HashtagPred/train.py", line 208, in main
    train()
  File "/home/HashtagPred/train.py", line 98, in train
    tower_caption_mask = enqueue(False)
  File "utils/data_utils.py", line 240, in enqueue
    answer_id, context_mask, caption_mask = read_numpy_format_and_label( filename_queue)
  File "utils/data_utils.py", line 160, in read_numpy_format_and_label
    tf.float32
  File "/home/HashtagPred/hashtag/local/lib/python2.7/site-packages/tensorflow/python/ops/script_ops.py", line 189, in py_func
    input=inp, token=token, Tout=Tout, name=name)
  File "/home/HashtagPred/hashtag/local/lib/python2.7/site-packages/tensorflow/python/ops/gen_script_ops.py", line 40, in _py_func
    name=name)
  File "/home/HashtagPred/hashtag/local/lib/python2.7/site-packages/tensorflow/python/framework/op_def_library.py", line 768, in apply_op
    op_def=op_def)
  File "/home/HashtagPred/hashtag/local/lib/python2.7/site-packages/tensorflow/python/framework/ops.py", line 2336, in create_op
    original_op=self._default_original_op, op_def=op_def)
  File "/home/HashtagPred/hashtag/local/lib/python2.7/site-packages/tensorflow/python/framework/ops.py", line 1228, in __init__
    self._traceback = _extract_stack()

InvalidArgumentError (see above for traceback): 0-th value returned by pyfunc_0 is double, but expects float
	 [[Node: PyFunc = PyFunc[Tin=[DT_STRING], Tout=[DT_FLOAT], token="pyfunc_0", _device="/job:localhost/replica:0/task:0/cpu:0"](DecodeCSV)]]

Exception in thread Thread-6: