@rpautrat
When i run the step6) "python3 experiment.py train configs/superpoint_coco.yaml superpoint_coco", I got the following statements, so how should i do? is there anything wrong? I need some advice, Thank you very much.
$ python3 experiment.py train configs/superpoint_coco.yaml superpoint_coco
[01/11/2019 00:29:25 INFO] Running command TRAIN
[01/11/2019 00:29:25 INFO] Number of GPUs detected: 1
[01/11/2019 00:29:28 INFO] Caching data, fist access will take some time.
[01/11/2019 00:29:29 INFO] Caching data, fist access will take some time.
[01/11/2019 00:29:30 INFO] Caching data, fist access will take some time.
2019-01-11 00:29:31.373688: I tensorflow/core/platform/cpu_feature_guard.cc:140] Your CPU supports instructions that this TensorFlow binary was not compiled to use: AVX2 FMA
2019-01-11 00:29:31.441013: I tensorflow/stream_executor/cuda/cuda_gpu_executor.cc:898] successful NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA node zero
2019-01-11 00:29:31.441328: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1212] Found device 0 with properties:
name: GeForce GTX 1070 major: 6 minor: 1 memoryClockRate(GHz): 1.695
pciBusID: 0000:01:00.0
totalMemory: 7.92GiB freeMemory: 7.11GiB
2019-01-11 00:29:31.441340: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1312] Adding visible gpu devices: 0
2019-01-11 00:29:31.610840: I tensorflow/core/common_runtime/gpu/gpu_device.cc:993] Creating TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 6868 MB memory) -> physical GPU (device: 0, name: GeForce GTX 1070, pci bus id: 0000:01:00.0, compute capability: 6.1)
[01/11/2019 00:29:31 INFO] Scale of 0 disables regularizer.
[01/11/2019 00:29:31 INFO] Scale of 0 disables regularizer.
[01/11/2019 00:29:32 INFO] Scale of 0 disables regularizer.
[01/11/2019 00:29:32 INFO] Scale of 0 disables regularizer.
[01/11/2019 00:29:32 INFO] Scale of 0 disables regularizer.
[01/11/2019 00:29:32 INFO] Scale of 0 disables regularizer.
[01/11/2019 00:29:32 INFO] Scale of 0 disables regularizer.
[01/11/2019 00:29:32 INFO] Scale of 0 disables regularizer.
[01/11/2019 00:29:32 INFO] Scale of 0 disables regularizer.
[01/11/2019 00:29:32 INFO] Scale of 0 disables regularizer.
[01/11/2019 00:29:32 INFO] Scale of 0 disables regularizer.
[01/11/2019 00:29:32 INFO] Scale of 0 disables regularizer.
[01/11/2019 00:29:32 INFO] Scale of 0 disables regularizer.
[01/11/2019 00:29:32 INFO] Scale of 0 disables regularizer.
[01/11/2019 00:29:32 INFO] Scale of 0 disables regularizer.
[01/11/2019 00:29:32 INFO] Scale of 0 disables regularizer.
[01/11/2019 00:29:32 INFO] Scale of 0 disables regularizer.
[01/11/2019 00:29:32 INFO] Scale of 0 disables regularizer.
[01/11/2019 00:29:32 INFO] Scale of 0 disables regularizer.
[01/11/2019 00:29:32 INFO] Scale of 0 disables regularizer.
[01/11/2019 00:29:32 INFO] Scale of 0 disables regularizer.
[01/11/2019 00:29:32 INFO] Scale of 0 disables regularizer.
[01/11/2019 00:29:32 INFO] Scale of 0 disables regularizer.
[01/11/2019 00:29:32 INFO] Scale of 0 disables regularizer.
[01/11/2019 00:29:33 INFO] Scale of 0 disables regularizer.
[01/11/2019 00:29:33 INFO] Scale of 0 disables regularizer.
[01/11/2019 00:29:33 INFO] Scale of 0 disables regularizer.
[01/11/2019 00:29:33 INFO] Scale of 0 disables regularizer.
[01/11/2019 00:29:33 INFO] Scale of 0 disables regularizer.
[01/11/2019 00:29:33 INFO] Scale of 0 disables regularizer.
[01/11/2019 00:29:33 INFO] Scale of 0 disables regularizer.
[01/11/2019 00:29:33 INFO] Scale of 0 disables regularizer.
[01/11/2019 00:29:33 INFO] Scale of 0 disables regularizer.
[01/11/2019 00:29:33 INFO] Scale of 0 disables regularizer.
[01/11/2019 00:29:33 INFO] Scale of 0 disables regularizer.
[01/11/2019 00:29:33 INFO] Scale of 0 disables regularizer.
[01/11/2019 00:29:33 INFO] Scale of 0 disables regularizer.
[01/11/2019 00:29:33 INFO] Scale of 0 disables regularizer.
[01/11/2019 00:29:33 INFO] Scale of 0 disables regularizer.
[01/11/2019 00:29:33 INFO] Scale of 0 disables regularizer.
[01/11/2019 00:29:33 INFO] Scale of 0 disables regularizer.
[01/11/2019 00:29:33 INFO] Scale of 0 disables regularizer.
[01/11/2019 00:29:33 INFO] Scale of 0 disables regularizer.
[01/11/2019 00:29:33 INFO] Scale of 0 disables regularizer.
[01/11/2019 00:29:33 INFO] Scale of 0 disables regularizer.
[01/11/2019 00:29:33 INFO] Scale of 0 disables regularizer.
[01/11/2019 00:29:33 INFO] Scale of 0 disables regularizer.
[01/11/2019 00:29:33 INFO] Scale of 0 disables regularizer.
2019-01-11 00:29:33.541473: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1312] Adding visible gpu devices: 0
2019-01-11 00:29:33.541570: I tensorflow/core/common_runtime/gpu/gpu_device.cc:993] Creating TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 135 MB memory) -> physical GPU (device: 0, name: GeForce GTX 1070, pci bus id: 0000:01:00.0, compute capability: 6.1)
[01/11/2019 00:29:41 INFO] Start training
2019-01-11 00:29:45.965791: I tensorflow/core/kernels/cuda_solvers.cc:159] Creating CudaSolver handles for stream 0x7f8dc016dbb0
2019-01-11 00:29:57.274840: W tensorflow/core/common_runtime/bfc_allocator.cc:275] Allocator (GPU_0_bfc) ran out of memory trying to allocate 4.12GiB. Current allocation summary follows.
2019-01-11 00:29:57.274945: I tensorflow/core/common_runtime/bfc_allocator.cc:630] Bin (256): Total Chunks: 50, Chunks in use: 46. 12.5KiB allocated for chunks. 11.5KiB in use in bin. 5.1KiB client-requested in use in bin.
2019-01-11 00:29:57.274977: I tensorflow/core/common_runtime/bfc_allocator.cc:630] Bin (512): Total Chunks: 26, Chunks in use: 20. 13.5KiB allocated for chunks. 10.2KiB in use in bin. 10.0KiB client-requested in use in bin.
2019-01-11 00:29:57.275005: I tensorflow/core/common_runtime/bfc_allocator.cc:630] Bin (1024): Total Chunks: 17, Chunks in use: 12. 17.8KiB allocated for chunks. 12.5KiB in use in bin. 12.0KiB client-requested in use in bin.
2019-01-11 00:29:57.275037: I tensorflow/core/common_runtime/bfc_allocator.cc:630] Bin (2048): Total Chunks: 0, Chunks in use: 0. 0B allocated for chunks. 0B in use in bin. 0B client-requested in use in bin.
2019-01-11 00:29:57.275060: I tensorflow/core/common_runtime/bfc_allocator.cc:630] Bin (4096): Total Chunks: 0, Chunks in use: 0. 0B allocated for chunks. 0B in use in bin. 0B client-requested in use in bin.
2019-01-11 00:29:57.275093: I tensorflow/core/common_runtime/bfc_allocator.cc:630] Bin (8192): Total Chunks: 1, Chunks in use: 0. 15.0KiB allocated for chunks. 0B in use in bin. 0B client-requested in use in bin.
2019-01-11 00:29:57.275118: I tensorflow/core/common_runtime/bfc_allocator.cc:630] Bin (16384): Total Chunks: 1, Chunks in use: 0. 30.5KiB allocated for chunks. 0B in use in bin. 0B client-requested in use in bin.
2019-01-11 00:29:57.275137: I tensorflow/core/common_runtime/bfc_allocator.cc:630] Bin (32768): Total Chunks: 1, Chunks in use: 0. 56.8KiB allocated for chunks. 0B in use in bin. 0B client-requested in use in bin.
2019-01-11 00:29:57.275157: I tensorflow/core/common_runtime/bfc_allocator.cc:630] Bin (65536): Total Chunks: 1, Chunks in use: 0. 69.5KiB allocated for chunks. 0B in use in bin. 0B client-requested in use in bin.
2019-01-11 00:29:57.275180: I tensorflow/core/common_runtime/bfc_allocator.cc:630] Bin (131072): Total Chunks: 3, Chunks in use: 3. 432.0KiB allocated for chunks. 432.0KiB in use in bin. 432.0KiB client-requested in use in bin.
2019-01-11 00:29:57.275201: I tensorflow/core/common_runtime/bfc_allocator.cc:630] Bin (262144): Total Chunks: 3, Chunks in use: 2. 828.0KiB allocated for chunks. 544.0KiB in use in bin. 544.0KiB client-requested in use in bin.
2019-01-11 00:29:57.275221: I tensorflow/core/common_runtime/bfc_allocator.cc:630] Bin (524288): Total Chunks: 7, Chunks in use: 5. 4.90MiB allocated for chunks. 3.45MiB in use in bin. 3.45MiB client-requested in use in bin.
2019-01-11 00:29:57.275245: I tensorflow/core/common_runtime/bfc_allocator.cc:630] Bin (1048576): Total Chunks: 13, Chunks in use: 10. 20.95MiB allocated for chunks. 16.95MiB in use in bin. 16.95MiB client-requested in use in bin.
2019-01-11 00:29:57.275267: I tensorflow/core/common_runtime/bfc_allocator.cc:630] Bin (2097152): Total Chunks: 15, Chunks in use: 13. 49.42MiB allocated for chunks. 42.39MiB in use in bin. 40.43MiB client-requested in use in bin.
2019-01-11 00:29:57.275289: I tensorflow/core/common_runtime/bfc_allocator.cc:630] Bin (4194304): Total Chunks: 8, Chunks in use: 8. 57.09MiB allocated for chunks. 57.09MiB in use in bin. 56.25MiB client-requested in use in bin.
2019-01-11 00:29:57.275312: I tensorflow/core/common_runtime/bfc_allocator.cc:630] Bin (8388608): Total Chunks: 11, Chunks in use: 10. 149.70MiB allocated for chunks. 140.91MiB in use in bin. 140.62MiB client-requested in use in bin.
2019-01-11 00:29:57.275331: I tensorflow/core/common_runtime/bfc_allocator.cc:630] Bin (16777216): Total Chunks: 0, Chunks in use: 0. 0B allocated for chunks. 0B in use in bin. 0B client-requested in use in bin.
2019-01-11 00:29:57.275354: I tensorflow/core/common_runtime/bfc_allocator.cc:630] Bin (33554432): Total Chunks: 8, Chunks in use: 8. 455.36MiB allocated for chunks. 455.36MiB in use in bin. 450.00MiB client-requested in use in bin.
2019-01-11 00:29:57.275372: I tensorflow/core/common_runtime/bfc_allocator.cc:630] Bin (67108864): Total Chunks: 0, Chunks in use: 0. 0B allocated for chunks. 0B in use in bin. 0B client-requested in use in bin.
2019-01-11 00:29:57.275389: I tensorflow/core/common_runtime/bfc_allocator.cc:630] Bin (134217728): Total Chunks: 0, Chunks in use: 0. 0B allocated for chunks. 0B in use in bin. 0B client-requested in use in bin.
2019-01-11 00:29:57.275409: I tensorflow/core/common_runtime/bfc_allocator.cc:630] Bin (268435456): Total Chunks: 2, Chunks in use: 1. 5.99GiB allocated for chunks. 4.19GiB in use in bin. 4.12GiB client-requested in use in bin.
2019-01-11 00:29:57.275429: I tensorflow/core/common_runtime/bfc_allocator.cc:646] Bin for 4.12GiB was 256.00MiB, Chunk State:
2019-01-11 00:29:57.275459: I tensorflow/core/common_runtime/bfc_allocator.cc:652] Size: 1.80GiB | Requested Size: 17.16MiB | in_use: 0, prev: Size: 4.19GiB | Requested Size: 4.12GiB | in_use: 1
2019-01-11 00:29:57.275481: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a200000 of size 1280
2019-01-11 00:29:57.275498: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a200500 of size 1280
2019-01-11 00:29:57.275522: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a200a00 of size 256
2019-01-11 00:29:57.275545: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a200b00 of size 256
2019-01-11 00:29:57.275568: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a200c00 of size 256
2019-01-11 00:29:57.275592: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a200d00 of size 256
2019-01-11 00:29:57.275615: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a200e00 of size 256
2019-01-11 00:29:57.275638: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a200f00 of size 256
2019-01-11 00:29:57.275660: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a201000 of size 256
2019-01-11 00:29:57.275676: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a201100 of size 256
2019-01-11 00:29:57.275690: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a201200 of size 256
2019-01-11 00:29:57.275710: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a201300 of size 256
2019-01-11 00:29:57.275730: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a201400 of size 256
2019-01-11 00:29:57.275745: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a201500 of size 256
2019-01-11 00:29:57.275760: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a201600 of size 256
2019-01-11 00:29:57.275777: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a201700 of size 256
2019-01-11 00:29:57.275812: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a201800 of size 256
2019-01-11 00:29:57.275831: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a201900 of size 256
2019-01-11 00:29:57.275846: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a201a00 of size 256
2019-01-11 00:29:57.275866: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a201b00 of size 256
2019-01-11 00:29:57.275889: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a201c00 of size 256
2019-01-11 00:29:57.275914: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a201d00 of size 256
2019-01-11 00:29:57.275939: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a201e00 of size 256
2019-01-11 00:29:57.275963: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a201f00 of size 256
2019-01-11 00:29:57.275995: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a202000 of size 256
2019-01-11 00:29:57.276012: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a202100 of size 256
2019-01-11 00:29:57.276035: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a202200 of size 256
2019-01-11 00:29:57.276059: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a202300 of size 256
2019-01-11 00:29:57.276088: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a202400 of size 589824
2019-01-11 00:29:57.276106: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a292400 of size 512
2019-01-11 00:29:57.276121: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Free at 0x1020a292600 of size 512
2019-01-11 00:29:57.276136: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a292800 of size 589824
2019-01-11 00:29:57.276153: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Free at 0x1020a322800 of size 512
2019-01-11 00:29:57.276174: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a322a00 of size 512
2019-01-11 00:29:57.276215: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a322c00 of size 512
2019-01-11 00:29:57.276232: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Free at 0x1020a322e00 of size 512
2019-01-11 00:29:57.276247: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a323000 of size 589824
2019-01-11 00:29:57.276262: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a3b3000 of size 512
2019-01-11 00:29:57.276277: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a3b3200 of size 512
2019-01-11 00:29:57.276292: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Free at 0x1020a3b3400 of size 768
2019-01-11 00:29:57.276307: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a3b3700 of size 512
2019-01-11 00:29:57.276322: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a3b3900 of size 294912
2019-01-11 00:29:57.276337: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Free at 0x1020a3fb900 of size 256
2019-01-11 00:29:57.276352: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a3fba00 of size 512
2019-01-11 00:29:57.276367: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Free at 0x1020a3fbc00 of size 512
2019-01-11 00:29:57.276382: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a3fbe00 of size 262144
2019-01-11 00:29:57.276398: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a43be00 of size 1024
2019-01-11 00:29:57.276421: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Free at 0x1020a43c200 of size 1024
2019-01-11 00:29:57.276442: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a43c600 of size 256
2019-01-11 00:29:57.276467: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a43c700 of size 256
2019-01-11 00:29:57.276499: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a43c800 of size 256
2019-01-11 00:29:57.276515: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a43c900 of size 256
2019-01-11 00:29:57.276537: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a43ca00 of size 256
2019-01-11 00:29:57.276561: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a43cb00 of size 512
2019-01-11 00:29:57.276587: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a43cd00 of size 768
2019-01-11 00:29:57.276612: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Free at 0x1020a43d000 of size 1024
2019-01-11 00:29:57.276638: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a43d400 of size 256
2019-01-11 00:29:57.276658: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Free at 0x1020a43d500 of size 256
2019-01-11 00:29:57.276684: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a43d600 of size 147456
2019-01-11 00:29:57.276711: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a461600 of size 256
2019-01-11 00:29:57.276731: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a461700 of size 1179648
2019-01-11 00:29:57.276756: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Free at 0x1020a581700 of size 1280
2019-01-11 00:29:57.276785: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a581c00 of size 1024
2019-01-11 00:29:57.276807: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Free at 0x1020a582000 of size 1024
2019-01-11 00:29:57.276835: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a582400 of size 256
2019-01-11 00:29:57.276854: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Free at 0x1020a582500 of size 1179904
2019-01-11 00:29:57.276879: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a6a2600 of size 256
2019-01-11 00:29:57.276906: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Free at 0x1020a6a2700 of size 71168
2019-01-11 00:29:57.276927: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a6b3d00 of size 147456
2019-01-11 00:29:57.276954: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Free at 0x1020a6d7d00 of size 256
2019-01-11 00:29:57.276980: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a6d7e00 of size 256
2019-01-11 00:29:57.277006: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a6d7f00 of size 256
2019-01-11 00:29:57.277026: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a6d8000 of size 147456
2019-01-11 00:29:57.277057: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a6fc000 of size 921600
2019-01-11 00:29:57.277078: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a7dd000 of size 921856
2019-01-11 00:29:57.277094: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Free at 0x1020a8be100 of size 58112
2019-01-11 00:29:57.277120: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020a8cc400 of size 62613248
2019-01-11 00:29:57.277140: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1020e482b00 of size 58982400
2019-01-11 00:29:57.277165: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Free at 0x10211cc2b00 of size 15360
2019-01-11 00:29:57.277191: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x10211cc6700 of size 256
2019-01-11 00:29:57.277208: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x10211cc6800 of size 256
2019-01-11 00:29:57.277223: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x10211cc6900 of size 256
2019-01-11 00:29:57.277243: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x10211cc6a00 of size 256
2019-01-11 00:29:57.277258: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x10211cc6b00 of size 256
2019-01-11 00:29:57.277273: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x10211cc6c00 of size 256
2019-01-11 00:29:57.277298: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x10211cc6d00 of size 256
2019-01-11 00:29:57.277324: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Free at 0x10211cc6e00 of size 256
2019-01-11 00:29:57.277343: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x10211cc6f00 of size 256
2019-01-11 00:29:57.277372: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x10211cc7000 of size 256
2019-01-11 00:29:57.277400: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x10211cc7100 of size 512
2019-01-11 00:29:57.277418: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x10211cc7300 of size 512
2019-01-11 00:29:57.277436: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x10211cc7500 of size 512
2019-01-11 00:29:57.277467: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x10211cc7700 of size 512
2019-01-11 00:29:57.277494: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x10211cc7900 of size 512
2019-01-11 00:29:57.277515: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x10211cc7b00 of size 1024
2019-01-11 00:29:57.277542: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x10211cc7f00 of size 512
2019-01-11 00:29:57.277567: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x10211cc8100 of size 512
2019-01-11 00:29:57.277593: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Free at 0x10211cc8300 of size 512
2019-01-11 00:29:57.277610: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x10211cc8500 of size 512
2019-01-11 00:29:57.277628: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x10211cc8700 of size 512
2019-01-11 00:29:57.277654: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x10211cc8900 of size 1024
2019-01-11 00:29:57.277676: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x10211cc8d00 of size 1024
2019-01-11 00:29:57.277701: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x10211cc9100 of size 1024
2019-01-11 00:29:57.277722: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x10211cc9500 of size 1024
2019-01-11 00:29:57.277748: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x10211cc9900 of size 1024
2019-01-11 00:29:57.277770: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Free at 0x10211cc9d00 of size 31232
2019-01-11 00:29:57.277798: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x10211cd1700 of size 60823552
2019-01-11 00:29:57.277823: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x102156d2f00 of size 58982400
2019-01-11 00:29:57.277851: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x10218f12f00 of size 58982400
2019-01-11 00:29:57.277878: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1021c752f00 of size 58982400
2019-01-11 00:29:57.277905: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1021ff92f00 of size 14745600
2019-01-11 00:29:57.277928: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x10220da2f00 of size 14745600
2019-01-11 00:29:57.277955: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x10221bb2f00 of size 14745600
2019-01-11 00:29:57.277982: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x102229c2f00 of size 14745600
2019-01-11 00:29:57.278009: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x102237d2f00 of size 14745600
2019-01-11 00:29:57.278037: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x102245e2f00 of size 3686400
2019-01-11 00:29:57.278065: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x10224966f00 of size 59129856
2019-01-11 00:29:57.278092: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x102281caf00 of size 58982400
2019-01-11 00:29:57.278122: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1022ba0af00 of size 7667712
2019-01-11 00:29:57.278149: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1022c15af00 of size 14745600
2019-01-11 00:29:57.278177: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1022cf6af00 of size 7372800
2019-01-11 00:29:57.278204: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1022d672f00 of size 14893056
2019-01-11 00:29:57.278231: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1022e4a6f00 of size 14745600
2019-01-11 00:29:57.278259: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1022f2b6f00 of size 7962624
2019-01-11 00:29:57.278287: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1022fa4ef00 of size 7372800
2019-01-11 00:29:57.278315: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x10230156f00 of size 1843200
2019-01-11 00:29:57.278343: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x10230318f00 of size 14893056
2019-01-11 00:29:57.278371: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1023114cf00 of size 14745600
2019-01-11 00:29:57.278394: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x10231f5cf00 of size 2695168
2019-01-11 00:29:57.278423: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x102321eef00 of size 1843200
2019-01-11 00:29:57.278443: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Free at 0x102323b0f00 of size 1024
2019-01-11 00:29:57.278473: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x102323b1300 of size 512
2019-01-11 00:29:57.278493: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x102323b1500 of size 512
2019-01-11 00:29:57.278522: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x102323b1700 of size 1024
2019-01-11 00:29:57.278548: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x102323b1b00 of size 1024
2019-01-11 00:29:57.278576: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Free at 0x102323b1f00 of size 290816
2019-01-11 00:29:57.278603: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x102323f8f00 of size 2138112
2019-01-11 00:29:57.278630: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x10232602f00 of size 1843200
2019-01-11 00:29:57.278656: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x102327c4f00 of size 3686400
2019-01-11 00:29:57.278683: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Free at 0x10232b48f00 of size 3686400
2019-01-11 00:29:57.278709: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x10232eccf00 of size 1843200
2019-01-11 00:29:57.278736: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x1023308ef00 of size 1843200
2019-01-11 00:29:57.278762: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x10233250f00 of size 7372800
2019-01-11 00:29:57.278788: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x10233958f00 of size 7372800
2019-01-11 00:29:57.278814: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x10234060f00 of size 3686400
2019-01-11 00:29:57.278837: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Free at 0x102343e4f00 of size 3686400
2019-01-11 00:29:57.278864: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x10234768f00 of size 3686400
2019-01-11 00:29:57.278892: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x10234aecf00 of size 7372800
2019-01-11 00:29:57.278919: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Free at 0x102351f4f00 of size 1179648
2019-01-11 00:29:57.278945: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x10235314f00 of size 7372800
2019-01-11 00:29:57.278972: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Free at 0x10235a1cf00 of size 936192
2019-01-11 00:29:57.278998: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x10235b01800 of size 2750208
2019-01-11 00:29:57.279024: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x10235da0f00 of size 3686400
2019-01-11 00:29:57.279051: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x10236124f00 of size 1843200
2019-01-11 00:29:57.279077: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Free at 0x102362e6f00 of size 589824
2019-01-11 00:29:57.279104: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x10236376f00 of size 1843200
2019-01-11 00:29:57.279130: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x10236538f00 of size 1843200
2019-01-11 00:29:57.279158: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Free at 0x102366faf00 of size 1843200
2019-01-11 00:29:57.279184: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x102368bcf00 of size 3686400
2019-01-11 00:29:57.279211: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x10236c40f00 of size 3686400
2019-01-11 00:29:57.279238: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x10236fc4f00 of size 3686400
2019-01-11 00:29:57.279264: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x10237348f00 of size 1843200
2019-01-11 00:29:57.279291: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Free at 0x1023750af00 of size 9216000
2019-01-11 00:29:57.279318: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x10237dd4f00 of size 3686400
2019-01-11 00:29:57.279344: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x10238158f00 of size 3686400
2019-01-11 00:29:57.279373: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Chunk at 0x102384dcf00 of size 4497120000
2019-01-11 00:29:57.279400: I tensorflow/core/common_runtime/bfc_allocator.cc:665] Free at 0x103445a6a00 of size 1930332416
2019-01-11 00:29:57.279424: I tensorflow/core/common_runtime/bfc_allocator.cc:671] Summary of in-use Chunks by size:
2019-01-11 00:29:57.279448: I tensorflow/core/common_runtime/bfc_allocator.cc:674] 46 Chunks of size 256 totalling 11.5KiB
2019-01-11 00:29:57.279469: I tensorflow/core/common_runtime/bfc_allocator.cc:674] 19 Chunks of size 512 totalling 9.5KiB
2019-01-11 00:29:57.279486: I tensorflow/core/common_runtime/bfc_allocator.cc:674] 1 Chunks of size 768 totalling 768B
2019-01-11 00:29:57.279506: I tensorflow/core/common_runtime/bfc_allocator.cc:674] 10 Chunks of size 1024 totalling 10.0KiB
2019-01-11 00:29:57.279523: I tensorflow/core/common_runtime/bfc_allocator.cc:674] 2 Chunks of size 1280 totalling 2.5KiB
2019-01-11 00:29:57.279543: I tensorflow/core/common_runtime/bfc_allocator.cc:674] 3 Chunks of size 147456 totalling 432.0KiB
2019-01-11 00:29:57.279561: I tensorflow/core/common_runtime/bfc_allocator.cc:674] 1 Chunks of size 262144 totalling 256.0KiB
2019-01-11 00:29:57.279580: I tensorflow/core/common_runtime/bfc_allocator.cc:674] 1 Chunks of size 294912 totalling 288.0KiB
2019-01-11 00:29:57.279597: I tensorflow/core/common_runtime/bfc_allocator.cc:674] 3 Chunks of size 589824 totalling 1.69MiB
2019-01-11 00:29:57.279617: I tensorflow/core/common_runtime/bfc_allocator.cc:674] 1 Chunks of size 921600 totalling 900.0KiB
2019-01-11 00:29:57.279642: I tensorflow/core/common_runtime/bfc_allocator.cc:674] 1 Chunks of size 921856 totalling 900.2KiB
2019-01-11 00:29:57.279671: I tensorflow/core/common_runtime/bfc_allocator.cc:674] 1 Chunks of size 1179648 totalling 1.12MiB
2019-01-11 00:29:57.279701: I tensorflow/core/common_runtime/bfc_allocator.cc:674] 9 Chunks of size 1843200 totalling 15.82MiB
2019-01-11 00:29:57.279731: I tensorflow/core/common_runtime/bfc_allocator.cc:674] 1 Chunks of size 2138112 totalling 2.04MiB
2019-01-11 00:29:57.279759: I tensorflow/core/common_runtime/bfc_allocator.cc:674] 1 Chunks of size 2695168 totalling 2.57MiB
2019-01-11 00:29:57.279789: I tensorflow/core/common_runtime/bfc_allocator.cc:674] 1 Chunks of size 2750208 totalling 2.62MiB
2019-01-11 00:29:57.279819: I tensorflow/core/common_runtime/bfc_allocator.cc:674] 10 Chunks of size 3686400 totalling 35.16MiB
2019-01-11 00:29:57.279850: I tensorflow/core/common_runtime/bfc_allocator.cc:674] 6 Chunks of size 7372800 totalling 42.19MiB
2019-01-11 00:29:57.279879: I tensorflow/core/common_runtime/bfc_allocator.cc:674] 1 Chunks of size 7667712 totalling 7.31MiB
2019-01-11 00:29:57.279907: I tensorflow/core/common_runtime/bfc_allocator.cc:674] 1 Chunks of size 7962624 totalling 7.59MiB
2019-01-11 00:29:57.279937: I tensorflow/core/common_runtime/bfc_allocator.cc:674] 8 Chunks of size 14745600 totalling 112.50MiB
2019-01-11 00:29:57.279968: I tensorflow/core/common_runtime/bfc_allocator.cc:674] 2 Chunks of size 14893056 totalling 28.41MiB
2019-01-11 00:29:57.279997: I tensorflow/core/common_runtime/bfc_allocator.cc:674] 5 Chunks of size 58982400 totalling 281.25MiB
2019-01-11 00:29:57.280028: I tensorflow/core/common_runtime/bfc_allocator.cc:674] 1 Chunks of size 59129856 totalling 56.39MiB
2019-01-11 00:29:57.280058: I tensorflow/core/common_runtime/bfc_allocator.cc:674] 1 Chunks of size 60823552 totalling 58.01MiB
2019-01-11 00:29:57.280088: I tensorflow/core/common_runtime/bfc_allocator.cc:674] 1 Chunks of size 62613248 totalling 59.71MiB
2019-01-11 00:29:57.280115: I tensorflow/core/common_runtime/bfc_allocator.cc:674] 1 Chunks of size 4497120000 totalling 4.19GiB
2019-01-11 00:29:57.280143: I tensorflow/core/common_runtime/bfc_allocator.cc:678] Sum Total of in-use chunks: 4.89GiB
2019-01-11 00:29:57.280175: I tensorflow/core/common_runtime/bfc_allocator.cc:680] Stats:
Limit: 7202206516
InUse: 5249080064
MaxInUse: 6401664768
NumAllocs: 531
MaxAllocSize: 4497120000
2019-01-11 00:29:57.280248: W tensorflow/core/common_runtime/bfc_allocator.cc:279] *************************************************************************x__________________________
2019-01-11 00:29:57.280303: W tensorflow/core/framework/op_kernel.cc:1202] OP_REQUIRES failed at cwise_ops_common.cc:70 : Resource exhausted: OOM when allocating tensor with shape[3,30,40,30,40,256] and type float on /job:localhost/replica:0/task:0/device:GPU:0 by allocator GPU_0_bfc
Traceback (most recent call last):
File "/home/ivip/lidongjiang/SP-venv/lib/python3.6/site-packages/tensorflow/python/client/session.py", line 1361, in _do_call
return fn(*args)
File "/home/ivip/lidongjiang/SP-venv/lib/python3.6/site-packages/tensorflow/python/client/session.py", line 1340, in _run_fn
target_list, status, run_metadata)
File "/home/ivip/lidongjiang/SP-venv/lib/python3.6/site-packages/tensorflow/python/framework/errors_impl.py", line 516, in exit
c_api.TF_GetCode(self.status.status))
tensorflow.python.framework.errors_impl.ResourceExhaustedError: OOM when allocating tensor with shape[3,30,40,30,40,256] and type float on /job:localhost/replica:0/task:0/device:GPU:0 by allocator GPU_0_bfc
[[Node: superpoint/train_tower0/gradients/superpoint/train_tower0/mul_3_grad/mul_1 = Mul[T=DT_FLOAT, _device="/job:localhost/replica:0/task:0/device:GPU:0"](superpoint/train_tower0/Reshape_6, superpoint/train_tower0/gradients/superpoint/train_tower0/Sum_grad/Tile)]]
Hint: If you want to see a list of allocated tensors when OOM happens, add report_tensor_allocations_upon_oom to RunOptions for current allocation info.
[[Node: superpoint/train_tower0/gradients/AddN_17/_427 = _Recv[client_terminated=false, recv_device="/job:localhost/replica:0/task:0/device:CPU:0", send_device="/job:localhost/replica:0/task:0/device:GPU:0", send_device_incarnation=1, tensor_name="edge_2157_superpoint/train_tower0/gradients/AddN_17", tensor_type=DT_FLOAT, _device="/job:localhost/replica:0/task:0/device:CPU:0"]()]]
Hint: If you want to see a list of allocated tensors when OOM happens, add report_tensor_allocations_upon_oom to RunOptions for current allocation info.
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "experiment.py", line 154, in
args.func(config, output_dir, args)
File "experiment.py", line 92, in _cli_train
train(config, config['train_iter'], output_dir)
File "experiment.py", line 33, in train
keep_checkpoints=config.get('keep_checkpoints', 1))
File "/home/ivip/lidongjiang/SuperPoint/superpoint/models/base_model.py", line 313, in train
options=options, run_metadata=run_metadata)
File "/home/ivip/lidongjiang/SP-venv/lib/python3.6/site-packages/tensorflow/python/client/session.py", line 905, in run
run_metadata_ptr)
File "/home/ivip/lidongjiang/SP-venv/lib/python3.6/site-packages/tensorflow/python/client/session.py", line 1137, in _run
feed_dict_tensor, options, run_metadata)
File "/home/ivip/lidongjiang/SP-venv/lib/python3.6/site-packages/tensorflow/python/client/session.py", line 1355, in _do_run
options, run_metadata)
File "/home/ivip/lidongjiang/SP-venv/lib/python3.6/site-packages/tensorflow/python/client/session.py", line 1374, in _do_call
raise type(e)(node_def, op, message)
tensorflow.python.framework.errors_impl.ResourceExhaustedError: OOM when allocating tensor with shape[3,30,40,30,40,256] and type float on /job:localhost/replica:0/task:0/device:GPU:0 by allocator GPU_0_bfc
[[Node: superpoint/train_tower0/gradients/superpoint/train_tower0/mul_3_grad/mul_1 = Mul[T=DT_FLOAT, _device="/job:localhost/replica:0/task:0/device:GPU:0"](superpoint/train_tower0/Reshape_6, superpoint/train_tower0/gradients/superpoint/train_tower0/Sum_grad/Tile)]]
Hint: If you want to see a list of allocated tensors when OOM happens, add report_tensor_allocations_upon_oom to RunOptions for current allocation info.
[[Node: superpoint/train_tower0/gradients/AddN_17/_427 = _Recv[client_terminated=false, recv_device="/job:localhost/replica:0/task:0/device:CPU:0", send_device="/job:localhost/replica:0/task:0/device:GPU:0", send_device_incarnation=1, tensor_name="edge_2157_superpoint/train_tower0/gradients/AddN_17", tensor_type=DT_FLOAT, _device="/job:localhost/replica:0/task:0/device:CPU:0"]()]]
Hint: If you want to see a list of allocated tensors when OOM happens, add report_tensor_allocations_upon_oom to RunOptions for current allocation info.
Caused by op 'superpoint/train_tower0/gradients/superpoint/train_tower0/mul_3_grad/mul_1', defined at:
File "experiment.py", line 154, in
args.func(config, output_dir, args)
File "experiment.py", line 92, in _cli_train
train(config, config['train_iter'], output_dir)
File "experiment.py", line 27, in train
with _init_graph(config) as net:
File "/opt/python3.6/lib/python3.6/contextlib.py", line 82, in enter
return next(self.gen)
File "experiment.py", line 77, in _init_graph
n_gpus=n_gpus, **config['model'])
File "/home/ivip/lidongjiang/SuperPoint/superpoint/models/base_model.py", line 122, in init
self._build_graph()
File "/home/ivip/lidongjiang/SuperPoint/superpoint/models/base_model.py", line 264, in _build_graph
self._train_graph(data)
File "/home/ivip/lidongjiang/SuperPoint/superpoint/models/base_model.py", line 188, in _train_graph
data, Mode.TRAIN, self.config['batch_size'])
File "/home/ivip/lidongjiang/SuperPoint/superpoint/models/base_model.py", line 164, in _gpu_tower
grad = tf.gradients(loss, model_params)
File "/home/ivip/lidongjiang/SP-venv/lib/python3.6/site-packages/tensorflow/python/ops/gradients_impl.py", line 611, in gradients
lambda: grad_fn(op, *out_grads))
File "/home/ivip/lidongjiang/SP-venv/lib/python3.6/site-packages/tensorflow/python/ops/gradients_impl.py", line 377, in _MaybeCompile
return grad_fn() # Exit early
File "/home/ivip/lidongjiang/SP-venv/lib/python3.6/site-packages/tensorflow/python/ops/gradients_impl.py", line 611, in
lambda: grad_fn(op, *out_grads))
File "/home/ivip/lidongjiang/SP-venv/lib/python3.6/site-packages/tensorflow/python/ops/math_grad.py", line 798, in _MulGrad
array_ops.reshape(math_ops.reduce_sum(x * grad, ry), sy))
File "/home/ivip/lidongjiang/SP-venv/lib/python3.6/site-packages/tensorflow/python/ops/math_ops.py", line 934, in binary_op_wrapper
return func(x, y, name=name)
File "/home/ivip/lidongjiang/SP-venv/lib/python3.6/site-packages/tensorflow/python/ops/math_ops.py", line 1161, in _mul_dispatch
return gen_math_ops._mul(x, y, name=name)
File "/home/ivip/lidongjiang/SP-venv/lib/python3.6/site-packages/tensorflow/python/ops/gen_math_ops.py", line 2789, in _mul
"Mul", x=x, y=y, name=name)
File "/home/ivip/lidongjiang/SP-venv/lib/python3.6/site-packages/tensorflow/python/framework/op_def_library.py", line 787, in _apply_op_helper
op_def=op_def)
File "/home/ivip/lidongjiang/SP-venv/lib/python3.6/site-packages/tensorflow/python/framework/ops.py", line 3271, in create_op
op_def=op_def)
File "/home/ivip/lidongjiang/SP-venv/lib/python3.6/site-packages/tensorflow/python/framework/ops.py", line 1650, in init
self._traceback = self._graph._extract_stack() # pylint: disable=protected-access
...which was originally created as op 'superpoint/train_tower0/mul_3', defined at:
File "experiment.py", line 154, in
args.func(config, output_dir, args)
[elided 6 identical lines from previous traceback]
File "/home/ivip/lidongjiang/SuperPoint/superpoint/models/base_model.py", line 188, in _train_graph
data, Mode.TRAIN, self.config['batch_size'])
File "/home/ivip/lidongjiang/SuperPoint/superpoint/models/base_model.py", line 159, in _gpu_tower
loss = self._loss(net_outputs, shards[i], **self.config)
File "/home/ivip/lidongjiang/SuperPoint/superpoint/models/super_point.py", line 82, in _loss
valid_mask=inputs['warped']['valid_mask'], **config)
File "/home/ivip/lidongjiang/SuperPoint/superpoint/models/utils.py", line 103, in descriptor_loss
dot_product_desc = tf.reduce_sum(descriptors * warped_descriptors, -1)
File "/home/ivip/lidongjiang/SP-venv/lib/python3.6/site-packages/tensorflow/python/ops/math_ops.py", line 934, in binary_op_wrapper
return func(x, y, name=name)
File "/home/ivip/lidongjiang/SP-venv/lib/python3.6/site-packages/tensorflow/python/ops/math_ops.py", line 1161, in _mul_dispatch
return gen_math_ops._mul(x, y, name=name)
File "/home/ivip/lidongjiang/SP-venv/lib/python3.6/site-packages/tensorflow/python/ops/gen_math_ops.py", line 2789, in _mul
"Mul", x=x, y=y, name=name)
File "/home/ivip/lidongjiang/SP-venv/lib/python3.6/site-packages/tensorflow/python/framework/op_def_library.py", line 787, in _apply_op_helper
op_def=op_def)
File "/home/ivip/lidongjiang/SP-venv/lib/python3.6/site-packages/tensorflow/python/framework/ops.py", line 3271, in create_op
op_def=op_def)
File "/home/ivip/lidongjiang/SP-venv/lib/python3.6/site-packages/tensorflow/python/framework/ops.py", line 1650, in init
self._traceback = self._graph._extract_stack() # pylint: disable=protected-access
ResourceExhaustedError (see above for traceback): OOM when allocating tensor with shape[3,30,40,30,40,256] and type float on /job:localhost/replica:0/task:0/device:GPU:0 by allocator GPU_0_bfc
[[Node: superpoint/train_tower0/gradients/superpoint/train_tower0/mul_3_grad/mul_1 = Mul[T=DT_FLOAT, _device="/job:localhost/replica:0/task:0/device:GPU:0"](superpoint/train_tower0/Reshape_6, superpoint/train_tower0/gradients/superpoint/train_tower0/Sum_grad/Tile)]]
Hint: If you want to see a list of allocated tensors when OOM happens, add report_tensor_allocations_upon_oom to RunOptions for current allocation info.
[[Node: superpoint/train_tower0/gradients/AddN_17/_427 = _Recv[client_terminated=false, recv_device="/job:localhost/replica:0/task:0/device:CPU:0", send_device="/job:localhost/replica:0/task:0/device:GPU:0", send_device_incarnation=1, tensor_name="edge_2157_superpoint/train_tower0/gradients/AddN_17", tensor_type=DT_FLOAT, _device="/job:localhost/replica:0/task:0/device:CPU:0"]()]]
Hint: If you want to see a list of allocated tensors when OOM happens, add report_tensor_allocations_upon_oom to RunOptions for current allocation info.