Comments (12)
Are you sure your COCO directory is set up correctly?
from keras-retinanet.
yes. before update keras. it works all right.
from keras-retinanet.
I also meet this problem, did u solve it ? thx. @wanghuok
from keras-retinanet.
@complicatedlee It is my fault. My coco path is not correct. but when i correct coco path. i get this error. my gpu is GTX 1060 with 6GB vedio memory.
loading annotations into memory...
Done (t=12.17s)
creating index...
index created!
loading annotations into memory...
Done (t=0.38s)
creating index...
index created!
Epoch 1/20
2017-11-08 17:41:09.261753: E tensorflow/stream_executor/cuda/cuda_blas.cc:366] failed to create cublas handle: CUBLAS_STATUS_NOT_INITIALIZED
2017-11-08 17:41:09.261779: W tensorflow/stream_executor/stream.cc:1901] attempting to perform BLAS operation using StreamExecutor without BLAS support
Traceback (most recent call last):
File "examples/train_coco.py", line 106, in
keras.callbacks.ReduceLROnPlateau(monitor='loss', factor=0.1, patience=2, verbose=1, mode='auto', epsilon=0.0001, cooldown=0, min_lr=0),
File "/home/wh/.local/lib/python2.7/site-packages/keras/legacy/interfaces.py", line 87, in wrapper
return func(*args, **kwargs)
File "/home/wh/.local/lib/python2.7/site-packages/keras/engine/training.py", line 2077, in fit_generator
class_weight=class_weight)
File "/home/wh/.local/lib/python2.7/site-packages/keras/engine/training.py", line 1797, in train_on_batch
outputs = self.train_function(ins)
File "/home/wh/.local/lib/python2.7/site-packages/keras/backend/tensorflow_backend.py", line 2332, in call
**self.session_kwargs)
File "/home/wh/.local/lib/python2.7/site-packages/tensorflow/python/client/session.py", line 889, in run
run_metadata_ptr)
File "/home/wh/.local/lib/python2.7/site-packages/tensorflow/python/client/session.py", line 1120, in _run
feed_dict_tensor, options, run_metadata)
File "/home/wh/.local/lib/python2.7/site-packages/tensorflow/python/client/session.py", line 1317, in _do_run
options, run_metadata)
File "/home/wh/.local/lib/python2.7/site-packages/tensorflow/python/client/session.py", line 1336, in _do_call
raise type(e)(node_def, op, message)
tensorflow.python.framework.errors_impl.InternalError: Blas SGEMM launch failed : m=22500, n=256, k=64
[[Node: res2a_branch1/convolution = Conv2D[T=DT_FLOAT, data_format="NHWC", padding="VALID", strides=[1, 1, 1, 1], use_cudnn_on_gpu=true, _device="/job:localhost/replica:0/task:0/device:GPU:0"](pool1/MaxPool, res2a_branch1/kernel/read)]]
[[Node: loss/add/_2591 = _Recvclient_terminated=false, recv_device="/job:localhost/replica:0/task:0/device:CPU:0", send_device="/job:localhost/replica:0/task:0/device:GPU:0", send_device_incarnation=1, tensor_name="edge_12603_loss/add", tensor_type=DT_FLOAT, _device="/job:localhost/replica:0/task:0/device:CPU:0"]]
Caused by op u'res2a_branch1/convolution', defined at:
File "examples/train_coco.py", line 60, in
model = create_model(weights=args.weights)
File "examples/train_coco.py", line 38, in create_model
return ResNet50RetinaNet(image, num_classes=90, weights=weights)
File "/home/wh/gitmodel/keras-retinanet/keras_retinanet/models/resnet.py", line 36, in ResNet50RetinaNet
resnet = keras_resnet.models.ResNet50(image, include_top=False, freeze_bn=True)
File "/home/wh/.local/lib/python2.7/site-packages/keras_resnet/models/_2d.py", line 181, in ResNet50
return ResNet(inputs, blocks, block=keras_resnet.blocks.bottleneck_2d, include_top=include_top, classes=classes, *args, **kwargs)
File "/home/wh/.local/lib/python2.7/site-packages/keras_resnet/models/_2d.py", line 70, in ResNet
x = block(features, stage_id, block_id, numerical_name=(blocks[stage_id] > 6), freeze_bn=freeze_bn)(x)
File "/home/wh/.local/lib/python2.7/site-packages/keras_resnet/blocks/_2d.py", line 139, in f
shortcut = keras.layers.Conv2D(filters * 4, (1, 1), strides=stride, name="res{}{}_branch1".format(stage_char, block_char), **parameters)(x)
File "/home/wh/.local/lib/python2.7/site-packages/keras/engine/topology.py", line 603, in call
output = self.call(inputs, **kwargs)
File "/home/wh/.local/lib/python2.7/site-packages/keras/layers/convolutional.py", line 164, in call
dilation_rate=self.dilation_rate)
File "/home/wh/.local/lib/python2.7/site-packages/keras/backend/tensorflow_backend.py", line 3185, in conv2d
data_format=tf_data_format)
File "/home/wh/.local/lib/python2.7/site-packages/tensorflow/python/ops/nn_ops.py", line 751, in convolution
return op(input, filter)
File "/home/wh/.local/lib/python2.7/site-packages/tensorflow/python/ops/nn_ops.py", line 835, in call
return self.conv_op(inp, filter)
File "/home/wh/.local/lib/python2.7/site-packages/tensorflow/python/ops/nn_ops.py", line 499, in call
return self.call(inp, filter)
File "/home/wh/.local/lib/python2.7/site-packages/tensorflow/python/ops/nn_ops.py", line 187, in call
name=self.name)
File "/home/wh/.local/lib/python2.7/site-packages/tensorflow/python/ops/gen_nn_ops.py", line 631, in conv2d
data_format=data_format, name=name)
File "/home/wh/.local/lib/python2.7/site-packages/tensorflow/python/framework/op_def_library.py", line 787, in _apply_op_helper
op_def=op_def)
File "/home/wh/.local/lib/python2.7/site-packages/tensorflow/python/framework/ops.py", line 2956, in create_op
op_def=op_def)
File "/home/wh/.local/lib/python2.7/site-packages/tensorflow/python/framework/ops.py", line 1470, in init
self._traceback = self._graph._extract_stack() # pylint: disable=protected-access
InternalError (see above for traceback): Blas SGEMM launch failed : m=22500, n=256, k=64
[[Node: res2a_branch1/convolution = Conv2D[T=DT_FLOAT, data_format="NHWC", padding="VALID", strides=[1, 1, 1, 1], use_cudnn_on_gpu=true, _device="/job:localhost/replica:0/task:0/device:GPU:0"](pool1/MaxPool, res2a_branch1/kernel/read)]]
[[Node: loss/add/_2591 = _Recvclient_terminated=false, recv_device="/job:localhost/replica:0/task:0/device:CPU:0", send_device="/job:localhost/replica:0/task:0/device:GPU:0", send_device_incarnation=1, tensor_name="edge_12603_loss/add", tensor_type=DT_FLOAT, _device="/job:localhost/replica:0/task:0/device:CPU:0"]]
from keras-retinanet.
@complicatedlee
The structure of your COCO dataset should be:
/path/to/COCO/
/path/to/COCO/annotations/
/path/to/COCO/images/train2017/
/path/to/COCO/images/val2017/
from keras-retinanet.
maybe you should set your batchsize smaller
from keras-retinanet.
from keras-retinanet.
I also find my problem, I just set the wrong path... thanks for your help~ @wanghuok
from keras-retinanet.
@complicatedlee You are welcome. My batch_size is already set to 1. Is it due to my less GPU memory?
from keras-retinanet.
If you use anaconda, maybe you could update anaconda using the following command
conda update --all
then try again. I'm not sure this is useful for your problem... @wanghuok
from keras-retinanet.
I'm afraid 6gb might not be enough. You can try to reduce the size of the images further by modifying the image_min_side
and image_max_side
values in the generator.
from keras-retinanet.
I don't know what the problem is. but it works after i switched to python3.
from keras-retinanet.
Related Issues (20)
- ERROR:gpu_init.cc(454)] Passthrough is not supported, GL is disabled, ANGLE is HOT 1
- Value Not Accepted: invalid literal HOT 1
- Extracting and saving mAP during training HOT 1
- Training Issue HOT 1
- when training yolov5 on custom dataset on spider show this error? how to resolve this error in pc? HOT 1
- COCO Large category and Anchor boxes generator HOT 1
- Error is displayed when running efficientnet: No module named 'efficientnet'
- Project dependencies may have API risk issues HOT 2
- InceptionV3 error HOT 1
- ModuleNotFoundError: No module named 'keras_retinanet.utils.compute_overlap' HOT 2
- 2023 Model
- Bad mAP results after long train HOT 1
- How to plot the metrics of my custom train
- Error occurred when finalizing GeneratorDataset iterator: FAILED_PRECONDITION: Python interpreter state is not initialized. The process may be terminated. HOT 3
- flag `--patience` (epochs to wait for no observable improvement for early stopping of training)
- How to convert model.h5 to ONNX ? HOT 1
- Error Code 2: Internal Error (ForeignNode does not support data-dependent shape for now.)
- How to get input_1 after training at the input of the model: [?, 512, 512, 3] ?
- train a model with a fixed input size HOT 3
- Not able to run my retinanet on gpu
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from keras-retinanet.