hadusam / blueoil-sagemaker Goto Github PK
View Code? Open in Web Editor NEWRun Blueoil on Amazon Sagemaker
License: Apache License 2.0
Run Blueoil on Amazon Sagemaker
License: Apache License 2.0
In the blueoil_cifar10_example.ipynb
, a training job invoked by estimator.fit({'dataset': train_data, 'config': config_data})
will fail with the exception
Exception during training: Return Code: 1, CMD: ['/usr/local/pyenv/versions/python3.6/bin/python', '/home/blueoil/blueoil/cmd/main.py', 'train', '--experiment_id', 'cifar10_sample', '--config', 'cifar10_sample.yml'], Err: b'/usr/local/pyenv/versions/python3.6/lib/python3.6/site-packages/pandas/compat/__init__.py:117: UserWarning: Could not import the lzma module. Your installed Python is incomplete. Attempting to use lzma compression will result in a RuntimeError.\n warnings.warn(msg)\nWARNING:tensorflow:From /usr/local/pyenv/versions/python3.6/lib/python3.6/site-packages/horovod/tensorflow/__init__.py:117: The name tf.global_variables is deprecated. Please use tf.compat.v1.global_variables instead.\n\nWARNING:tensorflow:From /usr/local/pyenv/versions/python3.6/lib/python3.6/site-packages/horovod/tensorflow/__init__.py:143: The name tf.get_default_graph is deprecated. Please use tf.compat.v1.get_default_graph instead.\n\nTraceback (most recent call last):\n File "/home/blueoil/blueoil/cmd/main.py", line 169, in <module>\n main()\n File "/usr/local/pyenv/versions/python3.6/lib/python3.6/site-packages/click/core.py", line 764, in __call__\n return self.main(*args, **kwargs)\n File "/usr/local/pyenv/versions/python3.6/lib/python3.6/site-packages/click/core.py", line 717, in main\n rv = self.invoke(ctx)\n File "/usr/local/pyenv/versions/python3.6/lib/python3.6/site-packages/click/core.py", line 1137, in invoke\n return _process_result(sub_ctx.command.invoke(sub_ctx))\n File "/usr/local/pyenv/versions/python3.6/lib/python3.6/site-packages/click/core.py", line 956, in invoke\n return ctx.invoke(self.callback, **ctx.params)\n File "/usr/local/pyenv/versions/python3.6/lib/python3.6/site-packages/click/core.py", line 555, in invoke\n return callback(*args, **kwargs)\n File "/home/blueoil/blueoil/cmd/main.py", line 79, in train\n experiment_id, checkpoint_name = run_train(config, experiment_id, recreate)\n File "/home/blueoil/blueoil/cmd/train.py", line 302, in train\n run(None, None, config_file, experiment_id, recreate)\n File "/home/blueoil/blueoil/cmd/train.py", line 273, in run\n config = config_util.load(config_file)\n File "/home/blueoil/blueoil/utils/config.py", line 124, in load\n check_config(config)\n File "/home/blueoil/blueoil/utils/config.py", line 106, in check_config\n raise KeyError("config file should include {} parameter".format(key))\nKeyError: \'config file should include IS_DEBUG parameter\'\n'
Traceback (most recent call last):
File "/opt/ml/code/train", line 119, in <module>
_run(train_cmd)
File "/opt/ml/code/train", line 60, in _run
raise Exception(error_msg)
Exception: Return Code: 1, CMD: ['/usr/local/pyenv/versions/python3.6/bin/python', '/home/blueoil/blueoil/cmd/main.py', 'train', '--experiment_id', 'cifar10_sample', '--config', 'cifar10_sample.yml'], Err: b'/usr/local/pyenv/versions/python3.6/lib/python3.6/site-packages/pandas/compat/__init__.py:117: UserWarning: Could not import the lzma module. Your installed Python is incomplete. Attempting to use lzma compression will result in a RuntimeError.\n warnings.warn(msg)\nWARNING:tensorflow:From /usr/local/pyenv/versions/python3.6/lib/python3.6/site-packages/horovod/tensorflow/__init__.py:117: The name tf.global_variables is deprecated. Please use tf.compat.v1.global_variables instead.\n\nWARNING:tensorflow:From /usr/local/pyenv/versions/python3.6/lib/python3.6/site-packages/horovod/tensorflow/__init__.py:143: The name tf.get_default_graph is deprecated. Please use tf.compat.v1.get_default_graph instead.\n\nTraceback (most recent call last):\n File "/home/blueoil/blueoil/cmd/main.py", line 169, in <module>\n main()\n File "/usr/local/pyenv/versions/python3.6/lib/python3.6/site-packages/click/core.py", line 764, in __call__\n return self.main(*args, **kwargs)\n File "/usr/local/pyenv/versions/python3.6/lib/python3.6/site-packages/click/core.py", line 717, in main\n rv = self.invoke(ctx)\n File "/usr/local/pyenv/versions/python3.6/lib/python3.6/site-packages/click/core.py", line 1137, in invoke\n return _process_result(sub_ctx.command.invoke(sub_ctx))\n File "/usr/local/pyenv/versions/python3.6/lib/python3.6/site-packages/click/core.py", line 956, in invoke\n return ctx.invoke(self.callback, **ctx.params)\n File "/usr/local/pyenv/versions/python3.6/lib/python3.6/site-packages/click/core.py", line 555, in invoke\n return callback(*args, **kwargs)\n File "/home/blueoil/blueoil/cmd/main.py", line 79, in train\n experiment_id, checkpoint_name = run_train(config, experiment_id, recreate)\n File "/home/blueoil/blueoil/cmd/train.py", line 302, in train\n run(None, None, config_file, experiment_id, recreate)\n File "/home/blueoil/blueoil/cmd/train.py", line 273, in run\n config = config_util.load(config_file)\n File "/home/blueoil/blueoil/utils/config.py", line 124, in load\n check_config(config)\n File "/home/blueoil/blueoil/utils/config.py", line 106, in check_config\n raise KeyError("config file should include {} parameter".format(key))\nKeyError: \'config file should include IS_DEBUG parameter\'\n'
which indicates KeyError: \'config file should include IS_DEBUG parameter\'
.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.