Comments (2)
Hi @OfirKP , the trains package was designed with always on network connection in mind, unfortunately there is no 'offline' option at the moment.
That said let me expand on the network issues you seem to run into:
If a connection is dropped (such as in your case), it will retry to connect to the server, these are the retry messages you keep getting.
If a connection was dropped after a socket was created (post handshake) it will use the socket request timeout (imagine a very very very slow download connection, this is essentially the same timeout). This timeout is defined to 10 minutes, after 10min passed and the request didn't return fully, a new connection will be established.
Setting retry limit to zero will not effect the socket timeout, and I suspect that this is was you observed, when you completely cut the network connections.
My quickest solution for you is to mark out the Task.init()
line, whenever you are in unstable network environment (such as, commuting on a train 😃 ).
from clearml.
Closing, due to lack of activity.
from clearml.
Related Issues (20)
- Scrolling log problem when using tqdm as training process bar HOT 5
- ClearML feature for integration KerasTuner is broken HOT 1
- Fix typo in docs and default sdk config HOT 1
- Executing clearml-task from cli with "-m" modules HOT 1
- Dynamic GPU/Queue Allocation for Workers in ClearML
- Add tag with Clearm-task (cli tools) HOT 1
- Problem creating datasets with Azure storage when multi file HOT 5
- Task creation failed!Always searching for this project? But I don't have it! HOT 1
- Support Megatron-LM training job on k8s cluster HOT 4
- Model.get_local_copy with specific download path. HOT 1
- "413 Request Entity Too Large" when uploading files to ClearML HOT 4
- legend titles broken in experiment comparison HOT 1
- Preview text files HOT 1
- Registering models from lightning not working (different than pytorch-lightning) HOT 2
- GPU monitoring failed getting GPU reading, switching off GPU monitoring HOT 6
- async variant of get_mutable_local_copy HOT 1
- Light theme for the dashboard HOT 1
- Plot comparison in a single figure not working for plots other than barplots HOT 3
- API calls fail for model with deleted parent task. HOT 1
- Scalar logging bug with Fire HOT 6
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from clearml.