Comments (3)
Multi-node experiments setup usually depends on your local cluster setup. I would recommend following the PyTorch Lightning documentation at https://pytorch-lightning.readthedocs.io/en/stable/clouds/cluster.html. Otherwise, the current code is already setup for doing multi-node experiments. And would just require setting the environment variable NODES
to the correct number and setting --trainer.num_nodes
to the correct number.
from climax.
Thanks for the reply.
-
By "Multi-node", I was referring to frameworks such as Horovod or Colossal, but Pytorch DDP is alright, too.
-
I run into another problem with the conda install. Our cluster run on Centos 8.5, which only has upto "glibc-2.28". I followed the conda install instructions without any error. But there is a runtime error, "torchdata" is looking for "glibc-2.29".
Does Microsoft has any conda build or do you know any build channel that uses "glibc-2.28" for Pytorch products?
or, Can you suggest any workaround ?
I look forward to your reply.
from climax.
torchdata
is easy to expunge from the current code (although you could also just use docker instead). At the moment it's only being used to get a list of files: dp.iter.FileLister
. You can replace that with your own function that does the file listing. Something like:
def get_files_from_root(root):
if os.path.isfile(root):
path = root
yield path
else:
for path, dirs, files in os.walk(root):
files.sort()
for f in files:
yield os.path.join(path, f)
dirs.sort()
from climax.
Related Issues (20)
- Regional forecasting lat long boundaries HOT 4
- Fine tuning regional forecasts at a higher resolution HOT 2
- The pre-train code has a bug with the number of nodes HOT 5
- Cannot access pretrained weight HOT 5
- Possible bug in lr scheduler HOT 4
- Training ClimaX without pre-training HOT 3
- Questions regarding pre-training HOT 5
- Question about Using GlobalForecast Code with Pre-cropped Data HOT 2
- Pretrain dataset prepare and Out-of-memory problem HOT 1
- Would it be possible to kindly share the downscaling data? HOT 3
- How can I check for early stopping conditions in this code? HOT 1
- The replication issues with the downscaling task. HOT 7
- How to download the IFS data? HOT 1
- How to use trained ClimaX model for predictions? HOT 1
- What is the point of the hrs_each_step variable? HOT 1
- Required training time HOT 3
- If use Docker to build the image as introductions,the name should obey dns rules,and so the name of image must be lowercase? HOT 7
- Predict Range and hrs_each_step
- How to handle Nan values in training data? HOT 1
- How to view log files in tensorboard format? HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from climax.