Comments (9)
Hey @denisbertini
You might be able to use the file-based mode that veloc offers. You call VELOC_Route_file
with the file path that you would normally write to and instead use the passed-back file path. Veloc then handles the files and moves them around / provides fault tolerance.
from veloc.
Hi @gonsie
Thanks a lot, i will try to use VeloC to leverage the checkpointing IO on our plasma physics simulation.
This is using extensively MPI-IO, i hope to make it runnuing. Do you know examples already that i can look at ?
Thanks
Denis
from veloc.
They only examples I know of are provided in the documentation (at the bottom of the page).
from veloc.
These example are using fwrite/fread but i suppose one can substitute the corresponding MPI_WRITE/MPI_READ functions in my case ...
from veloc.
Is there a particular reason why you need MPI-IO? It should be easier for you to protect the memory regions directly instead of writing them into a file using MPI-IO.
from veloc.
Well the program that i use is already using MPI-IO to do chekpointing and to dump the data files.
Is there some limitation in this case ? I mean combining MPI IO and VeloC ?
from veloc.
When you say "protecting memory region directly" you mean what is done in the heatdis_mem example ?
from veloc.
Yes, this is what I mean. If you are using MPI-IO, then you are already writing to a parallel file system so there is no point in using VELOC. The idea of using VELOC is to checkpoint asynchronously and avoid paying for expensive I/O (which you do if you wait for MPI-IO to finish).
from veloc.
Also, please note that we have a mailing list you can subscribe to: [email protected].
This is the right place to discuss such considerations. We use github issues primarily for bug reports.
from veloc.
Related Issues (20)
- restart-in-place: copy cray aprun variant from scr HOT 2
- Down node detection on LSF has wrong node count HOT 1
- SLURM restart-in-place script hangs when forcing prolog on down node HOT 1
- SLURM restart-in-place script double counts down node HOT 1
- Build VELOC as a static library HOT 1
- VELOC install 64 bit libraries in /usr/lib instead of /usr/lib64 HOT 1
- Node down, VeloC XOR restart on the new allocated node HOT 5
- error using test/heatdis example HOT 7
- Fortran 90 bindingd to VeloC? HOT 2
- Alternative to OpenSSL for md5 HOT 2
- Program not finishing in async mode HOT 19
- Build fails at linking with undefined reference to `kvtree_xxx` on Cori (NERSC) HOT 1
- can't build with AXL 4.0.0 HOT 4
- Unable to run the example program HOT 2
- example: function call within assert HOT 2
- Interop with GPU compute kernels HOT 5
- Use MPI_Exscan to compute offsets?
- Component releases for Veloc v1.7 HOT 1
- MPI_Comm_split with uninitialized key value? HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from veloc.