Comments (11)
Heyo! I wanted to give you updates - I have a build in a container alongside flux, and it looks like the example is running OK!
and the tests passed I think:
The results look like this:
# cat RESULTS.dat
step = 0339, dt = 0.000702, |e| = 4.9695537349e+01
step = 1041, dt = 0.000121, |e| = 3.3909635545e+03
step = 1154, dt = 0.001655, |e| = 4.6303396053e+01
step = 0560, dt = 0.002449, |e| = 1.3408616722e+02
step = 0413, dt = 0.000470, |e| = 3.2012077410e+01
step = 2872, dt = 0.000064, |e| = 5.6547039096e+01
step = 0858, dt = 0.000474, |e| = 5.6691500623e+01
step = 0776, dt = 0.000045, |e| = 4.0982431726e+02
The automated build is happening now, and once I have the container in a registry it should be fairly trivial to test with the Flux Operator. I will update here again when that is done! In the meantime if you find any more complex workflows please let me know! We are selecting three to include in a paper on the Flux Operator and still in the learning / testing phase for various tools at the lab.
from laghos.
Sounds great, feel free to reopen if we can help.
from laghos.
Hi @vsoch,
Not sure, could you give an example of what you need?
from laghos.
It could be anything from running a few steps (or more generally a workflow with several components) to having a service like a database or task queue alongside. Basically - anything more complex than a simple, single command to submit!
from laghos.
I don't know of such workflows with Laghos
. Maybe @kmittal2 has such workflows, but with Remhos
, related to his machine learning explorations.
from laghos.
Thanks - so there is no small public collection of workflows using this tool?
from laghos.
Actually there's the autotest workflow, see here.
from laghos.
This should be good to get me started! And if @kmittal2 has something more complex to share that would be great. I’ll do some work this weekend and update you here. Thank you and happy Friday!
from laghos.
Hi @vsoch, unfortunately I do not have anything more complex to share.
from laghos.
@vsoch I'm closing this for now, let us know if we can help more.
from laghos.
okay done! I wound up taking out the command from make tests
and giving it directly to flux, and that seemed to go faster. I was also very happy to see that command, because it looks like there are some parameters we could tune to test this at different scales.
We will have a minicluster.yaml example with laghos merged a bit later tonight.
And I really appreciate how easy this was to get working - going through all the actions that you use to install mfem/hypre/etc. was probably the hardest part - a container build that shows them all in one place would do wonders! If you ever need to point someone at a container, this build likely isn't perfect, but it got the job done (as shown above). It starts on top of a flux base built with conda (which I've done since a lot of projects like to do ML stuffs), which likely if you wanted to roll a custom build wouldn't be necessary (it makes the image quite a bit larger!) For keeping a record in this issue, here is the directory with the Dockerfile: https://github.com/rse-ops/flux-hpc/tree/main/laghos-demos. We also have an automated build, here: https://github.com/rse-ops/flux-hpc/pkgs/container/laghos-demos. The demo for the Flux Operator is just a yaml file that uses that image and defines the entrypoint, etc (as shown in the picture above).
Thank you again! We will definitely ping you if we have any more questions! And please ping me for any simulations / workflows / something the lab is interested in that could be run in the Flux Operator, even if they aren't containerized. I'm pretty good at that bit, and I'm trying to port / test as many things as I can get my hands on. 😆
from laghos.
Related Issues (20)
- NC or NURBS meshes HOT 3
- Definition of Q1D HOT 2
- R-Z coordinates with partial assembly HOT 1
- blast wave propagation HOT 2
- Other Equation of State in Laghos HOT 11
- Scaling Laghos / Picking number of processes/tasks HOT 1
- "spack install laghos" on LLNL quartz with gcc/8.3.1 is failing HOT 2
- Multi GPU runs HOT 4
- Excessive device memory wastage HOT 4
- Failing tests with Hypre-cuda and full assembly HOT 2
- Building Laghos on Crusher HOT 9
- GPU run of verification result #5 does not match values in table HOT 2
- Question about Non-AMR and AMR version of Laghos
- multi-material implementation HOT 8
- 'laghos' cores when run against 'box01_hex.mesh' with perfect cube of ranks HOT 2
- CUDA-awareness question HOT 5
- Question about performance (mainly cuda code) HOT 52
- L2 space for the energy variable discretization HOT 1
- ResetTimeStepEstimate logic HOT 5
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from laghos.