Coder Social home page Coder Social logo

Laghos in a workflow? about laghos HOT 11 CLOSED

ceed avatar ceed commented on June 11, 2024
Laghos in a workflow?

from laghos.

Comments (11)

vsoch avatar vsoch commented on June 11, 2024 1

Heyo! I wanted to give you updates - I have a build in a container alongside flux, and it looks like the example is running OK!

image

and the tests passed I think:

image

The results look like this:

# cat RESULTS.dat 
step = 0339, dt = 0.000702, |e| = 4.9695537349e+01
step = 1041, dt = 0.000121, |e| = 3.3909635545e+03
step = 1154, dt = 0.001655, |e| = 4.6303396053e+01
step = 0560, dt = 0.002449, |e| = 1.3408616722e+02
step = 0413, dt = 0.000470, |e| = 3.2012077410e+01
step = 2872, dt = 0.000064, |e| = 5.6547039096e+01
step = 0858, dt = 0.000474, |e| = 5.6691500623e+01
step = 0776, dt = 0.000045, |e| = 4.0982431726e+02

The automated build is happening now, and once I have the container in a registry it should be fairly trivial to test with the Flux Operator. I will update here again when that is done! In the meantime if you find any more complex workflows please let me know! We are selecting three to include in a paper on the Flux Operator and still in the learning / testing phase for various tools at the lab.

from laghos.

vladotomov avatar vladotomov commented on June 11, 2024 1

Sounds great, feel free to reopen if we can help.

from laghos.

vladotomov avatar vladotomov commented on June 11, 2024

Hi @vsoch,

Not sure, could you give an example of what you need?

from laghos.

vsoch avatar vsoch commented on June 11, 2024

It could be anything from running a few steps (or more generally a workflow with several components) to having a service like a database or task queue alongside. Basically - anything more complex than a simple, single command to submit!

from laghos.

vladotomov avatar vladotomov commented on June 11, 2024

I don't know of such workflows with Laghos. Maybe @kmittal2 has such workflows, but with Remhos, related to his machine learning explorations.

from laghos.

vsoch avatar vsoch commented on June 11, 2024

Thanks - so there is no small public collection of workflows using this tool?

from laghos.

vladotomov avatar vladotomov commented on June 11, 2024

Actually there's the autotest workflow, see here.

from laghos.

vsoch avatar vsoch commented on June 11, 2024

This should be good to get me started! And if @kmittal2 has something more complex to share that would be great. I’ll do some work this weekend and update you here. Thank you and happy Friday!

from laghos.

kmittal2 avatar kmittal2 commented on June 11, 2024

Hi @vsoch, unfortunately I do not have anything more complex to share.

from laghos.

vladotomov avatar vladotomov commented on June 11, 2024

@vsoch I'm closing this for now, let us know if we can help more.

from laghos.

vsoch avatar vsoch commented on June 11, 2024

okay done! I wound up taking out the command from make tests and giving it directly to flux, and that seemed to go faster. I was also very happy to see that command, because it looks like there are some parameters we could tune to test this at different scales.

image

We will have a minicluster.yaml example with laghos merged a bit later tonight.

And I really appreciate how easy this was to get working - going through all the actions that you use to install mfem/hypre/etc. was probably the hardest part - a container build that shows them all in one place would do wonders! If you ever need to point someone at a container, this build likely isn't perfect, but it got the job done (as shown above). It starts on top of a flux base built with conda (which I've done since a lot of projects like to do ML stuffs), which likely if you wanted to roll a custom build wouldn't be necessary (it makes the image quite a bit larger!) For keeping a record in this issue, here is the directory with the Dockerfile: https://github.com/rse-ops/flux-hpc/tree/main/laghos-demos. We also have an automated build, here: https://github.com/rse-ops/flux-hpc/pkgs/container/laghos-demos. The demo for the Flux Operator is just a yaml file that uses that image and defines the entrypoint, etc (as shown in the picture above).

Thank you again! We will definitely ping you if we have any more questions! And please ping me for any simulations / workflows / something the lab is interested in that could be run in the Flux Operator, even if they aren't containerized. I'm pretty good at that bit, and I'm trying to port / test as many things as I can get my hands on. 😆

from laghos.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.