Coder Social home page Coder Social logo

Fails with Slurm 18.08.8 about slurm-drmaa HOT 5 CLOSED

natefoo avatar natefoo commented on May 26, 2024
Fails with Slurm 18.08.8

from slurm-drmaa.

Comments (5)

natefoo avatar natefoo commented on May 26, 2024

I've just tried reproducing this with 18.08.8 and it worked for me. Can you include the debug log leading up to the exception?

from slurm-drmaa.

natefoo avatar natefoo commented on May 26, 2024

Nevermind, I see I have a bit more in email from you:

d #2af1 [     0.00]  * # Setting defaults for tasks and processors
d #2af1 [     0.00]  * # Native specification: -A pscstaff -p RM-small
t #2af1 [     0.00] -> slurmdrmaa_parse_native
d #2af1 [     0.00]  * # account = pscstaff
d #2af1 [     0.00]  * # partition = RM-small
d #2af1 [     0.00]  * finalizing job constraints
d #2af1 [     0.00]  * set min_cpus to ntasks: 1
t #2af1 [     0.00] <- slurmdrmaa_parse_native
E #2af1 [     0.77]  * fsd_exc_new(1001,slurm_submit_batch_job error (-1): Unspecified error,1)
t #2af1 [     0.77] -> slurmdrmaa_free_job_desc
t #2af1 [     0.77] <- slurmdrmaa_free_job_desc
t #2af1 [     0.77] <- drmaa_run_job=1: slurm_submit_batch_job error (-1): Unspecified error
F #2af1 [     0.77]  * Failed to submit a job: slurm_submit_batch_job error (-1): Unspecified error

This could be an issue with the native spec, I'll have a look at that.

from slurm-drmaa.

pdblood avatar pdblood commented on May 26, 2024

It turns out this error was being caused by a configuration issue that requires a job name be specified. With jobs submitted via sbatch, the name of the script was used when no job name was specified. Once the admin changed job_script.lua to handle nil values for job name, the tests with drmaa-run started working with Slurm 18 08.8. This did not fix my related issue with submitting jobs from Galaxy using slurm-drmaa, but drmaa-run now works as expected with Slurm 18.08.8.

from slurm-drmaa.

pdblood avatar pdblood commented on May 26, 2024

Closing this issue since this failure appears to have been due to a specific configuration detail in job_script.lua on the system running Slurm 18.08.8 that was different from the system I tested with Slurm 18.08.7, leading me to believe that there was an incompatibility with Slurm 18.08.8. After further testing, with drmaa-run, Slurm 18.08.8 appears to work as expected.

from slurm-drmaa.

natefoo avatar natefoo commented on May 26, 2024

Thanks for the update, I'd tried with Python drmaa and couldn't get it to fail, it's good to know what the issue was.

from slurm-drmaa.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.