Comments (4)
Interesting, I will check on Monday morning and see if I can reproduce this.
from fyrd.
Hi @jbloom, I am really sorry this took so long, I have been very busy at work and I just didn't have time to work on this at all.
So, I can't reproduce this on any of my machines, so it is likely unique to your machine, I would like to try and figure it out though. Could you try and run the same test again and then use sacct
to see whether it entered the queue? It has been given a job number so my guess is that one of three things are happening:
- There is a lag in the queue being updated on your machine, and so squeue and sacct are not reporting up to date information when you run job.wait()
- The queue is being cleared very quickly after job completion.
- squeue/sacct are returning slightly different information on your system and it isn't being parsed correctly.
So, could you try again and let me know if they do make it into the queue at all when this happens?
Thanks!
from fyrd.
@jbloom: I tried to add a workaround that should solve possibility 1. by adding more attempts to query the queue before failure. I also made some changes to slurm queue parsing that make it more robust, so there is a good chance that those combined changes should solve this issue for you. Please let me know.
from fyrd.
Closing now as I think this issue is fixed, let me know if it isn't
from fyrd.
Related Issues (20)
- Add possiblility to set path to slurm executables HOT 3
- When running as a file, file functions not imported
- Add email notification HOT 1
- Get exitcode from STDOUT instead of queue? HOT 2
- Add ability to set maximum jobs per queue HOT 2
- Job.write sbatch script error when using modules HOT 2
- Make sure `write()` deletes files and reinitializes before running
- fyrd queue -a -p does not filter by partition
- Add done file and success checking
- Add job tracking option
- Auto import all functions in a file HOT 2
- Setting the locations for *.o and *.e files with Torque PBS and submit_file HOT 2
- Allow shell variables in config paths HOT 1
- Don't update Queue on initialization HOT 2
- Created .sbatch file has partition of "dict_keys([6788651, 6724208, 6724209..." HOT 2
- Bug: Torque dependencies wrong
- Empty memory option in profiles.txt
- New Maintainer Needed
- KeyError: 'notify' when fyrd conf init
- Race condition loading/writing config file
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from fyrd.