Comments (6)
@RussTreadon-NOAA I think we saw something similar in the UFSWM here: ufs-community/ufs-weather-model#2015 and the TL;DR was to add export I_MPI_EXTRA_FILESYSTEM=ON
to the job card.
Can you give that a try to see if you still see that issue?
from global-workflow.
While this seems like a different issue, workflow is not yet supported on Hercules due to a Lustre issue with ln
on Rocky 9. We've had a ticket open for quite a while now.
from global-workflow.
@BrianCurtis-NOAA , thank you for sharing your insight.
env/HERCULES.env
currently contains
export I_MPI_EXTRA_FILESYSTEM=1
export I_MPI_EXTRA_FILESYSTEM_LIST=lustre
I replaced the first line above with
export I_MPI_EXTRA_FILESYSTEM=ON
The 202112200 18Z gdasfcst and enkfgdasfcst_mem002 still aborted with
24: file: module_write_netcdf.F90 line: 761 NetCDF: HDF error
24: Abort(1) on node 24 (rank 24 in comm 496): application called MPI_Abort(comm=0x84000002, 1) - process 24
In contrast, enkfgdasfcst_mem001 successfully ran to completion.
As a follow on test I removed
export I_MPI_EXTRA_FILESYSTEM_LIST=lustre
while retaining
export I_MPI_EXTRA_FILESYSTEM=ON
A rerun of gdasfcst still aborted with the NetCDF: HDF error
error. However, this time enkfgdasfcst_mem002 successfully ran to completion.
The seemingly random nature of this behavior is disturbing.
from global-workflow.
Thank you @WalterKolczynski-NOAA for letting me know that g-w does not support Hercules. It is unfortunate that we can not reliably run cycled global parallels on Hercules. Have we elevated the ln
and NetCDF: HDF error
issues to management?
from global-workflow.
The ln
issue has been elevated. The HDF error is new to me.
from global-workflow.
OK, we can keep this issue open for awareness. Hercules is not, at present, a viable option for running global parallels.
from global-workflow.
Related Issues (20)
- infrared cloud detection scheme fix files HOT 1
- Stage initial conditions for cycled and forecast-only via stage_ic job.
- Adjust GDAS config and tasks to allow for using the JEDI Configuration Builder tool for the YAML assembly
- CICE fix files at 1deg are not present in fix file directory HOT 3
- update fix files for CICE and MOM6/post HOT 11
- Update parm files for gfs atm product HOT 1
- Update ufs-weather-model hash in g-w HOT 12
- update gdas/gsibec qoption HOT 9
- Global-Workflow GFS Archive Task Fails with Exit Code 72 HOT 6
- Update gdas_gsibec_ver to 20240416 HOT 1
- C768 analysis tasks Fail on Hera HOT 33
- [NCO Bug] Display clear warning when missing snogrb file
- Add B matrix generation job for aerosols to workflow
- Add yaml to CI for high resolution tests to enable easy testing HOT 4
- GFSv16.3.? - GLDAS updates HOT 4
- C768 enkfgdaseupd task crashes on Hera
- Failures in GSI analysis due to inconsistent correlated error HOT 4
- Enable using the FV3_global_nest_v1 CCPP suite in the global-workflow
- Update gempak version HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from global-workflow.