Coder Social home page Coder Social logo

Comments (8)

josephzhang8 avatar josephzhang8 commented on August 29, 2024 1

from schism.

jamal919 avatar jamal919 commented on August 29, 2024

Interesting! Any idea how the model performs in desktop grade AMD processors with GCC? To put it different way, is the performance is comparable between an Intel i7 and Ryzen 5 processors? Thanks.

from schism.

SorooshMani-NOAA avatar SorooshMani-NOAA commented on August 29, 2024

@josephzhang8, should setting UCX_UNIFIED_MODE=y at runtime fix the crash or there are other things I need to change as well?

from schism.

josephzhang8 avatar josephzhang8 commented on August 29, 2024

from schism.

SorooshMani-NOAA avatar SorooshMani-NOAA commented on August 29, 2024

I see, thank you

from schism.

SorooshMani-NOAA avatar SorooshMani-NOAA commented on August 29, 2024

I still see the same issue on hpc6a platform with the

limit -s unlimited
export UCX_UNIFIED_MODE=y

environment. I get the following error in my run logs: first one of the following lines for each core:

MPI startup(): Warning: I_MPI_PMI_LIBRARY will be ignored since the hydra process manager was found

which I think is due to how the ParallelWorks environment is set up. And then one of these for each core

forrtl: severe (174): SIGSEGV, segmentation fault occurred
Image              PC                Routine            Line        Source
pschism_PAHM_TVD-  00000000006F71DA  for__signal_handl     Unknown  Unknown
libpthread-2.17.s  00002AFBEEFD8630  Unknown               Unknown  Unknown
libshm-fi.so       00002AFCFA21A98A  Unknown               Unknown  Unknown
libshm-fi.so       00002AFCFA2078BE  Unknown               Unknown  Unknown
libshm-fi.so       00002AFCFA2026B9  Unknown               Unknown  Unknown
libshm-fi.so       00002AFCFA202F23  Unknown               Unknown  Unknown
libefa-fi.so       00002AFCFAA08E31  Unknown               Unknown  Unknown
libefa-fi.so       00002AFCFAA11945  Unknown               Unknown  Unknown
libefa-fi.so       00002AFCFAA077A9  Unknown               Unknown  Unknown
libefa-fi.so       00002AFCFAA07865  Unknown               Unknown  Unknown
libmpi.so.12.0.0   00002AFBEDB26E84  Unknown               Unknown  Unknown
libmpi.so.12.0.0   00002AFBEDE1117B  Unknown               Unknown  Unknown
libmpi.so.12.0.0   00002AFBEDE18094  Unknown               Unknown  Unknown
libmpi.so.12.0.0   00002AFBEDA0746A  Unknown               Unknown  Unknown
libmpi.so.12.0.0   00002AFBEDA7BAF0  Unknown               Unknown  Unknown
libmpi.so.12.0.0   00002AFBEDA6616B  Unknown               Unknown  Unknown
libmpi.so.12.0.0   00002AFBEDA54748  MPI_Comm_dup          Unknown  Unknown
libmpifort.so.12.  00002AFBED4F260B  pmpi_comm_dup_        Unknown  Unknown
pschism_PAHM_TVD-  0000000000448D6E  Unknown               Unknown  Unknown
pschism_PAHM_TVD-  0000000000410794  Unknown               Unknown  Unknown
pschism_PAHM_TVD-  00000000004106A2  Unknown               Unknown  Unknown
libc-2.17.so       00002AFBEF207555  __libc_start_main     Unknown  Unknown
pschism_PAHM_TVD-  00000000004105A9  Unknown               Unknown  Unknown

from schism.

josephzhang8 avatar josephzhang8 commented on August 29, 2024

from schism.

josephzhang8 avatar josephzhang8 commented on August 29, 2024

from schism.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.