Comments (2)
That totally depends on your exact architecture. You will have to play around with different permutations to find the "best" version.
from hpcg.
HPCG code is meant to run with one or many OpenMP threads. At the same time, the reference implementation is meant for using one or many MPI ranks. And the reference code is meant to build on systems with no OpenMP nor MPI for maximum flexibility of understanding the code base and running basic tests. It is hard to recommend a particular number of threads and ranks for a particular system without knowing more details about the distribution of the threads (per cache level, per-socket, or per NUMA island). The MPI ranks could be distributed across a single node and thus might conflict with the thread counts. The use of hardware accelerators such as GPUs complicates this picture even harder. So please refrain from recommending a particular configuration to be applicable in general for many systems.
from hpcg.
Related Issues (20)
- compile error HOT 2
- Unit tests in `unittesting` directory fail to compile HOT 2
- I get a problem in the build step HOT 1
- If the --rt parameter is read from file, it does not get used HOT 1
- There exits code bug in graph multicoloring in OptimizeProblem.cpp
- Matlab example HOT 1
- HPCG Memory Output HOT 4
- Volta-enabled HPCG compilation HOT 1
- Undefined data attribute in parallel region with default(none) HOT 2
- HPCG Cuda Binary with MPI support not working properly for multiple hosts HOT 1
- HPCG crash when nx=440 ny=440 nz=424 HOT 4
- Does having warnings invalidate the benchmark results ? HOT 1
- Visualizing computational results ? HOT 1
- Formula to derive HPCG problem size - that fits in system memory HOT 1
- Number of Smoother Steps HOT 1
- HPCG Cuda Binary with multiple GPUs? HOT 5
- ComputeResidual.cpp:60:5: error: variable 'n' must have explicitly specified data sharing attributes HOT 4
- Loop upper bound implicitly shared by GCC causes error in ComputerResidual.cpp:60
- Hello. How to generate trace for HPCG? HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from hpcg.