Coder Social home page Coder Social logo

Comments (3)

t-hishinuma avatar t-hishinuma commented on July 3, 2024 2

Thanks for the reply.

for the near term EU exascale computers

I'm sorry, If you need MPI communication, MPI support in monolish will be next year.

The first goal of monolish is to develop a complete numerical library for any hardware on one node.

Asynchronous communication is important, but we're not going to get around to it for now.
We look forward to your pull requests.

When you have an API design for asynchronous communication, please share it with us.

regards.

from monolish.

v1j4y avatar v1j4y commented on July 3, 2024 1

Thanks for the detailed comments.

I am quite interested in this project due to the lack of standard libraries which are hardware agnostic and are written in C/C++. We're also working on a similar project albeit for Qauantum Monte Carlo applications. Here the goal is also to have a backend agnostic QMC kernel library for use by physicists for development of new methods for the near term EU exascale computers.

In our efforts, we have realized asynchronous copy/compute is essential for optimal saturation of the GPUs. We're are currently developing interfaces (for Fortran and C/C++) which includes the async copy/compute features along with the ability to simultaneously use CPU/GPU for the given workload in order to fully saturate the compute node.

I can have a look at your interfaces for GPU based Linear Algebra functions in details and can eventually send a pull request so that we can perhaps discuss in details what I mean by a unified interface.

from monolish.

t-hishinuma avatar t-hishinuma commented on July 3, 2024

Hi, Thanks for looking at this library.

There are no plans to support asynchrony in the near future.
This is because we don't have an idea for a unified and intuitive interface.
If you have an idea for an interface, we would be glad to hear from you.

AMD support is only a matter of time.
We have chosen OpenMP Offloading to support AMD Radeon and Intel Xe.
However, in the current version, many functions call cuBLAS, cuSPARSE, or cuSOLVER.
We will support AMD Radeon in the near future, either by implementing Rocm calls, or by implementing Reference BLAS for devices using OpenMP Offloading.

Regards,
T.Hishinuma

from monolish.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.