Coder Social home page Coder Social logo

CUTLASS results incorrect about cutlass HOT 10 CLOSED

nvidia avatar nvidia commented on August 15, 2024
CUTLASS results incorrect

from cutlass.

Comments (10)

kerrmudgeon avatar kerrmudgeon commented on August 15, 2024 1

Yes! We have root caused this problem and are preparing a patch coming soon.

Thank you for being patient.

from cutlass.

d4l3k avatar d4l3k commented on August 15, 2024

@kerrmudgeon Any thoughts/eta for a fix?

from cutlass.

YSZhuoyang avatar YSZhuoyang commented on August 15, 2024

Same here.

Built successfully and ran tests with cutlass v1.1, CUDA V10.0.130, NV driver 410.66 on Ubuntu 18.04, gtx 960m GPU failed (and it seems all failed tests are sgemm... & dgemm...).

from cutlass.

kerrmudgeon avatar kerrmudgeon commented on August 15, 2024

Unfortunately, we believe this problem affects all GPUs in the Maxwell architecture. We have not yet root caused it.

Pascal, Volta, and Turing are okay.

Will report back when we have updates.

from cutlass.

d4l3k avatar d4l3k commented on August 15, 2024

@kerrmudgeon Any updates?

from cutlass.

kerrmudgeon avatar kerrmudgeon commented on August 15, 2024

CUTLASS 1.2 mainly contributes optimizations for batched GEMM targeting Tensor Cores on Volta and Turing as well as optimizations for problems with small GEMM-M and GEMM-N dimensions.

We have not addressed the issue you and others have reported regarding CUTLASS on the Maxwell architecture, however. We are looking into it.

from cutlass.

azazhu avatar azazhu commented on August 15, 2024

hi Andrew, any updates?

from cutlass.

kerrmudgeon avatar kerrmudgeon commented on August 15, 2024

I have pushed a patch to the master branch. Can you verify that it resolves the correctness problem on Maxwell?

from cutlass.

YSZhuoyang avatar YSZhuoyang commented on August 15, 2024

Tried it again in the same environment and I saw all tests were passed.

from cutlass.

kerrmudgeon avatar kerrmudgeon commented on August 15, 2024

Good to hear.

I think this is safe to close. Feel free to re-open if you still have problems.

from cutlass.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.