What should the expected accuracy of the results be when using the <a class="user-ment

Floating Point Accuracy of @tensor results with CUDA about tensoroperations.jl HOT 3 CLOSED

ejmeitz commented on June 16, 2024

Floating Point Accuracy of @tensor results with CUDA

from tensoroperations.jl.

Comments (3)

lkdvos commented on June 16, 2024 1

Let me first elaborate a bit on the cuTENSOR side of things. As far as I understand, cuTENSOR allows for a dynamic way of changing the floating point accuracy, which is allowed to be of higher precision then the output array. The exact details are mostly mentioned in the docs page you linked, but here are some important notes:

the specified accuracy is applicable to intermediate results. For example, when doing matrix multiplication, you get expressions of the form a_11 * b_11 + a_12 * b_21 + ..., where the specified accuracy will be used for the multiplications and summations. This means that a_11*b11 will be computed as the specified accuracy, even though the final result may still evolve some truncation if the output array is of lower accuracy.
the specified accuracy is a lower bound, i.e. if you specify Float32, cuTENSOR may decide to increase the accuracy to Float64 if all input/output arrays and scalars support this. However, it will never decrease it to Float16.

Over to the TensorOperations side of things, in principle we do not expose the additional option to have increased intermediate accuracy when the output array is of lower accuracy. TensorOperations (not only for CuArrays) chooses an output type that has the precision based on promoting the type of all input arrays, and this is also the guarantee we then ask of cuTENSOR.

Finally, if you are using Julia, be careful with the conversion from Array to CuArray. cu(A) is defined to silently change the precision to Float32 from whatever it was before, as this is typically the most optimized precision for your GPU, but this may not necessarily be what you want, in which case you should CuArray(A).

Thus, answering your first question, cuTENSOR will in general not lower the floating point accuracy, but it may increase it if all input/output arrays and scalars support this. TensorOperations only asks the guaranteed precision of the output array from cuTENSOR, which is determined in function of the inputs.

For your second question, this is rather application-dependent. In generic cases, there really is no way of knowing if 1e-2 is actually a floating point effect or rather a value being zero. This is just inherent to working with floating point numbers. As a pathological example, the following holds:

julia> 1e-1 + 1e20 - 1e20 - 1e-1
-0.1

Nevertheless, when dealing with addition and multiplication only, this mostly occurs when your input floats are of vastly different scales. This is precisely what eps will tell you, it is the smallest number you can add which will still be representable, thus any precision smaller than this is lost. If your use-case has some bounds on the scales of input floats, you could possibly exclude these pathological cases, and decide that anything larger than some predefined limit is definitely not zero, and possibly anything smaller than this is actually zero. Often people use sqrt(eps) or eps^(3/4) for this, but this is mostly phenomenological and you should probably experiment and see what works for you. However, unless you abolutely require this, you could just leave the ones that are probably zero and continue, as these should only contribute about as much as the floating point errors you are making to the final result.

from tensoroperations.jl.

Jutho commented on June 16, 2024

I don't understand the question very well, but if you are working with Float32, the expected precision is order 1e-7. Check:

julia> eps(Float32)
1.1920929f-7

Hence, entries of the arrays that should be zero, can easily end up to be order 1e-7 due to numerical precision.

from tensoroperations.jl.

ejmeitz commented on June 16, 2024

Yeah I understand the signficant digits thing. I guess two questions I have:

Does cuTENSOR ever change the floating point accuracy when calling cuTENSOR? There's some things on this docs page from CUDA that are unclear to me about the accuracy guarantees.
Not a TensorOperations specific question necessarily. But is there a recommended way to differentiate between a value being zero and a value just being small when output by @tensor? I have values all the way from 1e-2 to 1e-15. Some of those definitely should just be zero but are not due to precision issues, but some should be non-zero and just small.

from tensoroperations.jl.

Floating Point Accuracy of @tensor results with CUDA about tensoroperations.jl HOT 3 CLOSED

Comments (3)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent