Op/optimization TODO: log determinant about theano HOT 22 OPEN

theano commented on June 22, 2024

Op/optimization TODO: log determinant

from theano.

Comments (22)

jaberg commented on June 22, 2024 1

Related: there is an optimization in linalg for the log det of PSD
matrices, it it the sum of the log of the diagonal of the cholesky
decomposition or something. I think in the case of PSD matrices that's
the best way to do it, it's better than slogdet.

On Mon, Oct 24, 2011 at 11:46 AM, David Warde-Farley
[email protected]
wrote:

Now that we have a determinant Op we should probably look into this.

There are numerically stable ways to compute the natural logarithm of the determinant, a quantity which is often needed in evaluating the log probability of a multivariate Gaussian distribution. NumPy provides this as numpy.slogdet for "sign log determinant" (it computes log(abs(det(X))) and also returns the sign of the determinant).

The derivative of the log determinant also has a particularly simple form as inv(X).T == inv(X.T).

Reply to this email directly or view it on GitHub:
#150

from theano.

nouiz commented on June 22, 2024

David, can you be more explicit in what is needed? A new op that call numpy.slogdet and an optimization that replace log(abs(det(X)) by it? We need to do some speed comparison before enabling it by default.

from theano.

jaberg commented on June 22, 2024

This would be a stability optimization, the speed shouldn't prevent
its application.

On Mon, Oct 24, 2011 at 11:57 AM, nouiz
[email protected]
wrote:

David, can you be more explicit in what is needed? A new op that call numpy.slogdet and an optimization that replace log(abs(det(X)) by it? We need to do some speed comparison before enabling it by default.

Reply to this email directly or view it on GitHub:
#150 (comment)

from theano.

nouiz commented on June 22, 2024

I would be interested to know how much slower it is. Having a stability optimization that is 10 times slower is not the best. In that case, we should document it well and tell people that don't want to how to disable it.

from theano.

dwf commented on June 22, 2024

We should be able to replace log(det(X)) with an slogdet Op as well, and just raise an error if the sign is negative (I guess this would be accomplished by an allow_negative keyword argument, the would default to False if log(det(X)) is replaced, and True if log(abs(det(X))) is replaced.

from theano.

dwf commented on June 22, 2024

@nouiz: Newer versions of NumPy actually call slogdet when you call det, so I don't think it's much of a speed hit, if at all. In either case you are doing an LU factorization.

We have an old version of NumPy at the lab which does not include slogdet, so I looked at the source code and indeed it calls the same LAPACK routine, dgetrf (zgetrf for complex) as slogdet in the newer version.

I actually just did a comparison of the old det() implementation and slogdet:

In [28]: timeit linalg.slogdet(X)
100 loops, best of 3: 15.1 ms per loop

In [29]: timeit det(X)  # old implementation, copied from NumPy 1.4.1
100 loops, best of 3: 15.4 ms per loop

from theano.

dwf commented on June 22, 2024

@jaberg That raises the interesting issue of how to know that certain intermediate results are positive-definite matrices, and how to mark them as such.

Seems like something that the user would need to specify in most cases, like x = posdef(tensor.matrix('x')) would mark in the graph somehow to assume that x is positive definite (either by wrapping in an Op that just computes the identity function, or by setting a property on the Variable object.

Any thoughts on the right way to do this?

from theano.

nouiz commented on June 22, 2024

So for the speed, it seam good.

James implemented an Hint Env feature to allow the user to give hint to the compiler. He used it for the other optimization he talked about. This approch seam good for me, but it can take much time to implement correctly all case correctly. At least that is what took time with the ShapeFeature.

from theano.

jaberg commented on June 22, 2024

On Mon, Oct 24, 2011 at 2:29 PM, David Warde-Farley
[email protected]
wrote:

@jaberg That raises the interesting issue of how to know that certain intermediate results are positive-definite matrices, and how to mark them as such.

Seems like something that the user would need to specify in most cases, like x = posdef(tensor.matrix('x')) would mark in the graph somehow to assume that x is positive definite (either by wrapping in an Op that just computes the identity function, or by setting a property on the Variable object.

Any thoughts on the right way to do this?

Check out how it's done in sandbox.linalg. there is a function
PSD_hint(x) that does exactly that.

James

from theano.

harpone commented on June 22, 2024

Just out of curiosity, why hasn't this been done yet? I haven't coded any Theano ops, but I could give it a shot... maybe just a log abs det, since that's probably the most common case (in MLE estimation etc)?

FYI I'm hitting the stability issue pretty badly...

from theano.

lamblin commented on June 22, 2024

I'm not sure why this has not been done yet, or maybe some people ended up implementing that in their own repositories.
If you want to give it a shot, it would be welcome. There should be a section on how to extend theano in the online doc. As long as you use python/numpy for the implementation and do not need a GPU version, it should be quite straightforward.
Please let us know if you have any issues or questions, either in a pull request or at theano-dev.

from theano.

harpone commented on June 22, 2024

OK I'll give it a try!

On Wed, Jan 27, 2016, 22:39 Pascal Lamblin [email protected] wrote:

I'm not sure why this has not been done yet, or maybe some people ended up
implementing that in their own repositories.
If you want to give it a shot, it would be welcome. There should be a
section on how to extend theano in the online doc. As long as you use
python/numpy for the implementation and do not need a GPU version, it
should be quite straightforward.
Please let us know if you have any issues or questions, either in a pull
request or at theano-dev.

—
Reply to this email directly or view it on GitHub
#150 (comment).

from theano.

harpone commented on June 22, 2024

Found a couple of implementations on Github. This one does an SVD first and takes the log of the diagonal (not sure why it's diagonal squared... need to check the math; also has a funny expression for the gradient... it's correct but computationally expensive).

This one uses numpy's slogdet.

Any preferences? SVD on GPU should be a simple copy/paste job I think, but I haven't found any CUDA/PyCUDA/gnumpy code for the slogdet...

from theano.

nouiz commented on June 22, 2024

You could check inside numpy code how slogdet is implemented. Maybe it does
it with an svd? Can you compate the output of both op to see if the result
is the same?

There is svd code on the GPU somewhere, so with that, it wouldn't be too
hard to make a GPU implementation

On Thu, Jan 28, 2016 at 3:25 AM, Heikki Arponen [email protected]
wrote:

Found a couple of implementations on Github. This one
https://github.com/FlorianSeidel/GOL/blob/9f7716e78a78fbdd7ae8ecac5a58b701f6a4da41/gol/goal/constraints.py
does an SVD first and takes the log of the diagonal (not sure why it's
diagonal squared... need to check the math; also has a funny expression for
the gradient... it's correct but computationally expensive).

This one
https://github.com/kolia/subunits/blob/05f40cbc45214d80567b132c95f16913ca2a0cc7/kolia_theano.py#L12
uses numpy's slogdet.

Any preferences? SVD on GPU should be a simple copy/paste job I think, but
I haven't found any CUDA/PyCUDA/gnumpy code for the slogdet...

—
Reply to this email directly or view it on GitHub
#150 (comment).

from theano.

harpone commented on June 22, 2024

It calls something in lapack routines in numpy/numpy/linalg/umath_linalg.c.src, which says in docstring that it "computes logdet from factored diagonal" so I guess it is SVD (or really an eigendecomposition since it's for square matrices)...

The square in the svd code was wrong as I suspected, but fixing that, the SVD and slogdet versions give slightly different results! Difference of about 1E-5 for 100*100 random normal matrices about 80% of time and zero difference 20% of time... happens for pure numpy/scipy versions too. Maybe there's some numerical stability issues in first doing the svd, then taking log and then sum??

I'll post the code below if anyone wants to take a look. I'm actually taking log abs det, because why take a log of a negative number...

So I'm guessing the slogdet version is numerically more accurate, which raises the question of how to do the SVD version accurately on a GPU... dammit!

PS. This will (hopefully) be my first ever PR, so I need some time to figure out how to code the tests, where to put them etc...

Numpy slogdet:

from theano.gof import Op, Apply
import numpy

matrix_inverse = T.nlinalg.MatrixInverse()

class LogAbsDet(Op):

    def make_node(self, x):
        x = theano.tensor.as_tensor_variable(x)
        o = theano.tensor.scalar(dtype=x.dtype)
        return Apply(self, [x], [o])

    def perform(self, node, (x,), (z,)):
        try:
            _, logabsdet = numpy.linalg.slogdet(x)
            z[0] = numpy.asarray(logabsdet, dtype=x.dtype)
        except Exception:
            print('Failed to compute determinant of {}.'.format(x))
            raise

    def grad(self, inputs, g_outputs):
        gz, = g_outputs
        x, = inputs
        return [gz * matrix_inverse(x).T]

    def __str__(self):
        return "LogAbsDet"

logabsdet = LogAbsDet()`

The SVD is the same, except now there's this inside the perform:

s = svd(x, compute_uv=False)
z[0] = np.asarray(np.sum(np.log(s)), dtype=x.dtype)

from theano.

harpone commented on June 22, 2024

For float64 the difference is 1E-14:

from numpy.linalg import slogdet
from numpy.linalg import svd

n = 100
testmat = np.random.randn(n, n)

_, logabsdet_numpy_test_slogdet = slogdet(testmat)
logabsdet_numpy_test_svd = np.sum(np.log(svd(testmat, compute_uv=False)))

logabsdet_numpy_test_slogdet - logabsdet_numpy_test_svd

For float32 the difference is 1E-5!

from numpy.linalg import slogdet
from numpy.linalg import svd

n = 100
testmat = np.random.randn(n, n).astype(np.float32)

_, logabsdet_numpy_test_slogdet = slogdet(testmat)
logabsdet_numpy_test_svd = np.sum(np.log(svd(testmat, compute_uv=False)))

logabsdet_numpy_test_slogdet - logabsdet_numpy_test_svd

Anyone know the reason for this?

EDIT: craaaap something wrong with my numpy/scipy installation... can anyone test if you get the same errors? Probably not... need to reinstall/recompile numpy and scipy I guess...
EDIT2: the difference is definitely still there after fixing numpy!!

from theano.

harpone commented on June 22, 2024

Given that the logdet values for the examples above are of order 1E+2, the relative error is around 1E-7...1E-8 for float32... maybe that's acceptable?

I think I should go with the direct SVD method then for the 'perform' method, if the GPU version is going to be also SVD...

from theano.

lamblin commented on June 22, 2024

Yes, this kind of differences is really in the normal range for float32, these are even quite low.

from theano.

harpone commented on June 22, 2024

OK did a PR here: #3954

By the way, SVD on GPU seems to be useful only when matrix dims go above 1000 * 1000, so maybe the GPU implementation is not that important yet? Or will there be some overhead from loading parameters back and forth between GPU memory and RAM??

This image is from this paper (not very fresh though...):

from theano.

harpone commented on June 22, 2024

New PR due to me being a n00b: #3959

from theano.

nouiz commented on June 22, 2024

The transfer to/from the GPU/CPU is to be avoided as much as possible. So
even if the op isn't faster, just for that reason, it can be useful to add
it.

On Sun, Jan 31, 2016 at 3:45 PM, Heikki Arponen [email protected]
wrote:

New PR due to me being a n00b: #3959
#3959

—
Reply to this email directly or view it on GitHub
#150 (comment).

from theano.

harpone commented on June 22, 2024

OK got it

from theano.

Op/optimization TODO: log determinant about theano HOT 22 OPEN

Comments (22)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent