Comments (5)
Triage Notes:
I was able to reproduce the issue on tensorflow v2.15, v2.16 and tf-nightly. Kindly find the gist of it here.
The tf.custom_gradients are working as expected when z = bar(x, y) & z = bar(x=x, y=y) and it was failing with z = bar(x, y=y) by throwing the error.
from tensorflow.
@tilakrayal when running your gist, the gradient is NOT as expected when z = bar(x=x, y=y). the expected output should be the custom gradient (which is [[-2000], [-4000], [-6000]] for dz/dx), given the function is wrapped in tf.custom_gradient, but instead returns the gradient produced by autograd (which is [[-0.66], [-1.33], [-2.]]).
The issue is that the custom gradient is silently ignored when using keyword arguments for every parameter.
from tensorflow.
The decorator @tf.custom_gradient
uses the input to deduce which arguments are differentiable inputs, and which arguments are non-differentiable parameters to the function. Positional arguments are considered differentiable, keyword arguments non-differentiable.
When passing all keyword arguments, the custom gradient is silently ignored and defaults to the autograd.
It thinks your function has no differentiable inputs, so simply forwards the upstream gradient and continues the computation.
When passing a mix of positional and keyword arguments, an exception is thrown.
Because you're returning multiple gradients when only the one non-keyword input is considered a differentiable input.
As for what to do: we will not change the behavior of the custom_gradient
decorator. You can update the documentation with a PR if you like to reflect this behavior.
from tensorflow.
This issue is stale because it has been open for 7 days with no activity. It will be closed if no further activity occurs. Thank you.
from tensorflow.
Are you satisfied with the resolution of your issue?
Yes
No
from tensorflow.
Related Issues (20)
- TFLiteConverter produces model that doesn't conform to GPUv2 (TfLiteGpuDelegate Init: FULLY_CONNECTED: Amount of input channels should match weights width)
- TextVectorization does not convert Cyrillic characters to lowercase HOT 1
- 2.12.0: memory leak in TFLite's tflite::Interpreter::Invoke() HOT 2
- GPUv2 numerical inaccuracy in simple Add + Mul
- tensorflow/tsl/cuda/cudart_stub.cc:28] Could not find cuda drivers on your machine, GPU will not be used. HOT 1
- Aborted (core dumped) with `tf.raw_ops.CombinedNonMaxSuppression` HOT 1
- Aborted (core dumped) with `tf.raw_ops.Dilation2DBackpropFilter`
- Aborted (core dumped) with `tf.raw_ops.FakeQuantWithMinMaxVarsPerChannelGradient` HOT 1
- Segmentation fault (core dumped) in `tf.raw_ops.FractionalMaxPoolGrad` HOT 1
- Aborted (core dumped) with `tf.raw_ops.LRNGrad` HOT 1
- Aborted (core dumped) with `tf.raw_ops.LSTMBlockCell`
- Aborted (core dumped) with `tf.raw_ops.LSTMBlockCellGrad` HOT 1
- Check fail in `tf.raw_ops.MaxPoolGradWithArgmax` HOT 1
- Aborted (core dumped) in `tf.raw_ops.NearestNeighbors`
- Aborted (core dumped) in `tf.raw_ops.SparseBincount`
- Aborted (core dumped) in `TensorScatterOp` HOT 1
- Buffer size mismatch in tensorflow/lite/kernels/stablehlo_pad.cc
- libtensorflow-cpu-windows-x86_64-2.15.0
- Dataset sharding warning
- libhexagon_interface.so for non Android - eLinux platform
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from tensorflow.