mhaut / angulargrad Goto Github PK
View Code? Open in Web Editor NEWPytorch implementation of AngularGrad: A New Optimization Technique for Angular Convergence of Convolutional Neural Networks
License: GNU General Public License v3.0
Pytorch implementation of AngularGrad: A New Optimization Technique for Angular Convergence of Convolutional Neural Networks
License: GNU General Public License v3.0
Hi, I'm a machine learning engineer working in medical data company in south korea.
I found your paper and thought that it was state-of-the-art. I was totally impressed by your brilliant idea, so I wanted to make them available in Tensorflow. Now, I'm implementing it as tensorflow optimizer. In order to do so, I had to fully understand and analyze your codes. While reading codes, a question came to mind during the analysis and I decided to post an issue.
Whether its type is boolean or tensor is not important. ans1[0] == "False"
cannot be True and always be False. This consequence makes codes below not approchable.
except:
if (ans1[0] == "False"):
min = angle
diff = abs(previous_grad - grad)
final_cos_theta = cos_theta.clone()
I'm not familiar with Pytorch. There could be possibility that I misunderstood the codes so i would appreciated it if you could explain this codes.
I am trying to reproduce the results of this work, and I encountered a methodological issue regarding angles computation.
The tangent of the angle between consecutive gradients is here computed as abs((previous_grad - grad) / (1 + previous_grad * grad))
:
AngularGrad/myoptims/tanangulargrad.py
Line 80 in cd8cbec
However, the result of this operation is not a scalar, but a vector with the same shape as grad
. What is the interpretation of such quantity? The paper states that this is the tangent of the angle between consecutive gradients, but it does not seem to be the case.
Thank you in advance!
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.