Comments (1)
The one mentioned is in the task-adaptation (inner loop) step, where gradient descent is used to optimize the task-specific parameter (denoted as q
in the code). Hence, we have to calculate the gradient w.r.t. q
. The first order or higher order is at the argument retain_graph
or create_graph
. According to the document of PyTorch, retain_graph
keeps the graph used to compute the grad, while create_graph
will construct the graph of the derivative. In the case of create_graph
, higher order derivative will be calculated when a backward()
is subsequently called.
I think I know what went wrong here. This is a bug caused by argparse
, where boolean
variables are always set to True
no matter what. Therefore, when you change the first_order
to False
, it would still be the same. I corrected it in the new code. Please make a git pull
to get the latest update. You can give higher order a try by setting the first_order
as False. Note that this will lead to a higher amount of memory and longer computation time.
from few_shot_meta_learning.
Related Issues (15)
- Some questions about this code. HOT 1
- Loss is NaN in PLATIPUS HOT 2
- Platipus loss function potentially doesn't match paper HOT 2
- Question about the implementation of VAMPIRE HOT 4
- test in Platius model HOT 2
- NaN loss when training with sine HOT 4
- error in Platipus model with sineline data source
- Models not training HOT 4
- Potential Problem of the loss function in ABML HOT 2
- Loss function for implementation of BMAML HOT 2
- Question about the initialization of theta0 in abml HOT 4
- getting NaN's in ABML at about epoch 14 HOT 4
- Consultation about the code HOT 1
- Regression code HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from few_shot_meta_learning.