Comments (2)
Hi Cuong,
Thanks for your reply.
The first thing I want to mention is that the logSigma's prior is Gamma Distribution not Gaussian. But I think it's a minor point.
Also, I totally agree that it's inappropriate to implement q(theta)=Dirac(theta-theta*) from the analytic perspective. Perhaps it's just an engineering approximation.
Anyway, I think it's better to add the weight-decay term if you really want to exactly reproduce the paper.
from few_shot_meta_learning.
Hi Zichen,
You are right. I did not implement the prior of theta into the calculation. Instead of setting the priors as shown in the paper, we can simply use the L2 regularization (weight_decay
of the op_theta
in Pytorch). In such case, the both the priors for the mean and logSigma would be Gaussian.
Besides, the main reason I did not do that is my disagreement with the assumption that sets q(theta) = Dirac_delta(theta - theta*). At the very first equation on page 2 of the paper, Jensen's in equality is used to derive the lower-bound. That derivation is associated with the assumption that q(theta) > 0. Therefore, setting q(theta) as a Dirac delta function violates that assumption.
Nevertheless, let me know if the explanation above resolves your concern.
from few_shot_meta_learning.
Related Issues (15)
- Some questions about this code. HOT 1
- Loss is NaN in PLATIPUS HOT 2
- Platipus loss function potentially doesn't match paper HOT 2
- Question about the implementation of VAMPIRE HOT 4
- test in Platius model HOT 2
- NaN loss when training with sine HOT 4
- error in Platipus model with sineline data source
- Models not training HOT 4
- Loss function for implementation of BMAML HOT 2
- Question about the initialization of theta0 in abml HOT 4
- First order approximate typo? HOT 1
- getting NaN's in ABML at about epoch 14 HOT 4
- Consultation about the code HOT 1
- Regression code HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from few_shot_meta_learning.