I'm trying to use Recurrent and Concat together in the same network. In particular, I'

Great question. First up. I don't think your <code class="notranslat

Ah, good to know about how Crop works. <p dir="au

You can just use the literal 0 . <co

Please see <a class="issue-link js-issue-link" data-error-text="Failed to load title"

Using Recurrent and Concat together about grenade HOT 10 CLOSED

huwcampbell commented on May 22, 2024

Using Recurrent and Concat together

from grenade.

Comments (10)

HuwCampbell commented on May 22, 2024

Great question.

First up. I don't think your Crop layers are correct. The 164 doesn't seem right in CropPlayer. (The numbers are how many are taken from the left and right, not the resulting width).

Seems like I should add a version of Crop which works on 1D shapes, you can do this too if you like in your own code.

I believe right now it's probably not possible*. As really, RecNet should be a recurrent network, with an recurrent Concat layer (R instead of F).

Now the problem is that Concat isn't an instance of RecurrentLayer. I can't think of a fundamental reason that it shouldn't be, or at least that a layer just like Concat (RecConcat) couldn't exist which takes two layers which are tagged with F or R and makes a new recurrent layer.

Would you like to try writing it?

EDIT:

With the current Concat layer (or without an orphan instance). One can write their own layers downstream

from grenade.

cpennington commented on May 22, 2024

Ah, good to know about how Crop works.

I'll take a stab at a 1D Crop and making Concat an instance of RecurrentLayer. Hopefully the types should guide me in the right direction (and I'll drop back here for advice if I get stuck).

from grenade.

HuwCampbell commented on May 22, 2024

Ahh, there's actually another problem. I haven't yet written an instance of RecurrentLayer for RecurrentNetwork.

I think it's possible, but requires packing all the recurrent (sideways travelling) shapes into a single vector.

from grenade.

HuwCampbell commented on May 22, 2024

You might have to run both LSTM networks forwards individually for now. The GAN mnist example gives a non-recurrent example of something like this.

from grenade.

cpennington commented on May 22, 2024

Ah, ok. I had thought about doing that, but hadn't looked closely enough at runBackwards/runGradient to see that they spit out something input-shaped.

Seems like runNetwork for both LSTMs, then combine their output, and feed that into runNetwork for the combining network. Then take the target output, and runBackward through the combining network to get target results for the two LSTMs, and then runGradient/applyUpdate for all networks should do the trick. I'll give it a try, see how it works out.

Thanks for your help!

from grenade.

HuwCampbell commented on May 22, 2024

That's right.
Only difference is you'll need runRecurrent and backPropagateRecurrent for the LSTM nets.

Edit. Sorry:
runRecurrentForwards and runRecurrentBackwards would also be useful.

from grenade.

cpennington commented on May 22, 2024

Cool, I'm making progress on this. One question that came up as I was working is whether there's an easy way to construct an all-zero vector for a particular RecurrentInput shape. I want to make sure my network is always starting from the same state at the start of every game.

from grenade.

HuwCampbell commented on May 22, 2024

You can just use the literal 0.

S is an instance of Num so has fromInteger. In fact RecurrentInputs xs is also an instance of Num, so that should work for the entire stack.

If you look at the code for backPropagateRecurrent you can see I do this (for the back propagated sideways gradients at least).

from grenade.

HuwCampbell commented on May 22, 2024

Please see #32

In that branch, this will compile

type R = Recurrent
type F = FeedForward

type ShapeInput = 'D1 10

type LearnPlayer = RecurrentNetwork
   '[ R (LSTM 10 20) ]
   '[ ShapeInput , D1 20 ]

type LearnOpponent = RecurrentNetwork
   '[ R (LSTM 10 20) ]
   '[ ShapeInput, D1 20 ]

type RecNet = RecurrentNetwork
    '[ R (
        ConcatRecurrent
          (D1 20)
          (R LearnPlayer)
          (D1 20)
          (R LearnOpponent)
        )
    ]
   '[ ShapeInput, 'D1 40 ]

randomNet :: MonadRandom m => m RecNet
randomNet = randomRecurrent

from grenade.

HuwCampbell commented on May 22, 2024

I believe this is fixed, but feel free to follow up with any problems you're having.

from grenade.

Using Recurrent and Concat together about grenade HOT 10 CLOSED

Comments (10)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent