Hello, I came across this from your YT video tutorials, thank you for making these

The graph is not necessarily a tree, it can have diamonds, see <a class="issue-link js

Noob question about backprop implementation about micrograd HOT 4 CLOSED

karpathy commented on August 15, 2024 3

Noob question about backprop implementation

from micrograd.

Comments (4)

Richard004 commented on August 15, 2024

Hi, this is my guess. It would definitely work.
But at a certain moment, the depth of the graph can be so large that one risks stack overflow.
It feels like a tradeoff between simplicity and robustness of the example. But I am not sure if that was the real concern.

My concern was that we are creating the order with each call of the back propagation, while it typically remains constant over the whole optimization process. So the order could easily be cached somewhere. But that is just performance problem, I undestand that this was supposed to be explanation of the principle.

And it is a great one! ❤

from micrograd.

deermichel commented on August 15, 2024

Great question, I was wondering the same 👍🏽

from micrograd.

t0lya commented on August 15, 2024

Hi, this is my guess. It would definitely work. But at a certain moment, the depth of the graph can be so large that one risks stack overflow. It feels like a tradeoff between simplicity and robustness of the example. But I am not sure if that was the real concern.

My concern was that we are creating the order with each call of the back propagation, while it typically remains constant over the whole optimization process. So the order could easily be cached somewhere. But that is just performance problem, I undestand that this was supposed to be explanation of the principle.

And it is a great one! ❤

Topological sort in micrograd is implemented using recursion, so the stack overflow concern is same as when calling backward() recursively

from micrograd.

t0lya commented on August 15, 2024

The graph is not necessarily a tree, it can have diamonds, see tinygrad/tinygrad#165 (comment)

Checking visited nodes during topological sort prevents repeat backward calls on same node

from micrograd.

Recommend Projects

Noob question about backprop implementation about micrograd HOT 4 CLOSED

Comments (4)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent