Backpropagation

to compute gradients efficiently

Chain Rule:

dz/dx = dz/dy × dy/dx

  • Forward pass
  • Backward pass

 

 

[展开全文]