Autograd Checkpoint

Test your understanding of automatic differentiation.


1. In a computational graph, the backward pass traverses nodes in:
2. Why do we use += instead of = when accumulating gradients?
3. The derivative of tanh(x) is:
4. What is the gradient of x at x=2 for the expression x^3?
5. Before calling backward(), we should: