Home

Learning to learn by gradient descent by gradient descent

You could try getting some motivation from the plot below or find the latest code here:

An LSTM based optimizer learns to minimize functions, that too, better on average than tried-and-tested optimizers along the likes of Adam, RMS Prop and SGD.

loss_versus_iterations

find more pages in the sidebar

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Home

Learning to learn by gradient descent by gradient descent

Clone this wiki locally