Exploding gradient problem

The exploding gradient problem was first described in an academic paper written titled "The problem of learning long-term dependencies in recurrent networks". The exploding gradient problem is a difficulty which can occur when training artificial neural networks using gradient descent by back propagation.

When large error gradients accumulate the model may become unstable and impair effective learning. Change in model weights can create an unstable network. The values of weights can become so large and cause overflow. A gradient is the direction and magnitude calculated during the training of a neural network it is used to teach the network weights in the right direction by the right amount. When there is an error gradient, explosion of components may grow exponentially.

Exploding gradient problem can be addressed by redesigning the network model, using rectified linear activation, using long short term memory (LSTM) networks, gradient clipping and weight regularization.Another solution to the exploding gradient problem is to prevent gradients from becoming to0 big applying a process known as gradient clipping that places a predefined threshold on each gradient. Gradient clipping ensures the gradients stay heading towards the same direction but with shorter lengths.

Timeline

No Timeline data yet.

Further Resources

Title

Author

Link

Type

Date

Learning Long-Term Dependancies with Gradient Descent is Difficult

Yoshua Bengio, Patrice Simard, Paolo Frasconi

http://ai.dinfo.unifi.it/paolo/ps/tnn-94-gradient.pdf

Academic

On the difficulty of training recurrent neural networks

Razvan Pascanu, Tomas Mikolov and Yoshua Bengio

http://proceedings.mlr.press/v28/pascanu13.pdf

Academic paper

The curious case of the vanishing & exploding gradient

Eniola Alese

https://medium.com/learn-love-ai/the-curious-case-of-the-vanishing-exploding-gradient-bf58ec6822eb

Web

June 5, 2018

Understanding the exploding gradient problem

Razvan Pascanu, Tomas Mikolov and Yoshua Bengio

https://pdfs.semanticscholar.org/728d/814b92a9d2c6118159bb7d9a4b3dc5eeaaeb.pdf

Academic paper

Exploding gradient problem

Contents

Timeline

Further Resources

References

Find more entities like Exploding gradient problem