Домой United States USA — software New logarithmic step size for stochastic gradient descent

New logarithmic step size for stochastic gradient descent

265
0
ПОДЕЛИТЬСЯ

The step size, often referred to as the learning rate, plays a pivotal role in optimizing the efficiency of the stochastic gradient descent (SGD) algorithm. In recent times, multiple step size strategies have emerged for .
The step size, often referred to as the learning rate, plays a pivotal role in optimizing the efficiency of the stochastic gradient descent (SGD) algorithm. In recent times, multiple step size strategies have emerged for enhancing SGD performance. However, a significant challenge associated with these step sizes is related to their probability distribution, denoted as ηt/ΣTt=1ηt .
This distribution has been observed to avoid assigning exceedingly small values to the final iterations.

Continue reading...