What is the long-run distribution of stochastic gradient descent? A large deviations analysis
In this paper, we examine the long-run distribution of stochastic gradient descent (SGD) in
general, non-convex problems. Specifically, we seek to understand which regions of the …
general, non-convex problems. Specifically, we seek to understand which regions of the …