SGD Paper - 搜索

约 5,150,000 个结果

在新选项卡中打开链接

时间不限

arxiv.org
https://arxiv.org › abs
An overview of gradient descent optimization algorithms
2016年9月15日 · Gradient descent optimization algorithms, while increasingly popular, are often used as black-box optimizers, as practical explanations of their strengths and weaknesses are …
arxiv.org
https://arxiv.org › abs
SGDR: Stochastic Gradient Descent with Warm Restarts
2016年8月13日 · In this paper, we propose a simple warm restart technique for stochastic gradient descent to improve its anytime performance when training deep neural networks. We …
ieee.org
https://ieeexplore.ieee.org › document
Constrained Stochastic Gradient Descent: The Good Practice
2017年11月29日 · Stochastic Gradient Descent (SGD) is the method of choice for large scale problems, most notably in deep learning. Recent studies target improving convergence and …
arxiv.org
https://arxiv.org › abs
[2501.08425] Is Stochastic Gradient Descent Effective? A PDE ...
2025年1月14日 · In this paper we analyze the behaviour of the stochastic gradient descent (SGD), a widely used method in supervised learning for optimizing neural network weights via …
zhihu.com
https://zhuanlan.zhihu.com
[论文阅读] 综述梯度下降优化算法 - 知乎 - 知乎专栏
批量梯度下降(Batch gradient descent)、随机梯度下降(SGD)、小批量梯度下降(Mini-batch gradient descent) 其实都是一回事，区别在于对多少样本数量计算梯度，BGD是对所有样本计 …
mdpi.com
https://www.mdpi.com
Recent Advances in Stochastic Gradient Descent in Deep Learning
2023年1月29日 · Among machine learning models, stochastic gradient descent (SGD) is not only simple but also very effective. This study provides a detailed analysis of contemporary state-of …
springer.com
link.springer.com › Journal of the Indian Institute of Science
Stochastic Gradient Descent and Its Variants in Machine Learning …
2019年2月12日 · Stochastic gradient descent (SGD) is a fundamental algorithm which has had a profound impact on machine learning. This article surveys some important results on SGD and …
paperswithcode.com
https://paperswithcode.com › method › sgd
SGD Explained - Papers With Code
Stochastic Gradient Descent is an iterative optimization technique that uses minibatches of data to form an expectation of the gradient, rather than the full gradient using all available data. …
neurips.cc
https://proceedings.neurips.cc › paper › file
[PDF]
SGD: The Role of Implicit Regularization, Batch-size and
Multi-epoch, small-batch, Stochastic Gradient Descent (SGD) has been the method of choice for learning with large over-parameterized models. A popular theory for explaining why SGD …
neurips.cc
https://proceedings.neurips.cc › paper › file
[PDF]
On the Almost Sure Convergence of Stochastic Gradient …
This paper analyzes the trajectories of stochastic gradient descent (SGD) to help understand the algorithm’s convergence properties in non-convex problems. We ﬁrst show that the sequence …
某些结果已被删除
分页
- 1
- 2
- 3
- 4
- 5
- 下一页

An overview of gradient descent optimization algorithms

SGDR: Stochastic Gradient Descent with Warm Restarts

Constrained Stochastic Gradient Descent: The Good Practice

[2501.08425] Is Stochastic Gradient Descent Effective? A PDE ...

[论文阅读] 综述梯度下降优化算法 - 知乎 - 知乎专栏

Recent Advances in Stochastic Gradient Descent in Deep Learning

Stochastic Gradient Descent and Its Variants in Machine Learning …

SGD Explained - Papers With Code

SGD: The Role of Implicit Regularization, Batch-size and

On the Almost Sure Convergence of Stochastic Gradient …