T
traeai
Sign in

概念

Stochastic Gradient Descent

别名:SGD

一种优化算法,每次迭代仅使用一个或少量样本计算梯度,用于大规模机器学习模型训练。

相关材料

已收录 1 条与 Stochastic Gradient Descent 相关的内容,按评分排序。

Towards Data Science 图标

Why Gradient Descent Became Stochastic

Towards Data Science4695 字 (约 19 分钟)
78

The core reason gradient descent evolved into stochastic gradient descent (SGD) is computational scalability: as dataset size grows, batch gradient descent (BGD) becomes prohibitively expensive, while SGD updates parameters using only one or a few samples per iteration—reducing cost and leveraging noise to escape local minima; the article illustrates this via linear regression, deriving the closed-form solution from MSE and naturally motivating iterative optimization.

入选理由:线性回归中β₀=27315.74、β₁=9020.66的解析解可通过MSE对β₀/β₁求偏导并令其为0推导得出

FeaturedArticle#Gradient Descent#Stochastic Gradient Descent#Linear Regression#Optimization#Machine Learning英文

跨材料问答 · Stochastic Gradient Descent

回答基于:Stochastic Gradient Descent 相关 1 条材料
    0 / 500

    AI may generate inaccurate information. Please verify important content.