basecls.solver.optimizer.sgd#
- class basecls.solver.optimizer.sgd.SGD(params, lr, momentum=0.0, nesterov=False, weight_decay=0.0)[源代码]#
基类:
OptimizerImplements stochastic gradient descent.
Nesterov momentum is based on the formula from “On the importance of initialization and momentum in deep learning”.
- 参数
params (
Union[Iterable[Parameter],dict]) – iterable of parameters to optimize or dicts defining parameter groups.lr (
float) – learning rate.momentum (
float) – momentum factor. Default:0.0nesterov (
bool) – enables Nesterov momentum. Default:Falseweight_decay (
float) – weight decay (L2 penalty). Default:0.0