Stochastic Gradient Descent
class StochasticGradientDescent(val batchSize: Int, val learningRate: LearningRateSchedule, val entropy: Random = Random.Default, val discardExtras: Boolean = false) : GradientDescent
Stochastic Gradient Descent (SGD) optimizer with adjustable learning rate.
SGD operates on a subset of the training data (a single sample or a mini-batch) randomly selected from the entire dataset.
Parameters
learning Rate
The learning rate for the optimizer.
Constructors
Link copied to clipboard
constructor(batchSize: Int, learningRate: LearningRateSchedule, entropy: Random = Random.Default, discardExtras: Boolean = false)