hey, for the nesterov momentum optimizer, is it only for weights and not biases? what does lambda and delta represent