Should @optimizers signature be modified to include loss_value? #7590

cottrell · 2021-08-11T18:11:44Z

cottrell
Aug 11, 2021

I am experimenting with trying to use the optimizers pattern to make some sort of an optimizer that tries it's best to reduce the learning rate if some loss increas condition happens. I think the best way to do this is include the step_size in the optimizer state. This is fine, but the step_size modification update condition depends on prev_loss and current loss which means that one must pass both gradients and loss into the update and it seems the checks in the optimizer complain about this.

@optimizer
def sgd_custom(step_size):
  """Construct optimizer triple for stochastic gradient descent but with some dynamic step_size thing.

  Args:
    step_size: positive scalar, or a callable representing a step size schedule
      that maps the iteration index to positive scalar.

  Returns:
    An (init_fun, update_fun, get_params) triple.
  """
  # step_size = make_schedule(step_size)
  def init(x0, loss0=None, step_size0=step_size):
      if loss0 is None:
          loss0 = np.Inf
      return x0, loss0, step_size0

  def update(i, g_loss, state):
      g, loss = g_loss
      # this breaks the api to some extent, now need to pass in value with gradients
      x, prev_loss, step_size = state
      if loss / prev_loss > 1.1:
          step_size *= 0.5
      x =  x - step_size(i) * g
      return x, loss, step_size

  def get_params(state):
      x, _, _ = state
      return x

If you try using this you get some error like

TypeError: optimizer update function was passed a gradient tree that did not match the parameter tree structure with which it was initialized: parameter tree PyTreeDef(*) and grad tree PyTreeDef((*, *)).

Am wondering if a) this is a bad idea in general for other reasons b) if not bad, is it worth thinking about allowing this pattern?

Or maybe this multi-step pattern exists elsewhere in the optimizer code and I have not yet come across it.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Should @optimizers signature be modified to include loss_value? #7590

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Should @optimizers signature be modified to include loss_value? #7590

Uh oh!

cottrell Aug 11, 2021

Replies: 0 comments

cottrell
Aug 11, 2021