Skip to content

Implement policy learning task  #3

@bruno-f-cruz

Description

@bruno-f-cruz

Requires:

  • Choice on each trial is given by the animal stabilizing its force within a range (min<force<max) for N seconds. In practice this can be done by running a min-max rolling window of N samples and thresholding the resulting value. This can be further simplified by operating on top of the paired boolean threshold output.
  • Once a choice is made, implements feedback modes:
  • Gradient (feedback is continuous based on Force_choice - Force_target)
  • UnsignedGradient (feedback is continuous based on abs(Force_choice - Force_target))
  • StepGradient (feedback is discrete (i.e. higher, lower))
  • ReinforcementLearning (feedback is discrete and unsigned)

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions