Added feature to allow different learning rates per layer in the NN #143
Open
Tgaaly wants to merge 4 commits intorasmusbergpalm:masterfrom
Open
Added feature to allow different learning rates per layer in the NN #143Tgaaly wants to merge 4 commits intorasmusbergpalm:masterfrom
Tgaaly wants to merge 4 commits intorasmusbergpalm:masterfrom
Conversation
… useful for transfer learning, pre-training parts of the NN and fine-tuning other parts
… useful for transfer learning, pre-training parts of the NN and fine-tuning other parts - updated nntrain.m - where learningRatePerLayer is also scaled with the nn.scaling_learningRate
… useful for transfer learning, pre-training parts of the NN and fine-tuning other parts - error fix
|
This is really useful. In my PR: #128 |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Added feature to allow different learning rates per layer in the NN. This feature is useful for transfer learning where you pre-training parts of the NN and then fine-tuning additional layers on top. The learning rates should be higher for the new layers and lower for the pre-trained layers. This is similar to what was done here: http://caffe.berkeleyvision.org/gathered/examples/finetune_flickr_style.html
By default the nn.learningRatePerLayer=[] and this will not cause an error in the default case as I check to see if its empty. The changes are very simple and straight forward.