[ENH] Add parameters and similarity measures to tSNE by pavlin-policar · Pull Request #2510 · biolab/orange3

pavlin-policar · 2017-07-29T20:08:36Z

Issue

The Manifold Learning widget did provide tSNE, yet lacked all the important parameters with which to tune the embedding. Most notably, I find learning_rate to be the most important parameter and has a huge effect on the result.

Description of changes

Add relevant parameters (perplexity, early exaggeration, learning rate, max iterations and initialization).

Includes

Code changes
Tests
Documentation

pavlin-policar · 2018-01-12T18:04:27Z

@lanzagar I've added the missing parameters to tSNE. This can probably be merged easily.

I decided to leave out the Orange metrics because that would require many more changes e.g. setting precomputed parameters, computing PCA manually on the widget, all of which is doable, but should go into a separate PR.

codecov-io · 2018-01-12T18:36:14Z

Codecov Report

Merging #2510 into master will increase coverage by <.01%.
The diff coverage is 100%.

@@            Coverage Diff            @@
##           master   #2510      +/-   ##
=========================================
+ Coverage    82.1%   82.1%   +<.01%     
=========================================
  Files         328     328              
  Lines       56210   56220      +10     
=========================================
+ Hits        46151   46161      +10     
  Misses      10059   10059

lanzagar · 2018-01-19T11:39:10Z

Orange/widgets/unsupervised/owmanifoldlearning.py

+    n_iter = Setting(1000)
+
+    init_index = Setting(0)
+    init_values = [("random", "Random"), ("pca", "PCA")]


Can we leave the default init to be pca.
I would also switch their positions to have pca as the first (default) option and then random.
This would then also make it consistent with mds (pca = first & default)

I know this goes against the sklearn defaults, but we already had pca as the only option before and I think it makes more sense for users (random can show a completely different figure each time). The consistency with mds is an added bonus.

As far as I can tell, both the MDS widget and manifold MDS control pane have the option to initialize randomly. I have, however, made PCA the default value, since that is what MDS does in both cases. Is there a third MDS widget that I missed?

This is fine now. That is what I meant - just change the order of pca and random, so that pca is default (but we still have both options, like in mds)

lanzagar · 2018-01-19T11:39:56Z

Orange/widgets/unsupervised/owmanifoldlearning.py

+        self.lr_spin = self._create_spin_parameter(
+            "learning_rate", 1, 1000, "Learning rate:")
+        self.n_iter_spin = self._create_spin_parameter(
+            "n_iter", 0, 1e5, "Max iterations:")


Max iterations spin allows values as low as 0. But running it with <250 results in an error. Change min value of spin control to 250.

lanzagar · 2018-01-19T11:42:12Z

Orange/widgets/unsupervised/owmanifoldlearning.py

            "metric", "Metric:")
-        self.parameters["init"] = "pca"
+        self.perplexity_spin = self._create_spin_parameter(
+            "perplexity", 0, 1000, "Perplexity:")


I am not sure about this one, but sklearn doc says to consider values 5-50. I think spin limits of 1-100 should be fine?

lanzagar · 2018-01-19T11:43:19Z

Orange/widgets/unsupervised/owmanifoldlearning.py

+        self.perplexity_spin = self._create_spin_parameter(
+            "perplexity", 0, 1000, "Perplexity:")
+        self.early_exaggeration_spin = self._create_spin_parameter(
+            "early_exaggeration", 1, 1000, "Early exaggeration:")


Not sure about this one either, but 1000 sounds really high. Probably 1-100 should be fine?

lanzagar · 2018-01-19T11:47:42Z

Orange/widgets/unsupervised/owmanifoldlearning.py

+    perplexity = Setting(30)
+    early_exaggeration = Setting(4)
+    learning_rate = Setting(1000)
+    n_iter = Setting(1000)


I suggest we have the same defaults as the latest sklearn version (they have been changing them recently)
perplexity=30, e.e.=12, lr=200, iter=1000

I didn't realize they had changed them. Fixed now.

…on controls

pavlin-policar force-pushed the refactor-tsne branch from fbecd39 to 9b2cedb Compare July 31, 2017 07:01

thocevar self-assigned this Aug 18, 2017

thocevar removed their assignment Sep 8, 2017

lanzagar self-assigned this Jan 12, 2018

pavlin-policar force-pushed the refactor-tsne branch from 9b2cedb to 12f8c72 Compare January 12, 2018 18:00

lanzagar added this to the 3.10 milestone Jan 19, 2018

lanzagar requested changes Jan 19, 2018

View reviewed changes

pavlin-policar added 2 commits January 19, 2018 14:18

OWManifold: Add missing parameters to tSNE

a65302d

OWManifold: Update settings to match sklearn defaults, change limits …

554d55d

…on controls

pavlin-policar force-pushed the refactor-tsne branch from 12f8c72 to 554d55d Compare January 19, 2018 13:22

lanzagar approved these changes Jan 19, 2018

View reviewed changes

lanzagar merged commit 471b417 into biolab:master Jan 19, 2018

pavlin-policar deleted the refactor-tsne branch January 19, 2018 14:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ENH] Add parameters and similarity measures to tSNE#2510

[ENH] Add parameters and similarity measures to tSNE#2510
lanzagar merged 2 commits intobiolab:masterfrom
pavlin-policar:refactor-tsne

pavlin-policar commented Jul 29, 2017 •

edited

Loading

Uh oh!

pavlin-policar commented Jan 12, 2018

Uh oh!

codecov-io commented Jan 12, 2018 •

edited

Loading

Uh oh!

lanzagar Jan 19, 2018

Uh oh!

lanzagar Jan 19, 2018

Uh oh!

pavlin-policar Jan 19, 2018

Uh oh!

lanzagar Jan 19, 2018

Uh oh!

lanzagar Jan 19, 2018

Uh oh!

lanzagar Jan 19, 2018

Uh oh!

lanzagar Jan 19, 2018

Uh oh!

lanzagar Jan 19, 2018

Uh oh!

pavlin-policar Jan 19, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

pavlin-policar commented Jul 29, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Issue

Description of changes

Includes

Uh oh!

pavlin-policar commented Jan 12, 2018

Uh oh!

codecov-io commented Jan 12, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

pavlin-policar commented Jul 29, 2017 •

edited

Loading

codecov-io commented Jan 12, 2018 •

edited

Loading