Move transition matrix computation outside of main function #15

mfalkiewicz · 2018-01-01T21:27:51Z

Refactoring the kernel normalization and transition matrix computation as independent function will allow for more code re-use.

This reverts commit ac9a191.

satra · 2018-01-02T17:00:19Z

mapalign/embed.py

                                                         metric=self.metric)
-                return self.affinity_matrix_
-            self.affinity_matrix_ = self.affinity(X)
-            return self.affinity_matrix_


the function description should be modified if this function is not going to return the affinity matrix.

I re-added the return of affinity_matrix_, but not as a field in self. The reason is that in the code I am currently writing I want to be able to bind results of several affinity calculations to different fields in the object.

satra · 2018-01-02T17:00:56Z

mapalign/embed.py

            raise ImportError('Checks require scikit-learn, but not found')

    ndim = L.shape[0]
-    if overwrite:


overwrite is no longer supported in the rewrite. i think this should still be used as a memory saving option.

@mfalkiewicz - this is not supported in the refactor

satra · 2018-01-02T17:04:50Z

mapalign/dist.py

-        k = int(max(2, np.round(D.shape[0] * 0.01)))
-        eps = 2 * np.median(np.sort(D, axis=0)[k+1, :])**2
+        k = int(max(2, np.round(D.shape[0] * f_neighbors)))
+        eps = np.median(np.sort(D, axis=0)[0:k+1, :]**2)


if i remember this correctly, eps is based on the k+1 th entry rather than than everything upto that point. it basically provides a scaling factor given the sorted distance.

The approach currently implemented is more liberal than the one I propose and both of them are correct. All manifold learning approaches are based on local similarities and these should be emphasized - I think the median of all the values up to the k-th captures this property more accurately. Besides, I think that parametrization of the kernel in terms of average distances to a certain fraction of nearest neighbors is more natural, because it doesn't depend on the scale of the affinities that one uses.

satra · 2018-01-02T17:05:57Z

mapalign/embed.py

-        L_alpha = d_alpha[:, np.newaxis] * L_alpha
-
-    M = L_alpha
+    M = _compute_markov_matrix(L, use_sparse = use_sparse, alpha = alpha)


M = _compute_markov_matrix(L if overwrite else L.copy(), use_sparse = use_sparse, alpha = alpha)

Regarding this comment and overwrite - I agree that it would be nice to have a memory saving option, but I am not sure if this refactor will allow to manipulate that. I don't know the details of the lexical scope of Python, but maybe you could enlighten me. _compute_markov_matrix creates a local binding A. If we pass an alias as an argument (L) rather than the object itself (L.copy()), is A also going to be an alias? In other words, if we pass an alias to variable that is in the lexical scope of function compute_diffusion_map as an argument to function _compute_markov_matrix, is _compute_markov_matrix going to mutate the object that the alias refers to OR create it's local copy and mutate that instead?

Sorry for slightly confusing description, but I don't know how to phrase this in a simpler way.

satra · 2018-01-02T17:08:28Z

@mfalkiewicz - should #13 still be a PR?

mfalkiewicz · 2018-01-02T18:48:25Z

@satra: no, I merged them both together and deleted #13.

satra · 2018-02-01T14:35:42Z

also tests are not passing - since the markov computation has changed.

i would recommend creating a second markov option with your changes, unless you think the current implementation is an error. this will help keep the api consistent. and you can create a new set of embedding tests with the alternate calculation.

Marcel Falkiewicz and others added 6 commits December 27, 2017 11:21

Fixed affinity matrix code for sklearn

4771d6e

Fixed imports

d74a4d1

Changed eps calculation, added f_neighbors as parameter

49c627f

Row sum 1 -> Col sum 1

ac9a191

Revert "Row sum 1 -> Col sum 1"

a6975d8

This reverts commit ac9a191.

Moved markov matrix computation outside of main function

f5bd950

satra reviewed Jan 2, 2018

View reviewed changes

Return affinity matrix, but not as part of self

43e0b15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Move transition matrix computation outside of main function #15

Move transition matrix computation outside of main function #15

Uh oh!

mfalkiewicz commented Jan 1, 2018

Uh oh!

satra Jan 2, 2018

Uh oh!

mfalkiewicz Jan 2, 2018

Uh oh!

satra Jan 2, 2018

Uh oh!

satra Feb 1, 2018

Uh oh!

satra Jan 2, 2018

Uh oh!

mfalkiewicz Jan 2, 2018

Uh oh!

satra Jan 2, 2018

Uh oh!

mfalkiewicz Jan 2, 2018

Uh oh!

satra commented Jan 2, 2018

Uh oh!

mfalkiewicz commented Jan 2, 2018

Uh oh!

satra commented Feb 1, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Move transition matrix computation outside of main function #15

Are you sure you want to change the base?

Move transition matrix computation outside of main function #15

Uh oh!

Conversation

mfalkiewicz commented Jan 1, 2018

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

satra commented Jan 2, 2018

Uh oh!

mfalkiewicz commented Jan 2, 2018

Uh oh!

satra commented Feb 1, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants