Why is the kmeans algorithm column-oriented instead of row-oriented?

In the docs (below), the `kmeans` algorithm takes a matrix where each column X[:, i] corresponds to an observed sample. This implementation goes against the idea of [tidy data](https://www.jstatsoft.org/article/view/v059i10) as well as differs from [Python's scikit-learn implementation of kmeans](http://scikit-learn.org/stable/modules/generated/sklearn.cluster.KMeans.html) and [R's base implementation of kmeans](https://stat.ethz.ch/R-manual/R-devel/library/stats/html/kmeans.html).

Is there a good reason for this? Should this algorithm be changed from column-oriented to row-oriented so as to be consistent with R and Python as well as with the concept of tidy data? 

URL: http://clusteringjl.readthedocs.io/en/stable/overview.html

## Inputs

A clustering algorithm, depending on its nature, may accept an input matrix in either of the following forms:

* Sample matrix X, where each column X[:,i] corresponds to an observed sample.
* Distance matrix D, where D[i,j] indicates the distance between samples i and j, or the cost of assigning one to the other.





Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Why is the kmeans algorithm column-oriented instead of row-oriented? #79

Inputs

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Why is the kmeans algorithm column-oriented instead of row-oriented? #79

Description

Inputs

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions