Commit 76da511
feat: Add MultiViewKMeans estimator for multi-feature clustering
Implements MultiViewKMeans estimator with:
- Per-view divergences (different distance measures for each view)
- Per-view weights (importance weighting)
- Combine strategies: "weighted" (default), "max", "min"
- ViewSpec case class for view configuration
- Full persistence support (save/load)
- 21 comprehensive tests
Use cases:
- Document clustering (content + metadata + citations)
- Image clustering (pixels + captions + metadata)
- Multi-modal data (text + audio + video features)
🤖 Generated with [Claude Code](https://claude.com/claude-code)
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>1 parent 8877c87 commit 76da511
File tree
4 files changed
+1489
-3
lines changed- src
- main/scala/com/massivedatascience/clusterer/ml
- test/scala/com/massivedatascience/clusterer/ml
4 files changed
+1489
-3
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
28 | 28 | | |
29 | 29 | | |
30 | 30 | | |
31 | | - | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
32 | 38 | | |
33 | 39 | | |
34 | 40 | | |
35 | 41 | | |
36 | 42 | | |
37 | 43 | | |
| 44 | + | |
38 | 45 | | |
39 | 46 | | |
40 | 47 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
25 | 25 | | |
26 | 26 | | |
27 | 27 | | |
28 | | - | |
| 28 | + | |
29 | 29 | | |
30 | 30 | | |
31 | 31 | | |
| |||
57 | 57 | | |
58 | 58 | | |
59 | 59 | | |
60 | | - | |
| 60 | + | |
61 | 61 | | |
62 | 62 | | |
63 | 63 | | |
| |||
87 | 87 | | |
88 | 88 | | |
89 | 89 | | |
| 90 | + | |
90 | 91 | | |
91 | 92 | | |
92 | 93 | | |
| |||
0 commit comments