Question: How to determine optimal K using ADAMIXTURE?


Hi ADAMIXTURE team,

Thank you for developing and sharing this excellent tool! I’m writing to ask for some guidance regarding model selection.

## Background

In ADMIXTURE, I usually determine the optimal number of clusters (K) using cross-validation, for example:

admixture pop.bed 16 --cv=10 -j8 --seed=12345

which reports:

CV error (K=16): 0.21043

This makes it straightforward to compare different K values and select the one with the lowest CV error.

## My question

In ADAMIXTURE, I did not find a `--cv` option in the CLI. I understand that ADAMIXTURE reports log-likelihood and other metrics, but I am unsure about the recommended approach for selecting the best K.

Could you please advise:

* What is the recommended way to choose the optimal K in ADAMIXTURE?
* Is there an equivalent to ADMIXTURE’s cross-validation workflow?


If there is documentation, an example workflow, or a recommended strategy that I may have overlooked, I would greatly appreciate being pointed to it.

Thank you very much for your time and help!

Best regards,
Aaron


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question: How to determine optimal K using ADAMIXTURE? #2

Background

My question

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Question: How to determine optimal K using ADAMIXTURE? #2

Description

Background

My question

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions