My understanding is that currently the combined_score metric is the most important. In some contexts the average of other metric values is used. It seems like analyzing the metrics in more detail might be helpful. For example, when sampling versions it might work well to include versions with complementary metrics profiles. And when deciding which versions to evict it might be useful to look at some kind of Pareto hull. Is anyone thinking along these lines already? Would the maintainers be interested in PRs in this direction?