A few interesting clustering ideas: * Behavioral clustering: Active Ratio, Commits/Day * Defect clustering: Files, Churn Ratio * Identify similar developers