Skip to content

Conversation

@x-tabdeveloping
Copy link
Owner

I already started working on a fighting-words based feature importance estimation method here: #75
Since then I have found a better method for measuring semantic difference between clusters based on linear classifiers.
I decided to use LinearDiscrimantAnalysis since it is orders of magnitudes faster than other methods, and it takes forever to estimate feature importance for each of the levels of the hierarchy otherwise when reducing the number of topics.

I am also planning to add supervised topic modelling based on these feature importance methods in the near future.

@x-tabdeveloping x-tabdeveloping merged commit 57e7d02 into main Jun 23, 2025
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants