Skip to content

adding semsim distribution plot analysis#188

Open
souzadevinicius wants to merge 5 commits intomainfrom
187-create-new-pheval-utils-function-to-generate-semsim-score-distribution-plots
Open

adding semsim distribution plot analysis#188
souzadevinicius wants to merge 5 commits intomainfrom
187-create-new-pheval-utils-function-to-generate-semsim-score-distribution-plots

Conversation

@souzadevinicius
Copy link
Member

@souzadevinicius souzadevinicius commented Aug 16, 2023

Fixes #187 and #180

Including semsim score distribution plots and improving the performance of data handling through the pandas to polars library replacement

Copy link
Member

@yaseminbridges yaseminbridges left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good! I just had a few minor comments

semsim_score_distribution_plot(input, score_column, output)


def filter_non_0_score(data: pl.DataFrame, col: str) -> pd.DataFrame:
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Have you got tests for all these other semsim util functions?

Copy link
Member Author

@souzadevinicius souzadevinicius Nov 2, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

They are implement on test_cli.py file.
But have to be improved

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I can see that there is a test for the CLI command in test_cli.py, what I meant is do you have tests for all the smaller methods that you are calling? Like the diff_semsim method in semsim_utils.py

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Create new pheval-utils function to generate semsim score distribution plots

2 participants