Skip to content

Conversation

@Samoed
Copy link
Member

@Samoed Samoed commented May 12, 2025

No description provided.

@Samoed Samoed requested review from Copilot and voorhs May 12, 2025 21:18
Copy link

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

This PR adds a new script for comparing AutoML evaluation frameworks by introducing various evaluation functions for frameworks such as Fedot, H2O, LAMA, GAMA, and GlueOn. The script accepts command-line arguments for dataset and framework, loads data from Hugging Face’s datasets, performs model evaluation using the specified framework, and logs results via wandb.

Comments suppressed due to low confidence (1)

scripts/automl_eval.py:44

  • The function name 'evalute_fedot' appears to be misspelled. Consider renaming it to 'evaluate_fedot' for clarity and consistency.
def evalute_fedot(train_df: pd.DataFrame, test_df: pd.DataFrame):

Comment on lines +15 to +21
Load a dataset from the Hugging Face datasets library.

Args:
dataset_name (str): The name of the dataset to load.

Returns:
DatasetDict: A dictionary containing the train, validation, and test splits of the dataset.
Copy link

Copilot AI May 12, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The docstring for load_data specifies a return type of 'DatasetDict' despite the function actually returning two DataFrames. Update the docstring to accurately describe the returned values.

Suggested change
Load a dataset from the Hugging Face datasets library.
Args:
dataset_name (str): The name of the dataset to load.
Returns:
DatasetDict: A dictionary containing the train, validation, and test splits of the dataset.
Load a dataset from the Hugging Face datasets library and return it as pandas DataFrames.
Args:
dataset_name (str): The name of the dataset to load.
Returns:
tuple[pd.DataFrame, pd.DataFrame]: A tuple containing two DataFrames:
- The first DataFrame contains the training data.
- The second DataFrame contains the testing data.

Copilot uses AI. Check for mistakes.
@Samoed Samoed closed this May 12, 2025
@Samoed Samoed deleted the automl_eval.py branch May 12, 2025 21:21
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants