microsoft
diff --git a/‎README.md‎
Lines changed: 7 additions & 7 deletions b/‎README.md‎
Lines changed: 7 additions & 7 deletions
diff --git a/‎docs/figures/Benchmarks.png‎
-280 KB b/‎docs/figures/Benchmarks.png‎
-280 KB
diff --git a/‎docs/figures/huggingface_logo-noborder.svg‎
Lines changed: 0 additions & 37 deletions b/‎docs/figures/huggingface_logo-noborder.svg‎
Lines changed: 0 additions & 37 deletions
diff --git a/‎eureka_ml_insights/prompt_templates/doc_str.jinja‎
Lines changed: 9 additions & 0 deletions b/‎eureka_ml_insights/prompt_templates/doc_str.jinja‎
Lines changed: 9 additions & 0 deletions
diff --git a/‎eureka_ml_insights/user_configs/doc_str.py‎
Lines changed: 139 additions & 0 deletions b/‎eureka_ml_insights/user_configs/doc_str.py‎
Lines changed: 139 additions & 0 deletions
diff --git a/‎readme_docs/figures/arxiv_logo.svg‎
Lines changed: 1 addition & 0 deletions b/‎readme_docs/figures/arxiv_logo.svg‎
Lines changed: 1 addition & 0 deletions
diff --git a/‎docs/figures/eureka_logo.png‎ renamed to ‎readme_docs/figures/eureka_logo.png‎ b/‎docs/figures/eureka_logo.png‎ renamed to ‎readme_docs/figures/eureka_logo.png‎
diff --git a/‎docs/figures/github.png‎ renamed to ‎readme_docs/figures/github.png‎ b/‎docs/figures/github.png‎ renamed to ‎readme_docs/figures/github.png‎
diff --git a/‎docs/figures/msr_blog.png‎ renamed to ‎readme_docs/figures/msr_blog.png‎ b/‎docs/figures/msr_blog.png‎ renamed to ‎readme_docs/figures/msr_blog.png‎
diff --git a/‎docs/figures/transparent_uml.png‎ renamed to ‎readme_docs/figures/transparent_uml.png‎ b/‎docs/figures/transparent_uml.png‎ renamed to ‎readme_docs/figures/transparent_uml.png‎
@@ -3,18 +3,18 @@
   <a href='https://arxiv.org/abs/2409.10566'><img src=https://img.shields.io/badge/arXiv-2409.10566-b31b1b.svg></a>
   <a href='https://arxiv.org/pdf/2504.00294'><img src=https://img.shields.io/badge/arXiv-2504.00294-b31b1b.svg></a>
   <a href='https://huggingface.co/datasets/microsoft/Eureka-Bench-Logs/tree/main'><img src=https://huggingface.co/front/assets/huggingface_logo-noborder.svg width="16">Eureka Evaluation Logs</a>
-  <a href='https://microsoft.github.io/eureka-ml-insights'><img src=docs/figures/github.png width="16">Project Website</a>
+  <a href='https://microsoft.github.io/eureka-ml-insights'><img src=readme_docs/figures/github.png width="16">Project Website</a>
 </p>
 
 This repository contains the code for the Eureka ML Insights framework. The framework is designed to help researchers and practitioners run reproducible evaluations of generative models using a variety of benchmarks and metrics efficiently. The framework allows the user to define custom pipelines for data processing, inference, and evaluation, and provides a set of pre-defined evaluation pipelines for key benchmarks.
 
 ## 📰 News
 
-- **[2025/5/20]**: We have uploaded logs from all experiment reported in our papers on [HuggingFace](https://huggingface.co/datasets/microsoft/Eureka-Bench-Logs/tree/main). 
-- **[2025/4/29]**: New blog post out [Eureka Inference-Time Scaling Insights: Where We Stand and What Lies Ahead](https://www.microsoft.com/en-us/research/articles/eureka-inference-time-scaling-insights-where-we-stand-and-what-lies-ahead/)
-- **[2025/3/31]**: ✨ We have a new paper out [Inference-Time Scaling for Complex Tasks: Where We Stand and What Lies Ahead](https://arxiv.org/abs/2504.00294)
-- **[2024/9/17]**: New blog post out [Eureka: Evaluating and understanding progress in AI](https://aka.ms/eureka-ml-insights-blog)
-- **[2024/9/17]**: ✨ New paper out [Eureka: Evaluating and Understanding Large Foundation Models](https://arxiv.org/abs/2409.10566)
+- **[2025/5/20]**: <img src=https://huggingface.co/front/assets/huggingface_logo-noborder.svg width="16"> We have uploaded logs from all experiment reported in our papers on [HuggingFace](https://huggingface.co/datasets/microsoft/Eureka-Bench-Logs/tree/main). 
+- **[2025/4/29]**: <img src=readme_docs/figures/msr_blog.png width="16"> New blog post out [Eureka Inference-Time Scaling Insights: Where We Stand and What Lies Ahead](https://www.microsoft.com/en-us/research/articles/eureka-inference-time-scaling-insights-where-we-stand-and-what-lies-ahead/)
+- **[2025/3/31]**: <img src=readme_docs/figures/arxiv_logo.svg width="16"> We have a new technical report out [Inference-Time Scaling for Complex Tasks: Where We Stand and What Lies Ahead](https://arxiv.org/abs/2504.00294)
+- **[2024/9/17]**: <img src=readme_docs/figures/msr_blog.png width="16"> New blog post out [Eureka: Evaluating and understanding progress in AI](https://aka.ms/eureka-ml-insights-blog)
+- **[2024/9/17]**: <img src=readme_docs/figures/arxiv_logo.svg width="16"> New technical report out [Eureka: Evaluating and Understanding Large Foundation Models](https://arxiv.org/abs/2409.10566)
 ## Table of Contents
 - [Eureka ML Insights Framework](#eureka-ml-insights-framework)
   - [📰 News](#-news)
@@ -98,7 +98,7 @@ The results of the experiment will be saved in a directory under `logs/FlenQA_Ex
 For other available experiment pipelines and model configurations, see the `eureka_ml_insights/user_configs` and `eureka_ml_insights/configs` directories, respectively. In [model_configs.py](eureka_ml_insights/configs/model_configs.py) you can configure the model classes to use your API keys, Key Vault urls, endpoints, and other model-specific configurations.
 
 ## 🗺️ Overview of Experiment Pipelines
-![Components](./docs/figures/transparent_uml.png)
+![Components](./readme_docs/figures/transparent_uml.png)
 Experiment pipelines define the sequence of components that are run to process data, run inference, and evaluate the model outputs. You can find examples of experiment pipeline configurations in the `user_configs` directory. To create a new experiment configuration, you need to define a class that inherits from `ExperimentConfig` and implements the `configure_pipeline` method. In the `configure_pipeline` method you define the Pipeline config (arrangement of Components) for your Experiment. Once your class is ready, add it to `user_configs/__init__.py` import list.
 
 
 
@@ -0,0 +1,9 @@
+You are given the contents of a python file between the <module_content> </module_content> tags. This module contains classes and functions. Your task is write docstrings for the module, classes and functions in Google style.
+If there already exist docstrings, you should rewrite them to be in Google style. 
+If there are no docstrings, you should write them in Google style. 
+For dataclasses, since there is no explicit init method, make sure to add attribute docstrings to the class docstring. Note that if a dataclass inherits from other dataclasses, make sure to add all attributes of the base classes as well as new attributes to the docstring. 
+Rewrite the whole file with no changes to the code. Only the docstrings should be changed.
+
+<module_content>
+{{ file_content }}
+</module_content>
@@ -0,0 +1,139 @@
+"""
+This module defines transformations for reading and writing file content
+and an experiment pipeline configuration for generating docstrings.
+"""
+
+import os
+from dataclasses import dataclass
+from typing import Any
+
+from eureka_ml_insights.configs import (
+    DataSetConfig,
+    ExperimentConfig,
+    InferenceConfig,
+    ModelConfig,
+    PipelineConfig,
+    PromptProcessingConfig,
+)
+from eureka_ml_insights.core import DataProcessing, Inference, PromptProcessing
+from eureka_ml_insights.data_utils import (
+    DataReader,
+    DFTransformBase,
+    MMDataLoader,
+)
+
+
+@dataclass
+class FileReaderTransform(DFTransformBase):
+    """
+    A transformation that reads file content from a specified file path column.
+    """
+
+    file_path_column: str = "file_path"
+
+    def transform(self, df):
+        """
+        Transform the DataFrame by reading content based on file paths.
+
+        Args:
+            df (pandas.DataFrame): The input DataFrame with the column specified
+                by file_path_column.
+
+        Returns:
+            pandas.DataFrame: The DataFrame with a new column 'file_content'.
+        """
+        # Implement the logic to read files from the specified column
+        df["file_content"] = df[self.file_path_column].apply(lambda x: open(x).read())
+        return df
+
+
+class FileWriterTransform(DFTransformBase):
+    """
+    A transformation that writes file content to a specified file path.
+    """
+
+    file_path_column: str = "file_path"
+    file_content_column: str = "model_output"
+
+    def transform(self, df):
+        """
+        Transforms the DataFrame by writing file content to disk.
+
+        This method replaces certain path elements, creates output directories if needed,
+        and writes the content from file_content_column to files specified by file_path_column.
+
+        Args:
+            df (pandas.DataFrame): The input DataFrame containing file path and content columns.
+
+        Returns:
+            pandas.DataFrame: The original DataFrame after writing the content to disk.
+        """
+        for index, row in df.iterrows():
+            with open(row[self.file_path_column], "w") as f:
+                f.write(row[self.file_content_column])
+        return df
+
+
+class DOCSTR_PIPELINE(ExperimentConfig):
+    """
+    An experiment configuration for a docstring generation pipeline.
+    """
+
+    def configure_pipeline(
+        self, model_config: ModelConfig, resume_from: str = None, **kwargs: dict[str, Any]
+    ) -> PipelineConfig:
+        """
+        Configure the pipeline components for docstring generation.
+
+        Args:
+            model_config (ModelConfig): Configuration for the model.
+            resume_from (str, optional): Path to a checkpoint to resume from. Defaults to None.
+            **kwargs (dict[str, Any]): Additional keyword arguments.
+
+        Returns:
+            PipelineConfig: The configured pipeline consisting of data processing,
+            inference, and post-processing components.
+        """
+
+        # input file should be a csv file with a column 'file_path' containing paths to Python files you want to add docstrings to.
+        input_file_path = kwargs.get("input_file_path", os.path.join(os.path.dirname(__file__), "../python_file_list.csv"))
+
+        # Configure the data processing component.
+        data_processing_comp = PromptProcessingConfig(
+            component_type=PromptProcessing,
+            prompt_template_path=os.path.join(os.path.dirname(__file__), "../prompt_templates/doc_str.jinja"),
+            data_reader_config=DataSetConfig(
+                DataReader,
+                {
+                    "path": input_file_path,
+                    "format": ".csv",
+                    "header": 0,
+                    "index_col": None,
+                    "transform": FileReaderTransform(file_path_column="file_path"),
+                },
+            ),
+            output_dir=os.path.join(self.log_dir, "data_processing_output"),
+        )
+        inference_comp = InferenceConfig(
+            component_type=Inference,
+            model_config=model_config,
+            data_loader_config=DataSetConfig(
+                MMDataLoader,
+                {"path": os.path.join(data_processing_comp.output_dir, "transformed_data.jsonl")},
+            ),
+            output_dir=os.path.join(self.log_dir, "inference_output"),
+            resume_from=resume_from,
+            max_concurrent=5,
+        )
+        post_process_comp = PromptProcessingConfig(
+            component_type=DataProcessing,
+            data_reader_config=DataSetConfig(
+                DataReader,
+                {
+                    "path": os.path.join(inference_comp.output_dir, "inference_result.jsonl"),
+                    "transform": FileWriterTransform(),
+                },
+            ),
+            output_dir=os.path.join(self.log_dir, "data_post_processing_output"),
+        )
+        return PipelineConfig([data_processing_comp, inference_comp, post_process_comp])