pytorch
diff --git a/‎.ci/scripts/benchmark_tooling/README.md‎
Lines changed: 116 additions & 25 deletions b/‎.ci/scripts/benchmark_tooling/README.md‎
Lines changed: 116 additions & 25 deletions
@@ -2,51 +2,142 @@
 
 A library providing tools for benchmarking ExecutorchBenchmark data.
 
-## Read Benchmark Data
-`get_benchmark_analysis_data.py` fetches benchmark data from HUD Open API and processes it, grouping metrics by private and public devices.
-
-### Quick Start
+## Installation
 
 Install dependencies:
 ```bash
 pip install -r requirements.txt
 ```
 
-Run with default output (CLI):
+## Tools
+
+### get_benchmark_analysis_data.py
+
+This script fetches benchmark data from HUD Open API and processes it, grouping metrics by private and public devices.
+## Quick start
+
+generates the matching_list json:
+```
+python get_benchmark_analysis_data.py get_matching_list \
+  --startTime 2025-06-11T00:00:00 \
+  --endTime 2025-06-17T00:00:00 \
+  --category private_mv3_iphone15 \
+  --filter "include=private,mv3;"\
+  --outputType json
+```
+
+if everything looks good, generate the private csv output:
+```
+python3 get_benchmark_analysis_data.py generate_data \
+--startTime "2025-06-11T00:00:00" \
+--endTime "2025-06-17T18:00:00" \
+--private-matching-json-path "./private_mv3_iphone15.json" --outputType csv \
+--includePublic false
+```
+
+
+#### Generate Benchmark Data
+
 ```bash
-python3 .ci/scripts/benchmark_tooling/get_benchmark_analysis_data.py --startTime "2025-06-11T00:00:00" --endTime "2025-06-17T18:00:00"
+python get_benchmark_analysis_data.py generate_data \
+  --startTime 2025-06-11T00:00:00 \
+  --endTime 2025-06-17T18:00:00
 ```
 
-Additional options:
+##### Options:
 - `--silent`: Hide processing logs, show only results
 - `--outputType df`: Display results in DataFrame format
-- `--outputType excel --outputDir "{YOUR_LOCAL_DIRECTORY}"`: Generate Excel file with multiple sheets (`res_private.xlsx` and `res_public.xlsx`)
-- `--outputType csv --outputDir "{YOUR_LOCAL_DIRECTORY}"`: Generate CSV files in folders (`private` and `public`)
+- `--outputType print`: Display results in dictionary format
+- `--outputType json --outputDir "/path/to/dir"`: Generate JSON file 'benchmark_results.json'
+- `--outputType csv --outputDir "/path/to/dir"`: Generate CSV files in folders (`private` and `public`)
 
+#### Get Matching Lists
 
-### Python API Usage
+The `get_matching_list` command allows you to filter benchmark data based on specific criteria.
 
-To use the benchmark fetcher in your own scripts:
+##### Get All Matching Lists
+```bash
+python get_benchmark_analysis_data.py get_matching_list \
+  --startTime 2025-06-11T00:00:00 \
+  --endTime 2025-06-17T00:00:00 \
+  --category all \
+  --outputType json
+```
 
-```python
-import ExecutorchBenchmarkFetcher from benchmark_tooling.get_benchmark_analysis_data
-fetcher = ExecutorchBenchmarkFetcher()
-# Must call run first
-fetcher.run()
-private, public = fetcher.to_df()
+##### Get Private Device Matching Lists
+```bash
+python get_benchmark_analysis_data.py get_matching_list \
+  --startTime 2025-06-11T00:00:00 \
+  --endTime 2025-06-17T00:00:00 \
+  --category private \
+  --filter "include=private;"
 ```
 
-## analyze_benchmark_stability.py
-`analyze_benchmark_stability.py` analyzes the stability of benchmark data, comparing the results of private and public devices.
+##### Get Public Device Matching Lists
+```bash
+python get_benchmark_analysis_data.py get_matching_list \
+  --startTime 2025-06-11T00:00:00 \
+  --endTime 2025-06-17T00:00:00 \
+  --category public \
+  --filter "exclude=private;"
+```
 
-### Quick Start
-Install dependencies:
+##### Advanced Filtering Examples
+Filter for specific models and devices:
 ```bash
-pip install -r requirements.txt
+# Get all mv3 models on iPhone 15 except apple_iphone_15_plus
+python get_benchmark_analysis_data.py get_matching_list \
+  --startTime 2025-06-11T00:00:00 \
+  --endTime 2025-06-17T00:00:00 \
+  --category private_mv3_iphone5 \
+  --filter "include=private,mv3,iphone_15;exclude=apple_iphone_15_plus"
 ```
 
+Multiple filters (using union logic):
+```bash
+# Get both mv3 and resnet50 models on iPhone 15 except apple_iphone_15_plus
+python get_benchmark_analysis_data.py get_matching_list \
+  --startTime 2025-06-11T00:00:00 \
+  --endTime 2025-06-17T00:00:00 \
+  --category private_models_iphone15 \
+  --filter "include=private,mv3,iphone_15;exclude=apple_iphone_15_plus" \
+  --filter "include=private,resnet50,iphone_15;exclude=apple_iphone_15_plus"
 ```
-python .ci/scripts/benchmark_tooling/analyze_benchmark_stability.py \
-    Benchmark\ Dataset\ with\ Private\ AWS\ Devices.xlsx \
-    --reference_file Benchmark\ Dataset\ with\ Public\ AWS\ Devices.xlsx
+
+##### Output Options
+- `--outputType json --outputDir "/path/to/dir"`: Generate JSON file '{category}.json'
+
+#### Python API Usage
+
+To use the benchmark fetcher in your own scripts:
+
+```python
+from benchmark_tooling.get_benchmark_analysis_data import ExecutorchBenchmarkFetcher
+
+# Initialize the fetcher
+fetcher = ExecutorchBenchmarkFetcher()
+
+# Fetch data for a specific time range
+fetcher.run(
+    "2025-06-11T00:00:00",
+    "2025-06-17T00:00:00",
+    private_device_matching_list,
+    public_device_matching_list
+)
+
+# Get results as DataFrames
+private_dfs, public_dfs = fetcher.toDataFrame()
+
+# Export results to Excel
+fetcher.output_data(OutputType.CSV, (output_dir="./results")
+```
+
+### analyze_benchmark_stability.py
+
+This script analyzes the stability of benchmark data, comparing the results of private and public devices.
+
+```bash
+python analyze_benchmark_stability.py \
+    "Benchmark Dataset with Private AWS Devices.xlsx" \
+    --reference_file "Benchmark Dataset with Public AWS Devices.xlsx"
 ```