Skip to content

Commit 0fb4cc1

Browse files
committed
Update base for Update on "[ET-VK] New Implementation of `permute' operator"
## Changes * Introduce `permute_buffer.glsl` and `permute_texture.glsl` compute shader templates to implement the permute operator ## Motivation The existing implementation of permute produced incorrect outputs for width packed textures. Furthermore, there was no buffer implementation for the permute operator. My goal with this diff is to introduce a more flexible implementation of permute that could work for any tensor representation. ## Performance impact None expected. Differential Revision: [D76483755](https://our.internmc.facebook.com/intern/diff/D76483755/) [ghstack-poisoned]
2 parents d714d28 + 89bdd1d commit 0fb4cc1

File tree

110 files changed

+3674
-942
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

110 files changed

+3674
-942
lines changed

.ci/scripts/benchmark_tooling/README.md

Lines changed: 4 additions & 20 deletions
Original file line numberDiff line numberDiff line change
@@ -77,16 +77,16 @@ python3 .ci/scripts/benchmark_tooling/analyze_benchmark_stability.py \
7777

7878
##### Filtering Options:
7979

80-
- `--device-pools`: Filter by private device pool names (e.g., "samsung-galaxy-s22-5g", "samsung-galaxy-s22plus-5g")
80+
- `--device-pools`: Filter by device pool names (e.g., "apple_iphone_15_private", "samsung_s22_private")
8181
- `--backends`: Filter by specific backend names (e.g.,"xnnpack_q8")
82-
- `--models`: Filter by specific model names (e.g., "mv3", "meta-llama-llama-3.2-1b-instruct-qlora-int4-eo8")
82+
- `--models`: Filter by specific model names (e.g., "mv3", "meta-llama/Llama-3.2-1B-Instruct-SpinQuant_INT4_EO8")
8383

8484
#### Example Usage
8585

8686
Filter by multiple private device pools and models:
8787
```bash
8888
# This fetches all private table data for models 'llama-3.2-1B' and 'mv3'
89-
python3 get_benchmark_analysis_data.py \
89+
python3 .ci/scripts/benchmark_tooling/get_benchmark_analysis_data.py \
9090
--startTime "2025-06-01T00:00:00" \
9191
--endTime "2025-06-11T00:00:00" \
9292
--device-pools 'apple_iphone_15_private' 'samsung_s22_private' \
@@ -97,7 +97,7 @@ Filter by specific device pool and models:
9797
```bash
9898
# This fetches all private iPhone table data for models 'llama-3.2-1B' and 'mv3',
9999
# and associated public iPhone data
100-
python3 get_benchmark_analysis_data.py \
100+
python3 .ci/scripts/benchmark_tooling/get_benchmark_analysis_data.py \
101101
--startTime "2025-06-01T00:00:00" \
102102
--endTime "2025-06-11T00:00:00" \
103103
--device-pools 'apple_iphone_15_private' \
@@ -140,22 +140,6 @@ fetcher.run(
140140
end_time="2025-06-17T18:00:00"
141141
)
142142

143-
# Get results in different formats
144-
# As DataFrames
145-
df_results = fetcher.to_df()
146-
147-
# Export to Excel
148-
fetcher.to_excel(output_dir="./results")
149-
150-
# Export to CSV
151-
fetcher.to_csv(output_dir="./results")
152-
153-
# Export to JSON
154-
json_path = fetcher.to_json(output_dir="./results")
155-
156-
# Get raw dictionary results
157-
dict_results = fetcher.to_dict()
158-
159143
# Use the output_data method for flexible output
160144
results = fetcher.output_data(output_type="excel", output_dir="./results")
161145
```

0 commit comments

Comments
 (0)