You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+5-5Lines changed: 5 additions & 5 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -87,7 +87,7 @@ weco --source optimize.py \
87
87
--metric speedup \
88
88
--maximize true \
89
89
--steps 15 \
90
-
--model claude-3-7-sonnet-20250219 \
90
+
--model gemini-2.5-pro-exp-03-25 \
91
91
--additional-instructions "Fuse operations in the forward method while ensuring the max float deviation remains small. Maintain the same format of the code."
92
92
```
93
93
@@ -105,7 +105,7 @@ weco --source optimize.py \
105
105
--metric speedup \
106
106
--maximize true \
107
107
--steps 30 \
108
-
--model o3-mini \
108
+
--model gemini-2.5-pro-exp-03-25 \
109
109
--additional-instructions examples.rst
110
110
```
111
111
@@ -122,7 +122,7 @@ Given how useful causal multihead self attention is to transformers, we've seen
122
122
--metric speedup \
123
123
--maximize true \
124
124
--steps 30 \
125
-
--model gemini-2.5-pro-preview-03-25 \
125
+
--model gemini-2.5-pro-exp-03-25 \
126
126
--additional-instructions "Use triton to optimize the code while ensuring a small max float diff. Maintain the same code format."
127
127
```
128
128
@@ -135,7 +135,7 @@ Given how useful causal multihead self attention is to transformers, we've seen
0 commit comments