@@ -278,6 +278,59 @@ cd examples/mlx_metal_kernel_opt
278278python run_benchmarks.py --mode compare # Compare standard vs optimized
279279```
280280
281+ ## 🧪 ** NEW: Simple Testing Tools**
282+
283+ ### ** Quick Performance Testing**
284+
285+ We've added simple tools to easily test your optimized attention kernel:
286+
287+ #### ** 1. Verify Setup**
288+ ``` bash
289+ python verify_setup.py # Check dependencies and files
290+ ```
291+
292+ #### ** 2. Quick Demo**
293+ ``` bash
294+ python quick_demo.py # Run demo with multiple test prompts
295+ ```
296+
297+ #### ** 3. Custom Testing**
298+ ``` bash
299+ # Test with default best_program.py
300+ python test_optimized_attention.py
301+
302+ # Test with custom program
303+ python test_optimized_attention.py path/to/your/best_program.py
304+
305+ # Test with custom prompt
306+ python test_optimized_attention.py --prompt " Write a Python function:" --max-tokens 200
307+ ```
308+
309+ #### ** 4. Cleanup**
310+ ``` bash
311+ python cleanup.py # Move temporary files to temp/ directory
312+ ```
313+
314+ ### ** What These Tools Do:**
315+
316+ - ** 🔧 test_optimized_attention.py** : Monkey patches mlx-lm with your optimized attention and runs side-by-side performance comparison
317+ - ** 🚀 quick_demo.py** : Automated demo with multiple test prompts showing performance improvements
318+ - ** 🔍 verify_setup.py** : Checks dependencies, files, and setup before running tests
319+ - ** 🧹 cleanup.py** : Organizes temporary files created during testing
320+
321+ ### ** Expected Output:**
322+
323+ ```
324+ 🚀 PERFORMANCE COMPARISON:
325+ Speed Improvement: +9.8%
326+ Memory Change: -0.04 GB
327+ Time Improvement: +9.6%
328+
329+ 🎯 SIGNIFICANT IMPROVEMENT achieved!
330+ ```
331+
332+ See ` TESTING_GUIDE.md ` for detailed usage instructions.
333+
281334## 📈 ** Expected Evolution Trajectory**
282335
283336### ** Generation 1-10: Broadcasting Optimizations**
0 commit comments