Skip to content

Commit 33f1140

Browse files
readme mp4
1 parent fd265a4 commit 33f1140

File tree

2 files changed

+23
-14
lines changed

2 files changed

+23
-14
lines changed

README.md

Lines changed: 23 additions & 14 deletions
Original file line numberDiff line numberDiff line change
@@ -19,6 +19,11 @@ It exists to make it easy for researchers and engineers to **prototype**, **exte
1919

2020
Vortex allows you to express novel sparse attention concisely while relying on an optimized execution engine.
2121

22+
<video controls width="600">
23+
<source src="assets/demov2.0.mp4" type="video/mp4">
24+
Your browser does not support the video tag.
25+
</video>
26+
2227
---
2328

2429
## ✨ Key Features
@@ -48,6 +53,24 @@ pip install -e .
4853

4954
---
5055

56+
## 🤖 AI-Generated Sparse Attention
57+
58+
Vortex is designed not only for hand-crafted sparsity patterns but also for AI-generated sparse attention.
59+
60+
Our demo shows how to use SOTA agents OpenHands (https://openhands.dev/) to generate sparse attention algorithms.
61+
62+
```bash
63+
export LLM_API_KEY=YOUR_API_KEY
64+
python openhands_gen.py
65+
66+
```
67+
68+
The usage and installation guide of OpenHands can be found in https://docs.openhands.dev/sdk.
69+
70+
Note: Some operators are not yet fused or fully optimized, which may lead to increased memory usage. Tune down the `mem_fraction_static` if CUDA OOM. This can also impact generation speed during inference.
71+
72+
---
73+
5174
## 🧩 Quick Example: Custom Sparse Attention
5275

5376
```python
@@ -117,20 +140,6 @@ If `vortex_module_path` is not provided, Vortex will automatically search in
117140

118141
---
119142

120-
## 🤖 AI-Generated Sparse Attention
121-
Vortex is designed not only for hand-crafted sparsity patterns but also for AI-generated sparse attention.
122-
123-
Our demo shows how to use SOTA agents OpenHands (https://openhands.dev/) to generate sparse attention algorithms.
124-
125-
```bash
126-
export LLM_API_KEY=YOUR_API_KEY
127-
python openhands_gen.py
128-
129-
```
130-
131-
The usage and installation guide of OpenHands can be found in https://docs.openhands.dev/sdk.
132-
133-
Note: Some operators are not yet fused or fully optimized, which may lead to increased memory usage. Tune down the `mem_fraction_static` if CUDA OOM. This can also impact generation speed during inference.
134143

135144
## 📘 API Reference
136145

assets/demov2.0.mp4

15.2 MB
Binary file not shown.

0 commit comments

Comments
 (0)