Skip to content

Commit 64f5d58

Browse files
readme mp4
1 parent 33f1140 commit 64f5d58

File tree

2 files changed

+9
-5
lines changed

2 files changed

+9
-5
lines changed

README.md

Lines changed: 9 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -11,18 +11,22 @@ Vortex: A Flexible and Efficient Sparse Attention Framework
1111
</h3>
1212

1313
<p align="center">
14-
<a href="https://github.com/Infini-AI-Lab/vortex_torch"><b>Documentation</b></a>
14+
<a href="https://github.com/Infini-AI-Lab/vortex_torch"><b>Documentation</b></a>
15+
&nbsp;&nbsp;|&nbsp;&nbsp;
16+
<a href="https://infini-ai-lab.github.io/ai-environment-architect/"><b>Blog 1</b></a>
1517
</p>
1618

19+
1720
Vortex is a lightweight, modular framework for building **custom sparse attention algorithms** for LLM inference.
1821
It exists to make it easy for researchers and engineers to **prototype**, **extend**, and **deploy** advanced sparsity patterns on modern inference backends such as SGLang—without modifying core model code.
1922

2023
Vortex allows you to express novel sparse attention concisely while relying on an optimized execution engine.
2124

22-
<video controls width="600">
23-
<source src="assets/demov2.0.mp4" type="video/mp4">
24-
Your browser does not support the video tag.
25-
</video>
25+
<figure>
26+
<img src="assets/demo.gif" alt="Demo" />
27+
<figcaption align="center"><em>OpenHands generate a sparse attention algorithm (up to 2.7X speedup in this example).</em></figcaption>
28+
</figure>
29+
2630

2731
---
2832

assets/demo.gif

7.59 MB
Loading

0 commit comments

Comments
 (0)