Skip to content

Commit f124fb0

Browse files
committed
Update docs
1 parent 4c0d1dd commit f124fb0

File tree

134 files changed

+1866
-2
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

134 files changed

+1866
-2
lines changed

_sources/deeplearning_operators/gemv.md.txt

Lines changed: 466 additions & 0 deletions
Large diffs are not rendered by default.

api/modules.html

Lines changed: 7 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -178,6 +178,13 @@
178178
<ul>
179179
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/elementwise.html">ElementWise Operators</a></li>
180180
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html">General Matrix-Vector Multiplication (GEMV)</a></li>
181+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#triton-implementation">Triton implementation</a></li>
182+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#naive-implementation-in-tilelang">Naive Implementation in TileLang</a></li>
183+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#more-concurrency">More concurrency</a></li>
184+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#vectorized-reads">Vectorized Reads</a></li>
185+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#tvm-thread-allreduce-instead-of-atomicadd"><code class="docutils literal notranslate"><span class="pre">tvm_thread_allreduce</span></code> Instead of <code class="docutils literal notranslate"><span class="pre">atomicAdd</span></code></a></li>
186+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#autotune">Autotune</a></li>
187+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#conclusion">Conclusion</a></li>
181188
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/matmul.html">General Matrix-Matrix Multiplication with Tile Library</a></li>
182189
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/matmul_dequant.html">General Matrix-Matrix Multiplication with Dequantization</a></li>
183190
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/flash_attention.html">Flash Attention</a></li>

api/tilelang.autotuner.html

Lines changed: 7 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -178,6 +178,13 @@
178178
<ul>
179179
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/elementwise.html">ElementWise Operators</a></li>
180180
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html">General Matrix-Vector Multiplication (GEMV)</a></li>
181+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#triton-implementation">Triton implementation</a></li>
182+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#naive-implementation-in-tilelang">Naive Implementation in TileLang</a></li>
183+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#more-concurrency">More concurrency</a></li>
184+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#vectorized-reads">Vectorized Reads</a></li>
185+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#tvm-thread-allreduce-instead-of-atomicadd"><code class="docutils literal notranslate"><span class="pre">tvm_thread_allreduce</span></code> Instead of <code class="docutils literal notranslate"><span class="pre">atomicAdd</span></code></a></li>
186+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#autotune">Autotune</a></li>
187+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#conclusion">Conclusion</a></li>
181188
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/matmul.html">General Matrix-Matrix Multiplication with Tile Library</a></li>
182189
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/matmul_dequant.html">General Matrix-Matrix Multiplication with Dequantization</a></li>
183190
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/flash_attention.html">Flash Attention</a></li>

api/tilelang.cache.html

Lines changed: 7 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -178,6 +178,13 @@
178178
<ul>
179179
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/elementwise.html">ElementWise Operators</a></li>
180180
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html">General Matrix-Vector Multiplication (GEMV)</a></li>
181+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#triton-implementation">Triton implementation</a></li>
182+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#naive-implementation-in-tilelang">Naive Implementation in TileLang</a></li>
183+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#more-concurrency">More concurrency</a></li>
184+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#vectorized-reads">Vectorized Reads</a></li>
185+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#tvm-thread-allreduce-instead-of-atomicadd"><code class="docutils literal notranslate"><span class="pre">tvm_thread_allreduce</span></code> Instead of <code class="docutils literal notranslate"><span class="pre">atomicAdd</span></code></a></li>
186+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#autotune">Autotune</a></li>
187+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#conclusion">Conclusion</a></li>
181188
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/matmul.html">General Matrix-Matrix Multiplication with Tile Library</a></li>
182189
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/matmul_dequant.html">General Matrix-Matrix Multiplication with Dequantization</a></li>
183190
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/flash_attention.html">Flash Attention</a></li>

api/tilelang.cache.kernel_cache.html

Lines changed: 7 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -178,6 +178,13 @@
178178
<ul>
179179
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/elementwise.html">ElementWise Operators</a></li>
180180
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html">General Matrix-Vector Multiplication (GEMV)</a></li>
181+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#triton-implementation">Triton implementation</a></li>
182+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#naive-implementation-in-tilelang">Naive Implementation in TileLang</a></li>
183+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#more-concurrency">More concurrency</a></li>
184+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#vectorized-reads">Vectorized Reads</a></li>
185+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#tvm-thread-allreduce-instead-of-atomicadd"><code class="docutils literal notranslate"><span class="pre">tvm_thread_allreduce</span></code> Instead of <code class="docutils literal notranslate"><span class="pre">atomicAdd</span></code></a></li>
186+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#autotune">Autotune</a></li>
187+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#conclusion">Conclusion</a></li>
181188
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/matmul.html">General Matrix-Matrix Multiplication with Tile Library</a></li>
182189
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/matmul_dequant.html">General Matrix-Matrix Multiplication with Dequantization</a></li>
183190
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/flash_attention.html">Flash Attention</a></li>

api/tilelang.carver.analysis.html

Lines changed: 7 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -178,6 +178,13 @@
178178
<ul>
179179
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/elementwise.html">ElementWise Operators</a></li>
180180
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html">General Matrix-Vector Multiplication (GEMV)</a></li>
181+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#triton-implementation">Triton implementation</a></li>
182+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#naive-implementation-in-tilelang">Naive Implementation in TileLang</a></li>
183+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#more-concurrency">More concurrency</a></li>
184+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#vectorized-reads">Vectorized Reads</a></li>
185+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#tvm-thread-allreduce-instead-of-atomicadd"><code class="docutils literal notranslate"><span class="pre">tvm_thread_allreduce</span></code> Instead of <code class="docutils literal notranslate"><span class="pre">atomicAdd</span></code></a></li>
186+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#autotune">Autotune</a></li>
187+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#conclusion">Conclusion</a></li>
181188
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/matmul.html">General Matrix-Matrix Multiplication with Tile Library</a></li>
182189
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/matmul_dequant.html">General Matrix-Matrix Multiplication with Dequantization</a></li>
183190
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/flash_attention.html">Flash Attention</a></li>

api/tilelang.carver.arch.arch_base.html

Lines changed: 7 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -178,6 +178,13 @@
178178
<ul>
179179
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/elementwise.html">ElementWise Operators</a></li>
180180
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html">General Matrix-Vector Multiplication (GEMV)</a></li>
181+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#triton-implementation">Triton implementation</a></li>
182+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#naive-implementation-in-tilelang">Naive Implementation in TileLang</a></li>
183+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#more-concurrency">More concurrency</a></li>
184+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#vectorized-reads">Vectorized Reads</a></li>
185+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#tvm-thread-allreduce-instead-of-atomicadd"><code class="docutils literal notranslate"><span class="pre">tvm_thread_allreduce</span></code> Instead of <code class="docutils literal notranslate"><span class="pre">atomicAdd</span></code></a></li>
186+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#autotune">Autotune</a></li>
187+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#conclusion">Conclusion</a></li>
181188
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/matmul.html">General Matrix-Matrix Multiplication with Tile Library</a></li>
182189
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/matmul_dequant.html">General Matrix-Matrix Multiplication with Dequantization</a></li>
183190
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/flash_attention.html">Flash Attention</a></li>

api/tilelang.carver.arch.cdna.html

Lines changed: 7 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -178,6 +178,13 @@
178178
<ul>
179179
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/elementwise.html">ElementWise Operators</a></li>
180180
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html">General Matrix-Vector Multiplication (GEMV)</a></li>
181+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#triton-implementation">Triton implementation</a></li>
182+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#naive-implementation-in-tilelang">Naive Implementation in TileLang</a></li>
183+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#more-concurrency">More concurrency</a></li>
184+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#vectorized-reads">Vectorized Reads</a></li>
185+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#tvm-thread-allreduce-instead-of-atomicadd"><code class="docutils literal notranslate"><span class="pre">tvm_thread_allreduce</span></code> Instead of <code class="docutils literal notranslate"><span class="pre">atomicAdd</span></code></a></li>
186+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#autotune">Autotune</a></li>
187+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#conclusion">Conclusion</a></li>
181188
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/matmul.html">General Matrix-Matrix Multiplication with Tile Library</a></li>
182189
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/matmul_dequant.html">General Matrix-Matrix Multiplication with Dequantization</a></li>
183190
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/flash_attention.html">Flash Attention</a></li>

api/tilelang.carver.arch.cpu.html

Lines changed: 7 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -178,6 +178,13 @@
178178
<ul>
179179
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/elementwise.html">ElementWise Operators</a></li>
180180
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html">General Matrix-Vector Multiplication (GEMV)</a></li>
181+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#triton-implementation">Triton implementation</a></li>
182+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#naive-implementation-in-tilelang">Naive Implementation in TileLang</a></li>
183+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#more-concurrency">More concurrency</a></li>
184+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#vectorized-reads">Vectorized Reads</a></li>
185+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#tvm-thread-allreduce-instead-of-atomicadd"><code class="docutils literal notranslate"><span class="pre">tvm_thread_allreduce</span></code> Instead of <code class="docutils literal notranslate"><span class="pre">atomicAdd</span></code></a></li>
186+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#autotune">Autotune</a></li>
187+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#conclusion">Conclusion</a></li>
181188
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/matmul.html">General Matrix-Matrix Multiplication with Tile Library</a></li>
182189
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/matmul_dequant.html">General Matrix-Matrix Multiplication with Dequantization</a></li>
183190
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/flash_attention.html">Flash Attention</a></li>

api/tilelang.carver.arch.cuda.html

Lines changed: 7 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -178,6 +178,13 @@
178178
<ul>
179179
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/elementwise.html">ElementWise Operators</a></li>
180180
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html">General Matrix-Vector Multiplication (GEMV)</a></li>
181+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#triton-implementation">Triton implementation</a></li>
182+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#naive-implementation-in-tilelang">Naive Implementation in TileLang</a></li>
183+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#more-concurrency">More concurrency</a></li>
184+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#vectorized-reads">Vectorized Reads</a></li>
185+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#tvm-thread-allreduce-instead-of-atomicadd"><code class="docutils literal notranslate"><span class="pre">tvm_thread_allreduce</span></code> Instead of <code class="docutils literal notranslate"><span class="pre">atomicAdd</span></code></a></li>
186+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#autotune">Autotune</a></li>
187+
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/gemv.html#conclusion">Conclusion</a></li>
181188
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/matmul.html">General Matrix-Matrix Multiplication with Tile Library</a></li>
182189
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/matmul_dequant.html">General Matrix-Matrix Multiplication with Dequantization</a></li>
183190
<li class="toctree-l1"><a class="reference internal" href="../deeplearning_operators/flash_attention.html">Flash Attention</a></li>

0 commit comments

Comments
 (0)