Skip to content

Commit a306a5c

Browse files
committed
post : Efficient Attention
proof
1 parent b40cf35 commit a306a5c

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

_posts/DeepLearning/Kernel Fusion/2025-03-07-fused.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -63,14 +63,14 @@ $$ y_i = \frac{e^{x_i - \max_{k=1}^V x_k}}{\sum_{j=1}^V e^{x_j - \max_{k=1}^V x_
6363

6464
**Induction**: \(V = S\)
6565

66-
1. $$ Assume \quad m_{S-1} = \max_{k=1}^{S-1} x_k $$
66+
1. $$ Assume \quad m_{S-1} = \max_{k=1}^{S-1} x_k $$
6767
$$m_S \leftarrow \max(m_{S-1}, x_S) $$
6868
$$= \max\bigl(\max_{k=1}^{S-1} x_k,\; x_S\bigr) $$
6969
$$ = \max_{k=1}^{S} x_k $$
7070

7171

7272

73-
2. $$ Assume \quad d_{S-1} = \sum_{j=1}^{S-1} e^{\,x_j - m_S} $$
73+
2. $$ Assume \quad d_{S-1} = \sum_{j=1}^{S-1} e^{\,x_j - m_S} $$
7474
$$d_S \leftarrow d_{S-1} \, e^{\,m_{S-1} - m_S} + e^{\,x_S - m_S} $$
7575
$$= \left(\sum_{j=1}^{S-1} e^{\,x_j - m_{S-1}}\right) e^{\,m_{S-1} - m_S} + e^{\,x_S - m_S} $$
7676
$$ = \sum_{j=1}^{S} e^{\,x_j - m_S} $$

0 commit comments

Comments
 (0)