Skip to content

Commit 3341e3b

Browse files
committed
post : Efficient Attention
proof
1 parent e2979d6 commit 3341e3b

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

_posts/DeepLearning/Kernel Fusion/2025-03-07-fused.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -123,8 +123,8 @@ max에 대한 증명은 생략하도록 하겠습니다.
123123
1. $$ Assume \quad d_{j} = \sum_{k=0}^{j} e^{\,x_k - m_i} \quad d_{i} = \sum_{k=i+1}^{j+i} e^{\,x_k - m_j} \quad m_{i+j} = \max(m_i, m_j) $$
124124
$$d_i \oplus d_{j} = d_i \, e^{\,m_i - \max(m_i, m_j)} \;+\; d_j \, e^{\,m_j - \max(m_i, m_j)} $$
125125
$$= \left(\sum_{k=0}^{i} e^{\,x_k - m_{i}}\right) e^{\,m_{i} - \max(m_i, m_j)} + \left(\sum_{k=i+1}^{i+j} e^{\,x_k - m_{i}}\right) e^{\,m_{j} - \max(m_i, m_j)} $$
126-
$$= \left(\sum_{k=0}^{i} e^{\,x_k - m_{i}}\right) e^{\,\max(m_i, m_j)} + \left(\sum_{k=i+1}^{i+j} e^{\,x_k - m_{i}}\right) e^{\,\max(m_i, m_j)} $$
127-
$$ = \sum_{j=1}^{S} e^{\,x_j - m_S} $$
126+
$$= \left(\sum_{k=0}^{i} e^{\,x_k - m_{i}}\right) e^{\,m_{i} - m_{i+j}} + \left(\sum_{k=i+1}^{i+j} e^{\,x_k - m_{i}}\right) e^{\,m_{j} - m_{i+j}} $$
127+
$$ = \sum_{k=0}^{i+j} e^{\,x_k - m_{i+j}} $$
128128

129129

130130

0 commit comments

Comments
 (0)