doc: Super tiny fix doc math (#1747)

fzyzcjy · web-flow · commit 8ddfd285e532 · 2025-09-21T15:54:49.000-07:00
diff --git a/docs/tutorials/recursive_attention.rst b/docs/tutorials/recursive_attention.rst
@@ -24,7 +24,7 @@ We can also generalize the value on index :math:`i` to index set :math:`I`:
     \mathbf{v}(I) = \sum_{i\in I}\textrm{softmax}(s_i) \mathbf{v}_i = \frac{\sum_{i\in I}\exp\left(s_i\right)\mathbf{v}_i}{\exp(s(I))}
 
 The :math:`softmax` function is restricted to the index set :math:`I`. Note that :math:`\mathbf{v}(\{1,2,\cdots, n\})` is the self-attention output of the entire sequence.
-The *attention state* of the index set :math:`i` can be defined as a tuple :math:`(s(I), \mathbf{v}(I))`, then we can define a binary **merge** operator :math:`\oplus` of two attention states as ((in practice we will minus $s$ with maximum value to guarantee numerical stability and here we omit them for simplicity):
+The *attention state* of the index set :math:`I` can be defined as a tuple :math:`(s(I), \mathbf{v}(I))`, then we can define a binary **merge** operator :math:`\oplus` of two attention states as ((in practice we will minus :math:`s` with maximum value to guarantee numerical stability and here we omit them for simplicity):
 
 .. math::