vchernoy
diff --git a/‎404.html‎
Lines changed: 2 additions & 0 deletions b/‎404.html‎
Lines changed: 2 additions & 0 deletions
diff --git a/‎content/post/binomial-modulo-prime.md‎
Lines changed: 95 additions & 0 deletions b/‎content/post/binomial-modulo-prime.md‎
Lines changed: 95 additions & 0 deletions
diff --git a/‎content/post/efficient-implementation-non-adjacent-selection.md‎
Lines changed: 3 additions & 81 deletions b/‎content/post/efficient-implementation-non-adjacent-selection.md‎
Lines changed: 3 additions & 81 deletions
diff --git a/‎content/post/two-var-recursive-func.md‎
Lines changed: 2 additions & 1 deletion b/‎content/post/two-var-recursive-func.md‎
Lines changed: 2 additions & 1 deletion
diff --git a/‎index.html‎
Lines changed: 91 additions & 19 deletions b/‎index.html‎
Lines changed: 91 additions & 19 deletions
@@ -570,6 +570,8 @@ <h2>Latest</h2>
 
       <li><a href="/post/">Posts</a></li>
 
+      <li><a href="/post/binomial-modulo-prime/">Binomial Coefficients Modulo a Prime: Fermat&#39;s Theorem and the Non-Adjacent Selection Problem</a></li>
+    
       <li><a href="/post/efficient-implementation-non-adjacent-selection/">Efficient Implementation of the Non-Adjacent Selection Formula</a></li>
 
       <li><a href="/post/two-var-recursive-func/">Cracking Multivariate Recursive Equations Using Generating Functions</a></li>
 
@@ -0,0 +1,95 @@
++++
+date = "2017-07-08T07:29:43Z"
+highlight = true
+math = true
+tags = ["math", "python", "binomial", "combinatorics", "programming", "modular arithmetic", "competitive programming"]
+title = "Binomial Coefficients Modulo a Prime: Fermat's Theorem and the Non-Adjacent Selection Problem"
+
+[header]
+  caption = ""
+  image = ""
+
++++
+
+In the [previous post][efficient-impl], we implemented the closed form $F_{n,m} = \binom{n-m+1}{m}$ using Python's `math.factorial`, and with `scipy` and `sympy`. Here we cover the common competitive-programming case: computing the answer **modulo a large prime** $M$ (e.g. $M = 10^9+7$).
+
+## Why modulo?
+
+In counting problems, the result can be huge even for moderate input. Often the problem asks for the answer modulo a big prime so that it fits in a standard integer type. We could compute the full number and then take the remainder, but that forces expensive long-integer arithmetic. Computing **everything** modulo $M$ from the start is much faster.
+
+## From binomials to modular inverses
+
+We have $F_{n,m} = \binom{n-m+1}{m} = \frac{(n-m+1)!}{m!\,(n-2m+1)!}$. To compute this mod $M$, we need factorials mod $M$ and division mod $M$. Division mod $M$ is multiplication by the **modular inverse**: for prime $M$ and $0 < x < M$, the inverse of $x$ is $x^{M-2} \bmod M$ by [Fermat's little theorem](https://en.wikipedia.org/wiki/Fermat%27s_little_theorem). In Python we can use `pow(x, M - 2, M)`.
+
+## Implementation
+
+```python
+import functools
+
+M = 10**9 + 7
+
+def f_binom_mod(n, m):
+    assert n >= 0 and m >= 0
+
+    if n + 1 < 2*m:
+        return 0
+
+    return binom_mod(n - m + 1, m)
+
+def binom_mod(n, m):
+    assert 0 <= m <= n
+
+    return ((fact_mod(n) * inv_mod(fact_mod(m))) % M * inv_mod(fact_mod(n - m))) % M
+
+@functools.lru_cache(maxsize=None)
+def fact_mod(m):
+    if m <= 1:
+        return 1
+
+    return (m * fact_mod(m - 1)) % M
+
+def inv_mod(x):
+    return pow(x, M - 2, M)
+```
+
+All operations stay in the ring of integers mod $M$. The only non-obvious part is modular division: we replace division by $d$ with multiplication by `inv_mod(d)` using Fermat's little theorem.
+
+## Benchmarks
+
+Compared to computing the full binomial and then taking the remainder, the modular version avoids long arithmetic and is much faster:
+
+```python
+fact_mod(10000)  # for caching factorials
+
+funcs = [f_binom_mod, f_binom, f_sci, f_sym]
+
+test(10000, 1000, funcs)
+test(10000, 2000, funcs)
+test(10000, 3000, funcs)
+```
+
+Example output:
+
+```
+f(10000,1000): 450169549
+  f_binom_mod:   0.0000 sec, x 1.00
+      f_binom:   0.0073 sec, x 337.60
+        f_sci:   0.0011 sec, x 49.33
+        f_sym:   0.0076 sec, x 353.22
+
+f(10000,2000): 75198348
+  f_binom_mod:   0.0000 sec, x 1.00
+      f_binom:   0.0063 sec, x 368.94
+        f_sci:   0.0026 sec, x 153.33
+        f_sym:   0.0053 sec, x 308.93
+
+f(10000,3000): 679286557
+  f_binom_mod:   0.0000 sec, x 1.00
+      f_binom:   0.0060 sec, x 361.12
+        f_sci:   0.0056 sec, x 338.13
+        f_sym:   0.0053 sec, x 319.02
+```
+
+The same pattern—factorials mod $M$ plus Fermat-based inverses—works for any combinatorial formula that can be written in terms of factorials and binomials modulo a prime.
+
+[efficient-impl]: /post/efficient-implementation-non-adjacent-selection/
@@ -15,7 +15,7 @@ In the [previous post][two-var-recursive], we derived the closed form for the no
 
 $$ F_{n, m} = {n - m + 1 \choose m} $$
 
-Now we discuss how to implement this efficiently in Python—from a simple factorial-based solution to library implementations and modular arithmetic for competitive programming.
+Now we discuss how to implement this efficiently in Python—from a simple factorial-based solution to library implementations. For the common case of computing the answer **modulo a large prime** (e.g. in competitive programming), see the [next post][binom-mod].
 
 ## Fast Solutions Based on Binomials
 
@@ -142,86 +142,8 @@ You can play with running tests on different $n$ and $m$.
 What I saw that actually there is no clear winner between the last 3 implementations.
 Probably, the most of the time is spent on the long arithmetic computation.
 
-## Modular Arithmetics
-
-In questions where it is required to count some objects, not rarely the answer might be very big even on very small input.
-In such case, typically it is asked to print the answer modulo some big prime integer, let's say, $M=1000^3+7$.
-Since Python has built-in long arithmetics, we can apply modulo on the final result,
-but executing the entire algorithm with long arithmetics while knowing that only small part of it is really important is very costly,
-and of course, not that efficient.
-
-Let's look, briefly, at very simple change we can do for `f_binom` function that will speed up the computation significantly:
-
-```python
-import functools
-
-M = 10**9 + 7
-
-def f_binom_mod(n, m):
-    assert n >= 0 and m >= 0
-
-    if n + 1 < 2*m:
-        return 0
-
-    return binom_mod(n - m + 1, m)
-
-def binom_mod(n, m):
-    assert 0 <= m <= n
-
-    return ((fact_mod(n) * inv_mod(fact_mod(m))) % M * inv_mod(fact_mod(n - m))) % M
-
-@functools.lru_cache(maxsize=None)
-def fact_mod(m):
-    if m <= 1:
-        return 1
-
-    return (m * fact_mod(m - 1)) % M
-
-def inv_mod(x):
-    return pow(x, M - 2, M)
-```
-
-As we can see, all the operations are computed modulo $M$.
-The function `fact_mod` is recursive but uses Memoization.
-The most tricky part is how to implement modular-division.
-From [Fermat's little theorem](https://en.wikipedia.org/wiki/Fermat%27s_little_theorem),
-we know that if $M$ is prime and $0 < x < M$, then $x^{-1} \equiv x^{M-2} \pmod M$.
-This allows to compute the multiplicative inverse of $x$ using the Python's built-in function
-[pow](https://docs.python.org/3/library/functions.html#pow).
-
-Let's test the new approach against other implementations:
-
-```python
-fact_mod(10000) # for caching factorials
-
-funcs = [f_binom_mod, f_binom, f_sci, f_sym]
-
-test(10000, 1000, funcs)
-test(10000, 2000, funcs)
-test(10000, 3000, funcs)
-```
-
-It is not a surprise that taking the benefits of modular computations results in the huge speedup in running-time:
-
-```
-f(10000,1000): 450169549
-  f_binom_mod:   0.0000 sec, x 1.00
-      f_binom:   0.0073 sec, x 337.60
-        f_sci:   0.0011 sec, x 49.33
-        f_sym:   0.0076 sec, x 353.22
-
-f(10000,2000): 75198348
-  f_binom_mod:   0.0000 sec, x 1.00
-      f_binom:   0.0063 sec, x 368.94
-        f_sci:   0.0026 sec, x 153.33
-        f_sym:   0.0053 sec, x 308.93
-
-f(10000,3000): 679286557
-  f_binom_mod:   0.0000 sec, x 1.00
-      f_binom:   0.0060 sec, x 361.12
-        f_sci:   0.0056 sec, x 338.13
-        f_sym:   0.0053 sec, x 319.02
-```
+When the problem asks for the answer **modulo a large prime** (e.g. $10^9+7$), computing everything mod $M$ from the start is much faster than using long integers. We cover that in a [separate post][binom-mod]: binomial coefficients modulo a prime using Fermat's little theorem.
 
 [intro-to-dp]: /post/intro-to-dp/
 [two-var-recursive]: /post/two-var-recursive-func/
+[binom-mod]: /post/binomial-modulo-prime/
@@ -81,9 +81,10 @@ Which actually equals to
 
 $$ F\_{n, m} = {n - m + 1 \choose m} $$
 
-In the [next post][efficient-impl], we discuss how to implement this closed form efficiently in Python—from a simple factorial-based solution to library implementations and modular arithmetic for competitive programming.
+In the [next post][efficient-impl], we implement this closed form in Python (factorial-based and with scipy/sympy). For computing the answer modulo a large prime, see [Binomial Coefficients Modulo a Prime][binom-mod].
 
 [intro-to-dp]: /post/intro-to-dp/
 [gen-func-art]: /post/gen-func-art/
 [efficient-impl]: /post/efficient-implementation-non-adjacent-selection/
+[binom-mod]: /post/binomial-modulo-prime/
 
@@ -970,6 +970,95 @@ <h1 class="mb-0">Recent Posts</h1>
 
 
 
+
+<div class="media stream-item view-compact">
+  <div class="media-body">
+
+    <div class="section-subheading article-title mb-0 mt-0">
+      <a href="/post/binomial-modulo-prime/" >Binomial Coefficients Modulo a Prime: Fermat&#39;s Theorem and the Non-Adjacent Selection Problem</a>
+    </div>
+
+    
+    <a href="/post/binomial-modulo-prime/"  class="summary-link">
+      <div class="article-style">
+        <p>In the <a href="/post/efficient-implementation-non-adjacent-selection/">previous post</a>, we implemented the closed form $F_{n,m} = \binom{n-m+1}{m}$ using Python&rsquo;s <code>math.factorial</code>, and with <code>scipy</code> and <code>sympy</code>. Here we cover the common competitive-programming case: computing the answer <strong>modulo a large prime</strong> $M$ (e.g. $M = 10^9+7$).</p>
+<h2 id="why-modulo">Why modulo?</h2>
+<p>In counting problems, the result can be huge even for moderate input. Often the problem asks for the answer modulo a big prime so that it fits in a standard integer type. We could compute the full number and then take the remainder, but that forces expensive long-integer arithmetic. Computing <strong>everything</strong> modulo $M$ from the start is much faster.</p>
+      </div>
+    </a>
+    
+
+    <div class="stream-meta article-metadata">
+
+      
+
+      
+        
+
+
+<div class="article-metadata">
+
+  
+  
+
+  
+  <span class="article-date">
+    
+    
+      
+    
+    Jul 8, 2017
+  </span>
+  
+
+  
+
+  
+  <span class="middot-divider"></span>
+  <span class="article-reading-time">
+    2 min read
+  </span>
+  
+
+  
+  
+  
+  
+
+  
+  
+
+</div>
+
+      
+    </div>
+
+    
+
+  </div>
+  <div class="ml-3">
+    
+    
+  </div>
+</div>
+
+  
+    
+
+
+
+
+
+
+
+
+
+
+
+
+
+  
+
 
 <div class="media stream-item view-compact">
   <div class="media-body">
@@ -983,26 +1072,9 @@ <h1 class="mb-0">Recent Posts</h1>
       <div class="article-style">
         <p>In the <a href="/post/two-var-recursive-func/">previous post</a>, we derived the closed form for the non-adjacent selection problem:</p>
 <p>$$ F_{n, m} = {n - m + 1 \choose m} $$</p>
-<p>Now we discuss how to implement this efficiently in Python—from a simple factorial-based solution to library implementations and modular arithmetic for competitive programming.</p>
+<p>Now we discuss how to implement this efficiently in Python—from a simple factorial-based solution to library implementations. For the common case of computing the answer <strong>modulo a large prime</strong> (e.g. in competitive programming), see the <a href="/post/binomial-modulo-prime/">next post</a>.</p>
 <h2 id="fast-solutions-based-on-binomials">Fast Solutions Based on Binomials</h2>
 <p>We can reflect the closed form in very trivial Python code:</p>
-<div class="highlight"><pre tabindex="0" style="color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4;"><code class="language-Python" data-lang="Python"><span style="display:flex;"><span><span style="color:#f92672">import</span> math
-</span></span><span style="display:flex;"><span>
-</span></span><span style="display:flex;"><span><span style="color:#66d9ef">def</span> <span style="color:#a6e22e">f_binom</span>(n, m):
-</span></span><span style="display:flex;"><span>    <span style="color:#66d9ef">assert</span> n <span style="color:#f92672">&gt;=</span> <span style="color:#ae81ff">0</span> <span style="color:#f92672">and</span> m <span style="color:#f92672">&gt;=</span> <span style="color:#ae81ff">0</span>
-</span></span><span style="display:flex;"><span>
-</span></span><span style="display:flex;"><span>    <span style="color:#66d9ef">if</span> n <span style="color:#f92672">+</span> <span style="color:#ae81ff">1</span> <span style="color:#f92672">&lt;</span> <span style="color:#ae81ff">2</span><span style="color:#f92672">*</span>m:
-</span></span><span style="display:flex;"><span>        <span style="color:#66d9ef">return</span> <span style="color:#ae81ff">0</span>
-</span></span><span style="display:flex;"><span>
-</span></span><span style="display:flex;"><span>    <span style="color:#66d9ef">return</span> binom(n <span style="color:#f92672">-</span> m <span style="color:#f92672">+</span> <span style="color:#ae81ff">1</span>, m)
-</span></span><span style="display:flex;"><span>
-</span></span><span style="display:flex;"><span><span style="color:#66d9ef">def</span> <span style="color:#a6e22e">binom</span>(n, m):
-</span></span><span style="display:flex;"><span>    <span style="color:#66d9ef">assert</span> <span style="color:#ae81ff">0</span> <span style="color:#f92672">&lt;=</span> m <span style="color:#f92672">&lt;=</span> n
-</span></span><span style="display:flex;"><span>
-</span></span><span style="display:flex;"><span>    <span style="color:#66d9ef">return</span> math<span style="color:#f92672">.</span>factorial(n) <span style="color:#f92672">//</span> math<span style="color:#f92672">.</span>factorial(m) <span style="color:#f92672">//</span> math<span style="color:#f92672">.</span>factorial(n <span style="color:#f92672">-</span> m)
-</span></span></code></pre></div><p>This implementation overperforms significantly the initial DP and memoization solutions from <a href="/post/intro-to-dp/">Introduction to Dynamic Programming and Memoization</a>.
-A naive implementation of <code>math.factorial()</code> might make $n$ multiplications.
-This could still be faster than doing $\Theta(n)$ additions in DP approach.</p>
       </div>
     </a>
 
@@ -1035,7 +1107,7 @@ <h2 id="fast-solutions-based-on-binomials">Fast Solutions Based on Binomials</h2
 
   <span class="middot-divider"></span>
   <span class="article-reading-time">
-    6 min read
+    4 min read
   </span>