rougier
diff --git a/‎01-preface.rst‎
Lines changed: 10 additions & 9 deletions b/‎01-preface.rst‎
Lines changed: 10 additions & 9 deletions
diff --git a/‎05-problem-vectorization.rst‎
Lines changed: 129 additions & 10 deletions b/‎05-problem-vectorization.rst‎
Lines changed: 129 additions & 10 deletions
diff --git a/‎07-beyond-numpy.rst‎
Lines changed: 63 additions & 58 deletions b/‎07-beyond-numpy.rst‎
Lines changed: 63 additions & 58 deletions
diff --git a/‎08-conclusion.rst‎
Lines changed: 22 additions & 3 deletions b/‎08-conclusion.rst‎
Lines changed: 22 additions & 3 deletions
@@ -91,21 +91,22 @@ How to contribute
 If you want to contribute to this book, you can:
 
 * Review chapters (please contact me)
-* Report issues (https://github.com/rougier/numpy-book/issues)
-* Suggest improvements (https://github.com/rougier/numpy-book/pulls)
-* Correct English (https://github.com/rougier/numpy-book/issues)
+* Report issues (https://github.com/rougier/from-python-to-numpy/issues)
+* Suggest improvements (https://github.com/rougier/from-python-to-numpy/pulls)
+* Correct English (https://github.com/rougier/from-python-to-numpy/issues)
 * Design a better and more responsive html template for the book.
-
+* Star the project (https://github.com/rougier/from-python-to-numpy)
 
 Publishing
 ++++++++++
 
 If you're an editor interested in publishing this book, you can contact me if
-you agree to have this open access version online, you know how to deal with
-`restructured text <http://docutils.sourceforge.net/rst.html>`_ (Word is not an
-option), you provide a real added-value as well as supporting services, and
-more importantly, you have a truly amazing latex book template (and be warned
-that I'm a bit picky about typography & design: E.Tufte is my hero).
+you agree to have this version and all subsequent versions open access
+(i.e. online), you know how to deal with `restructured text
+<http://docutils.sourceforge.net/rst.html>`_ (Word is not an option), you
+provide a real added-value as well as supporting services, and more
+importantly, you have a truly amazing latex book template (and be warned that
+I'm a bit picky about typography & design: E.Tufte is my hero).
 
 Still here?
 
 
@@ -333,6 +333,13 @@ illustrated below (reading from left to right, top to bottom). Once this is
 done, we can ascent the gradient from the starting node. You can check on the
 figure this leads to the shortest path.
 
+.. admonition:: **Figure**
+   :class: legend
+
+   Value iteration algorithm on a simple maze. Once entrance has been reached,
+   it is easy to find the shortest path by ascending the value gradient.
+
+   
 
 .. image:: data/value-iteration-1.pdf
    :width: 19%
@@ -561,13 +568,21 @@ References
 * `Animating Sand as a Fluid <https://www.cs.ubc.ca/%7Erbridson/docs/zhu-siggraph05-sandfluid.pdf>`_, Yongning Zhu & Robert Bridson, 2005.
 
 
-Blue noise
-----------
+Blue noise sampling
+-------------------
+
+Blue noise refers to sample sets that have random and yet uniform distributions
+with absence of any spectral bias. Such noise is very useful in a variety of
+graphics applications like rendering, dithering, stippling, etc. Many different
+methods have been proposed to achieve such noise whose most simple is certainly
+the DART method.
+
 
 .. admonition:: **Figure 10**
    :class: legend
 
-   Detail of "The Starry Night", Vincent van Gogh, 1889.
+   Detail of "The Starry Night", Vincent van Gogh, 1889. The detail has been
+   resampled using voronoi cells whose centers are a blue noise sample.
 
 .. image:: data/mosaic.png
    :width: 100%
@@ -577,23 +592,117 @@ Blue noise
 DART method
 +++++++++++
 
-Numpy implementation
-++++++++++++++++++++
+The DART method is one of the earliest and simplest method. It works by
+sequentially drawing uniform random point and only accept those who lies at a
+minimum distance from every previous accepted sample. This sequential method is
+therefore extremely slow because each new candidate needs to be tested against
+previous accepted candidates. The more points you accept, the slower is the
+method. Let's consider the unit surface and a minimum radius `r` to be enforced
+between each point.
+
+Knowing that the densest packing of circles in the plane is the hexagonal
+lattice of the bee's honeycomb, we know this density is :math:`d =
+\frac{1}{6}\pi\sqrt{3}` (in fact `I learned it
+<https://en.wikipedia.org/wiki/Circle_packing>`_ while writing this book).
+Considering circles with radius r, we can pack at most :math:`\frac{d}{\pi r^2}
+= \frac{\sqrt{3}}{6r^2} = \frac{1}{2r^2\sqrt{3}}`. We know the theoretical
+upper limit for the number of discs we can pack onto the surface but we'll
+likely not reach this upper limit because of random placements. Furthermore,
+because a lot of points will be rejected after a few have been accepted, we
+need to set a limit in the number of successive failed trials before we stop
+the whole process.
 
 
-.. admonition:: **Figure 11**
-   :class: legend
+.. code:: python
 
-   Comparison of uniform, grid-jittered and Poisson disc sampling.
+   import math
+   import random
+
+   def DART_sampling(width=1.0, height=1.0, r = 0.025, k=100):
+       def distance(p0, p1):
+           dx, dy = p0[0]-p1[0], p0[1]-p1[1]
+           return math.hypot(dx, dy)
+
+       points = []
+       i = 0
+       last_success = 0
+       while True:
+           x = random.uniform(0, width)
+           y = random.uniform(0, height)
+           accept = True
+           for p in points:
+               if distance(p, (x, y)) < r:
+                   accept = False
+                   break
+           if accept is True:
+               points.append((x, y))
+               if i-last_success > k:
+                   break
+               last_success = i
+           i += 1
+       return points
+
+I left as an exercise the vectorization of the DART method. The idea is to
+pre-compute enough uniform random samples as well as paired distances and to
+test for their sequential inclusion.
+       
+
+Bridson method
+++++++++++++++
+
+If the vectoriation of the previous method poses no real difficulty, the speed
+improvement is not so good and the quality remains low and dependent on the `k`
+parameter. The higher the better since it basically governs how hard to try to
+insert a new sample. But, when there is already a large number of accepted
+samples, only chance allows us to find a position to insert a new sample. We
+could increase the `k` value but this would make the method even more slow
+without any guarantee in quality. It's time to think out of the box and luckily
+enough, Robert Bridson did that for us and proposed a simple yet efficient
+method:
+
+**Step 0**. *Initialize an n-dimensional background grid for storing samples and
+accelerating spatial searches. We pick the cell size to be bounded by r/√n, so
+that each grid cell will contain at most one sample, and thus the grid can be
+implemented as a simple n- dimensional array of integers: the default −1
+indicates no sample, a non-negative integer gives the index of the sample
+located in a cell.*
+
+**Step 1**. *Select the initial sample, x0, randomly chosen uniformly from the
+domain. Insert it into the background grid, and initialize the “active list”
+(an array of sample indices) with this index (zero).*
+
+**Step 2**. *While the active list is not empty, choose a random index from it
+(say i). Generate up to k points chosen uniformly from the spherical annulus
+between radius r and 2r around xi. For each point in turn, check if it is
+within distance r of existing samples (using the background grid to only test
+nearby samples). If a point is adequately far from existing samples, emit it
+as the next sample and add it to the active list. If after k attempts no such
+point is found, instead remove i from the active list.*
+
+
+Implementation poses no real problem and is left as an exercise for the
+reader. Note that not only this method is fast, but it also offers a better
+quality (more samples) than the DART method even with a high `k`
+parameter.
+
+.. admonition:: **Figure**
+   :class: legend
 
+   Comparison of uniform, grid-jittered and Bridson sampling.
 
 .. image:: data/sampling.png
    :width: 100%
 
+
+
+           
 Sources
 +++++++
 
-* `sampling.py <code/sampling.py>`_
+* `DART-sampling-python.py <code/DART-sampling-python.py>`_
+* `DART-sampling-numpy.py <code/DART-sampling-numpy.py>`_ (solution to the exercise)
+* `Bridson-sampling.py <code/Bridson-sampling.py>`_ (solution to the exercise)
+* `sampling.py <code/sampling.py>`_ 
 * `mosaic.py <code/mosaic.py>`_
 * `voronoi.py <code/voronoi.py>`_
 
@@ -606,9 +715,19 @@ References
   Jose Esteve, 2012.
 * `Poisson Disk Sampling <http://devmag.org.za/2009/05/03/poisson-disk-sampling/>`_
   Herman Tulleken, 2009.
-* `Fast Poisson Disk Sampling in Arbitrary Dimensions <http://www.cs.ubc.ca/~rbridson/docs/bridson-siggraph07-poissondisk.pdf>`_
+* `Fast Poisson Disk Sampling in Arbitrary Dimensions <http://www.cs.ubc.ca/~rbridson/docs/bridson-siggraph07-poissondisk.pdf>`_,
   Robert Bridson, SIGGRAPH, 2007.
 
 
 Conclusion
 ----------
+
+The last example we'been studying is indeed a nice example where it is more
+important to vectorize the problem rather than to vectorize the code (and too
+early). In this spefici case we were lucky enough to have the work done for us
+but it won't be always the case and in such a case, the temptation might be
+high to vectorize the first solution we've found. I hope you're now convinced
+it might be a good idea in general to look for alternative solutions once
+you've found one. You'll (almost) always improve speed by vectorizing your
+code, but in the process, you may miss huge improvements.
+
@@ -127,8 +127,8 @@ important to wonder if you need an actual instance of your result or if a
 simple generator might do the job.
 
 
-Friends of Numpy
-----------------
+Numpy & co
+----------
 
 Beyond numpy, there are several other Python packages that are worth a look
 because they address similar yet different class of problems using different
@@ -306,72 +306,77 @@ and other massively parallel compute devices from Python.
 
 
 
-..
-   Friends of Scipy
-   ----------------
-
-   Here is a very short list of packages that are well-maintained, well tested and
-   may simplify your scientific life (depending on your domain). There are of
-   course many more and depending on your specific needs, chances are you do not
-   have to program everything by yourself. But it is a good exercise if you have
-   some spare time. For an extensive list, have a look at the `Awesome python list
-   <https://awesome-python.com>`_.
-
-   scikit-learn
-   ++++++++++++
-
-   `scikit-learn <http://scikit-learn.org/stable/>`_ is a free software machine
-   learning library for the Python programming language. It features various
-   classification, regression and clustering algorithms including support vector
-   machines, random forests, gradient boosting, k-means and DBSCAN, and is
-   designed to interoperate with the Python numerical and scientific libraries
-   NumPy and SciPy.
-
-
-   scikit-image
-   ++++++++++++
-
-   `scikit-image <http://scikit-image.org>`_ is a Python package dedicated to
-   image processing, and using natively NumPy arrays as image objects. This
-   chapter describes how to use scikit-image on various image processing tasks,
-   and insists on the link with other scientific Python modules such as NumPy and
-   SciPy.
+Scipy & co
+----------
 
-   SympPy
-   ++++++
+If there are several additional packages for Numpy, there is a trillion
+additional packages for scipy. In fact, every domain of science probably has
+its own package and most of the examples we've been studying until now could
+have been solved in two or three calls to a method in the relevant package.
+But of course, it was not the goal an programming things yourself is generally
+a good exercise if you have some spare time. The biggest difficulty at this
+point is to find these relevant packages. Here is a very short list of packages
+that are well-maintained, well tested and may simplify your scientific life
+(depending on your domain). There are of course many more and depending on your
+specific needs, chances are you do not have to program everything by
+yourself. For an extensive list, have a look at the `Awesome python list
+<https://awesome-python.com>`_.
+
+scikit-learn
+++++++++++++
+
+`scikit-learn <http://scikit-learn.org/stable/>`_ is a free software machine
+learning library for the Python programming language. It features various
+classification, regression and clustering algorithms including support vector
+machines, random forests, gradient boosting, k-means and DBSCAN, and is
+designed to interoperate with the Python numerical and scientific libraries
+NumPy and SciPy.
+
+
+scikit-image
+++++++++++++
+
+`scikit-image <http://scikit-image.org>`_ is a Python package dedicated to
+image processing, and using natively NumPy arrays as image objects. This
+chapter describes how to use scikit-image on various image processing tasks,
+and insists on the link with other scientific Python modules such as NumPy and
+SciPy.
+
+SympPy
+++++++
 
-   `SymPy <http://www.sympy.org/en/index.html>`_ is a Python library for symbolic
-   mathematics. It aims to become a full-featured computer algebra system (CAS)
-   while keeping the code as simple as possible in order to be comprehensible and
-   easily extensible. SymPy is written entirely in Python.
+`SymPy <http://www.sympy.org/en/index.html>`_ is a Python library for symbolic
+mathematics. It aims to become a full-featured computer algebra system (CAS)
+while keeping the code as simple as possible in order to be comprehensible and
+easily extensible. SymPy is written entirely in Python.
 
-   Astropy
-   +++++++
+Astropy
++++++++
 
-   The `Astropy <http://www.astropy.org>`_ project is a community effort to
-   develop a single core package for astronomy in Python and foster
-   interoperability between Python astronomy packages.
+The `Astropy <http://www.astropy.org>`_ project is a community effort to
+develop a single core package for astronomy in Python and foster
+interoperability between Python astronomy packages.
 
 
-   Cartopy
-   +++++++
+Cartopy
++++++++
 
-   `Cartopy <http://scitools.org.uk/cartopy/>`_ is a Python package designed to
-   make drawing maps for data analysis and visualisation as easy as
-   possible. Cartopy makes use of the powerful PROJ.4, numpy and shapely libraries
-   and has a simple and intuitive drawing interface to matplotlib for creating
-   publication quality maps.
+`Cartopy <http://scitools.org.uk/cartopy/>`_ is a Python package designed to
+make drawing maps for data analysis and visualisation as easy as
+possible. Cartopy makes use of the powerful PROJ.4, numpy and shapely libraries
+and has a simple and intuitive drawing interface to matplotlib for creating
+publication quality maps.
 
 
-   Brian
-   +++++
+Brian
++++++
 
-   `Brian <http://www.briansimulator.org>`_ is a free, open source simulator for
-   spiking neural networks. It is written in the Python programming language and
-   is available on almost all platforms. We believe that a simulator should not
-   only save the time of processors, but also the time of scientists. Brian is
-   therefore designed to be easy to learn and use, highly flexible and easily
-   extensible.
+`Brian <http://www.briansimulator.org>`_ is a free, open source simulator for
+spiking neural networks. It is written in the Python programming language and
+is available on almost all platforms. We believe that a simulator should not
+only save the time of processors, but also the time of scientists. Brian is
+therefore designed to be easy to learn and use, highly flexible and easily
+extensible.
 
 
 Conclusion
 
@@ -1,8 +1,27 @@
 Conclusion
 ===============================================================================
 
-|WIP|
+You've reached the end of this book. I hope you've learned something while
+reading it, I sure learned a lot writing it. Trying to explain something is a
+generally a good exercise to test for your knowledge of this thing. Of course,
+we only scratched the surface of Numpy and there are many things left to
+discover. Have a look at the bibliography for books written by true experts, at
+the documentation written by people making Numpy and don't hesitate to ask your
+questions on the mailing lists because the Numpy community is very friendly.
 
-.. contents:: **Contents**
-   :local:
+If there's a single message to retain from this book it is "premature
+optimization is the root of all evil". We've seen that code vectorization can
+drastically improve your computation, with several orders of magnitude in some
+cases. Still, problem vectorization is generally much more powerful. If you
+write code vectorization too early in your design process, you won't be able to
+think out of the box and you'll certainly miss some really powerful alternative
+because you won't be able anymore to identify your problem properly as we've
+seen in the problem vectorization chapter. This requires some experience and
+you have to be patient, experience is not an overnight process.
+
+Finally, custom vectorization is an option worth to consider once you've looked
+at the alternatives to numpy. When nothing works for you, Numpy still offers
+you a clever framework to forge your own tools. And who knows, this can be the
+start of an exciting adventure for you and the community as it happened to me
+with the glumpy and the vispy packages.