exports

kernelspec

format	template
pdf	arxiv_nips

display_name	language	name
Python 3	python	python3

Interpolation using polynomials

#%config InlineBackend.figure_format = 'svg'
from pylab import *
from scipy.interpolate import barycentric_interpolate

Polynomials are easy to evluate since they require only basic arithmetic operations: $-,+,\div$. They can provide a good approximation to continuous functions as shown by Weirstrass Theorem.

Interpolation using monomials

Given $N+1$ data

$$ (x_i, y_i), \quad i=0,1,\ldots,N \qquad \textrm{with} \qquad y_i = f(x_i) $$

we can try to find a polynomial of degree $N$

$$ p(x) = a_0 + a_1 x + \ldots + a_N x^N $$

that satisfies the interpolation conditions $$p(x_i) = y_i, \qquad i=0,1,\ldots, N$$ This is a linear system of $N+1$ equations that can be written in matrix form

$$ \underbrace{\begin{bmatrix} 1 & x_0 & x_0^2 & \ldots & x_0^N \\ 1 & x_1 & x_1^2 & \ldots & x_1^N \\ \vdots & \vdots & \ldots & \vdots \\ 1 & x_N & x_N^2 & \ldots & x_N^N \end{bmatrix}}_{V} \begin{bmatrix} a_0 \ a_1 \ \vdots \ a_N \end{bmatrix} = \begin{bmatrix} y_0 \ y_1 \ \vdots \ y_N \end{bmatrix} $$

This has a unique solution if the Vandermonde matrix $V$ is non-singular. Since

$$ \det V = \prod_{j=0}^N \prod_{k=j+1}^N (x_k - x_j) $$

we can solve the problem provided the points ${ x_i }$ are distinct.

$V$ is non-singular. We can show this without computing the determinant. Assume that the ${ x_i }$ are distinct. It is enough to show that the only solution of $Va=0$ is $a=0$. Note that the set of $N+1$ equations $Va=0$ is of the form $$p(x_i) = 0, \qquad i=0,1,\ldots,N$$ which implies that $p(x)$ has $N+1$ distinct roots. But since $p$ is a polynomial of degree $N$, this implies that $p(x) \equiv 0$ and hence each $a_i = 0$; the matrix $V$ is non-singular.

+++

Condition number of $V$

If we want to solve the interpolation problem by solving the matrix problem, then the condition number of the matrix becomes important. The condition number of a square matrix wrt some matrix norm is defined as

$$ \kappa(A) = \norm{A} \cdot \norm{A^{-1}} $$

Matrices with large condition numbers cannot be solved accurately on a computer by Gaussian elimination due to possible growth of round-off errors.

+++

:::{prf:example} Take $(x_0,x_1,x_2) = (100,101,102)$, then with $p(x) = a_0 + a_1 x + a_2 x^2$

$$ V = \begin{bmatrix} 1 & 100 & 10000 \\ 1 & 101 & 10201 \\ 1 & 102 & 10402 \end{bmatrix}, \qquad \cond(V) \approx 10^8 $$

If we scale the $x$ as $x \to \frac{x}{x_0}$, then with $p(x) = a_0 + a_1 (x/x_0) + a_2 (x/x_0)^2$

$$ \tilde{V} = \begin{bmatrix} 1 & 1.00 & 1.0000 \\ 1 & 1.01 & 1.0201 \\ 1 & 1.02 & 1.0402 \end{bmatrix}, \qquad \cond(\tilde{V}) \approx 10^5 $$

Or we can shift the origin to $x_0$, then with $p(x) = a_0 + a_1 (x-x_0) + a_2 (x-x_0)^2$

$$ \hat{V} = \begin{bmatrix} 1 & 0 & 0 \\ 1 & 1 & 1 \\ 1 & 2 & 4 \end{bmatrix}, \qquad \cond(\hat{V}) \approx 14 $$

The condition number can be improved by scaling and/or shifting the variables. :::

+++

:::{prf:remark} It is usually better to map the data to the interval $[-1,+1]$ or $[0,1]$ and then solve the interpolation problem on the mapped interval. Assuming that $x_0 < x_1 < \ldots x_N$, define

$$ \xi = \frac{x - x_0}{x_N - x_0} $$

and find a polynomial of the form

$$ p(\xi) = a_0 + a_1 \xi + \ldots + a_N \xi^N $$

But still the condition number increases rapidly with $N$. For $N=20$ we have a condition number of $8 \times 10^8$ even when using the interval $[-1,+1]$.

We will use the numpy.linalg.cond function to compute the condition number. By default, it uses the 2-norm.

Nvalues, Cvalues = [], []
for N in range(1,30):
    x = linspace(-1.0,+1.0,N+1)
    V = zeros((N+1,N+1))
    for j in range(0,N+1):
        V[:,j] = x**j
    Nvalues.append(N), Cvalues.append(cond(V))
semilogy(Nvalues, Cvalues, 'o-')
xlabel('N'), ylabel('cond(V)'), grid(True);

The condition number is large for even moderate value of $N=30$. :::

+++

Interpolating polynomials

:::{prf:theorem} Suppose $f(x)$ is a polynomial of degree $\le N$ and we interpolate this at $N+1$ distinct points to construct a polynomial $p(x)$ of degree $N$. Then $f(x) \equiv p(x)$. :::

:::{prf:proof} Define

$$ r(x) = f(x) - p(x) $$

which is of degree $\le N$. $r(x)$ vanishes at $N+1$ distinct nodes, the interpolation points, and hence it must be zero polynomial. Thus $p(x) \equiv f(x)$. :::

+++

:::{prf:example} If $f(x) = a + b x$ and we interpolate this with $p(x) = a_0 + a_1 x + a_2 x^2$ at three distinct points, then the solution will be

$$ a_0 = a, \qquad a_1 = b, \qquad a_2 = 0 $$

:::

+++

:::{exercise} If $f(x)$ is a polynomial of degree $m$ and $p(x)$ interpolates it at $n+1$ distinct points with $n > m$, then show that $p(x)$ is actually a polynomial of degree $m$. :::

+++

Lagrange interpolation

Lagrange interpolation provides the solution without having to solve a matrix problem. Define

$$ \pi_i(x) = (x-x_0) \ldots (x-x_{i-1})(x-x_{i+1})\ldots (x-x_N) $$

and

$$ \ell_i(x) = \frac{\pi_i(x)}{\pi_i(x_i)} $$

Note that

each $\ell_i$ is a polynomial of degree $N$, and
$\ell_i(x_j) = \delta_{ij}$, i.e.,

$$ \ell_i(x_i) = 1 \qquad \textrm{and} \qquad \ell_i(x_j) = 0, \quad j \ne i $$

Then consider the polynomial of degree $N$ given by

$$ p_N(x) = \sum_{j=0}^N y_j \ell_j(x) $$

By construction this satisfies

$$ p_N(x_i) = y_i, \qquad i=0,1,\ldots,N $$

and hence is the solution to the interpolation problem.

+++

:::{prf:example} Initially, let us use some python functions to compute the interpolating polynomial. In particular, we use scipy.interpolate.barycentric_interpolate function which we will explain in the next chapter. We will demonstrate this for the function

$$ f(x) = \cos(4\pi x), \qquad x \in [0,1] $$

Define the function

xmin, xmax = 0.0, 1.0
f = lambda x: cos(4*pi*x)

Make a grid and evaluate function at those points.

N = 8 # is the degree, we need N+1 points
x = linspace(xmin, xmax, N+1)
y = f(x)

Now we evaluate this on larger grid for better visualization

M = 100
xe = linspace(xmin, xmax, M)
ye = f(xe) # exact function
yp = barycentric_interpolate(x, y, xe)

plot(x,y,'o',xe,ye,'--',xe,yp,'-')
legend(('Data points','Exact function','Polynomial'))
grid(True), title('Degree '+str(N)+' interpolation');

:::

+++

:::{prf:example} Interpolate the following functions on uniformly spaced points

$$ f(x) = \cos(x), \qquad x \in [0,2\pi] $$

for $N=2,4,6,\ldots,12$.

xmin, xmax = 0.0, 2.0*pi
fun = lambda x: cos(x)

xx = linspace(xmin,xmax,100);
ye = fun(xx);

figure(figsize=(9,8))
for i in range(1,7):
    N = 2*i;
    subplot(3,2,i)
    x = linspace(xmin,xmax,N+1);
    y = fun(x);
    yy = barycentric_interpolate(x,y,xx);
    plot(x,y,'o',xx,ye,'--',xx,yy);
    axis([xmin, xmax, -1.1, +1.1])
    text(3.0,0.0,'N='+str(N),ha='center');

The interpolating polynomials seem to converge to the true function as $N$ increases. :::

+++

Error estimate

:::{prf:theorem} Let $p_N(x)$ be the degree $N$ polynomial which interpolates the given data $(x_i,y_i)$, $i=0,1,\ldots,N$. Then the error

$$ e_N(x) = f(x) - p_N(x) $$

is bounded by

$$ |e_N(x)| \le \frac{M_{N+1}}{(N+1)!} |\omega_N(x)| $$

where

$$ \omega_N(x) = (x-x_0)(x-x_1)\ldots(x-x_N), \qquad M_{N+1} = \max_{x \in I(x_0,\ldots,x_N)}|f^{(N+1)}(x)| $$

where $I(x_0,\ldots,x_N) = [\min{x_0,\ldots,x_N}, \max{x_0,\ldots,x_N}]$. :::

:::{prf:proof} Assume for simplicity that $x_0 < x_1 < \ldots < x_N$. Since

$$ e_N(x_i) = 0, \qquad i=0,1,\ldots,N $$

we can write the error as

$$ \label{eq:enK} e_N(x) = f(x) - p_N(x) = \omega_N(x) K(x) $$

for some function $K(x)$. Choose an arbitrary $x_* \in [x_0,x_N]$, different from all the $x_i$ and define

$$ \Phi(x) = f(x) - p_N(x) - \omega_N(x) K(x_*) $$

If $f(x)$ has $(N+1)$ derivatives, then we can differetiate $N+1$ times wrt $x$ to get

$$ \Phi^{(N+1)}(x) = f^{(N+1)}(x) - (N+1)! K(x_*) $$

Clearly, $\Phi(x)$ vanishes at $N+2$ points, ${x_0,x_1,\ldots,x_N,x_*}$. By mean value theorem,

$\Phi'(x)$ vanishes in atleast $N+1$ points in the interval $[x_0,x_N]$,
$\Phi''(x)$ vanishes in atleast $N$ points in $[x_0,x_N]$,
...
$\Phi^{(N+1)}(x)$ vanishes in atleast one point $\bar{x} \in [x_0,x_N]$

Then

$$ f^{(N+1)}(\bar{x}) - (N+1)! K(x__) = 0 \quad\Longrightarrow\quad K(x__) = \frac{1} {(N+1)!} f^{(N+1)}(\bar{x}) $$

and hence by

$$ f(x__) - p_N(x__) = \frac{1}{(N+1)!} \omega_N(x_*) f^{(N+1)} (\bar{x}) $$

Since $x_*$ was arbitrary, we can call it $x$ and we obtain the error formula

$$ e_N(x) = f(x) - p_N(x) = \frac{1}{(N+1)!} \omega_N(x) f^{(N+1)} (\bar{x}) $$

which proves the result. :::

+++

Uniformly spaced points

Let us specialize the error estimate to the case of uniformly spaced points in the interval $[a,b]$ with

$$ x_i = a + i h, \qquad 0 \le i \le N, \qquad h = \frac{b-a}{N} $$

We have $x_0 = a$ and $x_N = b$.

:::{prf:theorem} For any $x \in [a,b]$

$$ |\omega_N(x)| = \prod_{i=0}^N |x - x_i| \le \frac{1}{4} h^{N+1} N! $$ :::

:::{prf:proof} Fix an $x$ and find $j$ such that $x_j \le x \le x_{j+1}$. Show that (See Assignment)

$$ |(x-x_j)(x-x_{j+1})| \le \frac{1}{4}h^2 $$

Hence

$$ \prod_{i=0}^N |x - x_i| \le \frac{h^2}{4} \prod_{i=0}^{j-1}(x - x_i) \prod_{i=j+2}^N (x_i - x) $$

Now since $x \le x_{j+1}$ and $-x \le -x_j$

$$ \prod_{i=0}^N |x - x_i| \le \frac{h^2}{4} \prod_{i=0}^{j-1}(x_{j+1} - x_i) \prod_{i=j+2}^N (x_i - x_j) $$

Using

$$ x_{j+1} - x_i = (j-i+1)h, \qquad x_i - x_j = (i-j)h $$

the inequality becomes

$$ \prod_{i=0}^N |x - x_i| \le \frac{h^{N+1}}{4} \prod_{i=0}^{j-1}(j-i+1) \prod_{i=j+2}^N (i - j) = \frac{h^{N+1}}{4} (j+1)! (N-j)! $$

Finally show that (See Assignment, for proof, see problems collection)

$$ (j+1)! (N-j)! \le N!, \qquad 0 \le j \le N-1 $$

which completes the proof of the theorem. :::

+++

:::{prf:theorem} Assume that $|f^{(n)}(x)| \le M$ for all $x$ and $n$. Then the error of polynomial interpolation on uniformly spaced points is bounded by

$$ |f(x) - p(x)| \le \frac{M h^{N+1}}{4(N+1)} $$

and the error goes to zero as $N \to \infty$. :::

:::{prf:proof} The error bound follows from the two previous theorems. As $N$ increases, $h$ becomes less than one at some point, and beyond this, the right hand side in the error bound will converge to zero. :::

+++

:::{prf:remark} Functions like $\cos x$, $\sin x$, $\exp(x)$ satisfy the conditions of the above theorem. These conditions are quite strong and can be relaxed considerably. E.g., if $f(x) = \sin(\alpha x)$ then $|f^{(n)}(x)| \le |\alpha|^n$ and if $|\alpha| > 1$, the derivatives can increase with $n$. If the derivatives satisfy

$$ |f^{(n)}(x)| \le C \alpha^n, \qquad \alpha > 0 $$

then the error estimate gives

$$ |f(x) - p(x)| \le \frac{C (\alpha h)^{N+1}}{4(N+1)} $$

As $N$ increases, we will satisfy $\alpha h < 1$ and beyond this point, the right hand side goes to zero. :::

+++

:::{exercise} Modify previous code to apply it to $f(x) = \cos(4x)$ for $x \in [0,2\pi]$ and observe convergence for moderate values of $N$ and uniformly spaced points. For very high degrees, you should see the error start to grow. :::

+++

:::{prf:example} Runge phenomenon Consider interpolating the following two functions on $[-1,1]$

$$ f_1(x) = \exp(-5x^2), \qquad f_2(x) = \frac{1}{1 + 16 x^2} $$

We will try uniformly spaced points and Chebyshev points.

Let us first plot the two functions.

xmin, xmax = -1.0, +1.0

f1 = lambda x: exp(-5.0*x**2)
f2 = lambda x: 1.0/(1.0+16.0*x**2)

xx = linspace(xmin,xmax,100,True)
figure(figsize=(8,4))
plot(xx,f1(xx),xx,f2(xx))
legend(("$1/(1+16x^2)$", "$\\exp(-5x^2)$"));

The two functions look visually similar and both are infinitely differentiable.

def interp(f,points):
    xx = linspace(xmin,xmax,100,True);
    ye = f(xx);

    figure(figsize=(9,8))
    for i in range(1,7):
        N = 2*i
        subplot(3,2,i)
        if points == 'uniform':
            x = linspace(xmin,xmax,N+1,True)
        else:
            theta = linspace(0,pi,N+1, True)
            x = cos(theta)
        y = f(x);
        yy = barycentric_interpolate(x,y,xx);
        plot(x,y,'o',xx,ye,'--',xx,yy)
        axis([xmin, xmax, -1.0, +1.1])
        text(-0.1,0.0,'N = '+str(N));

Interpolate $f_1(x)$ on uniformly spaced points.

interp(f1,'uniform')

Interpolate $f_2(x)$ on uniformly spaced points.

interp(f2,'uniform')

The above results are not good. Let us try $f_2(x)$ on Chebyshev points.

interp(f2,'chebyshev')

What about interpolating $f_1(x)$ on Chebyshev points ?

interp(f1,'chebyshev')

This also seems fine. So uniform points works for one function, Chebyshev points work for both. :::

+++

Difficulty of polynomial interpolation

Do the polynomial approximations $p_N$ converge to the true function $f$ as $N \to \infty$ ? The error formula seems to suggest so, due to the factor $\frac{1}{(N+1)!}$ provided

higher order derivatives of $f$ are small
the function $\omega_N(x)$ which depends on point distribution is small

Size of derivatives

On uniformly spaced points, we have seen the interpolants of $\cos(x)$ converge but those of the rational function $\frac{1}{1+16x^2}$ do not. This must be related to the behaviour of the derivatives of these functions.

+++

:::{prf:example} Consider $f(x)=\ln(x)$. Then its derivatives are

$$ \begin{aligned} f' &= \frac{1}{x} \\ f'' &= -\frac{1}{x^2} \\ f''' &= \frac{2!}{x^3} \\ &\vdots \\ f^{(n)} &= \frac{(-1)^{n-1} (n-1)!}{x^n} \end{aligned} $$

Even though the curve $y=\ln(x)$ looks smooth near any value of $x$, as $n$ gets large, the derivatives become very large in size, and tend to behave like $n!$ or worse. :::

+++

This is the general situation; for most functions, some higher order derivatives tend to grow as $n!$. Even for a polynomial $p_N(x)$, the derivatives grow in size until the $N$'th one, which is $a_N N!$, after which they suddenly all become zero.

+++

Polynomial factor in error bound

The error of polynomial interpolation is given by

$$ f(x) - p_N(x) = \frac{\omega_N(x)}{(N+1)!} f^{(N+1)}(\xi) $$

where $\xi \in I(x_0,x_1,\ldots,x_N,x)$.

Case $N=1$. In case of linear interpolation

$$ \omega_1(x) = (x-x_0)(x-x_1), \qquad h = x_1 - x_0 $$

Then

$$ \max_{x_0 \le x \le x_1} |\omega_1(x)| = \frac{h^2}{4} $$

and the interpolation error bound is

$$ \max_{x_0 \le x \le x_1} |f(x) - p_1(x)| \le \frac{h^2}{8} \max_{x_0 \le x \le x_1}| f''(x)| $$

Case $N=2$. To bound $\omega_2(x)$ on $[x_0,x_2]$, shift the origin to $x_1$ so that

$$ \omega_2(x) = (x-h)x(x+h) $$

Near the center of the data

$$ \max_{x_1-\half h \le x \le x_1 + \half h} |\omega_2(x)| = 0.375 h^3 $$

whereas on the whole interval

$$ \max_{x_0 \le x \le x_2} |\omega_2(x)| = \frac{2\sqrt{3}}{9}h^3 \approx 0.385 h^3 $$

In this case, the error is of similar magnitude whether $x$ is near the center or near the edge.

Case $N=3$. We can shift the nodes so that they are symmetric about the origin. Then

$$ \omega_3(x) = \left( x^2 - \frac{9}{4}h^2 \right) \left( x^2 - \frac{1}{4}h^2 \right) $$

and

\begin{align} \max_{x_1 \le x \le x_2}|\omega_3(x)| &= \frac{9}{16}h^4 \approx 0.56 h^4 \ \max_{x_0 \le x \le x_3}|\omega_3(x)| &= h^4 \end{align}

In this case, the error near the endpoints can be twice as large as the error near the middle.

Case $N = 6$. The behaviour exhibited for $N=3$ is accentuated for larger degree. For $N=6$,

$$ \max_{x_2 \le x \le x_4}|\omega_6(x)| \approx 12.36 h^7, \qquad \max_{x_0 \le x \le x_6}|\omega_6(x)| \approx 95.8 h^7 $$

and the error near the ends can be almost 8 times that near the center.

The next functions evaluate $\omega_N(x)$ and plot it.

# x  = (x0,x1,x2,...,xN)
# xp = array of points where to evaluate
def omega(x,xp):
    fp = ones_like(xp)
    for xi in x:
        fp = fp * (xp - xi)
    return fp

def plot_omega(x):
    M  = 1000
    xx = linspace(-1.0,1.0,M)
    f = omega(x, xx)
    plot(xx,f,'b-',x,0*x,'o')
    title("N = "+str(N)), grid(True);

+++

:::{prf:example} $\omega_N(x)$ on uniformly spaced points For a given set of points $x_0, x_1, \ldots, x_N \in [-1,+1]$ we plot the function

$$ \omega_N(x) = (x-x_0)(x-x_1) \ldots (x-x_N), \qquad x \in [-1,+1] $$

for uniformly spaced points.

N = 8
x = linspace(-1.0,1.0,N+1)
plot_omega(x)

N = 20
x = linspace(-1.0,1.0,N+1)
plot_omega(x)

Near the end points, the function $\omega_N$ does not go to zero as fast as near the middle. For the Runge example, we observed convergence near the middle but not at the ends. :::

+++

Distribution of data points

The error in polynomial interpolation is

$$ |f(x) - p_N(x)| \le \frac{|\omega_N(x)|}{(N+1)!} \max_{\xi \in [a,b]} |f^{(N+1)}(\xi)| $$

For a given function $f(x)$, we cannot do anything about the derivative term in the error estimate. For uniformly spaced data points, $\omega_N$ has large value near the end points which is also where the Runge phenomenon is observed. But we can try to minimize the magnitude of $\omega_N(x)$ by choosing a different set of nodes for interpolation. For the following discussion, let us assume that the $x_i$ are ordered and contained in the interval $[-1,+1]$.

:::{hint} Question Given $N$, can we choose $N+1$ distinct nodes ${x_i}$ in $[-1,1]$ so that

$$ \max_{x \in [-1,+1]} |\omega_N(x)| $$

is minimized ? :::

We will show that

$$ \min_{{x_i}} \max_{x \in [-1,+1]} |\omega_N(x)| = 2^{-N} $$

and the minimum is achieved for following set of nodes

$$ x_i = \cos\left( \frac{2(N-i)+1}{2N+2} \pi \right), \qquad i=0,1,\ldots,N \label{eq:chebpts} $$

These are called Chebyshev points of first kind.

+++

Chebyshev polynomials

The Chebyshev polynomials are defined on the interval $[-1,+1]$ and the first few polynomials are

$$ \begin{aligned} T_0(x) &= 1 \\ T_1(x) &= x \end{aligned} $$

The remaining polynomials can be defined by recursion

$$ T_{n+1}(x) = 2 x T_n(x) - T_{n-1}(x) $$

so that

$$ \begin{aligned} T_2(x) &= 2x^2 - 1 \\ T_3(x) &= 4x^3 - 3x \\ T_4(x) &= 8x^4 - 8x^2 + 1 \end{aligned} $$

In Python, we can use function recursion to compute the Chebyshev polynomials.

def chebyshev(n, x):
    if n == 0:
        y = ones_like(x)
    elif n == 1:
        y = x.copy()
    else:
        y = 2 * x * chebyshev(n-1,x) - chebyshev(n-2,x)
    return y

The first few of these are shown below.

:tag: hide-input
N = 200
x = linspace(-1.0,1.0,N)
figure(figsize=(8,6))
for n in range(0,6):
    y = chebyshev(n,x)
    plot(x,y,label='n='+str(n))
legend(), grid(True), xlabel('x'), ylabel('$T_n(x)$');

We can write $x \in [-1,+1]$ in terms of an angle $\theta \in [0,\pi]$

$$ x=\cos\theta $$

Properties of Chebyshev polynomials

$T_n(x)$ is a polynomial of degree $n$.
$T_n(\cos\theta) = \cos(n\theta)$. (Fourier cosine series)
$T_n(x) = \cos(n\cos^{-1}(x))$
$|T_n(x)| \le 1$
Extrema $$ T_n\left(\cos\left(\frac{j\pi}{n}\right)\right) = (-1)^j, \qquad 0 \le j \le n $$
Roots $$ T_n\left( \cos \left(\frac{2j+1}{2n}\pi\right) \right) = 0, \qquad 0 \le j \le n-1 $$

+++

:::{exercise} Prove the above properties. :::

+++

:::{prf:definition} Monic polynomial A polynomial whose term of highest degree has coefficient one is called a monic polynomial. :::

+++

:::{prf:remark} In $T_n(x)$, the coefficient of $x^n$ is $2^{n-1}$ for $n \ge 1$ which can be observed from the recursion relation. Hence $2^{1-n} T_n(x)$ is monic polynomial of degree $n$.
:::

+++

:::{prf:theorem} If $p : [-1,1] \to \re$ is a monic polynomial of degree $n$, then

$$ \norm{p}\infty = \max{x \in[-1,+1]} |p(x)| \ge 2^{1-n} $$ :::

:::{prf:proof} We prove by contradiction. Suppose that

$$ |p(x)| < 2^{1-n}, \qquad \forall |x| \le 1 $$

Define

$$ q(x) = 2^{1-n} T_n(x) $$

The extrema of $T_n$ are at

$$ x_i = \cos\left( \frac{i\pi}{n} \right), \quad 0 \le i \le n $$

and

$$ T_n(x_i) = (-1)^i \qquad \textrm{so that} \qquad q(x_i) = 2^{1-n} (-1)^i $$

Now, by assumption

$$ (-1)^i p(x_i) \le |p(x_i)| < 2^{1-n} = (-1)^i q(x_i) $$

so that

$$ (-1)^i [q(x_i) - p(x_i)] > 0, \qquad 0 \le i \le n $$

$q-p$ changes sign $n$ times, so it must have atleast $n$ distinct roots. But $q-p$ is of degree $n-1$, and hence we get a contradiction. :::

+++

(sec:optimalintpnodes)=

Optimal nodes

Since $\omega_N(x)$ is a monic polynomial of degree $N+1$, we know from previous theorem that

$$ \max_{-1 \le x \le +1} |\omega_N(x)| \ge 2^{-N} $$

Question. Can we choose the $x_i$ so that the minimum value of $2^{-N}$ is achieved ?

Answer. If ${x_i}$ are the $N+1$ distinct roots of $T_{N+1}(x)$, then

$$ \omega_N(x) = 2^{-N} T_{N+1}(x) $$

so that

$$ \max_{-1 \le x \le +1} |\omega_N(x)| = 2^{-N} \max_{-1 \le x \le +1} |T_{N+1}(x)| = 2^{-N} $$

Thus the optimal nodes are Chebyshev points of first kind

$$ x_i = \cos\left( \frac{2i+1}{2N+2} \pi \right), \qquad 0 \le i \le N $$

In terms of $\theta$ variable, the spacing between these nodes

$$ \theta_{i+1} - \theta_i = \frac{\pi}{N+1} $$

is constant. With these points, we have

$$ -\frac{1}{2^{N}} \le \omega_N(x) \le \frac{1}{2^{N}}, \qquad x \in [-1,1] $$

+++

:::{prf:example} Chebyshev points of first kind The roots of degree $n$ Chebyshev polynomial $T_n(x)$ are

$$ x_i = \cos\left( \frac{2i+1}{2n} \pi \right), \qquad i=0,1,\ldots,n-1 $$

The roots are shown below for $n=10,11,\ldots,19$.

:tags: hide-input
c = 1
for n in range(10,20):
    j = linspace(0,n-1,n)
    theta = (2*j+1)*pi/(2*n)
    x = cos(theta)
    y = 0*x
    subplot(10,1,c)
    plot(x,y,'.')
    xticks([]); yticks([])
    ylabel(str(n))
    c += 1

Note that the roots are clustered near the end points and are contained in $(-1,1)$;

$$ x_0 = \cos\left( \frac{1}{2N+2} \pi \right) < 1, \qquad x_N = \cos\left( \frac{2N+1} {2N+2} \pi \right) > -1 $$

the endpoints of $[-1,+1]$ are not nodes. The nodes are ordered as $x_0 > x_1 > \ldots > x_N$. We can reorder them by defining the $x_i$ as in . :::

+++

:::{prf:theorem} If the nodes ${x_i}$ are the $N+1$ roots of the Chebyshev polynomial $T_{N+1}$, then the error formula for polynomial interpolation in the interval $[-1,+1]$ is

$$ |f(x) - p_N(x)| \le \frac{1}{2^N (N+1)!} \max_{|t| \le 1}|f^{(N+1)}(t)| $$ :::

+++

:::{prf:remark} Chebyshev points of second kind In practice, we dont use the Chebyshev nodes as derived above. The important point is how the points are clustered near the ends of the interval. This type of clustering can be achieved by other node sets. If we want $N+1$ nodes, then divide $[0,\pi]$ into $N$ equal intervals so that

$$ \theta_i = \frac{(N-i)\pi}{N}, \qquad i=0,1,\ldots,N $$

and then the nodes are given by

$$ x_i = \cos\theta_i $$

These are called Chebyshev points of second kind. In Python they can be obtained as

theta = linspace(0,pi,N+1)
x = -cos(theta)

which returns them in the order

$$ -1 = x_0 < x_1 < \ldots < x_N = +1 $$

They can also be obtained as projections of uniformly spaced points on the unit circle onto the $x$-axis.

:tags: hide-input
t = linspace(0,pi,1000)
xx, yy = cos(t), sin(t)
plot(xx,yy)

n = 10
theta = linspace(0,pi,n)
plot(cos(theta),sin(theta),'o')
plot(cos(theta),zeros(n),'sr',label='Chebyshev')
for i in range(n):
    x1 = [cos(theta[i]), cos(theta[i])]
    y1 = [0.0, sin(theta[i])]
    plot(x1,y1,'k--')
plot([-1.1,1.1],[0,0],'-')
legend(), xlabel('x')
axis([-1.1, 1.1, 0.0, 1.1])
axis('equal'), title(str(n)+' Chebyshev points');

Below, we compare the polynomial $\omega_N(x)$ for uniform and Chebyshev points for $N=16$.

:tags: hide-input
M  = 1000
xx = linspace(-1.0,1.0,M)

N = 16
xu = linspace(-1.0,1.0,N+1)    # uniform points
xc = cos(linspace(0.0,pi,N+1)) # chebyshev points
fu = omega(xu,xx)
fc = omega(xc,xx)
plot(xx,fu,'b-',xx,fc,'r-')
legend(("Uniform","Chebyshev"))
grid(True), title("Degree N = "+str(N));

With Chebyshev points, this function is of similar size throughout the interval. :::

+++

:::{exercise} Plot the function $\omega_N(x)$ for $N=16$ and for Chebyshev points of first and second kind. Write Python code to produce a plot like this.

:tags: remove-input
N = 16

# First kind
theta1 = pi*(2*arange(0,N+1) + 1)/(2*N + 2)
x1 = cos(theta1)

# Second kind
theta2 = linspace(0,pi,N+1)
x2 = cos(theta2)

x  = linspace(-1,1,1000)
f1 = omega(x1,x)
f2 = omega(x2,x)
figure(figsize=(10,8))
plot(x,f1,label='First kind')
plot(x,f2,label='Second kind')
plot([-1,1],[2**(-N),2**(-N)],'--',label='$y=2^{-N}$')
plot([-1,1],[-2**(-N),-2**(-N)],'--',label='$y=-2^{-N}$')
legend(), grid(True), xlabel('x'), ylabel('$\\omega_N(x)$')
title('Degree N = ' + str(N));

:::

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Interpolation using polynomials

Interpolation using monomials

Condition number of $V$

Interpolating polynomials

Lagrange interpolation

Error estimate

Uniformly spaced points

Difficulty of polynomial interpolation

Size of derivatives

Polynomial factor in error bound

Distribution of data points

Chebyshev polynomials

Optimal nodes

FilesExpand file tree

lagrange.md

Latest commit

History

lagrange.md

File metadata and controls

Interpolation using polynomials

Interpolation using monomials

Condition number of $V$

Interpolating polynomials

Lagrange interpolation

Error estimate

Uniformly spaced points

Difficulty of polynomial interpolation

Size of derivatives

Polynomial factor in error bound

Distribution of data points

Chebyshev polynomials

Optimal nodes