Lecture 4¶

The story so far¶

The proposed elimination procedure can be implemented using numerical rootfinding methods
It can be difficult to work with high-order polymials in floating-point arithmetic
Elimination may be unstable in floating-point arithmetic

Can we tackle the system directly?

Contents¶

Newton's method in multiple variables
Dealing with non-invertible systems
Computational efficiency

Newton in multiple variables¶

Given $f: \mathbb{R}^{n} \rightarrow \mathbb{R}^{m}$ ($m$ equations in $n$ unknowns), how do we find a root?

Newton's method in multiple variables reads

$$x_{k+1} = x_k - Df(x_k)^{-1}f(x_k),$$

with $Df : \mathbb{R}^{n} \rightarrow \mathbb{R}^{m\times n}$ is the Jacobian of $f$. It has quadratic convergence when starting close to a root when $Df(x^*)$ is invertible; linear convergence otherwise.

Inverting a matrix¶

How do we invert the Jacobian at each step?

Gaussian elimination
Iterative methods

Richardson iteration for $Ax = b$¶

$$x_{k+1} = x_k + \alpha (b - Ax_k).$$

convergence when $\|I - \alpha A\| < 1$
how about indefinite matrices?

We are in fact approximating the solution via a Neumann series (for $x_0 = 0$, $\alpha = 1$)

$$x_n = \sum_{k=0}^{n-1}(I-A)^kb = P_{n-1}(A)b.$$

Can we find a better polynomial approximation?

Ideally, we want a polynomial such that $P_{n}(\lambda_i) = \lambda_i^{-1}$ for all eigenvalues $\{\lambda_i\}_{i=1}^n$ of $A$.
Equivalently, we want to find $Q_n(A) = AP_{n}(A) - I$ which as roots at $\{\lambda_i\}_{i=1}^n$.

Krylov methods¶

The Richardson iteration generates a solution in the Krylov subspace

$$K_n(A,b) = \text{span}\{b, Ab, A^2b, \ldots, A^{n-1}b\}.$$

Can we find a better solution in this space?

Solve

$$\min_{x\in K_n(A,b)} \|Ax - b\|_2^2.$$

form orthogonal basis for $K_n$
compute minimum-residual solution
update basis and repeat

N	Residual	Error
20	1.80e-07	1.12e-06
40	4.27e-07	5.53e-06
60	7.67e-07	1.37e-05
80	9.70e-07	2.20e-05
100	9.53e-07	2.70e-05

Dealing with non-invertible systems¶

What to do when we have more / less equations than unknowns?

The pseudo-inverse¶

The Moore-Penrose pseudo-inverse of a matrix $A$ is the unique matrix $A^\dagger$ satisfying:

$AA^\dagger A = A$
$A^\dagger A A^\dagger = A^\dagger$
$AA^\dagger$ is Hermitian
$A^\dagger A$ is Hermitian

Two special cases:

$A^\dagger = (A^*\!A)^{-1}A^*$ when $A^*\!A$ is invertible (left inverse)
$A^\dagger = A^*(A\!A^*)^{-1}$ when $A\!A^*$ is invertible (right inverse)

We can define the pseudo-inverse generally through the singular value decomposition:

$$A = U\Sigma V^*,$$$$A^\dagger = V_k \Sigma_k^{-1} U_k^*,$$

with $k$ the rank of $A$.

Iterative methods¶

How do we compute the minimum-norm solution of

$$\min_x \|Ax - b\|_2^2.$$

Use Krylov (again)

Quasi-Newton methods¶

it may be too computationally expensive to form and invert the Jacobian at each iteration
the secant method circumvents this in the scalar case by approximating $f'(x_k) \approx \frac{f(x_k) - f(x_{k-1})}{x_k - x_{k-1}}$
can we generalise this to the multivariate case?

The secant equation¶

The Jacobian satisfies

$$Df(\xi_k)(x_{k} - x_{k-1}) = f(x_k) - f(x_{k-1}),$$

for $\xi_k$ an convex combination of $x_k, x_{k-1}$.

How do we get a usefull approximation $B_k$ or $H_k$ satisfying

$$H_k(x_{k} - x_{k-1}) = f(x_k) - f(x_{k-1}),$$

$$(x_{k} - x_{k-1}) = B_k(f(x_k) - f(x_{k-1})),$$

Assuming we have some $H_k$ (or $B_k$) satisfying the secant relation, how do we update it to obtain $H_{k+1}$ (or $B_{k+1}$)?

SR1-update:

$$H_{k+1} = H_k + \frac{(\Delta f_k - H_k\Delta x_k)(\Delta f_k - H_k\Delta x_k)^T}{(\Delta f_k - H_k\Delta x_k)^T\Delta x_k}$$$$B_{k+1} = B_k + \frac{(\Delta x_k - B_k\Delta f_k)(\Delta x_k - B_k\Delta f_k)^T}{(\Delta x_k - B_k\Delta f_k)^T\Delta f_k}$$

Some non-Newton methods¶

Fixed point iteration¶

$$x_{k+1} = g(x_k),$$

with $g(x) = x - \alpha f(x)$.

Minimisation¶

$$\min_x \textstyle{\frac{1}{2}}\|f(x)\|_2^2.$$

which can be solved by

$$x_{k+1} = x_k - \alpha Df(x_k)^*\cdot f(x_k)$$

Need to pick $\alpha$ small enough
Guaranteed to converge to stationary point

Summary¶

Can solve system of non-linear equations with Newton's method
Requires solution of linearised system at each iteration
Alternatives exist which avoid this
All these methods find a solution

Ideas for your project¶

Derive a system of equations for your robot
Describe / visualise the configuration space (which positions can the robot achieve)
Describe / visualise the kinematic singularities of the robot
Implement a method / methods to solve the inverse kinematic problem of the robot
Compare the methods on a well-chosen set of problem (i.e., several points in configuration space)
Discuss the results in terms of computational complexity, robustness, accuracy

Interdisciplinarity, own initiatives and originality are highly encouraged!