LU Decomposition for Solving Linear Equations

Learning objectives

Describe the factorization $A = L U <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow><mo>=</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">L</mi><mi mathvariant="bold">U</mi></mrow></math>$ .
Compare the cost of LU with other operations such as matrix-matrix multiplication.
Identify the problems with using LU factorization.
Implement an LU decomposition algorithm.
Given an LU decomposition for $A <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow></math>$ , solve the system $A x = b <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi><mi mathvariant="bold">x</mi></mrow><mo>=</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">b</mi></mrow></math>$ .
Give examples of matrices for which pivoting is needed.
Implement an LUP decomposition algorithm.
Manually compute LU and LUP decompositions.
Compute and use LU decompositions using library functions.

Forward substitution algorithm

The forward substitution algorithm solves the linear system $L x = b <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">L</mi><mi mathvariant="bold">x</mi></mrow><mo>=</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">b</mi></mrow></math>$ where $L <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">L</mi></mrow></math>$ is a lower triangular matrix.

A lower-triangular linear system $L x = b <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">L</mi></mrow><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">x</mi></mrow><mo>=</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">b</mi></mrow></math>$ can be written in matrix form:

This can also be written as the set of linear equations:

ℓ 11 x 1 = b 1 ℓ 21 x 1 + ℓ 22 x 2 = b 2 ⋮ + ⋮ + ⋱ = ⋮ ℓ n 1 x 1 + ℓ n 2 x 2 + \dots + ℓ n n x n = b n . <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mtable columnspacing="1em" rowspacing="4pt"><mtr><mtd><msub><mi>ℓ</mi><mrow data-mjx-texclass="ORD"><mn>11</mn></mrow></msub><msub><mi>x</mi><mn>1</mn></msub></mtd><mtd></mtd><mtd></mtd><mtd></mtd><mtd></mtd><mtd></mtd><mtd></mtd><mtd><mo>=</mo></mtd><mtd><msub><mi>b</mi><mn>1</mn></msub></mtd></mtr><mtr><mtd><msub><mi>ℓ</mi><mrow data-mjx-texclass="ORD"><mn>21</mn></mrow></msub><msub><mi>x</mi><mn>1</mn></msub></mtd><mtd><mo>+</mo></mtd><mtd><msub><mi>ℓ</mi><mrow data-mjx-texclass="ORD"><mn>22</mn></mrow></msub><msub><mi>x</mi><mn>2</mn></msub></mtd><mtd></mtd><mtd></mtd><mtd></mtd><mtd></mtd><mtd><mo>=</mo></mtd><mtd><msub><mi>b</mi><mn>2</mn></msub></mtd></mtr><mtr><mtd><mrow data-mjx-texclass="ORD"><mo>⋮</mo></mrow></mtd><mtd><mo>+</mo></mtd><mtd><mrow data-mjx-texclass="ORD"><mo>⋮</mo></mrow></mtd><mtd><mo>+</mo></mtd><mtd><mo>⋱</mo></mtd><mtd></mtd><mtd></mtd><mtd><mo>=</mo></mtd><mtd><mrow data-mjx-texclass="ORD"><mo>⋮</mo></mrow></mtd></mtr><mtr><mtd><msub><mi>ℓ</mi><mrow data-mjx-texclass="ORD"><mi>n</mi><mn>1</mn></mrow></msub><msub><mi>x</mi><mn>1</mn></msub></mtd><mtd><mo>+</mo></mtd><mtd><msub><mi>ℓ</mi><mrow data-mjx-texclass="ORD"><mi>n</mi><mn>2</mn></mrow></msub><msub><mi>x</mi><mn>2</mn></msub></mtd><mtd><mo>+</mo></mtd><mtd><mo>\dots</mo></mtd><mtd><mo>+</mo></mtd><mtd><msub><mi>ℓ</mi><mrow data-mjx-texclass="ORD"><mi>n</mi><mi>n</mi></mrow></msub><msub><mi>x</mi><mi>n</mi></msub></mtd><mtd><mo>=</mo></mtd><mtd><msub><mi>b</mi><mi>n</mi></msub><mo>.</mo></mtd></mtr></mtable></math>

The forward substitution algorithm solves a lower-triangular linear system by working from the top down and solving each variable in turn. In math this is:

x1=b1ℓ11x2=b2−ℓ21x1ℓ22⋮xn=bn−∑n−1j=1ℓnjxjℓnn.<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mtable displaystyle="true" columnalign="right left" columnspacing="0em" rowspacing="3pt"><mtr><mtd><msub><mi>x</mi><mn>1</mn></msub></mtd><mtd><mi></mi><mo>=</mo><mfrac><msub><mi>b</mi><mn>1</mn></msub><msub><mi>ℓ</mi><mrow data-mjx-texclass="ORD"><mn>11</mn></mrow></msub></mfrac></mtd></mtr><mtr><mtd><msub><mi>x</mi><mn>2</mn></msub></mtd><mtd><mi></mi><mo>=</mo><mfrac><mrow><msub><mi>b</mi><mn>2</mn></msub><mo>−</mo><msub><mi>ℓ</mi><mrow data-mjx-texclass="ORD"><mn>21</mn></mrow></msub><msub><mi>x</mi><mn>1</mn></msub></mrow><msub><mi>ℓ</mi><mrow data-mjx-texclass="ORD"><mn>22</mn></mrow></msub></mfrac></mtd></mtr><mtr><mtd></mtd><mtd><mi></mi><mrow data-mjx-texclass="ORD"><mo>⋮</mo></mrow></mtd></mtr><mtr><mtd><msub><mi>x</mi><mi>n</mi></msub></mtd><mtd><mi></mi><mo>=</mo><mfrac><mrow><msub><mi>b</mi><mi>n</mi></msub><mo>−</mo><munderover><mo data-mjx-texclass="OP">∑</mo><mrow data-mjx-texclass="ORD"><mi>j</mi><mo>=</mo><mn>1</mn></mrow><mrow data-mjx-texclass="ORD"><mi>n</mi><mo>−</mo><mn>1</mn></mrow></munderover><msub><mi>ℓ</mi><mrow data-mjx-texclass="ORD"><mi>n</mi><mi>j</mi></mrow></msub><msub><mi>x</mi><mi>j</mi></msub></mrow><msub><mi>ℓ</mi><mrow data-mjx-texclass="ORD"><mi>n</mi><mi>n</mi></mrow></msub></mfrac><mo>.</mo></mtd></mtr></mtable></math>

The properties of the forward substitution algorithm are:

If any of the diagonal elements $L i i <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>L</mi><mrow data-mjx-texclass="ORD"><mi>i</mi><mi>i</mi></mrow></msub></math>$ are zero then the system is singular and cannot be solved.
If all diagonal elements of $L <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">L</mi></mrow></math>$ are non-zero then the system has a unique solution.
The number of operations for the forward substitution algorithm is $O (n 2) <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>O</mi><mo stretchy="false">(</mo><msup><mi>n</mi><mn>2</mn></msup><mo stretchy="false">)</mo></math>$ as $n \to \infty <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>n</mi><mo accent="false" stretchy="false">\to</mo><mi mathvariant="normal">\infty</mi></math>$ .

The code for the forward substitution algorithm to solve $L x = b <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">L</mi><mi mathvariant="bold">x</mi></mrow><mo>=</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">b</mi></mrow></math>$ is:

import numpy as np
def forward_sub(L, b):
    """x = forward_sub(L, b) is the solution to L x = b
       L must be a lower-triangular matrix
       b must be a vector of the same leading dimension as L
    """
    n = L.shape[0]
    x = np.zeros(n)
    for i in range(n):
        tmp = b[i]
        for j in range(i-1):
            tmp -= L[i,j] * x[j]
        x[i] = tmp / L[i,i]
    return x

Back substitution algorithm

The back substitution algorithm solves the linear system $U x = b <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">U</mi><mi mathvariant="bold">x</mi></mrow><mo>=</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">b</mi></mrow></math>$ where $U <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">U</mi></mrow></math>$ is an upper-triangular matrix. It is the backwards version of forward substitution.

The upper-triangular system $U x = b <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">U</mi></mrow><mi>x</mi><mo>=</mo><mi>b</mi></math>$ can be written as the set of linear equations:

u 11 x 1 + u 12 x 2 + \dots + u 1 n x n = b 1 u 22 x 2 + \dots + u 2 n x n = b 2 ⋱ ⋮ = ⋮ u n n x n = b n . <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mtable columnspacing="1em" rowspacing="4pt"><mtr><mtd><msub><mi>u</mi><mrow data-mjx-texclass="ORD"><mn>11</mn></mrow></msub><msub><mi>x</mi><mn>1</mn></msub></mtd><mtd><mo>+</mo></mtd><mtd><msub><mi>u</mi><mrow data-mjx-texclass="ORD"><mn>12</mn></mrow></msub><msub><mi>x</mi><mn>2</mn></msub></mtd><mtd><mo>+</mo></mtd><mtd><mo>\dots</mo></mtd><mtd><mo>+</mo></mtd><mtd><msub><mi>u</mi><mrow data-mjx-texclass="ORD"><mn>1</mn><mi>n</mi></mrow></msub><msub><mi>x</mi><mi>n</mi></msub></mtd><mtd><mo>=</mo></mtd><mtd><msub><mi>b</mi><mn>1</mn></msub></mtd></mtr><mtr><mtd></mtd><mtd></mtd><mtd><msub><mi>u</mi><mrow data-mjx-texclass="ORD"><mn>22</mn></mrow></msub><msub><mi>x</mi><mn>2</mn></msub></mtd><mtd><mo>+</mo></mtd><mtd><mo>\dots</mo></mtd><mtd><mo>+</mo></mtd><mtd><msub><mi>u</mi><mrow data-mjx-texclass="ORD"><mn>2</mn><mi>n</mi></mrow></msub><msub><mi>x</mi><mi>n</mi></msub></mtd><mtd><mo>=</mo></mtd><mtd><msub><mi>b</mi><mn>2</mn></msub></mtd></mtr><mtr><mtd></mtd><mtd></mtd><mtd></mtd><mtd></mtd><mtd><mo>⋱</mo></mtd><mtd></mtd><mtd><mrow data-mjx-texclass="ORD"><mo>⋮</mo></mrow></mtd><mtd><mo>=</mo></mtd><mtd><mrow data-mjx-texclass="ORD"><mo>⋮</mo></mrow></mtd></mtr><mtr><mtd></mtd><mtd></mtd><mtd></mtd><mtd></mtd><mtd></mtd><mtd></mtd><mtd><msub><mi>u</mi><mrow data-mjx-texclass="ORD"><mi>n</mi><mi>n</mi></mrow></msub><msub><mi>x</mi><mi>n</mi></msub></mtd><mtd><mo>=</mo></mtd><mtd><msub><mi>b</mi><mi>n</mi></msub><mo>.</mo></mtd></mtr></mtable></math>

The back substitution solution works from the bottom up to give:

xn=bnunnxn−1=bn−1−un−1nxnun−1n−1⋮x1=b1−∑nj=2u1jxju11.<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mtable displaystyle="true" columnalign="right left" columnspacing="0em" rowspacing="3pt"><mtr><mtd><msub><mi>x</mi><mi>n</mi></msub></mtd><mtd><mi></mi><mo>=</mo><mfrac><msub><mi>b</mi><mi>n</mi></msub><msub><mi>u</mi><mrow data-mjx-texclass="ORD"><mi>n</mi><mi>n</mi></mrow></msub></mfrac></mtd></mtr><mtr><mtd><msub><mi>x</mi><mrow data-mjx-texclass="ORD"><mi>n</mi><mo>−</mo><mn>1</mn></mrow></msub></mtd><mtd><mi></mi><mo>=</mo><mfrac><mrow><msub><mi>b</mi><mrow data-mjx-texclass="ORD"><mi>n</mi><mo>−</mo><mn>1</mn></mrow></msub><mo>−</mo><msub><mi>u</mi><mrow data-mjx-texclass="ORD"><mi>n</mi><mo>−</mo><mn>1</mn><mi>n</mi></mrow></msub><msub><mi>x</mi><mi>n</mi></msub></mrow><msub><mi>u</mi><mrow data-mjx-texclass="ORD"><mi>n</mi><mo>−</mo><mn>1</mn><mi>n</mi><mo>−</mo><mn>1</mn></mrow></msub></mfrac></mtd></mtr><mtr><mtd></mtd><mtd><mi></mi><mrow data-mjx-texclass="ORD"><mo>⋮</mo></mrow></mtd></mtr><mtr><mtd><msub><mi>x</mi><mn>1</mn></msub></mtd><mtd><mi></mi><mo>=</mo><mfrac><mrow><msub><mi>b</mi><mn>1</mn></msub><mo>−</mo><munderover><mo data-mjx-texclass="OP">∑</mo><mrow data-mjx-texclass="ORD"><mi>j</mi><mo>=</mo><mn>2</mn></mrow><mi>n</mi></munderover><msub><mi>u</mi><mrow data-mjx-texclass="ORD"><mn>1</mn><mi>j</mi></mrow></msub><msub><mi>x</mi><mi>j</mi></msub></mrow><msub><mi>u</mi><mrow data-mjx-texclass="ORD"><mn>11</mn></mrow></msub></mfrac><mo>.</mo></mtd></mtr></mtable></math>

The properties of the back substitution algorithm are:

If any of the diagonal elements $U i i <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>U</mi><mrow data-mjx-texclass="ORD"><mi>i</mi><mi>i</mi></mrow></msub></math>$ are zero then the system is singular and cannot be solved.
If all diagonal elements of $U <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">U</mi></mrow></math>$ are non-zero then the system has a unique solution.
The number of operations for the back substitution algorithm is $O (n 2) <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>O</mi><mo stretchy="false">(</mo><msup><mi>n</mi><mn>2</mn></msup><mo stretchy="false">)</mo></math>$ as $n \to \infty <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>n</mi><mo accent="false" stretchy="false">\to</mo><mi mathvariant="normal">\infty</mi></math>$ .

The code for the back substitution algorithm to solve $U x = b <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">U</mi><mi mathvariant="bold">x</mi></mrow><mo>=</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">b</mi></mrow></math>$ is:

import numpy as np
def back_sub(U, b):
    """x = back_sub(U, b) is the solution to U x = b
       U must be an upper-triangular matrix
       b must be a vector of the same leading dimension as U
    """
    n = U.shape[0]
    x = np.zeros(n)
    for i in range(n-1, -1, -1):
        tmp = b[i]
        for j in range(i+1, n):
            tmp -= U[i,j] * x[j]
        x[i] = tmp / U[i,i]
    return x

LU decomposition

The LU decomposition of a matrix $A <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow></math>$ is the pair of matrices $L <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">L</mi></mrow></math>$ and $U <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">U</mi></mrow></math>$ such that:

$A = L U <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow><mo>=</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">L</mi><mi mathvariant="bold">U</mi></mrow></math>$
$L <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">L</mi></mrow></math>$ is a lower-triangular matrix with all diagonal entries equal to 1
$U <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">U</mi></mrow></math>$ is an upper-triangular matrix.

The properties of the LU decomposition are:

The LU decomposition may not exist for a matrix $A <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow></math>$ .
If the LU decomposition exists then it is unique.
The LU decomposition provides an efficient means of solving linear equations.
The reason that $L <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">L</mi></mrow></math>$ has all diagonal entries set to 1 is that this means the LU decomposition is unique. This choice is somewhat arbitrary (we could have decided that $U <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">U</mi></mrow></math>$ must have 1 on the diagonal) but it is the standard choice.
We use the terms decomposition and factorization interchangeably to mean writing a matrix as a product of two or more other matrices, generally with some defined properties (such as lower/upper triangular).

Example: LU decomposition

Consider the matrix $A = [122442464] . <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>A</mi><mo>=</mo><mrow data-mjx-texclass="INNER"><mo data-mjx-texclass="OPEN">[</mo><mtable columnspacing="1em" rowspacing="4pt"><mtr><mtd><mn>1</mn></mtd><mtd><mn>2</mn></mtd><mtd><mn>2</mn></mtd></mtr><mtr><mtd><mn>4</mn></mtd><mtd><mn>4</mn></mtd><mtd><mn>2</mn></mtd></mtr><mtr><mtd><mn>4</mn></mtd><mtd><mn>6</mn></mtd><mtd><mn>4</mn></mtd></mtr></mtable><mo data-mjx-texclass="CLOSE">]</mo></mrow><mo>.</mo></math>$

The LU factorization is $A = L U = [100410 4 0.5 1] [122 0 - 4 - 6 00 - 1] . <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow><mo>=</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">L</mi><mi mathvariant="bold">U</mi></mrow><mo>=</mo><mrow data-mjx-texclass="INNER"><mo data-mjx-texclass="OPEN">[</mo><mtable columnspacing="1em" rowspacing="4pt"><mtr><mtd><mn>1</mn></mtd><mtd><mn>0</mn></mtd><mtd><mn>0</mn></mtd></mtr><mtr><mtd><mn>4</mn></mtd><mtd><mn>1</mn></mtd><mtd><mn>0</mn></mtd></mtr><mtr><mtd><mn>4</mn></mtd><mtd><mn>0.5</mn></mtd><mtd><mn>1</mn></mtd></mtr></mtable><mo data-mjx-texclass="CLOSE">]</mo></mrow><mrow data-mjx-texclass="INNER"><mo data-mjx-texclass="OPEN">[</mo><mtable columnspacing="1em" rowspacing="4pt"><mtr><mtd><mn>1</mn></mtd><mtd><mn>2</mn></mtd><mtd><mn>2</mn></mtd></mtr><mtr><mtd><mn>0</mn></mtd><mtd><mo>-</mo><mn>4</mn></mtd><mtd><mo>-</mo><mn>6</mn></mtd></mtr><mtr><mtd><mn>0</mn></mtd><mtd><mn>0</mn></mtd><mtd><mo>-</mo><mn>1</mn></mtd></mtr></mtable><mo data-mjx-texclass="CLOSE">]</mo></mrow><mo>.</mo></math>$

Example: matrix for which LU decomposition fails

An example of a matrix which has no LU decomposition is

A = [0121] . <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow><mo>=</mo><mrow data-mjx-texclass="INNER"><mo data-mjx-texclass="OPEN">[</mo><mtable columnspacing="1em" rowspacing="4pt"><mtr><mtd><mn>0</mn></mtd><mtd><mn>1</mn></mtd></mtr><mtr><mtd><mn>2</mn></mtd><mtd><mn>1</mn></mtd></mtr></mtable><mo data-mjx-texclass="CLOSE">]</mo></mrow><mo>.</mo></math>

If we try and find the LU decomposition of this matrix then we get

Equating the individual entries gives us four equations to solve. The top-left and bottom-left entries give the two equations:

u 11 = 0 ℓ 21 u 11 = 2. <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mtable displaystyle="true" columnalign="right left" columnspacing="0em" rowspacing="3pt"><mtr><mtd><msub><mi>u</mi><mrow data-mjx-texclass="ORD"><mn>11</mn></mrow></msub></mtd><mtd><mi></mi><mo>=</mo><mn>0</mn></mtd></mtr><mtr><mtd><msub><mi>ℓ</mi><mrow data-mjx-texclass="ORD"><mn>21</mn></mrow></msub><msub><mi>u</mi><mrow data-mjx-texclass="ORD"><mn>11</mn></mrow></msub></mtd><mtd><mi></mi><mo>=</mo><mn>2.</mn></mtd></mtr></mtable></math>

These equations have no solution, so $A <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow></math>$ does not have an LU decomposition.

Solving LU decomposition linear systems

Knowing the LU decomposition for a matrix $A <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow></math>$ allows us to solve the linear system $A x = b <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi><mi mathvariant="bold">x</mi></mrow><mo>=</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">b</mi></mrow></math>$ using a combination of forward and back substitution. In equations this is:

A x = b L U x = b U x = L - 1 b x = U - 1 (L - 1 b), <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mtable displaystyle="true" columnalign="right left" columnspacing="0em" rowspacing="3pt"><mtr><mtd><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi><mi mathvariant="bold">x</mi></mrow></mtd><mtd><mi></mi><mo>=</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">b</mi></mrow></mtd></mtr><mtr><mtd><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">L</mi><mi mathvariant="bold">U</mi><mi mathvariant="bold">x</mi></mrow></mtd><mtd><mi></mi><mo>=</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">b</mi></mrow></mtd></mtr><mtr><mtd><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">U</mi><mi mathvariant="bold">x</mi></mrow></mtd><mtd><mi></mi><mo>=</mo><msup><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">L</mi></mrow><mrow data-mjx-texclass="ORD"><mo>-</mo><mn>1</mn></mrow></msup><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">b</mi></mrow></mtd></mtr><mtr><mtd><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">x</mi></mrow></mtd><mtd><mi></mi><mo>=</mo><msup><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">U</mi></mrow><mrow data-mjx-texclass="ORD"><mo>-</mo><mn>1</mn></mrow></msup><mo stretchy="false">(</mo><msup><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">L</mi></mrow><mrow data-mjx-texclass="ORD"><mo>-</mo><mn>1</mn></mrow></msup><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">b</mi></mrow><mo stretchy="false">)</mo><mo>,</mo></mtd></mtr></mtable></math>

where we first evaluate $L - 1 b <math xmlns="http://www.w3.org/1998/Math/MathML"><msup><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">L</mi></mrow><mrow data-mjx-texclass="ORD"><mo>-</mo><mn>1</mn></mrow></msup><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">b</mi></mrow></math>$ using forward substitution and then evaluate $x = U - 1 (L - 1 b) <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">x</mi></mrow><mo>=</mo><msup><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">U</mi></mrow><mrow data-mjx-texclass="ORD"><mo>-</mo><mn>1</mn></mrow></msup><mo stretchy="false">(</mo><msup><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">L</mi></mrow><mrow data-mjx-texclass="ORD"><mo>-</mo><mn>1</mn></mrow></msup><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">b</mi></mrow><mo stretchy="false">)</mo></math>$ using back substitution.

An equivalent way to write this is to introduce a new vector $y <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">y</mi></mrow></math>$ defined by $y = U x <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>y</mi><mo>=</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">U</mi><mi mathvariant="bold">x</mi></mrow></math>$ . This means we can rewrite $A x = b <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi><mi mathvariant="bold">x</mi></mrow><mo>=</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">b</mi></mrow></math>$ as:

A x = b L U x = b L y = b use forward substitution to obtain y U x = y use backward substitution to obtain x <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mtable displaystyle="true" columnalign="right left" columnspacing="0em" rowspacing="3pt"><mtr><mtd><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi><mi mathvariant="bold">x</mi></mrow></mtd><mtd><mi></mi><mo>=</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">b</mi></mrow></mtd></mtr><mtr><mtd><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">L</mi><mi mathvariant="bold">U</mi><mi mathvariant="bold">x</mi></mrow></mtd><mtd><mi></mi><mo>=</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">b</mi></mrow></mtd></mtr><mtr><mtd><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">L</mi><mi mathvariant="bold">y</mi></mrow></mtd><mtd><mi></mi><mo>=</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">b</mi></mrow><mstyle scriptlevel="0"><mspace width="2em"></mspace></mstyle><mtext>use forward substitution to obtain </mtext><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">y</mi></mrow></mtd></mtr><mtr><mtd><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">U</mi><mi mathvariant="bold">x</mi></mrow></mtd><mtd><mi></mi><mo>=</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">y</mi></mrow><mstyle scriptlevel="0"><mspace width="2em"></mspace></mstyle><mtext>use backward substitution to obtain </mtext><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">x</mi></mrow></mtd></mtr></mtable></math>

We have thus replaced $A x = b <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi><mi mathvariant="bold">x</mi></mrow><mo>=</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">b</mi></mrow></math>$ with two linear systems: $L y = b <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">L</mi><mi mathvariant="bold">y</mi></mrow><mo>=</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">b</mi></mrow></math>$ and $U x = y <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">U</mi><mi mathvariant="bold">x</mi></mrow><mo>=</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">y</mi></mrow></math>$ . These two linear systems can then be solved one after the other using forward and back substitution.

The LU solve algorithm for solving the linear system $L U x = b <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">L</mi><mi mathvariant="bold">U</mi><mi mathvariant="bold">x</mi></mrow><mo>=</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">b</mi></mrow></math>$ written as code is:

import numpy as np
def lu_solve(L, U, b):
    """x = lu_solve(L, U, b) is the solution to L U x = b
       L must be a lower-triangular matrix
       U must be an upper-triangular matrix of the same size as L
       b must be a vector of the same leading dimension as L
    """
    y = forward_sub(L, b)
    x = back_sub(U, y)
    return x

The number of operations for the LU solve algorithm is $O (n 2) <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>O</mi><mo stretchy="false">(</mo><msup><mi>n</mi><mn>2</mn></msup><mo stretchy="false">)</mo></math>$ as $n \to \infty <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>n</mi><mo accent="false" stretchy="false">\to</mo><mi mathvariant="normal">\infty</mi></math>$ .

The LU decomposition algorithm

Given a matrix $A <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow></math>$ there are many different algorithms to find the matrices $L <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">L</mi></mrow></math>$ and $U <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">U</mi></mrow></math>$ for the LU decomposition. Here we will use the recursive leading-row-column LU algorithm. This algorithm is based on writing $A = L U <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow><mo>=</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">L</mi><mi mathvariant="bold">U</mi></mrow></math>$ in block form as:

In the above block form of the $n \times n <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>n</mi><mo>\times</mo><mi>n</mi></math>$ matrix $A <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow></math>$ , the entry $a 11 <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>a</mi><mrow data-mjx-texclass="ORD"><mn>11</mn></mrow></msub></math>$ is a scalar, $a 12 <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi mathvariant="bold-italic">a</mi><mrow data-mjx-texclass="ORD"><mn>12</mn></mrow></msub></math>$ is a $1 \times (n - 1) <math xmlns="http://www.w3.org/1998/Math/MathML"><mn>1</mn><mo>\times</mo><mo stretchy="false">(</mo><mi>n</mi><mo>-</mo><mn>1</mn><mo stretchy="false">)</mo></math>$ row vector, $a 12 <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi mathvariant="bold-italic">a</mi><mrow data-mjx-texclass="ORD"><mn>12</mn></mrow></msub></math>$ is an $(n - 1) \times 1 <math xmlns="http://www.w3.org/1998/Math/MathML"><mo stretchy="false">(</mo><mi>n</mi><mo>-</mo><mn>1</mn><mo stretchy="false">)</mo><mo>\times</mo><mn>1</mn></math>$ column vector, and $A 22 <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow><mrow data-mjx-texclass="ORD"><mn>22</mn></mrow></msub></math>$ is an $(n - 1) \times (n - 1) <math xmlns="http://www.w3.org/1998/Math/MathML"><mo stretchy="false">(</mo><mi>n</mi><mo>-</mo><mn>1</mn><mo stretchy="false">)</mo><mo>\times</mo><mo stretchy="false">(</mo><mi>n</mi><mo>-</mo><mn>1</mn><mo stretchy="false">)</mo></math>$ matrix.

Comparing the left- and right-hand side entries of the above block matrix equation we see that:

a 11 = u 11 a 12 = u 12 a 21 = u 11 ℓ 21 A 22 = ℓ 21 u 12 + L 22 U 22 . <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mtable displaystyle="true" columnalign="right left" columnspacing="0em" rowspacing="3pt"><mtr><mtd><msub><mi>a</mi><mrow data-mjx-texclass="ORD"><mn>11</mn></mrow></msub></mtd><mtd><mi></mi><mo>=</mo><msub><mi>u</mi><mrow data-mjx-texclass="ORD"><mn>11</mn></mrow></msub></mtd></mtr><mtr><mtd><msub><mi mathvariant="bold-italic">a</mi><mrow data-mjx-texclass="ORD"><mn>12</mn></mrow></msub></mtd><mtd><mi></mi><mo>=</mo><msub><mi mathvariant="bold-italic">u</mi><mrow data-mjx-texclass="ORD"><mn>12</mn></mrow></msub></mtd></mtr><mtr><mtd><msub><mi mathvariant="bold-italic">a</mi><mrow data-mjx-texclass="ORD"><mn>21</mn></mrow></msub></mtd><mtd><mi></mi><mo>=</mo><msub><mi>u</mi><mrow data-mjx-texclass="ORD"><mn>11</mn></mrow></msub><msub><mi mathvariant="bold-italic">ℓ</mi><mrow data-mjx-texclass="ORD"><mn>21</mn></mrow></msub></mtd></mtr><mtr><mtd><msub><mi>A</mi><mrow data-mjx-texclass="ORD"><mn>22</mn></mrow></msub></mtd><mtd><mi></mi><mo>=</mo><msub><mi mathvariant="bold-italic">ℓ</mi><mrow data-mjx-texclass="ORD"><mn>21</mn></mrow></msub><msub><mi mathvariant="bold-italic">u</mi><mrow data-mjx-texclass="ORD"><mn>12</mn></mrow></msub><mo>+</mo><msub><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">L</mi></mrow><mrow data-mjx-texclass="ORD"><mn>22</mn></mrow></msub><msub><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">U</mi></mrow><mrow data-mjx-texclass="ORD"><mn>22</mn></mrow></msub><mo>.</mo></mtd></mtr></mtable></math>

These four equations can be rearranged to solve for the components of the $L <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">L</mi></mrow></math>$ and $U <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">U</mi></mrow></math>$ matrices as:

u11=a11u12=a12ℓ21=1u11a21L22U22=A22−a21(a11)−1a12⏟Schur complement S22.<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mtable displaystyle="true" columnalign="right left" columnspacing="0em" rowspacing="3pt"><mtr><mtd><msub><mi>u</mi><mrow data-mjx-texclass="ORD"><mn>11</mn></mrow></msub></mtd><mtd><mi></mi><mo>=</mo><msub><mi>a</mi><mrow data-mjx-texclass="ORD"><mn>11</mn></mrow></msub></mtd></mtr><mtr><mtd><msub><mi mathvariant="bold-italic">u</mi><mrow data-mjx-texclass="ORD"><mn>12</mn></mrow></msub></mtd><mtd><mi></mi><mo>=</mo><msub><mi mathvariant="bold-italic">a</mi><mrow data-mjx-texclass="ORD"><mn>12</mn></mrow></msub></mtd></mtr><mtr><mtd><msub><mi mathvariant="bold-italic">ℓ</mi><mrow data-mjx-texclass="ORD"><mn>21</mn></mrow></msub></mtd><mtd><mi></mi><mo>=</mo><mfrac><mn>1</mn><msub><mi>u</mi><mrow data-mjx-texclass="ORD"><mn>11</mn></mrow></msub></mfrac><msub><mi mathvariant="bold-italic">a</mi><mrow data-mjx-texclass="ORD"><mn>21</mn></mrow></msub></mtd></mtr><mtr><mtd><msub><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">L</mi></mrow><mrow data-mjx-texclass="ORD"><mn>22</mn></mrow></msub><msub><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">U</mi></mrow><mrow data-mjx-texclass="ORD"><mn>22</mn></mrow></msub></mtd><mtd><mi></mi><mo>=</mo><munder><mrow data-mjx-texclass="OP"><munder><mrow><msub><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow><mrow data-mjx-texclass="ORD"><mn>22</mn></mrow></msub><mo>−</mo><msub><mi mathvariant="bold-italic">a</mi><mrow data-mjx-texclass="ORD"><mn>21</mn></mrow></msub><mo stretchy="false">(</mo><msub><mi>a</mi><mrow data-mjx-texclass="ORD"><mn>11</mn></mrow></msub><msup><mo stretchy="false">)</mo><mrow data-mjx-texclass="ORD"><mo>−</mo><mn>1</mn></mrow></msup><msub><mi mathvariant="bold-italic">a</mi><mrow data-mjx-texclass="ORD"><mn>12</mn></mrow></msub></mrow><mo>⏟</mo></munder></mrow><mrow data-mjx-texclass="ORD"><mtext>Schur complement </mtext><msub><mi>S</mi><mrow data-mjx-texclass="ORD"><mn>22</mn></mrow></msub></mrow></munder><mo>.</mo></mtd></mtr></mtable></math>

The first three equations above can be immediately evaluated to give the first row and column of $L <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">L</mi></mrow></math>$ and $U <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">U</mi></mrow></math>$ . The last equation can then have its right-hand-side evaluated, which gives the Schur complement $S 22 <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>S</mi><mrow data-mjx-texclass="ORD"><mn>22</mn></mrow></msub></math>$ of $A <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow></math>$ . We thus have the equation $L 22 U 22 = S 22 <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">L</mi></mrow><mrow data-mjx-texclass="ORD"><mn>22</mn></mrow></msub><msub><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">U</mi></mrow><mrow data-mjx-texclass="ORD"><mn>22</mn></mrow></msub><mo>=</mo><msub><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">S</mi></mrow><mrow data-mjx-texclass="ORD"><mn>22</mn></mrow></msub></math>$ , which is an $(n - 1) \times (n - 1) <math xmlns="http://www.w3.org/1998/Math/MathML"><mo stretchy="false">(</mo><mi>n</mi><mo>-</mo><mn>1</mn><mo stretchy="false">)</mo><mo>\times</mo><mo stretchy="false">(</mo><mi>n</mi><mo>-</mo><mn>1</mn><mo stretchy="false">)</mo></math>$ LU decomposition problem which we can recursively solve.

The code for the recursive leading-row-column LU algorithm to find $L <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">L</mi></mrow></math>$ and $U <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">U</mi></mrow></math>$ for $A = L U <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow><mo>=</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">L</mi><mi mathvariant="bold">U</mi></mrow></math>$ is:

import numpy as np
def lu_decomp(A):
    """(L, U) = lu_decomp(A) is the LU decomposition A = L U
       A is any matrix
       L will be a lower-triangular matrix with 1 on the diagonal, the same shape as A
       U will be an upper-triangular matrix, the same shape as A
    """
    n = A.shape[0]
    if n == 1:
        L = np.array([[1]])
        U = A.copy()
        return (L, U)

    A11 = A[0,0]
    A12 = A[0,1:]
    A21 = A[1:,0]
    A22 = A[1:,1:]

    L11 = 1
    U11 = A11

    L12 = np.zeros(n-1)
    U12 = A12.copy()

    L21 = A21.copy() / U11
    U21 = np.zeros(n-1)

    S22 = A22 - np.outer(L21, U12)
    (L22, U22) = lu_decomp(S22)

    L = np.block([[L11, L12], [L21, L22]])
    U = np.block([[U11, U12], [U21, U22]])
    return (L, U)

The number of operations for the recursive leading-row-column LU decomposition algorithm is $O (n 3) <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>O</mi><mo stretchy="false">(</mo><msup><mi>n</mi><mn>3</mn></msup><mo stretchy="false">)</mo></math>$ as $n \to \infty <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>n</mi><mo accent="false" stretchy="false">\to</mo><mi mathvariant="normal">\infty</mi></math>$ .

Solving linear systems using LU decomposition

We can put the above sections together to produce an algorithm for solving the system $A x = b <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi><mi mathvariant="bold">x</mi></mrow><mo>=</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">b</mi></mrow></math>$ , where we first compute the LU decomposition of $A <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow></math>$ and then use forward and backward substitution to solve for $x <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">x</mi></mrow></math>$ .

The properties of this algorithm are:

The algorithm may fail, even if $A <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow></math>$ is invertible.
The number of operations in the algorithm is $O (n 3) <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi data-mjx-variant="-tex-calligraphic" mathvariant="script">O</mi></mrow><mo stretchy="false">(</mo><msup><mi>n</mi><mn>3</mn></msup><mo stretchy="false">)</mo></math>$ as $n \to \infty <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>n</mi><mo accent="false" stretchy="false">\to</mo><mi mathvariant="normal">\infty</mi></math>$ .

The code for the linear solver using LU decomposition is: import numpy as np

import numpy as np
def linear_solve_without_pivoting(A, b):
    """x = linear_solve_without_pivoting(A, b) is the solution to A x = b (computed without pivoting)
       A is any matrix
       b is a vector of the same leading dimension as A
       x will be a vector of the same leading dimension as A
    """
    (L, U) = lu_decomp(A)
    x = lu_solve(L, U, b)
    return x

Pivoting

The LU decomposition can fail when the top-left entry in the matrix $A <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow></math>$ is zero or very small compared to other entries. Pivoting is a strategy to mitigate this problem by rearranging the rows and/or columns of $A <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow></math>$ to put a larger element in the top-left position.

There are many different pivoting algorithms. The most common of these are full pivoting, partial pivoting, and scaled partial pivoting. We will only discuss partial pivoting in detail.

1) Partial pivoting only rearranges the rows of $A <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow></math>$ and leaves the columns fixed.

2) Full pivoting rearranges both rows and columns.

3) Scaled partial pivoting approximates full pivoting without actually rearranging columns.

LU decomposition with partial pivoting

The LU decomposition with partial pivoting (LUP) of an $n \times n <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>n</mi><mo>\times</mo><mi>n</mi></math>$ matrix $A <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow></math>$ is the triple of matrices $L <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">L</mi></mrow></math>$ , $U <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">U</mi></mrow></math>$ , and $P <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">P</mi></mrow></math>$ such that:

$P A = L U <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">P</mi><mi mathvariant="bold">A</mi></mrow><mo>=</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">L</mi><mi mathvariant="bold">U</mi></mrow></math>$
$L <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">L</mi></mrow></math>$ is an $n \times n <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>n</mi><mo>\times</mo><mi>n</mi></math>$ lower-triangular matrix with all diagonal entries equal to 1.
$U <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">U</mi></mrow></math>$ is an $n \times n <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>n</mi><mo>\times</mo><mi>n</mi></math>$ upper-triangular matrix.
$P <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">P</mi></mrow></math>$ is an $n \times n <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>n</mi><mo>\times</mo><mi>n</mi></math>$ permutation matrix.

The properties of the LUP decomposition are:

The permutation matrix $P <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">P</mi></mrow></math>$ acts to permute the rows of $A <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow></math>$ . This attempts to put large entries in the top-left position of $A <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow></math>$ and each sub-matrix in the recursion, to avoid needing to divide by a small or zero element.
The LUP decomposition always exists for a matrix $A <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow></math>$ .
The LUP decomposition of a matrix $A <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow></math>$ is not unique.
The LUP decomposition provides a more robust method of solving linear systems than LU decomposition without pivoting, and it is approximately the same cost.

Solving LUP decomposition linear systems

Knowing the LUP decomposition for a matrix $A <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow></math>$ allows us to solve the linear system $A x = b <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi><mi mathvariant="bold">x</mi></mrow><mo>=</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">b</mi></mrow></math>$ by first applying $P <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">P</mi></mrow></math>$ and then using the LU solver. In equations we start by taking $A x = b <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi><mi mathvariant="bold">x</mi></mrow><mo>=</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">b</mi></mrow></math>$ and multiplying both sides by $P <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">P</mi></mrow></math>$ , giving

A x = b P A x = P b L U x = P b . <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mtable displaystyle="true" columnalign="right left" columnspacing="0em" rowspacing="3pt"><mtr><mtd><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi><mi mathvariant="bold">x</mi></mrow></mtd><mtd><mi></mi><mo>=</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">b</mi></mrow></mtd></mtr><mtr><mtd><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">P</mi><mi mathvariant="bold">A</mi><mi mathvariant="bold">x</mi></mrow></mtd><mtd><mi></mi><mo>=</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">P</mi><mi mathvariant="bold">b</mi></mrow></mtd></mtr><mtr><mtd><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">L</mi><mi mathvariant="bold">U</mi><mi mathvariant="bold">x</mi></mrow></mtd><mtd><mi></mi><mo>=</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">P</mi><mi mathvariant="bold">b</mi></mrow><mo>.</mo></mtd></mtr></mtable></math>

The code for the LUP solve algorithm to solve the linear system ${\bf L U x} = {\bf P b}$ is:

import numpy as np
def lup_solve(L, U, P, b):
    """x = lup_solve(L, U, P, b) is the solution to L U x = P b
       L must be a lower-triangular matrix
       U must be an upper-triangular matrix of the same shape as L
       P must be a permutation matrix of the same shape as L
       b must be a vector of the same leading dimension as L
    """
    z = np.dot(P, b)
    x = lu_solve(L, U, z)
    return x

The number of operations for the LUP solve algorithm is $O (n 2) <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi data-mjx-variant="-tex-calligraphic" mathvariant="script">O</mi></mrow><mo stretchy="false">(</mo><msup><mi>n</mi><mn>2</mn></msup><mo stretchy="false">)</mo></math>$ as $n \to \infty <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>n</mi><mo accent="false" stretchy="false">\to</mo><mi mathvariant="normal">\infty</mi></math>$ .

The LUP decomposition algorithm

Just as there are different LU decomposition algorithms, there are also different algorithms to find a LUP decomposition. Here we use the recursive leading-row-column LUP algorithm.

This algorithm is a recursive method for finding $L <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">L</mi></mrow></math>$ , $U <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">U</mi></mrow></math>$ , and $P <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">P</mi></mrow></math>$ so that $P A = L U <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">P</mi><mi mathvariant="bold">A</mi></mrow><mo>=</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">L</mi><mi mathvariant="bold">U</mi></mrow></math>$ . It consists of the following steps.

1) First choose $i <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>i</mi></math>$ so that row $i <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>i</mi></math>$ in $A <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow></math>$ has the largest absolute first entry. That is, $| A i 1 | \geq | A j 1 | <math xmlns="http://www.w3.org/1998/Math/MathML"><mo data-mjx-texclass="ORD" fence="false" stretchy="false">|</mo><msub><mi>A</mi><mrow data-mjx-texclass="ORD"><mi>i</mi><mn>1</mn></mrow></msub><mo data-mjx-texclass="ORD" fence="false" stretchy="false">|</mo><mo>\geq</mo><mo data-mjx-texclass="ORD" fence="false" stretchy="false">|</mo><msub><mi>A</mi><mrow data-mjx-texclass="ORD"><mi>j</mi><mn>1</mn></mrow></msub><mo data-mjx-texclass="ORD" fence="false" stretchy="false">|</mo></math>$ for all $j <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>j</mi></math>$ . Let $P 1 <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">P</mi></mrow><mn>1</mn></msub></math>$ be the permutation matrix that pivots (shifts) row $i <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>i</mi></math>$ to the first row, and leaves all other rows in order. We can explicitly write $P 1 <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">P</mi></mrow><mn>1</mn></msub></math>$ as

2) Write $ˉ A <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mover><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow><mo stretchy="false">¯</mo></mover></mrow></math>$ to denote the pivoted $A <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow></math>$ matrix, so $ˉ A = P 1 A <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mover><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow><mo stretchy="false">¯</mo></mover></mrow><mo>=</mo><msub><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">P</mi></mrow><mn>1</mn></msub><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow></math>$ .

3) Let $P 2 <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">P</mi></mrow><mn>2</mn></msub></math>$ be a permutation matrix that leaves the first row where it is, but permutes all other rows. We can write $P 2 <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">P</mi></mrow><mn>2</mn></msub></math>$ as $P 2 = [10 0 P 22], <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">P</mi></mrow><mn>2</mn></msub><mo>=</mo><mrow data-mjx-texclass="INNER"><mo data-mjx-texclass="OPEN">[</mo><mtable columnspacing="1em" rowspacing="4pt"><mtr><mtd><mn>1</mn></mtd><mtd><mn mathvariant="bold">0</mn></mtd></mtr><mtr><mtd><mn mathvariant="bold">0</mn></mtd><mtd><msub><mi>P</mi><mrow data-mjx-texclass="ORD"><mn>22</mn></mrow></msub></mtd></mtr></mtable><mo data-mjx-texclass="CLOSE">]</mo></mrow><mo>,</mo></math>$ where $P 22 <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">P</mi></mrow><mrow data-mjx-texclass="ORD"><mn>22</mn></mrow></msub></math>$ is an $(n - 1) \times (n - 1) <math xmlns="http://www.w3.org/1998/Math/MathML"><mo stretchy="false">(</mo><mi>n</mi><mo>-</mo><mn>1</mn><mo stretchy="false">)</mo><mo>\times</mo><mo stretchy="false">(</mo><mi>n</mi><mo>-</mo><mn>1</mn><mo stretchy="false">)</mo></math>$ permutation matrix.

4) Factorize the (unknown) full permutation matrix $P <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">P</mi></mrow></math>$ as the product of $P 2 <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">P</mi></mrow><mn>2</mn></msub></math>$ and $P 1 <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">P</mi></mrow><mn>1</mn></msub></math>$ , so $P = P 2 P 1 <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">P</mi></mrow><mo>=</mo><msub><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">P</mi></mrow><mn>2</mn></msub><msub><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">P</mi></mrow><mn>1</mn></msub></math>$ . This means that $P A = P 2 P 1 A = P 2 ˉ A <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">P</mi></mrow><mi>A</mi><mo>=</mo><msub><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">P</mi></mrow><mn>2</mn></msub><msub><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">P</mi></mrow><mn>1</mn></msub><mi>A</mi><mo>=</mo><msub><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">P</mi></mrow><mn>2</mn></msub><mrow data-mjx-texclass="ORD"><mover><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow><mo stretchy="false">¯</mo></mover></mrow></math>$ , which first shifts row $i <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>i</mi></math>$ of $A <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow></math>$ to the top, and then permutes the remaining rows. This is a completely general permutation matrix $P <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">P</mi></mrow></math>$ , but this factorization is key to enabling a recursive algorithm.

5) Using the factorization $P = P 2 P 1 <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">P</mi></mrow><mo>=</mo><msub><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">P</mi></mrow><mn>2</mn></msub><msub><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">P</mi></mrow><mn>1</mn></msub></math>$ , now write the LUP factorization in block form as

6) Equating the entries in the above matrices gives the equations

7) Substituting the first three equations above into the last one and rearranging gives

P 22 (ˉ A 22 - ˉ a 21 (ˉ a 11) - 1 ˉ a 12) ⏟ Schur complement S 22 = L 22 U 22 . <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><msub><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">P</mi></mrow><mrow data-mjx-texclass="ORD"><mn>22</mn></mrow></msub><munder><mrow data-mjx-texclass="OP"><munder><mrow><mrow data-mjx-texclass="OPEN"><mo minsize="1.623em" maxsize="1.623em">(</mo></mrow><msub><mrow data-mjx-texclass="ORD"><mover><mi>A</mi><mo stretchy="false">¯</mo></mover></mrow><mrow data-mjx-texclass="ORD"><mn>22</mn></mrow></msub><mo>-</mo><msub><mrow data-mjx-texclass="ORD"><mover><mi mathvariant="bold-italic">a</mi><mo stretchy="false">¯</mo></mover></mrow><mrow data-mjx-texclass="ORD"><mn>21</mn></mrow></msub><mo stretchy="false">(</mo><msub><mrow data-mjx-texclass="ORD"><mover><mi>a</mi><mo stretchy="false">¯</mo></mover></mrow><mrow data-mjx-texclass="ORD"><mn>11</mn></mrow></msub><msup><mo stretchy="false">)</mo><mrow data-mjx-texclass="ORD"><mo>-</mo><mn>1</mn></mrow></msup><msub><mrow data-mjx-texclass="ORD"><mover><mi mathvariant="bold-italic">a</mi><mo stretchy="false">¯</mo></mover></mrow><mrow data-mjx-texclass="ORD"><mn>12</mn></mrow></msub><mrow data-mjx-texclass="CLOSE"><mo minsize="1.623em" maxsize="1.623em">)</mo></mrow></mrow><mo>⏟</mo></munder></mrow><mrow data-mjx-texclass="ORD"><mtext>Schur complement </mtext><msub><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">S</mi></mrow><mrow data-mjx-texclass="ORD"><mn>22</mn></mrow></msub></mrow></munder><mo>=</mo><msub><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">L</mi></mrow><mrow data-mjx-texclass="ORD"><mn>22</mn></mrow></msub><msub><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">U</mi></mrow><mrow data-mjx-texclass="ORD"><mn>22</mn></mrow></msub><mo>.</mo></math>

8) Recurse to find the LUP decomposition of $S 22 <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>S</mi><mrow data-mjx-texclass="ORD"><mn>22</mn></mrow></msub></math>$ , resulting in $L 22 <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">L</mi></mrow><mrow data-mjx-texclass="ORD"><mn>22</mn></mrow></msub></math>$ , $U 22 <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">U</mi></mrow><mrow data-mjx-texclass="ORD"><mn>22</mn></mrow></msub></math>$ , and $P 22 <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">P</mi></mrow><mrow data-mjx-texclass="ORD"><mn>22</mn></mrow></msub></math>$ that satisfy the above equation.

9) Solve for the first rows and columns of $L <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">L</mi></mrow></math>$ and $U <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">U</mi></mrow></math>$ with the above equations to give

u11=ˉa11u12=ˉa12ℓ21=1ˉa11P22ˉa21.<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mtable displaystyle="true" columnalign="right left" columnspacing="0em" rowspacing="3pt"><mtr><mtd><msub><mi>u</mi><mrow data-mjx-texclass="ORD"><mn>11</mn></mrow></msub></mtd><mtd><mi></mi><mo>=</mo><msub><mrow data-mjx-texclass="ORD"><mover><mi>a</mi><mo stretchy="false">¯</mo></mover></mrow><mrow data-mjx-texclass="ORD"><mn>11</mn></mrow></msub></mtd></mtr><mtr><mtd><msub><mi mathvariant="bold-italic">u</mi><mrow data-mjx-texclass="ORD"><mn>12</mn></mrow></msub></mtd><mtd><mi></mi><mo>=</mo><msub><mrow data-mjx-texclass="ORD"><mover><mi mathvariant="bold-italic">a</mi><mo stretchy="false">¯</mo></mover></mrow><mrow data-mjx-texclass="ORD"><mn>12</mn></mrow></msub></mtd></mtr><mtr><mtd><msub><mi mathvariant="bold-italic">ℓ</mi><mrow data-mjx-texclass="ORD"><mn>21</mn></mrow></msub></mtd><mtd><mi></mi><mo>=</mo><mfrac><mn>1</mn><msub><mrow data-mjx-texclass="ORD"><mover><mi>a</mi><mo stretchy="false">¯</mo></mover></mrow><mrow data-mjx-texclass="ORD"><mn>11</mn></mrow></msub></mfrac><msub><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">P</mi></mrow><mrow data-mjx-texclass="ORD"><mn>22</mn></mrow></msub><msub><mrow data-mjx-texclass="ORD"><mover><mi mathvariant="bold-italic">a</mi><mo stretchy="false">¯</mo></mover></mrow><mrow data-mjx-texclass="ORD"><mn>21</mn></mrow></msub><mo>.</mo></mtd></mtr></mtable></math>

10) Finally, reconstruct the full matrices $L <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">L</mi></mrow></math>$ , $U <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">U</mi></mrow></math>$ , and $P <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">P</mi></mrow></math>$ from the component parts.

In code the recursive leading-row-column LUP algorithm for finding the LU decomposition of $A <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow></math>$ with partial pivoting is:

import numpy as np
def lup_decomp(A):
    """(L, U, P) = lup_decomp(A) is the LUP decomposition P A = L U
       A is any matrix
       L will be a lower-triangular matrix with 1 on the diagonal, the same shape as A
       U will be an upper-triangular matrix, the same shape as A
       U will be a permutation matrix, the same shape as A
    """
    n = A.shape[0]
    if n == 1:
        L = np.array([[1]])
        U = A.copy()
        P = np.array([[1]])
        return (L, U, P)

    i = np.argmax(A[:,0])
    A_bar = np.vstack([A[i,:], A[:i,:], A[(i+1):,:]])

    A_bar11 = A_bar[0,0]
    A_bar12 = A_bar[0,1:]
    A_bar21 = A_bar[1:,0]
    A_bar22 = A_bar[1:,1:]

    S22 = A_bar22 - np.dot(A_bar21, A_bar12) / A_bar11

    (L22, U22, P22) = lup_decomp(S22)

    L11 = 1
    U11 = A_bar11

    L12 = np.zeros(n-1)
    U12 = A_bar12.copy()

    L21 = np.dot(P22, A_bar21) / A_bar11
    U21 = np.zeros(n-1)

    L = np.block([[L11, L12], [L21, L22]])
    U = np.block([[U11, U12], [U21, U22]])
    P = np.block([
        [np.zeros((1, i-1)), 1,                  np.zeros((1, n-i))],
        [P22[:,:(i-1)],      np.zeros((n-1, 1)), P22[:,i:]]
    ])
    return (L, U, P)

The properties of the recursive leading-row-column LUP decomposition algorithm are:

The computational complexity (number of operations) of the algorithm is $O (n 3) <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi data-mjx-variant="-tex-calligraphic" mathvariant="script">O</mi></mrow><mo stretchy="false">(</mo><msup><mi>n</mi><mn>3</mn></msup><mo stretchy="false">)</mo></math>$ as $n \to \infty <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>n</mi><mo accent="false" stretchy="false">\to</mo><mi mathvariant="normal">\infty</mi></math>$ .
The last step in the code that computes $P <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">P</mi></mrow></math>$ does not do so by constructing and multiplying $P 2 <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">P</mi></mrow><mn>2</mn></msub></math>$ and $P 1 <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">P</mi></mrow><mn>1</mn></msub></math>$ . This is because this would be an $O (n 3) <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi data-mjx-variant="-tex-calligraphic" mathvariant="script">O</mi></mrow><mo stretchy="false">(</mo><msup><mi>n</mi><mn>3</mn></msup><mo stretchy="false">)</mo></math>$ step, making the whole algorithm $O (n 4) <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi data-mjx-variant="-tex-calligraphic" mathvariant="script">O</mi></mrow><mo stretchy="false">(</mo><msup><mi>n</mi><mn>4</mn></msup><mo stretchy="false">)</mo></math>$ . Instead we take advantage of the special structure of $P 2 <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">P</mi></mrow><mn>2</mn></msub></math>$ and $P 1 <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">P</mi></mrow><mn>1</mn></msub></math>$ to compute $P <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">P</mi></mrow></math>$ with $O (n 2) <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi data-mjx-variant="-tex-calligraphic" mathvariant="script">O</mi></mrow><mo stretchy="false">(</mo><msup><mi>n</mi><mn>2</mn></msup><mo stretchy="false">)</mo></math>$ work.

Solving linear systems using LUP decomposition

Just as with the plain LU decomposition, we can use LUP decomposition to solve the linear system $A x = b <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi><mi mathvariant="bold">x</mi></mrow><mo>=</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">b</mi></mrow></math>$ . This is the linear solver using LUP decomposition algorithm.

The properties of this algorithm are:

The algorithm may fail. In particular if $A <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow></math>$ is singular (or singular in finite precision), U will have a zero on it’s diagonal.
The number of operations in the algorithm is $O (n 3) <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi data-mjx-variant="-tex-calligraphic" mathvariant="script">O</mi></mrow><mo stretchy="false">(</mo><msup><mi>n</mi><mn>3</mn></msup><mo stretchy="false">)</mo></math>$ as $n \to \infty <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>n</mi><mo accent="false" stretchy="false">\to</mo><mi mathvariant="normal">\infty</mi></math>$ .

The code for the linear solver using LUP decomposition is:

import numpy as np
def linear_solve(A, b):
    """x = linear_solve(A, b) is the solution to A x = b (computed with partial pivoting)
       A is any matrix
       b is a vector of the same leading dimension as A
       x will be a vector of the same leading dimension as A
    """
    (L, U, P) = lup_decomp(A)
    x = lup_solve(L, U, P, b)
    return x

Example: matrix for which LUP decomposition succeeds but LU decomposition fails

Recall our example of a matrix which has no LU decomposition:

A = [0121] . <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow><mo>=</mo><mrow data-mjx-texclass="INNER"><mo data-mjx-texclass="OPEN">[</mo><mtable columnspacing="1em" rowspacing="4pt"><mtr><mtd><mn>0</mn></mtd><mtd><mn>1</mn></mtd></mtr><mtr><mtd><mn>2</mn></mtd><mtd><mn>1</mn></mtd></mtr></mtable><mo data-mjx-texclass="CLOSE">]</mo></mrow><mo>.</mo></math>

To find the LUP decomposition of $A <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow></math>$ , we first write the permutation matrix $P <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">P</mi></mrow></math>$ that shifts the second row to the top, so that the top-left entry has the largest possible magnitude. This gives

Review Questions

Given a factorization $P A = L U <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">P</mi><mi mathvariant="bold">A</mi></mrow><mo>=</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">L</mi><mi mathvariant="bold">U</mi></mrow></math>$ , how would you solve the system $A x = b <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">x</mi></mrow><mo>=</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">b</mi></mrow></math>$ ?
Understand the process of solving a triangular system. Solve an example triangular system.
Recognize and understand Python code implementing forward substitution, back substitution, and LU factorization.
When does an LU factorization exist?
When does an LUP factorization exist?
What special properties do $P <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">P</mi></mrow></math>$ , $L <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">L</mi></mrow></math>$ , and $U <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">U</mi></mrow></math>$ have?
Can we find an LUP factorization of a singular matrix?
What happens if we try to solve a system $A x = b <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">x</mi></mrow><mo>=</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">b</mi></mrow></math>$ with a singular matrix $A <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow></math>$ ?
Compute the LU factorization of a small matrix by hand.
Why do we use pivoting when solving linear systems?
How do we choose a pivot element?
What effect does a given permutation matrix have when multiplied by another matrix?
What is the cost of matrix-matrix multiplication?
What is the cost of computing an LU or LUP factorization?
What is the cost of forward or back substitution?
What is the cost of solving $A x = b <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">x</mi></mrow><mo>=</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">b</mi></mrow></math>$ for a general matrix?
What is the cost of solving $A x = b <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">x</mi></mrow><mo>=</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">b</mi></mrow></math>$ for a triangular matrix?
What is the cost of solving $A x = b i <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">x</mi></mrow><mo>=</mo><msub><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">b</mi></mrow><mi>i</mi></msub></math>$ with the same matrix $A <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow></math>$ and several right-hand side vectors $b i <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">b</mi></mrow><mi>i</mi></msub></math>$ ?
Given a process that takes time $O (n k) <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi data-mjx-variant="-tex-calligraphic" mathvariant="script">O</mi></mrow><mo stretchy="false">(</mo><msup><mi>n</mi><mi>k</mi></msup><mo stretchy="false">)</mo></math>$ , what happens to the runtime if we double the input size (i.e. double $n <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>n</mi></math>$ )? What if we triple the input size?

ChangeLog

2018-02-28 Erin Carrier ecarrie2@illinois.edu: fix error in ludecomp() code
2018-02-22 Erin Carrier ecarrie2@illinois.edu: update properties for solving using LUP
2018-01-14 Erin Carrier ecarrie2@illinois.edu: removes demo links
2017-11-02 John Doherty jjdoher2@illinois.edu: fixed typo in back substitution
2017-11-02 Arun Lakshmanan lakshma2@illinois.edu: minor fix in lup_solve(), add changelog
2017-10-25 Nathan Bowman nlbowma2@illinois.edu: added review questions
2017-10-23 Erin Carrier ecarrie2@illinois.edu: fix links
2017-10-20 Matthew West mwest@illinois.edu: minor fix in back_sub()
2017-10-19 Nathan Bowman nlbowma2@illinois.edu: minor existence of LUP
2017-10-17 Luke Olson lukeo@illinois.edu: update links
2017-10-17 Erin Carrier ecarrie2@illinois.edu: fixes
2017-10-16 Matthew West mwest@illinois.edu: first complete draft