Run this tutorial

Click here to run this tutorial on mybinder.org:

Please note that starting the notebook server may take a couple of minutes.

Tutorial: Linear time-invariant systems¶

In this tutorial, we discuss finite-dimensional, continuous-time, linear time-invariant (LTI) systems of the form

\[\begin{split}\begin{align} E \dot{x}(t) & = A x(t) + B u(t), \\ y(t) & = C x(t) + D u(t). \end{align}\end{split}\]

where \(u\) is the input, \(x\) the state, and \(y\) the output of the system, and \(A, B, C, D, E\) are matrices of appropriate dimensions (more details can be found in [Ant05]). In pyMOR, these models are captured by LTIModels, which contain the matrices \(A, B, C, D, E\) as Operators. We start by building an LTIModel and then demonstrate some of its properties, using a discretized heat equation as the example.

We focus on a non-parametric example, but parametric LTI systems can be handled similarly by constructing \(A, B, C, D, E\) as parametric Operators and passing parameter values via the mu argument to the methods of the LTIModel.

Note

Discrete-time LTI systems can be constructed by passing positive values for the sampling_time to any constructor of an LTIModel.

Building a model¶

We consider the following one-dimensional heat equation over \((0, 1)\) with two inputs \(u_1, u_2\) and three outputs \(y_1, y_2, y_2\):

\[\begin{split}\begin{align} \partial_t T(\xi, t) & = \partial_{\xi \xi} T(\xi, t) + u_1(t), & 0 < \xi < 1,\ t > 0, \\ -\partial_\xi T(0, t) & = -T(0, t) + u_2(t), & t > 0, \\ \partial_\xi T(1, t) & = -T(1, t), & t > 0, \\ y_1(t) & = T(0, t), & t > 0, \\ y_2(t) & = T(0.5, t), & t > 0, \\ y_3(t) & = T(1, t), & t > 0. \end{align}\end{split}\]

There are many ways of building an LTIModel. Here, we show how to build one from custom matrices, instead of using a discretizer as in Tutorial: Using pyMOR’s discretization toolkit (and the to_lti method of InstationaryModel to obtain an LTIModel). In particular, we will use the from_matrices method of LTIModel, which instantiates an LTIModel from NumPy or SciPy matrices.

First, we do the necessary imports and some matplotlib style choices.

import matplotlib.pyplot as plt
import numpy as np
import scipy.sparse as sps
from pymor.models.iosys import LTIModel

plt.rcParams['axes.grid'] = True

Next, we can assemble the matrices based on a centered finite difference approximation using standard methods of NumPy and SciPy.

import numpy as np
import scipy.sparse as sps

k = 50
n = 2 * k + 1

E = sps.eye(n, format='lil')
E[0, 0] = E[-1, -1] = 0.5
E = E.tocsc()

d0 = n * [-2 * (n - 1)**2]
d1 = (n - 1) * [(n - 1)**2]
A = sps.diags([d1, d0, d1], [-1, 0, 1], format='lil')
A[0, 0] = A[-1, -1] = -n * (n - 1)
A = A.tocsc()

B = np.zeros((n, 2))
B[:, 0] = 1
B[0, 0] = B[-1, 0] = 0.5
B[0, 1] = n - 1

C = np.zeros((3, n))
C[0, 0] = C[1, k] = C[2, -1] = 1

Then, we can create an LTIModel from NumPy and SciPy matrices A, B, C, E.

fom = LTIModel.from_matrices(A, B, C, E=E)

We can take a look at the internal representation of the LTIModel fom.

fom

LTIModel(
    NumpyMatrixOperator(<101x101 sparse, 301 nnz>, source_id='STATE', range_id='STATE'),
    NumpyMatrixOperator(<101x2 dense>, range_id='STATE'),
    NumpyMatrixOperator(<3x101 dense>, source_id='STATE'),
    D=ZeroOperator(NumpyVectorSpace(3), NumpyVectorSpace(2)),
    E=NumpyMatrixOperator(<101x101 sparse, 101 nnz>, source_id='STATE', range_id='STATE'),
    presets={})

From this, we see that the matrices were wrapped in NumpyMatrixOperators, while the default value was chosen for the \(D\) matrix (ZeroOperator). The operators in an LTIModel can be accessed via its attributes, e.g., fom.A is the Operator representing the \(A\) matrix.

We can also see some basic information from fom’s string representation

print(fom)

LTIModel
    class: LTIModel
    number of equations: 101
    number of inputs:    2
    number of outputs:   3
    continuous-time
    linear time-invariant
    solution_space:  NumpyVectorSpace(101, id='STATE')

which gives the dimensions of the underlying system more directly, together with some of its properties.

Transfer function evaluation¶

The transfer function \(H\) is the function such that \(Y(s) = H(s) U(s)\), where \(U\) and \(Y\) are respectively the Laplace transforms of the input \(u\) and the output \(y\), assuming zero initial condition (\(x(0) = 0\)). The expression for \(H\) can be found by applying the Laplace transform to the system equations to obtain

\[\begin{split}\begin{align} s E X(s) & = A X(s) + B U(s), \\ Y(s) & = C X(s) + D U(s). \end{align}\end{split}\]

using that \(s X(s)\) is the Laplace transform of \(\dot{x}(t)\). Eliminating \(X(s)\) leads to

\[Y(s) = \left( C (s E - A)^{-1} B + D \right) U(s),\]

i.e., \(H(s) = C (s E - A)^{-1} B + D\). Note that \(H\) is a matrix-valued rational function (each component is a rational function).

The transfer function of a given LTIModel is stored as the attribute transfer_function. It can be evaluated using its eval_tf method. The result is a NumPy array.

print(fom.transfer_function.eval_tf(0))
print(fom.transfer_function.eval_tf(1))
print(fom.transfer_function.eval_tf(1j))

[[0.5        0.66666667]
 [0.625      0.5       ]
 [0.5        0.33333333]]
[[0.31606222 0.4999973 ]
 [0.39347047 0.30326476]
 [0.31606222 0.18394049]]
[[0.37281526-0.21715839j 0.55742226-0.1956196j ]
 [0.46483929-0.27336178j 0.36331911-0.23241964j]
 [0.37281526-0.21715839j 0.22541935-0.17719566j]]

Similarly, the derivative of the transfer function can be computed using the eval_dtf method. The result is again a NumPy array.

print(fom.transfer_function.eval_dtf(0))
print(fom.transfer_function.eval_dtf(1))
print(fom.transfer_function.eval_dtf(1j))

[[-0.2916625  -0.25926111]
 [-0.36718437 -0.3125    ]
 [-0.2916625  -0.24073889]]
[[-0.11600655 -0.10808607]
 [-0.14598636 -0.12374206]
 [-0.11600655 -0.09196959]]
[[-0.10621989+0.18938184j -0.10083093+0.16301232j]
 [-0.13365757+0.23848281j -0.11317824+0.20350967j]
 [-0.10621989+0.18938184j -0.08260248+0.16036596j]]

To evaluate the transfer function over a sequence of points on the imaginary axis, the freq_resp method can be used. A typical use case is plotting the transfer function, which is discussed in the next section.

Magnitude and Bode plots¶

It is known that if the input is chosen as \(u(t) = a e^{\xi t} \sin(\omega t + \varphi) e_j\) (where \(e_j\) is the \(j\)-th canonical vector), then

\[\lim_{t \to \infty} \left( y_i(t) - a \lvert H_{ij}(\xi + \boldsymbol{\imath} \omega) \rvert e^{\xi t} \sin(\omega t + \varphi + \arg(H_{ij}(\xi + \boldsymbol{\imath} \omega))) \right) = 0.\]

In words, if the input is a pure exponential, the frequency is preserved in the output, the amplitude is multiplied by the amplitude of the transfer function, and the phase is shifted by the argument of the transfer function. In particular, if the input is sinusiodal, i.e., \(\xi = 0\), then the output is also sinusiodal.

It is of interest to plot the transfer function over the imaginary axis to visualize how the LTI system responds to each frequency in the input. Since the transfer function is complex-valued (and matrix-valued), there are multiple ways to plot it.

One way is the “magnitude plot”, a visualization of the mapping \(\omega \mapsto \lVert H(\boldsymbol{\imath} \omega) \rVert\), using the mag_plot method.

w = np.logspace(-2, 8, 300)
_ = fom.transfer_function.mag_plot(w)

_images/9437fc31c5b286261c6315498ae222c04ed06f0ab0b97e01f53ac0caceb7c82f.png

Note that mag_plot computes the Frobenius norm of \(H(\boldsymbol{\imath} \omega)\) by default, just as scipy.linalg.norm. Likewise, the choice of the norm \(\lVert \cdot \rVert\) can be controlled using the ord parameter.

Another visualization is the Bode plot, which shows the magnitude and phase of each component of the transfer function. More specifically, \(\omega \mapsto \lvert H_{ij}(\boldsymbol{\imath} \omega) \rvert\) is in subplot \((2 i - 1, j)\) and \(\omega \mapsto \arg(H_{ij}(\boldsymbol{\imath} \omega))\) is in subplot \((2 i, j)\).

_ = fom.transfer_function.bode_plot(w)

_images/a565fd03dd23ff85e6fd7c1b5855b4a5df4d8a09d7a1e041bbafaf20147847c7.png

System poles¶

The poles of an LTI system are the poles of its transfer function. From the form of the transfer function it follows that the poles are eigenvalues of \(E^{-1} A\), assuming that \(E\) is invertible. Conversely, the eigenvalues of \(E^{-1} A\) are the poles of the system in the generic case (more precisely, if the system is minimal, i.e., controllable and observable; see [Ant05]).

The poles of an LTIModel can be obtained using its poles method (assuming the system is minimal).

poles = fom.poles()
fig, ax = plt.subplots()
ax.plot(poles.real, poles.imag, '.')
_ = ax.set_title('Poles')

_images/ebbe1330be80e25123f20eca730180f9912b512dde9e1634f77337841d67f8c6.png

Note

The poles method uses a dense eigenvalue solver, which is applicable only up to medium-sized problems.

System Gramians¶

The controllability and observability Gramians of an asymptotically stable system with invertible \(E\) are respectively

\[\begin{split}\begin{align*} P & = \int_0^\infty e^{t E^{-1} A} E^{-1} B B^{\operatorname{T}} E^{-\!\operatorname{T}} e^{t A^{\operatorname{T}} E^{-\!\operatorname{T}}} \operatorname{d}\!t, \text{ and} \\ E^{\operatorname{T}} Q E & = \int_0^\infty e^{t A^{\operatorname{T}} E^{-\!\operatorname{T}}} C^{\operatorname{T}} C e^{t E^{-1} A} \operatorname{d}\!t. \end{align*}\end{split}\]

From this, it is clear that \(P\) and \(Q\) are symmetric positive semidefinite. Furthermore, it can be shown that \(P\) and \(Q\) are solutions to Lyapunov equation

\[\begin{split}\begin{align*} A P E^{\operatorname{T}} + E P A^{\operatorname{T}} + B B^{\operatorname{T}} & = 0, \\ A^{\operatorname{T}} Q E + E^{\operatorname{T}} Q A + C^{\operatorname{T}} C & = 0. \end{align*}\end{split}\]

The Gramians can be used to quantify how much does the input influence the state (controllability) and state the output (observability). This is used to motivate the balanced truncation method (see Tutorial: Reducing an LTI system using balanced truncation). Also, they can be used to compute the \(\mathcal{H}_2\) norm (see below).

To find the “Gramians” \(P\) and \(Q\) of an LTIModel, the gramian method can be used. Although solutions to Lyapunov equations are generally dense matrices, they can be often be very well approximated by a low-rank matrix. With gramian, it is possible to compute the dense solution or only the low-rank Cholesky factor. For example, the following computes the low-rank Cholesky factor of the controllability Gramian as a VectorArray:

fom.gramian('c_lrcf')

NumpyVectorArray(
    NumpyVectorSpace(101, id='STATE'),
    [[-9.67050759e-01 -9.49603788e-01 -9.37818152e-01 ... -5.08356138e-01
      -5.03442167e-01 -4.98481264e-01]
     [-1.13133419e+00 -9.59580542e-01 -8.43810516e-01 ...  1.94699113e-01
       1.92880555e-01  1.91000976e-01]
     [ 9.08538963e-01  4.58145972e-01  2.24375612e-01 ...  9.25753046e-02
       9.17614570e-02  9.08841977e-02]
     ...
     [-4.48720858e-17  1.95466800e-15 -3.87392910e-14 ... -1.84179333e-10
       3.10420349e-11 -2.87160762e-12]
     [ 3.97648476e-18 -1.92190365e-16  4.26024322e-15 ... -2.24465928e-11
       1.75879252e-12 -8.24288506e-14]
     [-1.85127015e-17  7.70480426e-16 -1.45373940e-14 ...  1.65200046e-10
      -3.71100064e-11  4.41382006e-12]],
    _len=101)

Hankel singular values¶

The Hankel singular values of an LTI system are \(\sigma_i = \sqrt{\lambda_i(E^{\operatorname{T}} Q E P)}\), where \(\lambda_i\) is the \(i\)-th eigenvalue.

Plotting the Hankel singular values shows us how well an LTI system can be approximated by a reduced-order model. The hsv method can be used to compute them.

hsv = fom.hsv()
fig, ax = plt.subplots()
ax.semilogy(range(1, len(hsv) + 1), hsv, '.-')
_ = ax.set_title('Hankel singular values')

_images/eaa6b5b8b271e3c420d90289ec4d9c2f7eb736282c10d4e0fa2e4b9579405254.png

As expected for a heat equation, the Hankel singular values decay rapidly.

System norms¶

There are various system norms, used for quantifying the sensitivity of system’s outputs to its inputs. pyMOR currently has methods for computing: the \(\mathcal{H}_2\) norm, the \(\mathcal{H}_\infty\) norm, and the Hankel (semi)norm.

The \(\mathcal{H}_2\) norm is (if \(E\) is invertible, \(E^{-1} A\) has eigenvalues in the open left half plane, and \(D\) is zero)

\[\lVert H \rVert_{\mathcal{H}_2} = \left( \frac{1}{2 \pi} \int_{-\infty}^{\infty} \lVert H(\boldsymbol{\imath} \omega) \rVert_{\operatorname{F}}^2 \operatorname{d}\!\omega \right)^{\frac{1}{2}}.\]

It can be shown that

\[\lVert y \rVert_{\mathcal{L}_\infty} \leqslant \lVert H \rVert_{\mathcal{H}_2} \lVert u \rVert_{\mathcal{L}_2}.\]

Additionally, for systems with a single input or a single output (i.e., \(u(t) \in \mathbb{R}\) or \(y(t) \in \mathbb{R}\)),

\[\lVert H \rVert_{\mathcal{H}_2} = \sup_{u \neq 0} \frac{\lVert y \rVert_{\mathcal{L}_\infty}}{\lVert u \rVert_{\mathcal{L}_2}}.\]

The computation of the \(\mathcal{H}_2\) norm is based on the system Gramians

\[\lVert H \rVert_{\mathcal{H}_2}^2 = \operatorname{tr}\!\left(C P C^{\operatorname{T}}\right) = \operatorname{tr}\!\left(B^{\operatorname{T}} Q B\right).\]

The h2_norm method of an LTIModel can be used to compute it.

fom.h2_norm()

2.0377751533822073

The \(\mathcal{H}_\infty\) norm is (if \(E\) is invertible and \(E^{-1} A\) has eigenvalues in the open left half plane)

\[\lVert H \rVert_{\mathcal{H}_\infty} = \sup_{\omega \in \mathbb{R}} \lVert H(\boldsymbol{\imath} \omega) \rVert_2.\]

It is always true that

\[\lVert H \rVert_{\mathcal{H}_\infty} = \sup_{u \neq 0} \frac{\lVert y \rVert_{\mathcal{L}_2}}{\lVert u \rVert_{\mathcal{L}_2}},\]

and, in particular,

\[\lVert y \rVert_{\mathcal{L}_2} \leqslant \lVert H \rVert_{\mathcal{H}_\infty} \lVert u \rVert_{\mathcal{L}_2}.\]

The hinf_norm method uses a dense solver from Slycot to compute the \(\mathcal{H}_\infty\) norm.

fom.hinf_norm()

1.2890705455704852

The Hankel norm is (if \(E\) is invertible and \(E^{-1} A\) has eigenvalues in the open left half plane)

\[\lVert H \rVert_{\operatorname{H}} = \sigma_1,\]

i.e., the largest Hankel singular value. Since it is independent of \(D\), the “Hankel norm” is only a seminorm in general.

It can be shown that the Hankel norm is the norm of the Hankel operator \(\mathcal{H} \colon \mathcal{L}_2(-\infty, 0) \to \mathcal{L}_2(0, \infty)\) mapping past inputs \(u_-\) to future outputs \(y_+\)

\[y_+(t) = \mathcal{H}(u_-)(t) = \int_{-\infty}^0 h(t - \tau) u_-(\tau) \operatorname{d}\!\tau,\]

where \(h\) is the impulse response \(h(t) = C e^{t E^{-1} A} E^{-1} B + D \delta(t)\) (i.e., \(H\) is the Laplace transform of \(h\)). Thus,

\[\lVert H \rVert_{\operatorname{H}} = \sup_{u_- \neq 0} \frac{\lVert y_+ \rVert_{\mathcal{L}_2}}{\lVert u_- \rVert_{\mathcal{L}_2}},\]

The computation of the Hankel norm in hankel_norm relies on the hsv method.

fom.hankel_norm()

0.6221361786735345

Download the code: tutorial_lti_systems.md, tutorial_lti_systems.ipynb.