A priori error estimates for optimal control problems governed by the transient Stokes equations and subject to state constraints pointwise in time

Abstract

In this paper, we consider a state constrained optimal control problem governed by the transient Stokes equations. The state constraint is given by an |$L^{2}$| functional in space, which is required to fulfill a pointwise bound in time. The discretization scheme for the Stokes equations consists of inf-sup stable finite elements in space and a discontinuous Galerkin method in time, for which we have recently established best approximation type error estimates. Using these error estimates, for the discrete control problem we derive error estimates and as a by-product we show an improved regularity for the optimal control. We complement our theoretical analysis with numerical results.

Stokes equations, instationary, optimal control, state constraints, error estimates

1. Introduction

In this paper, we consider the following optimal control problem

$$ \begin{align}& \text{Minimize} \ J({\textbf{q}},{\textbf{u}}):= \frac{1}{2} \int_{0}^{T}\lVert{\textbf{u}}(t) - {{\textbf{u}}}_{d}(t)\rVert^{2}_{L^{2}(\varOmega)}\ \text{d}t + \frac{\alpha}{2} \int_{0}^{T} \lVert{\textbf{q}}(t)\rVert^{2}_{L^{2}(\varOmega)}\ \text{d}t\end{align} $$

(1.1a)

subject to the state equation

$$ \begin{align}& \begin{aligned} \partial_{t} {\textbf{u}} - \varDelta{\textbf{u}} + \nabla p &= {\textbf{q}} &\quad &\text{in}\ I \times \varOmega,\\ \nabla\cdot{\textbf{u}} &=0 &\quad &\text{in}\ \ I \times \varOmega,\\{\textbf{u}} &= {\textbf{0}} &\quad &\text{on}\ I \times \partial \varOmega,\\{\textbf{u}}(0) & = {\textbf{0}} &\quad &\text{in}\ \ \varOmega, \end{aligned}\end{align} $$

(1.1b)

control constraints

$$ \begin{align}& {\textbf{q}}_{a} \le{\textbf{q}}(t,x) \le{\textbf{q}}_{b} \quad \text{for almost all}\ \; (t,x) \in I\times \varOmega\end{align} $$

(1.1c)

and state constraints

$$ \begin{align}& \int_\varOmega{\textbf{u}}(t,x)\cdot \mathbf{w}(x) \ \text{d}x \le \beta \quad \text{for all} \ t \in \bar{I}.\end{align} $$

(1.1d)

Here we assume that |$\varOmega \subset{\mathbb{R}}^{d}$|⁠, |$d=2,3$|⁠, is a convex polygonal or polyhedral domain and |$I= (0,T]$| is a bounded time interval. In the objective function, |${\textbf{u}}_{d} \in L^{2}(I\times \varOmega )^{d}$| represents the desired state and |$\alpha> 0$| is the regularization parameter. The control constraints are given by the constant vectors |${\textbf{q}}_{a},{\textbf{q}}_{b}\in{({\mathbb{R}} \cup \{\pm \infty \})^{d}}$| and satisfy |${\textbf{q}}_{a} < {\textbf{q}}_{b}$|⁠. In the state constraint, the constant scalar |$\beta $| satisfies |$\beta> 0$| and |$\mathbf{w}(x)$| is a given function in |$L^{2}(\varOmega )^{d}$|⁠. Note that for ease of presentation, we consider an optimal control problem with homogeneous initial data |${\textbf{u}}(0) = {\textbf{0}}$|⁠, while all results also extend to the inhomogeneous case |${\textbf{u}}(0) = {\textbf{u}}_{0}$|⁠. The main result of this paper states that the error between the optimal control |$\bar{{\textbf{q}}}$| for the continuous problem and the optimal solution |$\bar{{\textbf{q}}}_{\sigma }$| of the discretized problem satisfies

$$ \begin{align*} & \lVert\bar{{\textbf{q}}} - \bar{{\textbf{q}}}_{\sigma}\rVert_{L^{2}(I\times \varOmega)} \le \frac{C}{\sqrt{\alpha}} \, \ln{\frac{T}{k}} (k^{\frac{1}{2}}+h), \end{align*} $$

and is presented in Theorem 6.6. A similar optimal control problem subject to the heat equation was considered in Meidner et al. (2011), where a comparable error estimate was derived. The authors of Ludovici & Wollner (2015); Ludovici et al. (2018) discuss error estimates for parabolic problems with purely time-dependent controls and impose constraints on spatial averages of either function values or gradients of the state at every point in time. Error estimates for state constrained parabolic problems, with state constraints applied pointwise in time and space, can be found in Deckelnick & Hinze (2011); Gong & Hinze (2013); Christof & Vexler (2021).

The optimal control of flow phenomena subject to state constraints is a very active research topic, and there have been numerous contributions to the field, see, e.g., De Los Reyes & Kunisch (2006); De Los Reyes & Griesse (2008); De Los Reyes & Yousept (2009) for optimal control of the stationary Navier–Stokes equations and Fattorini & Sritharan (1998); Wang (2002a,b); Wang & Wang (2003); Liu (2010) for the transient Navier–Stokes equations, subject to general state constraints of the form |${\textbf{u}} \in \mathcal C$|⁠. Note that the above references only contain the analysis of the continuous problems and some numerical results, but no derivation of error estimates. In De Los Reyes et al. (2008), error estimates for an optimal control problem subject to the stationary Stokes equations with pointwise state constraints are shown. Let us also specifically mention John & Wachsmuth (2009), where an optimal control problem of the stationary Navier–Stokes equations was considered, and a constraint was put onto the drag functional |$\int _{\partial \varOmega } (\partial _{n} {\textbf{u}} - p \mathbf{n}) \mathbf{e}_{d} \ \text{d}s$|⁠, for some given direction of interest specified by the unit vector |$\mathbf{e}_{d}$|⁠. The setting of our work, constraining an |$L^{2}(\varOmega )$| functional pointwise in time, can be seen as a step towards discussing transient problems with drag/lift constraints at every point in time. The rest of the paper is structured as follows. In Section 2, we introduce the notation and present some analysis of the transient Stokes problem used in this paper. We then proceed to discuss the optimal control problem, including wellposedness and optimality conditions in Section 3. Depending on the regularity of available data, we discuss regularity and structural properties of the optimal solution. We introduce the discretization of the transient Stokes problem in Section 4 and recollect some important error estimates. This allows us to discuss the discrete formulations of the optimal control problem, where first in Section 5 we present the analysis and error estimates for a problem with variational discretization, cf., Hinze (2005); Deckelnick & Hinze (2011), where only the state equation is discretized, but the control is not. Following up this section, we discretize the control by piecewise constant functions in space and time, and present the analysis of the fully discrete problem in Section 6, which contains the main result of this work, Theorem 6.6. We conclude our work by using the derived error estimates to obtain improved regularity for the optimal control in Section 7 and presenting numerical results in Section 8.

2. Notation and preliminary results

We will use the standard notation for the Lebegue and Sobolev spaces over the spatial domain |$\varOmega $|⁠. The pressure space is

$$ \begin{align*}& L^{2}_{0}(\varOmega):= \left\{p \in L^{2}(\varOmega) \colon \int_\varOmega p \ \text{d}x = 0\right\}. \end{align*} $$

Throughout the paper, vector valued quantities and spaces will be indicated by boldface letters. We denote for a Banach space |$X$| and |$1 \le p \le \infty $|⁠, by |$L^{p}(I;X)$| the Bochner space of |$X$|-valued functions over |$I$|⁠, whose |$X$|-norm is |$p$|-integrable w.r.t time. If |$X$| is reflexive and |$1 \le p < \infty $|⁠, there holds the following isomorphism, see Hytönen et al. (2016, Corollary 1.3.22)

$$ \begin{align}& L^{p}(I;X)^{*} \cong L^{q}(I;X^{*}), \qquad \text{where}\ \frac{1}{p} + \frac{1}{q} = 1.\end{align} $$

(2.1)

Note that the range includes the value |$p=1$| and not |$p= \infty $|⁠. The dual space of |$C(\bar I;L^{2}(\varOmega )^{d})$| is isomorphic to the space of regular |$L^{2}(\varOmega )^{d}$|-valued Borel measures, and we denote it by |$\mathcal M(\bar I;L^{2}(\varOmega )^{d}) \cong (C(\bar I;{L^{2}(\varOmega )^{d}}))^{*}$|⁠. Similarly for scalar regular Borel measures of |$\bar I$|⁠, we use the notation |$\mathcal M(\bar I) \cong (C(\bar I))^{*}$|⁠. To denote the vector valued spaces of divergence free functions, having various levels of regularity, we use the following notation:

$$ \begin{align*} \mathbf{V}_{0} :=&\ \{{\textbf{v}} \in L^{2}(\varOmega)^{d}: \ \nabla \cdot{\textbf{v}} = 0 \land{\textbf{v}} \cdot \mathbf{n} = 0 \text{ on}\ \partial \varOmega\},\\\mathbf{V}:=&\ \{{\textbf{v}}\in H^{1}_{0}(\varOmega)^{d}: \ \nabla\cdot{\textbf{v}}=0 \},\\ \mathbf{V}_{2}:=& \ H^{2}(\varOmega)^{d} \cap \mathbf{V}, \end{align*} $$

where by |${\textbf{v}}\cdot \mathbf{n}$| we denote the generalized normal trace. We denote by |$\mathbf{V}^{*}$| the topological dual space of |$\mathbf{V}$|⁠, and define

$$ \begin{align} \mathbf{W} &:= L^{2}(I;\mathbf{V})\cap H^{1}(I;\mathbf{V}^{*})\hookrightarrow C(\bar{I};{\mathbf{V}_{0}}), \notag\\ \mathbf{X} &:= \{{\textbf{v}} \in L^{2}(I;\mathbf{V}): \partial_{t} {\textbf{v}} \in L^{2}(I;\mathbf{V}^{*}) + L^{1}(I;\mathbf{V}_{0}) \text{ and}\ {\textbf{v}}(0)={\textbf{0}}\}, \notag \\ \mathbf{Y} &:= L^{2}(I;\mathbf{V}^{*}) + L^{1}(I;\mathbf{V}_{0}) \quad \Rightarrow \quad \mathbf{Y}^{*} \cong L^{2}(I;\mathbf{V}) \cap L^\infty(I;\mathbf{V}_{0}) \quad \text{ due to }\ ({2.1}).\notag\end{align} $$

We denote by |$\left (\cdot ,\cdot \right )_{\varOmega }$| and |$\left (\cdot ,\cdot \right )_{I\times \varOmega }$| the inner products of |$L^{2}(\varOmega )^{d}$| and |$L^{2}(I;L^{2}(\varOmega )^{d})$|⁠, respectively, and by |$\left \langle \cdot ,\cdot \right \rangle _{I\times \varOmega }$| the duality pairing between |$L^{2}(I;\mathbf{V})$| and |$L^{2}(I;\mathbf{V}^{*})$|⁠. We introduce the Stokes operator |$A$|⁠, defined by

$$ \begin{align*}& A \colon D(A) \subset \mathbf{V}_{0} \to \mathbf{V}_{0}, \ \left(A {\textbf{u}},{\textbf{v}}\right)_{\varOmega} = \left(\nabla{\textbf{u}},\nabla{\textbf{v}}\right)_{\varOmega}. \end{align*} $$

The |$H^{2}$| regularity results of Kellogg & Osborn (1976); Dauge (1989) show that |$D(A) = \mathbf{V}_{2}$|⁠. As |$A$| is a positive, selfadjoint operator, fractional powers |$A^{s}$| are well defined. Of special importance is |$A^{\frac{1}{2}}$| which is an isometric isomorphism

$$ \begin{align*}& A^{\frac{1}{2}} \colon D(A^{\frac{1}{2}}) = \mathbf{V} \to \mathbf{V}_{0}, \end{align*} $$

as it holds |$\|A^{\frac{1}{2}} {\textbf{u}}\|_{L^{2}(\varOmega )}^{2} = \left (A {\textbf{u}},{\textbf{u}}\right )_{\varOmega } = \left (\nabla{\textbf{u}},\nabla{\textbf{u}}\right )_{\varOmega } = \|\nabla{\textbf{u}}\|_{L^{2}(\varOmega )}^{2}$|⁠. For the proof of |$D(A^{\frac{1}{2}}) = \mathbf{V}$|⁠, see (Sohr, 2001, Ch. III, Lemma 2.2.1). By its definition, we can extend |$A$| to an operator (denoted by the same symbol) |$A \colon \mathbf{V} \to \mathbf{V}^{*}$| yielding another isometric isomorphism between those spaces. Lastly, as

$$ \begin{align*}& \|A^{\frac{1}{2}}{\textbf{u}}\|_{\mathbf{V}^{*}} = \sup_{{\textbf{v}} \in \mathbf{V}} \dfrac{\left\langle A^{\frac{1}{2}} {\textbf{u}},{\textbf{v}}\right\rangle_{\varOmega}}{\|{\textbf{v}}\|_{\mathbf{V}}} = \sup_{{\textbf{w}} \in \mathbf{V}_{0}} \dfrac{\left\langle A^{\frac{1}{2}} {\textbf{u}}, A^{-\frac{1}{2}}{\textbf{w}}\right\rangle_{\varOmega}}{\|A^{-\frac{1}{2}} {\textbf{w}}\|_{\mathbf{V}}} = \sup_{{\textbf{w}} \in \mathbf{V}_{0}} \dfrac{\left({\textbf{u}}, {\textbf{w}}\right)_{\varOmega}}{\|A^{-\frac{1}{2}} {\textbf{w}}\|_{\mathbf{V}}} = \sup_{{\textbf{w}} \in \mathbf{V}_{0}} \dfrac{\left({\textbf{u}}, {\textbf{w}}\right)_{\varOmega}}{\|{\textbf{w}}\|_{\mathbf{V}_{0}}} = \|{\textbf{u}}\|_{\mathbf{V}_{0}}, \end{align*} $$

there holds the isometric isomorphism

$$ \begin{align}& A^{\frac{1}{2}} \colon \mathbf{V}_{0} \to \mathbf{V}^{*}.\end{align} $$

(2.2)

Using the above defined function spaces, the weak formulation of the state equation (1.1b) for a given |${\textbf{q}}\in L^{1}(I;L^{2}(\varOmega )^{d}) + L^{2}(I;\mathbf{V}^{*})$| reads as follows: Find |${\textbf{u}}\in{\mathbf{X}}$| such that

$$ \begin{align}& \begin{aligned} \left\langle\partial_{t} {\textbf{u}},{\textbf{v}}\right\rangle_{\varOmega}+\left(\nabla{\textbf{u}},\nabla{\textbf{v}}\right)_{\varOmega} &= \left({\textbf{q}},{\textbf{v}}\right)_{\varOmega}\quad{\text{for all}\ {\textbf{v}} \in \mathbf{V}, \ \text{a.e. in}\ I}. \end{aligned}\end{align} $$

(2.3)

For the above weak formulation, there holds the following result.

Theorem 2.1.

For |${\textbf{q}} \in L^{1}(I;{L^{2}(\varOmega )^{d}}) + L^{2}(I;\mathbf{V}^{*})$|⁠, there exists a unique solution |${\textbf{u}} \in L^{2}(I;\mathbf{V}) \cap C(\bar I;\mathbf{V}_{0})$| solving (2.3) and the following estimate holds:

$$ \begin{align*}& \|{\textbf{u}}\|_{L^{2}(I;\mathbf{V})} + \|{\textbf{u}}\|_{L^\infty(I;L^{2}(\varOmega))} \le C \|{\textbf{q}}\|_{L^{1}(I;\mathbf{V}_{0}) + L^{2}(I;\mathbf{V}^{*})}. \end{align*} $$

If |${\textbf{q}} \in L^{2}(I;{L^{2}(\varOmega )^{d}})$|⁠, then |${\textbf{u}} \in L^{2}(I;{\mathbf{V}_{2}}) \cap H^{1}(I;{\mathbf{V}_{0}}) \hookrightarrow C(\bar I;\mathbf{V})$|⁠, and there holds

$$ \begin{align*}& \|{\textbf{u}}\|_{L^{2}(I;H^{2}(\varOmega))} + \|{\textbf{u}}\|_{L^\infty(I;\mathbf{V})} + \|\partial_{t} {\textbf{u}}\|_{L^{2}(I\times \varOmega)} \le C \|{\textbf{q}}\|_{L^{2}(I\times \varOmega)}. \end{align*} $$

Proof.

The first part of this theorem is proven in Temam (1977, Chapter III, Theorem 1.1) and the remark on page 179 therein. The |$H^{2}$| regularity part can be shown as in Temam (1977, Chapter III, Proposition 1.2) using the |$H^{2}$| regularity result for the stationary Stokes equations in convex polygonal/polyhedral domains, proven in Dauge (1989, Theorems 5.5, 6.3) see also Kellogg & Osborn (1976, Theorem 2).

It is a classical result that the Stokes operator in the Hilbert space setting exhibits maximal parabolic regularity, i.e.

$$ \begin{align}& {\textbf{q}} \in L^{p}(I;\mathbf{V}_{0}), 1<p<\infty \quad \Rightarrow \quad{\textbf{u}} \text{ solving } ({2.3}) \ \mathrm{satisfies}\ \partial_{t} {\textbf{u}}, A{\textbf{u}} \in L^{p}(I;\mathbf{V}_{0}),\end{align} $$

(2.4)

see Behringer et al. (2023, Proposition 2.6), holding on fairly general domains, e.g. Lipschitz domains. If |$\varOmega $| is convex, the |$H^{2}$| regularity results further imply that |${\textbf{u}} \in L^{p}(I;\mathbf{V}_{2})$|⁠. Most often maximal parabolic regularity is treated in |$L^{2}$| or |$L^{q}$| setting in space, but it can also be extended to settings of weaker spacial regularity. The property (2.2) combined with Auscher et al. (2015, Lemma 11.4) yields that maximal parabolic regularity also holds in |$\mathbf{V}^{*}$|⁠, i.e.

$$ \begin{align}& {\textbf{q}} \in L^{p}(I;\mathbf{V}^{*}), 1 < p < \infty \quad \Rightarrow \quad{\textbf{u}} \text{ solving } ({2.3})\ \mathrm{satisfies}\ \partial_{t} {\textbf{u}} \in L^{p}(I;\mathbf{V}^{*}), {\textbf{u}} \in L^{p}(I;\mathbf{V}).\end{align} $$

(2.5)

If the right-hand side |${\textbf{q}}$| is regular enough, there exists an associated pressure to the weak solution of (2.3), formulated in divergence free spaces. The regularity of the pressure depends on the regularity of the right-hand side and the velocity component of the solution. There holds the following result, see Behringer et al. (2023, Theorem 2.10, Corollary 2.11).

Proposition 2.2.

Let |${\textbf{q}} \in L^{s}(I;L^{2}(\varOmega )^{d})$| for some |$1< s < \infty $| and let |${\textbf{u}} \in L^{2}(I;\mathbf{V}) \cap{C(\bar I;\mathbf{V}_{0})}$| be the weak solution to (2.3). Then there exists a unique |$p \in L^{s}(I;L^{2}_{0}(\varOmega ))$| such that

$$ \begin{align}& \partial_{t} {\textbf{u}} - \varDelta{\textbf{u}} + \nabla p = {\textbf{q}},\end{align} $$

(2.6)

which is to be understood as an identity in |$L^{s}(I;H^{-1}(\varOmega )^{d})$|⁠. On convex domains |$\varOmega $|⁠, the pressure satisfies |$p \in L^{s}(I;H^{1}(\varOmega ))$|⁠.

3. Continuous optimal control problem

We now introduce the control to state mapping.

Theorem 3.1.

Let |$S\colon{{\textbf{q}}}\mapsto{{\textbf{u}}}$| denote the solution operator for the state equation (2.3). Then |$S$| is a bounded linear operator between the following spaces:

|$S\colon L^{2}(I;\mathbf{V}^{*})\to \mathbf{W} \hookrightarrow C(\bar{I};L^{2}(\varOmega )^{d})$|
|$S\colon L^{1}(I;\mathbf{V}_{0})\to C(\bar{I};L^{2}(\varOmega )^{d})$|
|$S\colon L^\infty (I;L^{2}(\varOmega )^{d})\to W^{1,s}(I;L^{2}(\varOmega )^{d})\cap L^{s}(I;H^{2}(\varOmega )^{d})$|⁠, |$1 \le s<\infty $|⁠.

Proof.

The first two claims are the direct consequences of Theorem 2.1. Note, that the regularity |$\partial _{t} {\textbf{u}} \in L^{2}(I;\mathbf{V}^{*})$| can only be obtained by bootstrapping if the right-hand side is in |$L^{2}(I;\mathbf{V}^{*})$|⁠. The third claim is obtained by using the maximal parabolic regularity and |$H^{2}$| regularity of the stationary Stokes problem on convex polygonal/polyhedral domains, see Dauge (1989, Theorem 5.5, Theorem 6.3).

To abbreviate the notation, we will frequently use |${\textbf{u}}({\textbf{q}}):= S({\textbf{q}})$|⁠. Since |$S$| is a bounded linear operator between the spaces introduced in the previous theorem, it is Fréchet differentiable, and its directional derivative in direction |${\mathbf{\delta }\mathbf{q}}$| satisfies

$$ \begin{align*}& S^{\prime}({\textbf{q}})({\mathbf{\delta}\mathbf{q}}) = S({\mathbf{\delta}\mathbf{q}}), \end{align*} $$

i.e. is independent of |${\textbf{q}}$|⁠.

Remark 3.2.

The operator |$S$| is linear and coincides with its Frechét derivative, due to our choice to work with homogeneous initial data in the state equation (1.1b). The results presented in this work however also hold true in the inhomogeneous initial data case. In this case, the assumption |$\beta>0$|⁠, on the parameter |$\beta $| of the state constraint, then has to be generalized to |$\beta> \left ({\textbf{u}}_{0},{\textbf{w}}\right )_{\varOmega }$|⁠.

Let us next give a characterization of the adjoint operator |$S^{*}$|⁠. Theorem 2.1, together with Boyer & Fabrie (2013, Proposition V.1.3), yields that for any |${\textbf{q}} \in{\mathbf{Y}}$|⁠, there exists a unique solution |${\textbf{u}} \in \mathbf{X}$| such that

$$ \begin{align}& a({\textbf{u}},{\textbf{v}}):= \left\langle\partial_{t} {\textbf{u}},{\textbf{v}}\right\rangle_{I\times\varOmega} + \left(\nabla{\textbf{u}},\nabla{\textbf{v}}\right)_{I\times\varOmega} = \left\langle{\textbf{q}},{\textbf{v}}\right\rangle_{I\times\varOmega} \qquad \forall{\textbf{v}} \in{\mathbf{Y}^{*}},\end{align} $$

(3.1)

and the two formulations (2.3) and (3.1) are equivalent. The solution operator |$S$| is the inverse of the operator |$T\colon \mathbf{X} \to \mathbf{Y}$|⁠, |$\langle T {\textbf{u}},{\textbf{v}}\rangle _{\mathbf{Y}\times \mathbf{Y}^{*}} = a({\textbf{u}},{\textbf{v}})$|⁠. As |$T$| is invertible, so is |$T^{*}$| and its inverse is precisely |$S^{*}$|⁠. With this construction, the adjoint operator |$S^{*}: \mathbf{g} \mapsto{\textbf{z}}$| corresponds to the following weak formulation: given |$\mathbf{g} \in \mathbf{X}^{*}$|⁠, find |${\textbf{z}} \in{\mathbf{Y}^{*}}$| satisfying

$$ \begin{align}& \left\langle\partial_{t} {\textbf{v}},{\textbf{z}}\right\rangle_{I\times\varOmega} + \left(\nabla{\textbf{v}},\nabla{\textbf{z}}\right)_{I\times\varOmega} = \left(\mathbf{g},{\textbf{v}}\right)_{I\times\varOmega} \quad \forall{\textbf{v}} \in \mathbf{X}.\end{align} $$

(3.2)

We now discuss the regularity of the adjoint operator.

Corollary 3.3.

Let |$S^{*}$| denote the adjoint operator to the solution operator introduced in Theorem 3.1. Then it satisfies

$$ \begin{align*}& S^{*}\colon \mathcal M(\bar I;L^{2}(\varOmega)^{d}) \to L^{2}(I;\mathbf{V})\cap L^\infty(I;\mathbf{V}_{0}). \end{align*} $$

Proof.

This is a direct consequence of the definition of the adjoint operator, Theorem 3.1, and the isomorphism |$(L^{1}(I;\mathbf{V}_{0}))^{*} \cong L^\infty (I;\mathbf{V}_{0})$|⁠, due to (2.1).

Due to the linearity of the adjoint operator, it again holds

$$ \begin{align*}& S^{*}({\mathbf{\delta}\mathbf{q}}) = (S^{\prime}({\textbf{q}}))^{*}({\mathbf{\delta}\mathbf{q}}). \end{align*} $$

For convenience, for the weight |$\mathbf{w}\in L^{2}(\varOmega )^{d}$|⁠, we define the functional |$G_{\mathbf{w}}\colon L^{2}(\varOmega )^{d}\to \mathbb{R}$| by

$$ \begin{align}& G_{\mathbf{w}}({\textbf{v}}):=({\textbf{v}},\mathbf{w})_{\varOmega}.\end{align} $$

(3.3)

For time-dependent functions |${\textbf{v}} \colon I\to L^{2}(\varOmega )^{d}$|⁠, the application of |$G_{\mathbf{w}}$| is defined by

$$ \begin{align*} & G_{\textbf{w}}({\textbf{v}})(t):=G_{\textbf{w}}({\textbf{v}}(t)). \end{align*} $$

Using the functional |$G_{\mathbf{w}}$|⁠, the state constraint (1.1d) can be compactly rewritten as

$$ \begin{align}& G_{\mathbf{w}}({\textbf{u}})\le \beta\quad \text{in}\ \ \bar{I}.\end{align} $$

(3.4)

Remark 3.4.

Due to the continuous embedding |$\mathbf{W}\hookrightarrow C(\bar{I};L^{2}(\varOmega )^{d})$|⁠, we have |$G_{\mathbf{w}}({\textbf{v}})(\cdot )\in C(\bar{I})$| for any |${\textbf{v}}\in \mathbf{W}$|⁠. Thus we can understand |$G_{\mathbf{w}}$| as a linear, continuous operator from |$\mathbf{W}$| to |$C(\bar I)$|⁠.

To write the optimal control problem (1.1a)–(1.1d) into reduced form, we define |$\mathcal{G}:=G_{\mathbf{w}}\circ S$| and the closed convex cone |$\mathcal{K}\subset C(\bar{I})$| by

$$ \begin{align*} & \mathcal{K}:=\{v\in C(\bar{I}): \ v\le \beta\ \text{in}\ \ \bar{I}\}. \end{align*} $$

Using the above definitions, the reduced form reads

$$ \begin{align}& \text{Minimize} \ j({\textbf{q}}):=J({\textbf{q}},S({\textbf{q}})) \quad \text{for} \quad{\textbf{q}}\in \mathbf{Q}_{ad}\quad \text{subject to}\quad \mathcal{G}({\textbf{q}})\in \mathcal{K},\end{align} $$

(3.5)

where the admissible set is given by

$$ \begin{align*} & \mathbf{Q}_{ad}=\{ {\textbf{q}}\in L^2(I; L^2(\varOmega)^d): \quad{\textbf{q}}_a \le{\textbf{q}}(t,x) \le{\textbf{q}}_b \quad \text{a.e. in}\ I\times \varOmega\}. \end{align*} $$

We define the projection operator onto the feasible set by

$$ \begin{align*}& P_{[{\textbf{q}}_{a},{\textbf{q}}_{b}]}({\textbf{q}}):= \min\{{\textbf{q}}_{b},\max\{{\textbf{q}},{\textbf{q}}_{a}\}\}, \end{align*} $$

which we understand componentwise and pointwise for every |$(t,x) \in I \times \varOmega $|⁠. Throughout the paper, we will work under the following assumption.

Assumption 1 (Slater condition).

There exists |$\tilde{{\textbf{q}}}\in \mathbf{Q}_{ad}$| such that |$G_{\mathbf{w}}({\textbf{u}}(\tilde{{\textbf{q}}}))<\beta $| for all |$t \in \bar I$|⁠, where |${\textbf{u}}(\tilde{{\textbf{q}}})$| is the solution of the weak transient Stokes problem (2.3) for this particular control |$\tilde{{\textbf{q}}}$|⁠.

Remark 3.5.

As the homogeneous initial data in our setting necessitate the choice |$\beta> 0$|⁠, if the control constraints |${\textbf{q}}_{a}, {\textbf{q}}_{b}$| admit the control |$\tilde{\textbf{q}} \equiv{\textbf{0}}$|⁠, the existence of such a Slater point is immediately given. In that case, the unique solution to the state equation with right-hand side |$\tilde{\textbf{q}} \equiv{\textbf{0}}$| is |${\textbf{u}}(\tilde{\textbf{q}}) \equiv{\textbf{0}}$| which trivially satisfies |$G_{\textbf{w}}({\textbf{u}}(\tilde{\textbf{q}})) = 0 < \beta $|⁠.

Theorem 3.6.

Under Assumption 1, there exists a unique optimal control |$\bar{\textbf{q}} \in L^{2}(I;L^{2}(\varOmega )^{d})$| with unique associated state |$\bar{\textbf{u}}$|⁠, solving the optimal control problem (1.1a)–(1.1d).

Proof.

The Assumption 1 yields the existence of a feasible |$\tilde{\textbf{q}}$|⁠, such that the associated state |$\tilde{\textbf{u}} = {\textbf{u}}(\tilde{\textbf{q}})$| satisfies the state constraint. Let us define

$$ \begin{align*}& \mathcal J:= \inf \{J({\textbf{q}},{\textbf{u}})\colon{\textbf{q}} \text{ and} \ {\textbf{u}}\ \text{ satisfy}\ ({\text{1.1b}})\text{--}{{\text{1.1d}}} \} {\ge 0}, \end{align*} $$

and let |$\{{\textbf{q}}_{n}\}$| and |$\{{\textbf{u}}_{n}:= {\textbf{u}}({\textbf{q}}_{n})\}$| denote sequences of feasible controls with associated states, such that |$J({\textbf{q}}_{n},{\textbf{u}}_{n}) \to \mathcal J$| as |$n\to \infty $|⁠. As it holds |$J({\textbf{q}}_{n},{\textbf{u}}_{n}) \le{\mathcal{J}+1}$| for large enough |$n$|⁠, there holds a bound |$\|{\textbf{q}}_{n}\|_{L^{2}(I\times \varOmega )} \le C$| for all |$n$|⁠. From Theorem 3.1, we obtain |$\|{\textbf{u}}_{n}\|_{L^{2}(I;H^{2}(\varOmega ))} + \|{\textbf{u}}_{n}\|_{H^{1}(I;L^{2}(\varOmega ))} \le C$|⁠. We can thus take a subsequence, denoted by the same index, such that

$$ \begin{align*}& {\textbf{q}}_{n} \rightharpoonup \hat{\textbf{q}} \text{ in}\ L^{2}(I;L^{2} (\varOmega)^{d}), \quad{\textbf{u}}_{n} \rightharpoonup \hat{\textbf{u}} \text{ in}\ L^{2}(I;H^{2}(\varOmega)^{d}), \quad \partial_{t} {\textbf{u}}_{n} \rightharpoonup \partial_{t} \hat{\textbf{u}} \text{ in}\ L^{2}(I;L^{2} (\varOmega)^{d}). \end{align*} $$

These allow us to pass to the limit in the weak form of the state equation, showing that |$\hat{\textbf{u}} = {\textbf{u}}(\hat{\textbf{q}})$|⁠. Furthermore, as |$L^{2}(I;H^{2}(\varOmega )^{d})\cap H^{1}(I;L^{2}(\varOmega )^{d}) \hookrightarrow C({\bar I};H^{s}(\varOmega )^{d})$| compactly, for |$s<1$|⁠, see (Simon 1986, Corollary 8), by taking another subsequence, we obtain |${\textbf{u}}_{n} \to \hat{\textbf{u}}$| in |$C({\bar I};L^{2}(\varOmega )^{d})$|⁠. As |$G_{\textbf{w}}({\textbf{u}}_{n}) \le \beta $| for all |$t \in I$|⁠, this shows |$G_{\textbf{w}}(\hat{\textbf{u}}) \le \beta $| for all |$t \in I$|⁠. Lastly, it holds due to the lower semicontinuity of the norms

$$ \begin{align*}& J(\hat{\textbf{q}},\hat{\textbf{u}}) \le \liminf_{n\to \infty} J({\textbf{q}}_{n},{\textbf{u}}_{n}) = \mathcal J, \end{align*} $$

which shows that |$\bar{\textbf{q}} = \hat{\textbf{q}}$| is indeed a minimizer with associated state |$\bar{\textbf{u}} = \hat{\textbf{u}}$|⁠. Using uniform convexity of the squared |$L^{2}$| norms and linearity of the state equation gives the uniqueness.

Theorem 3.7 (First order optimality system).

Let Assumption 1 be fulfilled. Then a control |$\bar{{\textbf{q}}}\in \mathbf{Q}_{ad}$| with associated state |$\bar{{\textbf{u}}}={\textbf{u}}(\bar{{\textbf{q}}})$| is the optimal solution to the problem (1.1a)–(1.1d) if and only if and there exists an adjoint state |$\bar{{\textbf{z}}}\in L^{2}(I;\mathbf{V})\cap L^\infty (I;\mathbf{V}_{0})$| and a Lagrange multiplier |$\bar{\mu }\in (C(\bar I))^{*}$| that satisfy: State equation

$$ \begin{align}& \begin{aligned} \partial_{t} \bar{{\textbf{u}}} - \varDelta \bar{{\textbf{u}}} + \nabla \bar{p} &= \bar{{\textbf{q}}} &\quad &\text{in}\ I \times \varOmega,\\ \nabla\cdot \bar{{\textbf{u}}} &=0 &\quad &\text{in}\ I \times \varOmega,\\ \bar{{\textbf{u}}} &= {\textbf{0}} &\quad &\text{on}\ I \times \partial \varOmega,\\ \bar{{\textbf{u}}}(0) & = {\textbf{0}}&\quad &\text{in}\ \varOmega. \end{aligned}\end{align} $$

(3.6a)

State constraint and complementarity conditions

$$ \begin{align}& G_{\textbf{w}}(\bar{{\textbf{u}}})\le \beta, \quad \bar{\mu}\geq 0\quad\text{and}\quad \langle \bar{\mu},\beta-G_{\textbf{w}} (\bar{\textbf{u}}) \rangle =0.\end{align} $$

(3.6b)

Adjoint equation

$$ \begin{align}& \begin{aligned} -\partial_{t} \bar{{\textbf{z}}} - \varDelta \bar{{\textbf{z}}} + \nabla \bar{r} &= \bar{{\textbf{u}}}-{\textbf{u}}_{d}+{\bar{\mu}}{\textbf{w}} &\quad &\text{in}\ I \times \varOmega,\\ \nabla\cdot \bar{{\textbf{z}}} &=0 &\quad &\text{in}\ I \times \varOmega,\\ \bar{{\textbf{z}}} &= {\textbf{0}} &\quad &\text{on}\ I \times \partial \varOmega,\\ \bar{{\textbf{z}}}(T) & = \bar \mu(\{T\}) {\textbf{w}} &\quad &\text{in}\ \varOmega. \end{aligned}\end{align} $$

(3.6c)

Variational inequality

$$ \begin{align}& \left(\alpha\bar{{\textbf{q}}}+\bar{{\textbf{z}}}, \delta{\textbf{q}}-\bar{{\textbf{q}}}\right)\geq 0 \quad \forall \delta{\textbf{q}}\in \mathbf{Q}_{ad} \ \Leftrightarrow \ \bar{{\textbf{q}}} =P_{[{\textbf{q}}_{a},{\textbf{q}}_{b}]}\left(-\frac{1}{\alpha}\bar{{\textbf{z}}}\right).\end{align} $$

(3.6d)

The state (3.6a) and adjoint (3.6c) equations should be understood in the weak sense (2.3) and (3.2), respectively.

Proof.

We derive the optimality conditions from the reduced problem (3.5). By the generalized KKT condition (cf. Casas (1993, Theorem 5.2)) and the Slater condition from Assumption 1, i.e., the existence of |$\tilde{{\textbf{q}}}\in \mathbf{Q}_{ad}$| such that |$\mathcal{G}(\tilde{{\textbf{q}}})\in \operatorname{int}\mathcal{K}$|⁠, the optimality of |$\bar{{\textbf{q}}}$| is equivalent to the existence of a Lagrange multiplier |$\bar{\mu }\in C(\bar{I})^{*}$| and the adjoint state

$$ \begin{align}& \bar{{\textbf{z}}}= S^{\prime}(\bar{{\textbf{q}}})^{*}(S(\bar{{\textbf{q}}})-{\textbf{u}}_{d})+\mathcal{G}^{\prime}(\bar{{\textbf{q}}})^{*}\bar{\mu}\end{align} $$

(3.7)

satisfying

$$ \begin{align*} & \left(\alpha\bar{{\textbf{q}}}+\bar{{\textbf{z}}}, \delta{\textbf{q}}-\bar{{\textbf{q}}}\right)\geq 0 \quad \forall \delta{\textbf{q}}\in \mathbf{Q}_{ad} \quad \text{and}\quad \langle v -\mathcal{G}(\bar{{\textbf{q}}}),\bar{\mu}\rangle\le 0\quad \forall v\in \mathcal{K}. \end{align*} $$

By definition and linearity of the involved operators, we can write |$\mathcal{G}^{\prime}(\bar{\textbf{q}})^{*} = S^{*} \circ G_{\mathbf w}^{*}$|⁠, where |$G_{\mathbf w}^{*}\colon C(\bar I)^{*} \to C(\bar I; L^{2}(\varOmega )^{d})^{*}$|⁠, |$G_{\mathbf w}^{*}(\mu ) = \mu{\textbf{w}}$|⁠. Thus it holds |$\bar{\textbf{z}} = S^{*}(\bar{\textbf{u}} - {\textbf{u}}_{d} + \bar \mu{\textbf{w}})$|⁠, which gives the proposed regularity of |$\bar{\textbf{z}}$| as a consequence of Corollary 3.3. To complete the proof, we point to the following equivalence:

$$ \begin{align*} & (S(\bar{\textbf{q}}) \in \mathcal K) \land (\langle v-\mathcal{G}(\bar{{\textbf{q}}}),\bar{\mu}\rangle\le 0\quad \forall v\in \mathcal{K}) \ \Leftrightarrow \ (G_{{\textbf{w}}}(\bar{\textbf{u}}) \le \beta) \land (\bar{\mu}\geq 0) \land (\langle \beta-{G}(\bar{{\textbf{u}}}),\bar{\mu}\rangle= 0), \end{align*} $$

which is exactly (3.6b).

From the first order optimality system (3.6a)–(3.6b) we can derive the following regularity results.

Theorem 3.8.

Let |$\bar{{\textbf{q}}}$| denote the optimal control for the problem (1.1a). Then the following regularity holds:

$$ \begin{align*} & \bar{{\textbf{q}}}\in L^2(I;H^1(\varOmega)^d)\cap L^\infty(I;L^2(\varOmega)^d). \end{align*} $$

If additionally, |${\textbf{q}}_{a},{\textbf{q}}_{b} \in{\mathbb{R}}^{d}$|⁠, i.e. are finite, it holds |$\bar{{\textbf{q}}}\in{L^\infty (I \times \varOmega )^{d}}$|⁠.

Proof.

Since |$\mathbf{w}\in L^{2}(\varOmega )^{d}$| and |$\bar{\mu }\in (C(\bar{I}))^{*}$|⁠, we have that |$\bar{\mu }{\textbf{w}}\in ( C(\bar{I};L^{2}(\varOmega )^{d}))^{*}$|⁠. By Corollary 3.3, it holds |$\bar{\textbf{z}} = S^{*}(\bar{\textbf{u}} - {\textbf{u}}_{d} + \bar \mu{\textbf{w}}) \in L^{2}(I;\mathbf{V})\cap L^\infty (I;L^{2}(\varOmega )^{d})$|⁠. Using that

$$ \begin{align*} & \bar{{\textbf{q}}}= P_{[{\textbf{q}}_a,{\textbf{q}}_b]} \left(-\frac{1}{\alpha}\bar{{\textbf{z}}}\right), \end{align*} $$

and |${\textbf{q}}_{a}, {\textbf{q}}_{b}$| are constant, we have the theorem.

The above result shows the regularity available for the optimal control |$\bar{\textbf{q}}$| without any additional assumptions. If one assumes higher regularity of the data, especially of the weight |${\textbf{w}}$| of the state constraint, we can show improved regularity, and in some cases even a structural result, that the Lagrange multiplier |$\bar \mu $| contains no Dirac contributions. We first treat a general adjoint problem with measure valued right-hand side in Theorem 3.9, before showing regularity of the optimal variables in Theorem 3.10 and the corollaries thereafter.

Theorem 3.9.

Let |$\mu \in C(\bar I)^{*}$| and |${\textbf{w}} \in L^{2}(\varOmega )^{d}$| be given, and let |${\textbf{z}} \in L^{2}(I;\mathbf{V}) \cap L^\infty (I;\mathbf{V}_{0})$| be the weak solution to

$$ \begin{align}& \left\langle\partial_{t} {\textbf{v}}, {\textbf{z}}\right\rangle_{I\times\varOmega} + \left(\nabla{\textbf{v}},\nabla{\textbf{z}}\right)_{I\times\varOmega} = \left\langle{\textbf{v}},{\textbf{w}} \mu\right\rangle_{I\times\varOmega} \quad \text{for all}\ {\textbf{v}} \in \mathbf{X}.\end{align} $$

(3.8)

Then, if additionally |${\textbf{w}} \in \mathbf{V}$|⁠, it holds |${\textbf{z}} \in L^{s}(I;\mathbf{V}) \cap \text{BV}(I;\mathbf{V}^{*})$| for all |$1 \le s < \infty $|⁠. Moreover, if |${\textbf{w}} \in \mathbf{V}_{2}$|⁠, it holds |${\textbf{z}} \in L^{s}(I;\mathbf{V}_{2}) \cap \text{BV}(I;\mathbf{V}_{0})$| for all |$1 \le s < \infty $|⁠.

Proof.

Throughout this proof, let |${\textbf{w}}$| satisfy at least |${\textbf{w}} \in \mathbf{V}$|⁠. Using Appell et al. (2014, Theorem 4.31), there exists a normalized function of bounded variation |$\tilde \mu \in \text{NBV}(\bar I)$|⁠, such that the application of |$\mu \in C(\bar I)^{*}$| to any |$\xi \in C(\bar I)$| can be expressed as a Riemann–Stieltjes integral

$$ \begin{align*}& \left\langle\xi,\mu\right\rangle_{I} = \int_{I} \xi(t) \ \text{d} \tilde \mu(t). \end{align*} $$

Slightly modifying Appell et al. (2014, Definition 1.2), we can normalize |$\tilde \mu $| such that it is of bounded variation, right continuous and satisfies |$\tilde \mu (T) = 0$|⁠. If |$\xi \in C^{1}(\bar I)$|⁠, due to Appell et al. (2014, Proposition 4.24, Theorem 4.17), there holds

$$ \begin{align*}& \left\langle\xi, \mu\right\rangle_{I} = - \int_{I} \tilde \mu(t)\ \text{d} \xi(t) + \tilde \mu(T) \xi(T) - \tilde \mu(0) \xi(0) = - \int_{I} \tilde \mu(t) \partial_{t} \xi(t) \ \text{d} t -\tilde \mu(0) \xi(0) \end{align*} $$

where the last integral can be understood equivalently in the Riemann or Lebesgue sense. Working with the Lebesque integral allows us to pass to the limit |$C^{1}(\bar I) \ni \xi _{n} \to \xi \in W^{1,1}(I) \hookrightarrow C(\bar I)$|⁠, showing that

$$ \begin{align*}& \left\langle\xi, \mu\right\rangle_{I} = - \left(\tilde \mu, \partial_{t} \xi\right)_{I} - \tilde \mu(0) \xi(0) \qquad \forall \xi \in W^{1,1}(I). \end{align*} $$

As due to the definition of |$\mathbf{X}$|⁠, it holds |$\partial _{t} {\textbf{v}} \in L^{1}(I;\mathbf{V}_{0}) + L^{2}(I;\mathbf{V}^{*})$| and |${\textbf{v}}(0) = {\textbf{0}}$| for all |${\textbf{v}} \in \mathbf{X}$|⁠, and since |${\textbf{w}} \in \mathbf{V}$|⁠, we have |$\left \langle{\textbf{v}},{\textbf{w}}\right \rangle _{\varOmega } \in W^{1,1}(I)$|⁠. Thus we obtain

$$ \begin{align}& \left\langle{\textbf{v}},{\textbf{w}} \mu\right\rangle_{I\times\varOmega} = - \left\langle\tilde \mu(t) {\textbf{w}}, \partial_{t} {\textbf{v}}\right\rangle_{I\times\varOmega}.\end{align} $$

(3.9)

Let us define |$\hat{\textbf{z}} = {\textbf{z}} + \tilde \mu{\textbf{w}}$|⁠. Then adding (3.8), (3.9) and the identity |$\left (\nabla{\textbf{v}}, \tilde \mu \nabla{\textbf{w}}\right )_{I\times \varOmega } = \left \langle \tilde \mu A {\textbf{w}},{\textbf{v}}\right \rangle _{I\times \varOmega }$| yields

$$ \begin{align*}& \left\langle\partial_{t} {\textbf{v}},\hat{\textbf{z}}\right\rangle_{I\times\varOmega} + \left(\nabla{\textbf{v}},\nabla \hat{\textbf{z}}\right)_{I\times\varOmega} = \left\langle\tilde \mu A {\textbf{w}},{\textbf{v}}\right\rangle_{I\times\varOmega} \quad \forall{\textbf{v}} \in \mathbf{X}. \end{align*} $$

As |$\tilde \mu \in L^\infty (I)$| and |$\tilde \mu (T) < \infty $|⁠, |$\hat{\textbf{z}}$| satisfies a backwards in time Stokes equation with right-hand side |$\tilde \mu A {\textbf{w}}$|⁠. Depending on the regularity of |${\textbf{w}}$|⁠, we obtain the following:

Case 1: |${\textbf{w}} \in \mathbf{V}$|⁠. Here |$\tilde \mu{\textbf{w}} \in \text{BV}(I;\mathbf{V}) \hookrightarrow L^\infty (I;\mathbf{V})$| and thus |$\tilde \mu A {\textbf{w}} \in L^\infty (I;\mathbf{V}^{*})$|⁠. Using the maximal parabolic regularity in |$\mathbf{V}^{*}$| (2.5), this yields

$$ \begin{align*}& \hat{\textbf{z}} \in W^{1,s}(I;\mathbf{V}^{*}) \cap L^{s}(I;\mathbf{V}) \quad \text{for any}\ 1 \le s < \infty. \end{align*} $$

Considering the special cases |$s=1$| and |$s=2$|⁠, we obtain

$$ \begin{align*}& \hat{\textbf{z}} \in W^{1,1}(I;\mathbf{V}^{*}) \hookrightarrow \text{BV}(I;\mathbf{V}^{*}) \quad \text{and} \quad \hat{\textbf{z}} \in H^{1}(I;\mathbf{V}^{*}) \cap L^{2}(I;\mathbf{V}) \hookrightarrow C(\bar I;\mathbf{V}_{0}). \end{align*} $$

As a consequence |$ {\textbf{z}} = \hat{\textbf{z}} - \tilde \mu{\textbf{w}} \in L^{s}(I;\mathbf{V}) \cap \text{BV}(I;\mathbf{V}^{*})$| for any |$1 \le s < \infty $|⁠.

Case 2: |${\textbf{w}} \in \mathbf{V}_{2}$|⁠. Here |$\tilde \mu{\textbf{w}} \in \text{BV}(I;\mathbf{V}_{2}) \hookrightarrow L^\infty (I;\mathbf{V}_{2})$| and thus |$\tilde \mu A {\textbf{w}} \in L^\infty (I;\mathbf{V}_{0})$|⁠. Using the maximal parabolic regularity in |$\mathbf{V}_{0}$| (2.4), this yields

$$ \begin{align*}& \hat{\textbf{z}} \in W^{1,s}(I;\mathbf{V}_{0}) \cap L^{s}(I;\mathbf{V}_{2}) \quad \text{for any}\ 1 \le s < \infty. \end{align*} $$

Considering the special cases |$s=1$| and |$s=2$|⁠, we obtain

$$ \begin{align*}& \hat{\textbf{z}} \in W^{1,1}(I;\mathbf{V}_{0}) \hookrightarrow \text{BV}(I;\mathbf{V}_{0}) \quad \text{and} \quad \hat{\textbf{z}} \in H^{1}(I;\mathbf{V}_{0}) \cap L^{2}(I;\mathbf{V}_{2}) \hookrightarrow C(\bar I;\mathbf{V}). \end{align*} $$

As a consequence |$ {\textbf{z}} = \hat{\textbf{z}} - \tilde \mu{\textbf{w}} \in L^{s}(I;\mathbf{V}_{2}) \cap \text{BV}(I;\mathbf{V}_{0})$| for any |$1 \le s < \infty $|⁠.

Since in the above proof we only have shown continuity of |$\hat{\textbf{z}}$|⁠, but |$\tilde \mu $| in general can have jumps as it is a |$\text{BV}$| function, without further information, we cannot deduce continuity of the whole solution |${\textbf{z}}$|⁠. For the adjoint equation (3.6c) this translates to the question of continuity in time of |$\bar{\textbf{z}}$| and by (3.6d) also of |$\bar{\textbf{q}}$|⁠. If in addition to the arguments of the previous theorem, we exploit the information from the optimality system, it turns out, that we can at least show continuity in time of |$\bar{\textbf{q}}$|⁠. As the adjoint state in the optimality system has another term |${\textbf{u}} - {\textbf{u}}_{d}$| on the right-hand side, depending on the available regularity of |${\textbf{u}}_{d}$| in time, we might lose some of the time regularity that we just derived.

Theorem 3.10.

Let |$(\bar{\textbf{q}},\bar{\textbf{u}},\bar{\textbf{z}},\bar \mu )$| satisfy the first order necessary optimality conditions (3.6a)–(3.6d). Let additionally |${\textbf{u}}_{d} \in L^{s}(I;L^{2}(\varOmega )^{d})$| for some |$s \in [2,\infty )$| and |${\textbf{w}} \in \mathbf{V}$|⁠. Then |$\bar{\textbf{z}} \in L^{s}(I;\mathbf{V}) \cap \text{BV}(I;\mathbf{V}^{*})$| and |$\bar{\textbf{q}} \in L^{s}(I;H^{1}(\varOmega )^{d}) \cap C(\bar I;L^{2}(\varOmega )^{d})$|⁠. If further |${\textbf{w}} \in \mathbf{V}_{2}$|⁠, then |$\bar{\textbf{z}} \in L^{s}(I;\mathbf{V}_{2}) \cap \text{BV}(I;\mathbf{V}_{0})$| and |$\bar{\textbf{q}} \in L^{s}(I;W^{1,\infty }(\varOmega )^{d}) \cap C(\bar I;H^{1}(\varOmega )^{d})$|⁠.

Proof.

As in the proof of the previous theorem, we introduce the |$\text{NBV}$| function |$\tilde \mu $| satisfying |$\left \langle \xi ,\bar \mu \right \rangle _{I} = - \left (\tilde \mu ,\partial _{t} \xi \right )_{I} - \tilde \mu (0)\xi (0)$| for all |$\xi \in W^{1,1}(I)$|⁠, and introduce |$\hat{\textbf{z}} = \bar{\textbf{z}} + \tilde \mu{\textbf{w}}$|⁠. This now satisfies

$$ \begin{align*}& \left\langle\partial_{t} {\textbf{v}},\hat{\textbf{z}}\right\rangle_{I\times\varOmega} + \left(\nabla{\textbf{v}},\nabla \hat{\textbf{z}}\right)_{I\times\varOmega} = \left({\textbf{u}} - {\textbf{u}}_{d},{\textbf{v}}\right)_{I\times\varOmega} + \left\langle\tilde \mu A {\textbf{w}},{\textbf{v}}\right\rangle_{I\times\varOmega} \quad \forall{\textbf{v}} \in \mathbf{X}. \end{align*} $$

The regularity |$\bar{\textbf{u}} \in L^\infty (I;\mathbf{V}_{0})$|⁠, the assumption |${\textbf{u}}_{d} \in L^{s}(I;L^{2}(\varOmega )^{d})$| and the previous arguments again yield that

$$ \begin{align*} {\textbf{w}} \in \mathbf{V} &\Rightarrow \hat{\textbf{z}} \in W^{1,s}(I;\mathbf{V}^{*}) \cap L^{s}(I;\mathbf{V}) \hookrightarrow C(\bar I;\mathbf{V}_{0}) \cap \text{BV}( I;\mathbf{V}^{*}) \ \text{ and} \ \bar{\textbf{z}} \in L^{s}(I;\mathbf{V}),\\{\textbf{w}} \in \mathbf{V}_{2} &\Rightarrow \hat{\textbf{z}} \in W^{1,s}(I;\mathbf{V}_{0}) \cap L^{s}(I;\mathbf{V}_{2}) \hookrightarrow C(\bar I;\mathbf{V}) \cap \text{BV}( I;\mathbf{V}_{0}) \ \text{ and} \ \bar{\textbf{z}} \in L^{s}(I;\mathbf{V}_{2}). \end{align*} $$

As in Theorem 3.9, this shows the |$\text{BV}$| regularity in time of |$\bar{\textbf{z}}$|⁠. To show the regularity of |$\bar{\textbf{q}}$|⁠, we shall make use of the optimality conditions. The optimal control satisfies

$$ \begin{align}& \bar{\textbf{q}} = P_{[{\textbf{q}}_{a},{\textbf{q}}_{b}]}({\textbf{z}}_\alpha) \quad \text{where} \quad{\textbf{z}}_\alpha:= - \frac{1}{\alpha}\bar{\textbf{z}} =- \frac{1}{\alpha} \hat{\textbf{z}} + \frac{1}{\alpha}\tilde \mu{\textbf{w}} \in C({\bar I};\mathbf{V}_{0}) + \text{BV}(I;\mathbf{V}).\end{align} $$

(3.10)

The available regularity of |$\bar{\textbf{z}}$| immediately gives the claimed |$L^{s}$| regularity in time of |$\bar{\textbf{q}}$|⁠. Note that due to the application of |$P_{[{\textbf{q}}_{a},{\textbf{q}}_{b}]}$|⁠, |$\bar{\textbf{q}}$| is in general not divergence free anymore and exhibits at most |$W^{1,\infty }(\varOmega )$| regularity in space, even if |$\bar{\textbf{z}}$| is smoother. We now turn towards showing continuity in time of |$\bar{\textbf{q}}$|⁠. Let us denote by |$[\cdot ]$| the jump function w.r.t. time, i.e. |$[\varphi ](t) = \varphi (t+) - \varphi (t-)$|⁠. Since functions in |$\text{BV}(I;X)$| for any Banach space |$X$| possess well-defined onesided limits, see Heida et al. (2019, Propositions 2.1 & 2.2), we obtain from (3.10) that |${\textbf{z}}_\alpha (t \pm ) \in \mathbf{V}_{0}$| is well defined for any |$t \in I$|⁠. Hence, by continuity of |$P_{[{\textbf{q}}_{a},{\textbf{q}}_{b}]}\colon L^{2}(\varOmega )^{d} \to L^{2}(\varOmega )^{d}$| we also have |$[\bar{\textbf{q}}] = [P_{[{\textbf{q}}_{a},{\textbf{q}}_{b}]}({\textbf{z}}_\alpha )] \in L^{2}(\varOmega )^{d}$| for all |$t \in I$|⁠. By distinguishing the different cases, it is straightforward to verify that for any |${\textbf{v}} \in C(\bar I;L^{2}(\varOmega )^{d}) + \text{BV}(I;L^{2}(\varOmega )^{d})$| it holds

$$ \begin{align*}& 0 \le \left([P_{[{\textbf{q}}_{a},{\textbf{q}}_{b}]}({\textbf{v}})],[P_{[{\textbf{q}}_{a},{\textbf{q}}_{b}]}({\textbf{v}})]\right)_{\varOmega} \le \left([P_{[{\textbf{q}}_{a},{\textbf{q}}_{b}]}({\textbf{v}})],[{\textbf{v}}]\right)_{\varOmega} \le \left([{\textbf{v}}],[{\textbf{v}}]\right)_{\varOmega} \quad \text{for all}\ t \in I. \end{align*} $$

Applying this chain of inequalities to |${\textbf{v}} = {\textbf{z}}_\alpha $|⁠, using |$\bar{\textbf{q}} = P_{[{\textbf{q}}_{a},{\textbf{q}}_{b}]}({\textbf{z}}_\alpha )$| and the continuity in time of |$\hat{\textbf{z}}$|⁠, we obtain

$$ \begin{align}& 0 \le \|[\bar{\textbf{q}}]\|_{L^{2}(\varOmega)}^{2} \le \left([\bar{\textbf{q}}],[{\textbf{z}}_\alpha]\right)_{\varOmega} = \frac{[\tilde \mu]}{\alpha} \left([\bar{\textbf{q}}],{\textbf{w}}\right)_{\varOmega}.\end{align} $$

(3.11)

For all |$t \in I$| satisfying |$[\tilde \mu ](t) = 0$|⁠, this immediately shows |$[\bar{\textbf{q}}](t) = {\textbf{0}}$|⁠. Thus let us assume now that there exists |$t^{*} \in I$| with |$[\tilde \mu ](t^{*}) \neq 0$|⁠. As |$\bar \mu \ge 0$|⁠, there holds for |$t_{1} < t_{2}$| due to Appell et al. (2014, Theorem 4.17, Proposition 4.24)

$$ \begin{align*}& 0 \le \left\langle\chi_{[t_{1},t_{2}]},\bar \mu\right\rangle_{I} = \int_{[t_{1},t_{2}]} 1 \text{d} \tilde \mu(t) = - \int_{[t_{1},t_{2}]} \tilde \mu \ \text{d} 1(t) + \tilde \mu(t_{2}) - \tilde \mu(t_{1}) = \tilde \mu(t_{2}) - \tilde \mu(t_{1}), \end{align*} $$

i.e. |$\tilde \mu $| is monotonically increasing and therefore, it holds |$[\tilde \mu ] \ge 0$|⁠. Moreover it holds

$$ \begin{align}& \begin{aligned} \bar \mu(\{t^{*}\}) & = \lim_{\varepsilon \to 0} \bar \mu((t^{*}-\varepsilon,t^{*}+\varepsilon)) = \lim_{\varepsilon \to 0} \left\langle\chi_{(t^{*}-\varepsilon,t^{*}+\varepsilon)},\bar \mu\right\rangle_{I} = \lim_{\varepsilon \to 0} \int_{(t^{*}-\varepsilon,t^{*}+\varepsilon)} 1 \text{d} \tilde \mu\\ & = \lim_{\varepsilon \to 0}- \int_{(t^{*}-\varepsilon,t^{*}+\varepsilon)} \tilde \mu \ \text{d} 1 + \tilde \mu(t^{*} + \varepsilon) - \tilde \mu(t^{*} - \varepsilon) = [\tilde \mu](t^{*}). \end{aligned}\end{align} $$

(3.12)

Thus if |$[\tilde \mu ](t^{*}) \neq 0$|⁠, then |$[\tilde \mu ](t^{*})> 0$| and |$t^{*} \in \operatorname{supp}(\bar \mu )$|⁠, yielding that the state constraint is active in |$t^{*}$|⁠, i.e. |$G_{\textbf{w}}(\bar{\textbf{u}}(t^{*})) = \beta $|⁠. As |${\textbf{w}} \in \mathbf{V}$|⁠, due to (2.3), it holds

$$ \begin{align*}& \left(\bar{\textbf{q}}, {\textbf{w}}\right)_{\varOmega} - \left(\nabla \bar{\textbf{u}},\nabla{\textbf{w}}\right)_{\varOmega} = \left(\partial_{t} \bar{\textbf{u}},{\textbf{w}}\right)_{\varOmega} = \partial_{t} G_{\textbf{w}}(\bar{\textbf{u}}) \quad \text{a.e. in}\ I. \end{align*} $$

Since |$\bar{\textbf{q}} \in C(\bar I;{L^{2}(\varOmega )^{d}}) + \text{BV}(I;{H^{1}(\varOmega )^{d}})$| and |$\nabla \bar{\textbf{u}} \in C(\bar I; \mathbf{V})$|⁠, this identity shows that |$\partial _{t} G_{\textbf{w}}(\bar{\textbf{u}}) \in L^{2}(I)$| has representant with well defined onesided limits, which we shall denote by the same symbol, and it holds

$$ \begin{align}& [\partial_{t} G_{\textbf{w}}(\bar{\textbf{u}})](t^{*}) = \left([\bar{\textbf{q}}](t^{*}),{\textbf{w}}\right)_{\varOmega} \ge \frac{\alpha}{[\tilde \mu](t^{*})}\|[\bar{\textbf{q}}](t^{*})\|_{L^{2}(\varOmega)}^{2} \ge 0,\end{align} $$

(3.13)

where the last inequality holds due to (3.11) and |$[\tilde \mu ](t^{*})> 0$|⁠. Using |$G_{\textbf{w}}({\textbf{u}})(t) = \int _{0}^{t} \partial _{t} G_{\textbf{w}}({\textbf{u}})(s) \ \text{d} s$|⁠, it is straightforward to check that

$$ \begin{align*}& \lim_{h \to 0+} \dfrac{G_{\textbf{w}}(\bar{\textbf{u}})(t\pm h)-G_{\textbf{w}}({\textbf{u}})(t)}{h} = \lim_{h\to 0+} \frac{1}{h} \int_{t}^{t\pm h} \partial_{t} G_{\textbf{w}}(\bar{\textbf{u}})(s) \ \text{d} s = \pm \partial_{t} G_{\textbf{w}}(\bar{\textbf{u}})(t\pm). \end{align*} $$

i.e. the onesided limits of |$\partial _{t} G_{\textbf{w}}(\bar{\textbf{u}})$| correspond to the directional derivatives of |$G_{\textbf{w}}(\bar{\textbf{u}})$|⁠. As |$G_{\textbf{w}}(\bar{\textbf{u}})(t^{*}) = \beta $|⁠, |$t^{*}$| is a local maximum of |$G_{\textbf{w}}(\bar{\textbf{u}})$|⁠, yielding

$$ \begin{align*}& \pm \partial_{t} G_{\textbf{w}}(\bar{\textbf{u}})(t^{*} \pm) \le 0 \Rightarrow [\partial_{t} G_{\textbf{w}}(\bar{\textbf{u}})](t^{*}) \le 0. \end{align*} $$

Combining this with (3.13) yields |$\|[\bar{\textbf{q}}](t^{*})\|_{L^{2}(\varOmega )}^{2} = 0$|⁠, which shows |$\bar{\textbf{q}} \in C({\bar I};L^{2}(\varOmega )^{d})$|⁠. If additionally |${\textbf{w}} \in \mathbf{V}_{2}$|⁠, with the same arguments we can show |$\bar{\textbf{q}} \in C({\bar I};H^{1}(\varOmega )^{d})$|⁠.

Remark 3.11.

In the proof of the above theorem, we can under some circumstances also show continuity in time of |$\bar{\textbf{z}}$|⁠. In fact, the only obstacle preventing this result is the application of the projection operator onto the control constraints |$P_{[{\textbf{q}}_{a},{\textbf{q}}_{b}]}(\cdot )$|⁠. It is in general possible that at some point in time |$\hat t$|⁠, |$\bar{\textbf{z}}$| is discontinuous. In such cases, the spacial support of the jump is contained in the set of points |$x \in \varOmega $|⁠, where |$-\frac{1}{\alpha } \bar{\textbf{z}}(\hat t \pm )$| lie outside the set of admissible controls |$[{\textbf{q}}_{a},{\textbf{q}}_{b}]$|⁠. As the jump of |$\bar{\textbf{z}}$| is a scalar multiple of |${\textbf{w}}$|⁠, this gives a compatibility condition on |$\operatorname{supp}({\textbf{w}})$| and the set where the control constraints are active. As the active set of the control constraints cannot be known a priori, such a condition on |$\bar{\textbf{q}}$| is not straightforward to verify. Instead we present in the next two corollaries two sets of assumptions on the data, where we can obtain improved regularity of |$\bar{\textbf{z}}$| and |$\bar \mu $|⁠.

Corollary 3.12.

Let |$(\bar{\textbf{q}},\bar{\textbf{u}},\bar{\textbf{z}},\bar \mu )$| satisfy the first order necessary optimality conditions (3.6a)–(3.6d). Let further |${\textbf{q}}_{a,i} = -\infty $|⁠, |${\textbf{q}}_{b,i} = + \infty $| for |$i=1,...,d$|⁠, and |${\textbf{w}} \in \mathbf{V}$|⁠. Then |$\bar{\textbf{z}} \in C({\bar I};\mathbf{V}_{0})$| and |$\bar \mu (\{t\}) = 0$| for all |$t \in I$|⁠, i.e. |$\bar \mu $| does not contain any Dirac contributions. If additionally |${\textbf{w}} \in \mathbf{V}_{2}$|⁠, then |$\bar{\textbf{z}} \in C({\bar I};\mathbf{V})$|⁠.

Proof.

This is a direct consequence of the results of Theorem 3.10, as in this case, the identity |$\bar{\textbf{q}} = - \frac{1}{\alpha } \bar{\textbf{z}}$| holds.

Corollary 3.13.

Let |$(\bar{\textbf{q}},\bar{\textbf{u}},\bar{\textbf{z}},\bar \mu )$| satisfy the first order necessary optimality conditions (3.6a)–(3.6d). Let further |${\textbf{q}}_{a} < \textbf{0} < {\textbf{q}}_{b}$|⁠, |${\textbf{u}}_{d} \in L^{s}(I;L^{2}(\varOmega )^{d})$| for some |$s> 2$|⁠, and |${\textbf{w}} \in \mathbf{V}_{2}$| satisfy

$$ \begin{align*}& \exists x^{*} \in \partial \varOmega \text{ s.th.}\ \forall \varepsilon>0 \text{ it holds} \operatorname{supp}({\textbf{w}}) \cap B_{\varepsilon}(x^{*}) \neq \emptyset. \end{align*} $$

Then |$\bar{\textbf{z}} \in C({\bar I};\mathbf{V})$| and |$\bar \mu (\{t\}) = 0$| for all |$t \in I$|⁠, i.e. |$\bar \mu $| does not contain any Dirac contributions.

Proof.

As |${\textbf{u}}_{d} \in L^{s}(I;L^{2}(\varOmega )^{d})$| for some |$s>2$|⁠, as in the beginning of the proof of Theorem 3.10, we obtain

$$ \begin{align*}& \hat{\textbf{z}} \in W^{1,s}(I;\mathbf{V}_{0}) \cap L^{s}(I;\mathbf{V}_{2}) \hookrightarrow C({\bar I};C(\bar \varOmega)), \end{align*} $$

where the last embedding holds due to Simon (1986, Corollary 8). As further |${\textbf{w}} \in \mathbf{V}_{2} \hookrightarrow C(\bar \varOmega )$|⁠, for any |$t^{*} \in I$| it thus holds |$\bar{\textbf{z}}(t^{*} +),\bar{\textbf{z}}(t^{*}-) \in C(\bar \varOmega )$|⁠. As |$\bar{\textbf{z}}(t^{*} +)(x^{*}) = \bar{\textbf{z}}(t^{*}-)(x^{*}) = {\textbf{0}}$|⁠, and |${\textbf{q}}_{a} < {\textbf{0}} < {\textbf{q}}_{b}$|⁠, there exists |$\delta> 0$|⁠, such that

$$ \begin{align*}& - \frac{1}{\alpha} \bar{\textbf{z}}(t^{*}+)(x), - \frac{1}{\alpha} \bar{\textbf{z}}(t^{*}-)(x) \in ({\textbf{q}}_{a},{\textbf{q}}_{b}) \quad \text{for all}\ x \in B_\delta(x^{*})\cap \bar \varOmega. \end{align*} $$

Thus |$[\bar{\textbf{q}}](t^{*})|_{B_\delta (x^{*})} = [- \frac{1}{\alpha } \bar{\textbf{z}}](t^{*})|_{B_\delta (x^{*})}$|⁠. As |$\operatorname{supp}({\textbf{w}}) \cap B_\delta (x^{*}) \neq \emptyset $|⁠, there exists an open subset |$\omega \subset B_\delta (x^{*})$|⁠, such that |${\textbf{w}}(x) \neq 0$| for all |$x \in \omega $|⁠. In the end, we obtain, using the continuity in time of |$\bar{\textbf{q}}$| shown in Theorem 3.10:

$$ \begin{align*}& 0 = \|[\bar{\textbf{q}}](t^{*})\|_{L^{2}(\varOmega)}^{2} \ge \int_{\omega} [\bar{\textbf{q}}](t^{*})^{2} \text{d} x = \int_\omega \left(\left[ - \frac{1}{\alpha} \bar{\textbf{z}}\right](t^{*})\right)^{2} \text{d} x = \int_\omega \frac{[\tilde \mu](t^{*})^{2}}{\alpha^{2}} |{\textbf{w}}|^{2} \text{d} x \ge 0. \end{align*} $$

As |$|{\textbf{w}}|> 0$| for all |$x \in \omega $|⁠, this shows |$[\tilde \mu ](t^{*}) = 0$|⁠. As |$t^{*}$| was arbitrary, this concludes the proof.

4. Finite element approximation of the state equation

4.1 Spatial discretization

Let |$\{{\mathcal{T}}_{h}\}$| be a family of triangulations of |$\bar \varOmega $|⁠, consisting of closed simplices, where we denote by |$h$| the maximum mesh-size. Let |$\mathbf{X}_{h} \subset H^{1}_{0}(\varOmega )^{d}$| and |$M_{h} \subset L^{2}_{0}(\varOmega )$| be a pair of compatible finite element spaces, i.e., there holds a uniform discrete inf-sup condition,

$$ \begin{align}&\mathop{\mathrm{sup}}\limits_{{\mathbf{{v}}}_h \in{\mathbf{X}}_h}\frac{\left(q_{h},\nabla \cdot{\mathbf{{v}}}_h\right)_{\varOmega}} {{\|}{\nabla{\mathbf{{v}}}_h}{{\|}_{L^2(\varOmega)}}} \geq \gamma{\|}{q_h}{\|}_{L^2(\varOmega)} \quad \forall q_h \in M_h,\end{align} $$

(4.1)

with a constant |$\gamma>0$| independent of |$h$|⁠. We shall work under the assumption that the discrete spaces have the following approximative properties.

Assumption 2.

There exist interpolation operators |$i_{h}\colon H^{2}(\varOmega )^{d} \cap H^{1}_{0}(\varOmega )^{d} \to \mathbf{X}_{h}$| and |$r_{h}\colon L^{2}(\varOmega ) \to M_{h}$|⁠, such that

$$ \begin{align*} \|\nabla({\textbf{v}} - i_{h} {\textbf{v}})\|_{L^{2}(\varOmega)} &\le c h \|\nabla^{2} {\textbf{v}}\|_{L^{2}(\varOmega)} && \forall{\textbf{v}} \in H^{2}(\varOmega)^{2} \cap H^{1}_{0}(\varOmega)^{2},\\ \|q - r_{h} q\|_{L^{2}(\varOmega)} &\le c h \|\nabla q\|_{L^{2}(\varOmega)} && \forall q \in H^{1}(\varOmega). \end{align*} $$

This assumption is valid, for example, for Taylor–Hood and MINI finite elements on shape regular meshes, see Behringer et al. (2023, Assumption 7.2). We define the space of discretely divergence-free vector fields |$\mathbf{V}_{h}$| as

$$ \begin{align}& \mathbf{V}_{h} = \{ {\textbf{v}}_{h} \in \mathbf{X}_{h} \colon (\nabla \cdot{\textbf{v}}_{h}, q_{h}) =0 \quad \forall q_{h} \in M_{h}\}.\end{align} $$

(4.2)

While on a computational level, especially for the examples presented in Section 8, we work with a discrete velocity-pressure formulation, in our theoretical analysis we will always use the equivalent formulation in discretely divergence free spaces, in order to shorten notation. One exception is the following stationary Stokes problem of finding for some given |$f \in H^{-1}(\varOmega )^{d}$| a solution |$({\textbf{u}},p) \in H^{1}_{0}(\varOmega )^{d} \times L^{2}_{0}(\varOmega )$| to

$$ \begin{align}& \left(\nabla{\textbf{u}},\nabla{\textbf{v}}\right)_{\varOmega} - \left(\nabla \cdot{\textbf{v}},p\right)_{\varOmega} + \left(\nabla \cdot{\textbf{u}},q\right)_{\varOmega} = \left({\textbf{f}},{\textbf{v}}\right)_{\varOmega} \quad \forall ({\textbf{v}},q) \in H^{1}_{0}(\varOmega)^{d} \times L^{2}_{0}(\varOmega).\end{align} $$

(4.3)

Its discrete approximation in velocity-pressure formulation reads: find |$({\textbf{u}}_{h},p_{h}) \in \mathbf{X}_{h} \times M_{h}$| satisfying

$$ \begin{align}& \left(\nabla{\textbf{u}}_{h},\nabla{\textbf{v}}_{h}\right)_{\varOmega} - \left(\nabla \cdot{\textbf{v}}_{h},p_{h}\right)_{\varOmega} + \left(\nabla \cdot{\textbf{u}}_{h},q_{h}\right)_{\varOmega} = \left({\textbf{f}},{\textbf{v}}_{h}\right)_{\varOmega} \quad \forall ({\textbf{v}}_{h},q_{h}) \in \mathbf{X}_{h} \times M_{h}.\end{align} $$

(4.4)

The above discrete system can be interpreted as a Stokes Ritz projection: given |$({\textbf{u}},p) \in H^{1}_{0}(\varOmega )^{d} \times L^{2}_{0}(\varOmega )$|⁠, find |$(R_{h}^{S}({\textbf{u}},p),R_{h}^{S,p}({\textbf{u}},p)):=({\textbf{u}}_{h},p_{h}) \in \mathbf{X}_{h} \times M_{h}$|⁠, satisfying

$$ \begin{align}& \left(\nabla ({\textbf{u}}-{\textbf{u}}_{h}),\nabla{\textbf{v}}_{h}\right)_{\varOmega} - \left(\nabla \cdot{\textbf{v}}_{h},(p-p_{h})\right)_{\varOmega} + \left(\nabla \cdot ({\textbf{u}}-{\textbf{u}}_{h}),q_{h}\right)_{\varOmega} = 0 \quad \forall ({\textbf{v}}_{h},q_{h}) \in \mathbf{X}_{h} \times M_{h}.\end{align} $$

(4.5)

Note that, if |${\textbf{u}} \in \mathbf{V}$|⁠, then it holds |$R_{h}^{S}({\textbf{u}},p) \in \mathbf{V}_{h}$|⁠. Further, the Stokes Ritz projection satisfies the following stability, see Boffi et al. (2013, Theorem 5.2.1).

$$ \begin{align*}& \|\nabla R_{h}^{S}({\textbf{u}},p)\|_{L^{2}(\varOmega)} + \|R_{h}^{S,p}({\textbf{u}},p)\|_{L^{2}(\varOmega)} \le C (\|\nabla{\textbf{u}}\|_{L^{2}(\varOmega)} + \|p\|_{L^{2}(\varOmega)}). \end{align*} $$

Let us recall the following error estimates for the stationary discrete Stokes problem: see Boffi et al. (2013, Theorem 5.25), John (2016, Theorems 4.21, 4.25, 4.28), Ern & Guermond (2021, Theorems 53.17 & 53.19) or Girault & Raviart (1986, Chapter II, Theorems 1.8 & 1.9).

Theorem 4.1.

Let |$({\textbf{u}},p)$| and |$({\textbf{u}}_{h},p_{h})$| denote the solutions to the continuous and discrete stationary Stokes problems (4.3) and (4.4), respectively. Then there holds the estimate

$$ \begin{align*}& \|{\textbf{u}} - {\textbf{u}}_{h}\|_{H^{1}(\varOmega)} + \|p - p_{h}\|_{L^{2}(\varOmega)} \le C h (\|{\textbf{u}}\|_{H^{2}(\varOmega)} + \|p\|_{H^{1}(\varOmega)}). \end{align*} $$

If |$\varOmega $| is convex, there further holds the estimate

$$ \begin{align*}& \|{\textbf{u}} - {\textbf{u}}_{h}\|_{L^{2}(\varOmega)} \le C h (\|{\textbf{u}} -{\textbf{u}}_{h}\|_{H^{1}(\varOmega)} + \|p-p_{h}\|_{L^{2}(\varOmega)}). \end{align*} $$

Note, that while Boffi et al. (2013, Theorem 5.5.6) contains formal results, on how to derive error estimates also for |$p$| in a weaker norm, e.g. |$H^{-1}(\varOmega )$|⁠, the argument requires |$H^{2}$| regularity results for the compressible Stokes equations. As the corresponding results in Kellogg & Osborn (1976); Dauge (1989) require an additional decaying condition for the compressibility data, this makes derivation of weaker error estimates for the pressure complicated.

4.2 Temporal discretization: the discontinuous Galerkin method

In this section, we introduce the discontinuous Galerkin method for the time discretization of the transient Stokes equations, a similar method was considered, e.g., in Chrysafinos & Walkington (2010). For that, we partition |$I = (0,T]$| into subintervals |$I_{m} = (t_{m-1},t_{m}]$| of length |${k}_{m} = t_{m} - t_{m-1}$|⁠, where |$0= t_{0}<t_{1}<\dots <t_{M-1}<t_{M} = T$|⁠. The maximal and minimal time steps are denoted by |${k}=\max _{m} {k}_{m}$| and |${k}_{\min }=\min _{m}{k}_{m}$|⁠, respectively. The time partition fulfills the following assumptions:

There are constants |$C,{\theta }>0$| independent of |${k}$| such that
$$ \begin{align}& {k}_{\min} \geq C {k}^{{\theta}}.\end{align} $$
(4.6)
There is a constant |$\kappa>0$| independent of |${k}$| such that for all
|$m=1, 2, \dots , M-1$|
$$ \begin{align}& \kappa^{-1} \leq \frac{{k}_{m}}{{k}_{m+1}} \leq \kappa.\end{align} $$
(4.7)
It holds |${k} \leq \frac{T}{4}$|⁠.

For a given Banach space |$\mathcal{B}$|⁠, we define the semidiscrete space |$X_{k}^{0}(\mathcal{B})$| of piecewise constant functions in time as

$$ \begin{align}& X_{k}^{0}(\mathcal{B}) = \{ {\textbf{v}}_{k} \in L^{2}(I; \mathcal{B}) \colon{\textbf{v}}_{k}\vert_{I_{m}} \equiv{\textbf{v}}_{m} \text{ for some}\ {\textbf{v}}_{m} \in \mathcal{B}, m = 1,2, \dots, M\}.\end{align} $$

(4.8)

We use the following standard notation for a function |${\textbf{u}} \in X_{k}^{r}(\mathcal{B})$| to denote one-sided limits and jumps at the time nodes:

$$ \begin{align}& {\textbf{u}}_{m}^{+} = \lim_{\varepsilon \rightarrow 0^{+}}{\textbf{u}}(t_{m}+\varepsilon), \quad{\textbf{u}}_{m}^{-} = \lim_{\varepsilon \rightarrow 0^{+}}{\textbf{u}}(t_{m}-\varepsilon),\quad [{\textbf{u}}]_{m} = {\textbf{u}}_{m}^{+}-{\textbf{u}}_{m}^{-}.\end{align} $$

(4.9)

We define the bilinear form |${B}(\cdot ,\cdot )$| by

$$ \begin{align*} & {B}({\textbf{u}},{\textbf{v}}) = {\sum_{m=1}^{M} (\partial_{t}{\textbf{u}}, {\textbf{v}} )_{I_{m}\times \varOmega}} + (\nabla{\textbf{u}}, \nabla{\textbf{v}})_{I\times \varOmega} + \sum_{m=2}^{M} ([{\textbf{u}}]_{m-1},{\textbf{v}}^{+}_{m-1})_{\varOmega} + ({\textbf{u}}_{0}^{+},{\textbf{v}}_{0}^{+})_{\varOmega}. \end{align*} $$

With this bilinear form, we define the fully discrete approximation for the transient Stokes problem on the discretely divergence free space |$X_{k}^{0}(\mathbf{V}_{h})$|⁠:

$$ \begin{align}& {\textbf{u}}_{kh} \in X_{k}^{0}(\mathbf{V}_{h}) \;:\; {B}({\textbf{u}}_{kh},{\textbf{v}}_{kh}) = ({\textbf{q}},{\textbf{v}}_{kh})_{I \times \varOmega} \quad \forall{\textbf{v}}_{kh} \in X_{k}^{0}(\mathbf{V}_{h}).\end{align} $$

(4.10)

The unique solution to this system is stable, as the following theorem summarizes:

Theorem 4.2.

Let |${\textbf{q}} \in L^{2}(I;H^{-1}(\varOmega )^{d})+L^{1}(I;L^{2}(\varOmega )^{d})$|⁠. Then there exists a unique solution |${\textbf{u}}_{kh} \in X^{0}_{k}(\mathbf{V}_{h})$| of (4.10), satisfying

$$ \begin{align*}& \|{\textbf{u}}_{kh}\|_{L^{2}(I;H^{1}(\varOmega))} + \|{\textbf{u}}_{kh}\|_{L^\infty(I;L^{2}(\varOmega))} \le C \|{\textbf{q}}\|_{L^{2}(I;H^{-1}(\varOmega))+L^{1}(I;L^{2}(\varOmega))}, \end{align*} $$

where the constant |$C>0$| is independent of |$k,h$|⁠.

Proof.

As (4.10) poses a square system of linear equations in finite dimensions, it suffices to show uniqueness. This is a standard argument, making use of a discrete Gronwall Lemma, see e.g. Vexler & Wagner (2024, Theorem 4.13) for a proof focusing especially on the |$L^{1}(I;L^{2}(\varOmega )^{d})$| right-hand side case.

Remark 4.3.

Rearranging terms in the definition of the bilinear form gives the following dual representation of |$B(\cdot ,\cdot )$|

$$ \begin{align*}& B({\textbf{u}},{\textbf{v}}) = {- \sum_{m=1}^{M} ({\textbf{u}}, \partial_{t} {\textbf{v}} )_{I_{m}\times \varOmega}} + (\nabla{\textbf{u}}, \nabla{\textbf{v}})_{I\times \varOmega} - \sum_{m=1}^{M-1} ({\textbf{u}}_{m}^{-},[{\textbf{v}}]_{m})_{\varOmega} + ({\textbf{u}}_{M}^{-},{\textbf{v}}_{M}^{-})_{\varOmega}. \end{align*} $$

With the same arguments as above, for given |$\mathbf{g} \in L^{2}(I;H^{-1}(\varOmega )^{d}) + L^{1}(I;L^{2}(\varOmega )^{d})$|⁠, solutions |${\textbf{z}}_{kh} \in X^{0}_{k}(\mathbf{V}_{h})$| to the discrete dual equation

$$ \begin{align*}& B({\textbf{v}}_{kh},{\textbf{z}}_{kh}) = \left(\mathbf{g},{\textbf{v}}_{kh}\right)_{I\times\varOmega} \quad \forall{\textbf{v}}_{kh} \in X_{k}^{0}(\mathbf{V}_{h}) \end{align*} $$

exist, are unique and satisfy the stability

$$ \begin{align*}& \|{\textbf{z}}_{kh}\|_{L^{2}(I;H^{1}(\varOmega))} + \|{\textbf{z}}_{kh}\|_{L^\infty(I;L^{2}(\varOmega))} \le C \|\mathbf{g}\|_{L^{2}(I;H^{-1}(\varOmega))+L^{1}(I;L^{2}(\varOmega))}. \end{align*} $$

4.3 Best approximation type fully discrete error estimate for the Stokes problem in |$L^\infty (I;L^{2}(\varOmega )^{d})$| norm

In our recent paper Behringer et al. (2023), we have established a best approximation type error estimate for the Stokes problem in the |$L^\infty (I;L^{2}(\varOmega )^{d})$| norm. From this more general result, we obtain in the case of homogeneous initial data the following result, see Behringer et al. (2023, Corollary 6.4).

Theorem 4.4.

Let |${\textbf{q}} \in L^{s}(I;L^{2}(\varOmega )^{d})$| for some |$s>1$| and let |${\textbf{u}}\in \mathbf{W}$| be the weak solution to (2.3) with associated pressure |$p$| in the sense of (2.6). Let |${\textbf{u}}_{kh}\in X_{k}^{0}(\mathbf{V}_{h})$| be the fully discrete Galerkin solution to (4.10). Then there exists a constant |$C$| independent of |$k$| and |$h$|⁠, such that for any |$\chi \in X_{k}^{0}(\mathbf{V}_{h})$| there holds

$$ \begin{align*} & \|{\textbf{u}}-{\textbf{u}}_{kh}\|_{L^\infty(I;L^{2}(\varOmega))} \le C\ell_{k} \left(\|{\textbf{u}}-\chi\|_{L^\infty(I;L^{2}(\varOmega))}+\|{\textbf{u}}-R^{S}_{h}({\textbf{u}},p)\|_{L^\infty(I;L^{2}(\varOmega))}\right), \end{align*} $$

where |$\ell _{k}=\ln{\frac{T}{k}}$| and |$R^{S}_{h}({\textbf{u}},p)$| is the stationary finite element Stokes projection introduced in (4.5).

Using the error estimates for the stationary Stokes Ritz projection of Theorem 4.1, in Behringer et al. (2023, Theorem 7.4) the following estimate in terms of explicit orders of convergence was shown.

Corollary 4.5.

If in addition to assumptions of Theorem 4.4, the domain |$\varOmega $| is convex, and |${\textbf{q}} \in L^\infty (I;L^{2}(\varOmega )^{d})$|⁠, then there exists a constant |$C$| independent of |$k$| and |$h$| such that

$$ \begin{align*} & \|{\textbf{u}}-{\textbf{u}}_{kh}\|_{L^\infty(I;L^2(\varOmega))}\le C\ell_k^2(h^2+k)\|{\textbf{q}}\|_{L^\infty(I;L^2(\varOmega))}. \end{align*} $$

The above results are valid in |$L^{2}(I; L^{2}(\varOmega )^{d})$| norm as well. However, using the energy and duality arguments it is possible to show the corresponding results log-free and with less regularity assumptions on the data (cf. Leykekhman & Vexler (2024, Theorems 11 & 13)).

Theorem 4.6.

Let |${\textbf{q}} \in L^{2}(I;L^{2}(\varOmega )^{d})$| and let |${\textbf{u}}\in \mathbf{W}$| be the weak solution to (2.3) with associated pressure |$p$| in the sense of (2.6). Let |${\textbf{u}}_{kh}\in X_{k}^{0}(\mathbf{V}_{h})$| be the fully discrete Galerkin solution to (4.10). Then there exists a constant |$C$|⁠, independent of |$k$| and |$h$|⁠, such that for any |$\chi \in X_{k}^{0}(\mathbf{V}_{h})$|⁠, there holds

$$ \begin{align}& \|{\textbf{u}}-{\textbf{u}}_{kh}\|_{L^{2}(I; L^{2}(\varOmega))}\le C \left(\|{\textbf{u}}-\chi\|_{L^{2}(I; L^{2}(\varOmega))}+\|{\textbf{u}}-R^{S}_{h}({\textbf{u}},p)\|_{L^{2}(I; L^{2}(\varOmega))}+\|{\textbf{u}}-\pi_{k}{\textbf{u}}\|_{L^{2}(I; L^{2}(\varOmega))}\right),\end{align} $$

(4.11a)

and

$$ \begin{align}& \begin{aligned} \|\nabla({\textbf{u}}-{\textbf{u}}_{kh})\|_{L^{2}(I; L^{2}(\varOmega))} \le C \bigg(\|\nabla({\textbf{u}}-\chi)\|_{L^{2}(I; L^{2}(\varOmega))}&+\|\nabla({\textbf{u}}-R^{S}_{h}({\textbf{u}},p))\|_{L^{2}(I; L^{2}(\varOmega))}\\ &+\|\nabla({\textbf{u}}-\pi_{k}{\textbf{u}})\|_{L^{2}(I; L^{2}(\varOmega))}\bigg), \end{aligned}\end{align} $$

(4.11b)

where |$R^{S}_{h}({\textbf{u}},p)$| is the stationary finite element Stokes projection defined in (4.5) and |$\pi _{k}$| is the time projection onto |$X^{0}_{k}(\mathbf{V})$|⁠, with |$\pi _{k} v|_{I_{m}} = v(t_{m}^{-})$| for |$m=1,2,\dots ,M$|⁠.

Corollary 4.7.

If in addition to assumptions of Theorem 4.6, the domain |$\varOmega $| is convex, then there exists a constant |$C$| independent of |$k$| and |$h$| such that

$$ \begin{align*} & \|{\textbf{u}}-{\textbf{u}}_{kh}\|_{L^2(I\times \varOmega)}\le C(h^2+k)\|{\textbf{q}}\|_{L^2(I\times \varOmega)}. \end{align*} $$

5. Variational discretization of the optimal control problem

In this section, we consider the optimal control problem subject to the fully discretized Stokes equations. We consider a variational discretization for the controls, i.e., do not fix a finite dimensional approximation of the control space yet, cf., Hinze (2005); Deckelnick & Hinze (2011). The problem reads

$$ \begin{align}& \text{Minimize} \; J({\textbf{q}}_{kh},{\textbf{u}}_{kh}) = \frac{1}{2} \lVert{\textbf{u}}_{kh} - {\textbf{u}}_{d}\rVert^{2}_{L^{2}(I\times \varOmega)}+ \frac{\alpha}{2} \lVert{\textbf{q}}_{kh}\rVert^{2}_{L^{2}(I\times \varOmega)}\end{align} $$

(5.1a)

over all |${\textbf{q}}_{kh} \in \mathbf{Q}_{ad}$|⁠, |${\textbf{u}}_{kh} \in X^{0}_{k}(\mathbf{V}_{h})$|⁠, subject to

$$ \begin{align} & B({\textbf{u}}_{kh},{\textbf{v}}_{kh}) = ( {\textbf{q}}_{kh}, {\textbf{v}}_{kh})_{I\times\varOmega} \quad \text{for all}\quad{\textbf{v}}_{kh} \in X^{0}_{k}(\mathbf{V}_{h}) \end{align} $$

(5.1b)

$$ \begin{align} & G_{\textbf{w}}({\textbf{u}}_{kh})\mid_{I_{m}}\le \beta\quad\text{for}\quad m=1,2,\dots,M.\qquad\qquad\ \ \ \end{align} $$

(5.1c)

Following the structure of Section 3 and using Theorem 4.2, we introduce the discrete analog to the control state map,

$$ \begin{align}& S_{kh}\colon L^{2}(I;H^{-1}(\varOmega)^{d}) + L^{1}(I;L^{2}(\varOmega)^{d}) \to X^{0}_{k}(\mathbf{V}_{h}),\quad{\textbf{q}} \mapsto{\textbf{u}}_{kh} = {\textbf{u}}_{kh}({\textbf{q}}) \text{ solving } ({4.10}).\end{align} $$

(5.2)

The finitely many state constraints we describe with the help of the continuous linear operator |$\mathcal{G}_{kh}\colon \mathbf{Q}_{ad} \to{\mathbb{R}}^{M}$| with |$(\mathcal{G}_{kh}({\textbf{q}}))_{m}:=G_{\mathbf{w}}\circ S_{kh} ({\textbf{q}}) \mid _{I_{m}}$| for |$m=1,2,\dots ,M$|⁠. Using the set

$$ \begin{align*} & \mathcal{K}_{kh}:=\{{\textbf{v}} \in\mathbb{R}^M\ \mid \ {\textbf{v}} _m\le b,\ m=1,2,\dots,M \}, \end{align*} $$

we can rewrite the problem (5.1a)–(5.1c) in the reduced form:

$$ \begin{align}& \text{Minimize} \ j_{kh}({\textbf{q}}_{kh}):=J({\textbf{q}}_{kh},S_{kh}({\textbf{q}}_{kh})) \quad \text{over} \quad{\textbf{q}}_{kh}\in \mathbf{Q}_{ad}\quad \text{subject to}\quad \mathcal{G}_{kh}({\textbf{q}}_{kh})\in \mathcal{K}_{kh}.\end{align} $$

(5.3)

Before discussing wellposedness and optimality conditions of this discrete problem, we shall show that the Slater assumption on the continuous level carries over to the discrete problem. As we achieve this with the finite element error estimates presented in Section 4, we need to impose a rather weak coupling condition between |$k$| and |$h$|⁠, allowing us to deduce convergence in the result of Corollary 4.5. Throughout the remainder of this work, we thus work under the following assumption.

Assumption 3.

There exists a function |$\varPhi \colon (0,1) \to (0,\infty )$| with |$\lim _{h\to 0} \varPhi (h) = 0$|⁠, such that the discretization parameters |$k$| and |$h$| satisfy

$$ \begin{align*}& \left|\ln\left(\frac{T}{k}\right)\right| h \le \varPhi(h). \end{align*} $$

Remark 5.1.

This assumption is valid, e.g. if there exists a constant |$C> 0$| such that |$\left |\ln \left (\frac{T}{k}\right )\right | h |\ln h| \le C$|⁠. As the choice of the term |$|\ln h|$| in such a condition can be made arbitrarily weak, we have chosen to work under the more general formulation of Assumption 3.

Lemma 5.2.

There exists |$h_{0}> 0$| such that for any |$h\le h_{0}$| and |$k$| satisfying Assumption 3, the Slater point |$\tilde{\textbf{q}} \in \mathbf{Q}_{ad}$| from Assumption 1 satisfies the following discrete Slater condition:

$$ \begin{align*}& G_{\textbf{w}}({\textbf{u}}_{kh}(\tilde{\mathbf{q}})) < \beta{\qquad \text{for all}\ t \in \bar I.} \end{align*} $$

Proof.

Using that |$G_{\mathbf{w}}({\textbf{u}}(\tilde{{\textbf{q}}}))<\beta $| in |$\bar I$|⁠, by the Slater condition Assumption 1 there exists |$\delta>0$| such that |$G_{\mathbf{w}}({\textbf{u}}(\tilde{{\textbf{q}}}))\le \beta -\delta $|⁠. For arbitrary |$\hat{\textbf{q}} \in C^\infty (I\times \varOmega )^{d}$| it holds due to triangle inequality

$$ \begin{align*}& \|{\textbf{u}}(\tilde{\textbf{q}}) - {\textbf{u}}_{kh}(\tilde{\textbf{q}})\|_{L^\infty(I;L^{2}(\varOmega))} \!\le \|{\textbf{u}}(\tilde{\textbf{q}}) - {\textbf{u}}_{kh}(\tilde{\textbf{q}}) \!-\! {\textbf{u}}(\hat{\textbf{q}}) + {\textbf{u}}_{kh}(\hat{\textbf{q}})\|_{L^\infty(I;L^{2}(\varOmega))}\!+ \|{\textbf{u}}(\hat{\textbf{q}}) - {\textbf{u}}_{kh}(\hat{\textbf{q}})\|_{L^\infty(I;L^{2}(\varOmega))}. \end{align*} $$

Using the continuous and fully discrete stability results of the state equations, presented in Theorem 2.1 and Theorem 4.2, as well as the error estimate Corollary 4.5 for the problem with right-hand side |$\hat{\textbf{q}}$|⁠, we obtain

$$ \begin{align*}& \|{\textbf{u}}(\tilde{\textbf{q}}) - {\textbf{u}}_{kh}(\tilde{\textbf{q}})\|_{L^\infty(I;L^{2}(\varOmega))} \le C \|\tilde{\textbf{q}} - \hat{\textbf{q}}\|_{L^{2}(I\times \varOmega)} + C \ell_{k}^{2} (k + h^{2}) \|\hat{\textbf{q}}\|_{L^\infty(I;L^{2}(\varOmega))}. \end{align*} $$

For any |$\varepsilon \!>\! 0$|⁠, due to the density of |$C^\infty (I\times \varOmega )^{d}$| in |${L^{2}(I;L^{2}(\varOmega )^{d})}$|⁠, we can find |$\hat{\textbf{q}}_\varepsilon $| such that |$C \|\tilde{\textbf{q}} - \hat{\textbf{q}}_\varepsilon \|_{L^{2}(I \times \varOmega ))} \!<\! \frac{\varepsilon }{2}$|⁠. Moreover, for |$h\le h_{0}$| sufficiently small, and |$k$| satisfying Assumption 3, it also holds |$C \ell _{k}^{2} (k + h^{2}) \|\hat{\textbf{q}}_\varepsilon \|_{L^\infty (I;L^{2}(\varOmega ))}\! <\! \frac{\varepsilon }{2}$|⁠. Thus in total |$\|{\textbf{u}}(\tilde{\textbf{q}}) - {\textbf{u}}_{kh}(\tilde{\textbf{q}})\|_{L^\infty (I;L^{2}(\varOmega ))} \!<\! \varepsilon $|⁠. Choosing |$\varepsilon $| small enough, such that |$\varepsilon \|{\textbf{w}}\|_{L^{2}(\varOmega )}\ < \delta$|⁠, we obtain

$$ \begin{align}& G({\textbf{u}}_{kh}(\tilde{{\textbf{q}}}))=G({\textbf{u}}(\tilde{{\textbf{q}}}))+G({\textbf{u}}_{kh}(\tilde{{\textbf{q}}})-{\textbf{u}}(\tilde{{\textbf{q}}})) \le \beta-\delta+\|{\textbf{w}}\|_{L^{2}(\varOmega)}\|{\textbf{u}}(\tilde{{\textbf{q}}})-{\textbf{u}}_{kh}(\tilde{{\textbf{q}}})\|_{L^\infty(I;L^{2}(\varOmega))}<\beta.\end{align} $$

(5.4)

Theorem 5.3.

Let |$k$| and |$h$| satisfy Assumption 3, and let |$h$| be small enough. Then there exists a unique solution |$(\bar{\textbf{q}}_{kh},\bar{\textbf{u}}_{kh})$| to the optimal control problem (5.1a)–(5.1c).

Proof.

As Lemma 5.2 shows feasibility of |$(\tilde{\textbf{q}},{\textbf{u}}_{kh}(\tilde{\textbf{q}}))$| under the given assumptions, the existence proof follows the same steps as the one of Theorem 3.6 on the continuous level.

Theorem 5.4 (Discrete first order optimality system).

A control |$\bar{{\textbf{q}}}_{kh}\in \mathbf{Q}_{ad}$| and the associated state |$\bar{{\textbf{u}}}_{kh}={\textbf{u}}_{kh}(\bar{{\textbf{q}}}_{kh}) \in X^{0}_{k}(\mathbf{V}_{h})$| is the optimal solution to the problem (5.1a)–(5.1c) if and only if there exists an adjoint state |$\bar{{\textbf{z}}}_{kh} \in X^{0}_{k}(\mathbf{V}_{h})$| and a Lagrange multiplier |$\bar{\mu }_{kh}\in L^{1}(I)$| that satisfy: Discrete state equation

$$ \begin{align}& B(\bar{{\textbf{u}}}_{kh},{\textbf{v}}_{kh}) = ( \bar{{\textbf{q}}}_{kh}, {\textbf{v}}_{kh})_{I\times\varOmega}, \quad \forall{\textbf{v}}_{kh} \in X^{0}_{k}(\mathbf{V}_{h}).\end{align} $$

(5.5a)

Discrete state constraint and complementarity conditions

$$ \begin{align}& G_{\textbf{w}}(\bar{\textbf{u}}_{kh})|_{I_{m}} \le \beta, \ m=1,...,M, \quad \bar{\mu}_{kh}\geq 0 \quad\text{and}\quad \langle \bar{\mu}_{kh},\beta-G_{\textbf{w}}(\bar{\textbf{u}}_{kh}) \rangle=0.\end{align} $$

(5.5b)

Discrete adjoint equation

$$ \begin{align}& B({\textbf{v}}_{kh},\bar{{\textbf{z}}}_{kh}) = ( \bar{{\textbf{u}}}_{kh}-{\textbf{u}}_{d}+{\bar{\mu}_{kh}}{\textbf{w}}, {\textbf{v}}_{kh})_{I\times\varOmega} \quad \forall{\textbf{v}}_{kh} \in X^{0}_{k}(\mathbf{V}_{h}).\end{align} $$

(5.5c)

Discrete variational inequality

$$ \begin{align}& \left(\alpha\bar{{\textbf{q}}}_{kh}+\bar{{\textbf{z}}}_{kh}, \delta{\textbf{q}}-\bar{{\textbf{q}}}_{kh}\right)_{I\times \varOmega}\geq 0 \quad \forall \delta{\textbf{q}}\in \mathbf{Q}_{ad} \ \Leftrightarrow \ \bar{{\textbf{q}}}_{kh} =P_{[{\textbf{q}}_{a},{\textbf{q}}_{b}]}\left(-\frac{1}{\alpha}\bar{{\textbf{z}}}_{kh}\right).\end{align} $$

(5.5d)

Furthermore, there exist |$\bar \mu _{kh}^{m} \in{\mathbb{R}}_{\ge 0}$|⁠, |$m=1,2,\dots ,M$|⁠, such that the discrete Lagrange multiplier |$\bar{\mu }_{kh}\in L^{1}(I)$| satisfies the expression

$$ \begin{align}& \bar \mu_{kh} = \sum_{m=1}^{M} \frac{\bar \mu_{kh}^{m}}{k_{m}} \chi_{I_{m}},\end{align} $$

(5.6)

where |$\chi _{I_{m}}$| denotes the characteristic function of the interval |$I_{m}$|⁠.

Proof.

In Lemma 5.2, we have shown that under Assumption 3 and for small enough |$h$|⁠, there holds |$\mathcal{G}_{kh}(\tilde{{\textbf{q}}})\in \operatorname{int}(\mathcal{K}_{kh})$|⁠. Similarly to the proof of Theorem 3.7 we obtain that the optimality of |$\bar{q}_{kh}$| is equivalent to the existence of a Lagrange multiplier |$(\bar{\mu }_{kh}^{m})_{m=1}^{M}\in \mathbb{R}^{M}_{\ge 0}$| and the adjoint state |$\bar{{\textbf{z}}}_{kh}\in X^{0}_{k}(\mathbf{V}_{h})$| satisfying (5.5b), (5.5c) and (5.5d). Finally, by the construction given in (5.6), |$\bar{\mu }_{kh}$| is an element of |$L^{1}(I)$|⁠.

Remark 5.5.

Notice that from the definition (5.6) and using that |$\bar \mu _{kh} \ge 0$|⁠, it holds

$$ \begin{align*} & \|\bar \mu_{kh}\|_{L^1(I)} = \|\bar{\mu}_{kh}\|_{C(\bar{I})^*}=\langle \bar{\mu}_{kh},1 \rangle = \sum_{m=1}^M \bar \mu_{kh}^m. \end{align*} $$

Remark 5.6.

We would like to point out that although the state |$\bar{{\textbf{u}}}_{kh}$| and the adjoint |$\bar{{\textbf{z}}}_{kh}$| are fully discrete, the corresponding control |$\bar{{\textbf{q}}}_{kh}\in \mathbf{Q}_{ad}$| is piecewise constant in time via (5.5d), but not necessary piecewise polynomial in space with respect to the given mesh, due to the projection onto |$[{\textbf{q}}_{a},{\textbf{q}}_{b}]$|⁠.

With the optimality conditions established, we now show the following stability of optimal solutions to the discrete problem subject to different discretization levels.

Lemma 5.7.

Under Assumption 3 and for |$h$| small enough, there exists |$C>0$| independent of |$k$|⁠, |$h$|⁠, such that the optimal control |$\bar{{\textbf{q}}}_{kh}\in \mathbf{Q}_{ad}$| of the variationally discretized problem (5.1a)–(5.1c), together with its corresponding state |$\bar{{\textbf{u}}}_{kh}\in X^{0}_{k}(\mathbf{V}_{h})$| and corresponding multiplier |$\bar{\mu }_{kh}\in L^{1}(\bar I)$| satisfy the bound

$$ \begin{align*} & \|\bar{\textbf{q}}_{kh}\|_{L^\infty(I;L^2(\varOmega))}+ \|\bar{{\textbf{u}}}_{kh}\|_{L^2(I\times \varOmega)}+\|\bar{\mu}_{kh}\|_{L^1(\bar I)}\le C. \end{align*} $$

Proof.

Due to the feasibility of |$\tilde{\textbf{q}}$|⁠, shown in Lemma 5.2, it holds

$$ \begin{align*} & \begin{aligned} J(\bar{{\textbf{q}}}_{kh},\bar{{\textbf{u}}}_{kh})\le J(\tilde{{\textbf{q}}},{\textbf{u}}_{kh}(\tilde{{\textbf{q}}})) &=\frac{1}{2} \lVert{\textbf{u}}_{kh}(\tilde{{\textbf{q}}}) - {\textbf{u}}_d\rVert^2_{L^2(I\times \varOmega)} + \frac{\alpha}{2} \lVert\tilde{{\textbf{q}}}\rVert^2_{L^2(I)} \le C(T,\tilde{{\textbf{q}}}), \end{aligned} \end{align*} $$

where due to Theorem 4.2, this bound is independent of |$k$| and |$h$|⁠. This results in

$$ \begin{align}& \|\bar{{\textbf{q}}}_{kh}\|_{L^{2}(I\times \varOmega)}+\|\bar{{\textbf{u}}}_{kh}\|_{L^{2}(I\times \varOmega)}\le C.\end{align} $$

(5.7)

Let us define |${\textbf{p}}=\frac{1}{2}\bar{{\textbf{q}}}+\frac{1}{2}\tilde{{\textbf{q}}}$|⁠. By definition |${\textbf{p}}\in \mathbf{Q}_{ad}$| and thus by (5.5d) it holds |$ \left (\alpha \bar{{\textbf{q}}}_{kh}+\bar{{\textbf{z}}}_{kh}, {\textbf{p}}-\bar{{\textbf{q}}}_{kh}\right )_{I\times \varOmega }\geq 0. $| This yields

$$ \begin{align}& \begin{aligned} 0 &\le \alpha\left(\bar{{\textbf{q}}}_{kh}, {\textbf{p}}-\bar{{\textbf{q}}}_{kh}\right)_{I\times \varOmega}+\left(\bar{{\textbf{z}}}_{kh}, {\textbf{p}}-\bar{{\textbf{q}}}_{kh}\right)_{I\times \varOmega}\\ &=\alpha\left(\bar{{\textbf{q}}}_{kh}, {\textbf{p}}-\bar{{\textbf{q}}}_{kh}\right)_{I\times \varOmega}+B({\textbf{u}}_{kh}({\textbf{p}})-\bar{{\textbf{u}}}_{kh},\bar{{\textbf{z}}}_{kh})\\ &=\alpha\left(\bar{{\textbf{q}}}_{kh}, {\textbf{p}}-\bar{{\textbf{q}}}_{kh}\right)_{I\times \varOmega}+({\textbf{u}}_{kh}({\textbf{p}})-\bar{{\textbf{u}}}_{kh},\bar{{\textbf{u}}}_{kh}-{\textbf{u}}_{d})_{I\times \varOmega}+\langle \bar{\mu}_{kh},G_{\textbf{w}}({{\textbf{u}}}_{kh}({\textbf{p}}))-G_{\textbf{w}}(\bar{{\textbf{u}}}_{kh}) \rangle, \end{aligned}\end{align} $$

(5.8)

where we can bound the first two terms by (5.7) and obtain

$$ \begin{align}& 0 \le C + \langle \bar{\mu}_{kh},G_{\textbf{w}}({{\textbf{u}}}_{kh}({\textbf{p}}))-G_{\textbf{w}}(\bar{{\textbf{u}}}_{kh}) \rangle.\end{align} $$

(5.9)

For |${\textbf{p}}$|⁠, using Assumption 3, we can follow a similar argument as in the proof of Lemma 5.2, in order to obtain

$$ \begin{align*} & \begin{aligned} G_{\textbf{w}}({\textbf{u}}_{kh}({{\textbf{p}}}))&=G_{\textbf{w}}({\textbf{u}}_{kh}({{\textbf{p}}})-{\textbf{u}}({{\textbf{p}}}))+G_{\textbf{w}}({\textbf{u}}({{\textbf{p}}}))\\ &=G_{\textbf{w}}({\textbf{u}}_{kh}({{\textbf{p}}})-{\textbf{u}}({{\textbf{p}}}))+\frac{1}{2}G_{\textbf{w}}(\bar{{\textbf{u}}})+\frac{1}{2}G_{\textbf{w}}({\textbf{u}}(\tilde{{\textbf{q}}}))\\ &\le \| {\textbf{w}}\|_{L^2(\varOmega)}\|{\textbf{u}}_{kh}({{\textbf{p}}})-{\textbf{u}}({{\textbf{p}}})\|_{L^\infty(I; L^2(\varOmega))}+\frac{1}{2}G_{\textbf{w}}(\bar{{\textbf{u}}})+\frac{1}{2}G_{\textbf{w}}({\textbf{u}}(\tilde{{\textbf{q}}}))\\ &\le\frac{1}{4}\delta+ \frac{1}{2}\beta+\frac{1}{2}\beta-\frac{1}{2}\delta =\beta-\frac{1}{4}\delta. \end{aligned} \end{align*} $$

Inserting this into (5.9) yields together with |$\bar \mu _{kh} \ge 0$| and the complementarity conditions (5.5b):

$$ \begin{align*} & 0 \le C+\langle \bar{\mu}_{kh},\beta-G_{\textbf{w}}(\bar{{\textbf{u}}}_{kh}) \rangle-\frac{\delta}{4}\langle \bar{\mu}_{kh},1 \rangle =C-\frac{\delta}{4}\langle \bar{\mu}_{kh},1 \rangle. \end{align*} $$

Thus, again using |$\bar{\mu }_{kh}\geq 0$| and Remark 5.5 results in

$$ \begin{align}& \|\bar{\mu}_{kh}\|_{L^{1}(\bar I)}=\langle \bar{\mu}_{kh},1 \rangle\le C.\end{align} $$

(5.10)

Combining (5.7) and (5.10) with Remark 4.3 yields the boundedness of |$\bar{\textbf{z}}_{kh}$| in |$L^\infty (I;L^{2}(\varOmega )^{d})$|⁠. By the representation |$\bar{\textbf{q}}_{kh} = P_{[{\textbf{q}}_{a},{\textbf{q}}_{b}]} \left ( - \frac{1}{\alpha } \bar{\textbf{z}}_{kh} \right )$|⁠, this shows |$\|\bar{\textbf{q}}_{kh}\|_{L^\infty (I;L^{2}(\varOmega ))} \le C$|⁠, which concludes the proof.

Theorem 5.8.

Let Assumption 3 hold and let |$h$| be sufficiently small. Let |$(\bar{\textbf{q}},\bar{\textbf{u}})$| and |$(\bar{\textbf{q}}_{kh},\bar{\textbf{u}}_{kh})$| be the unique solutions to the continuous and variationally discretized optimal control problems (1.1a)–(1.1d) and (5.1a)–(5.1c), respectively. Then there exists a constant |$C>0$|⁠, such that it holds

$$ \begin{align*} & \sqrt{\alpha}\lVert\bar{{\textbf{q}}} - \bar{{\textbf{q}}}_{kh}\rVert_{L^{2}(I\times \varOmega)}+\lVert\bar{{\textbf{u}}} - \bar{{\textbf{u}}}_{kh}\rVert_{L^{2}(I\times \varOmega)} \le C \, \ell_{k} (k^{\frac{1}{2}}+h), \quad \ell_{k} = \ln{\frac{T}{k}}. \end{align*} $$

Proof.

Choosing |$\delta{\textbf{q}}=\bar{{\textbf{q}}}_{kh}$| in (3.6d) and |$\delta{\textbf{q}}=\bar{{\textbf{q}}}$| in (5.5d) results in

$$ \begin{align}& \left(\alpha\bar{{\textbf{q}}}+\bar{{\textbf{z}}}, \bar{{\textbf{q}}}_{kh}-\bar{{\textbf{q}}}\right)_{I\times \varOmega}\geq 0\quad \text{and}\quad \left(\alpha\bar{{\textbf{q}}}_{kh}+\bar{{\textbf{z}}}_{kh}, \bar{{\textbf{q}}}-\bar{{\textbf{q}}}_{kh}\right)_{I\times \varOmega}\geq 0.\end{align} $$

(5.11)

Adding these two inequalities results in

$$ \begin{align}& \alpha\|\bar{{\textbf{q}}}_{kh}-\bar{{\textbf{q}}}\|^{2}_{L^{2}(I\times \varOmega)}\le (\bar{{\textbf{z}}},\bar{{\textbf{q}}}_{kh}-\bar{{\textbf{q}}})_{I\times \varOmega}+ (\bar{{\textbf{z}}}_{kh},\bar{{\textbf{q}}}-\bar{{\textbf{q}}}_{kh})_{I\times \varOmega}:=I_{1}+I_{2}.\end{align} $$

(5.12)

We estimate the two terms separately.

Estimate for |$I_{1}$|⁠. Using the weak formulations (2.3) and (3.2) of the continuous state and adjoint equations (3.6a) & (3.6c), respectively, we have

$$ \begin{align*} & \begin{aligned} I_1 &= \left\langle\partial_t ({\textbf{u}}(\bar{{\textbf{q}}}_{kh})-\bar{{\textbf{u}}}),\bar{\textbf{z}}\right\rangle_{I\times\varOmega} +\left(\nabla ({\textbf{u}}(\bar{{\textbf{q}}}_{kh})-\bar{{\textbf{u}}}),\nabla \bar{\textbf{z}}\right)_{I\times\varOmega}\\ &= (\bar{{\textbf{u}}}-{\textbf{u}}_d,{\textbf{u}}(\bar{{\textbf{q}}}_{kh})-\bar{{\textbf{u}}})_{I \times \varOmega}+\langle G_{\textbf{w}}({\textbf{u}}(\bar{{\textbf{q}}}_{kh})-\bar{{\textbf{u}}}),\bar{\mu}\rangle. \end{aligned} \end{align*} $$

Introducing the pointwise projection onto the state constraint

$$ \begin{align}& P_\beta v= P_{\beta}v(t):= \max\{v(t),\beta\},\quad \forall t\in \bar{I},\end{align} $$

(5.13)

the last term can be estimated as

$$ \begin{align*} & \begin{aligned} \langle G_{\textbf{w}}({\textbf{u}}(\bar{{\textbf{q}}}_{kh})-\bar{{\textbf{u}}}),\bar{\mu}\rangle &=\langle G_{\textbf{w}}({\textbf{u}}(\bar{{\textbf{q}}}_{kh}))-P_{\beta}G_{\textbf{w}}({\textbf{u}}(\bar{{\textbf{q}}}_{kh})),\bar{\mu}\rangle +\langle P_{\beta}G_{\textbf{w}}({\textbf{u}}(\bar{{\textbf{q}}}_{kh}))-G_{\textbf{w}}(\bar{{\textbf{u}}}),\bar{\mu}\rangle\\ &\le \langle G_{\textbf{w}}({\textbf{u}}(\bar{{\textbf{q}}}_{kh}))-P_{\beta}G_{\textbf{w}}({\textbf{u}}(\bar{{\textbf{q}}}_{kh})),\bar{\mu}\rangle +\langle{\beta} -G_{\textbf{w}}(\bar{{\textbf{u}}}),\bar{\mu}\rangle, \end{aligned} \end{align*} $$

where we have used that due to |$\bar \mu \ge 0$|⁠, it holds |$\langle P_{\beta } G_{\textbf{w}}({\textbf{u}}(\bar{\textbf{q}}_{kh})),\bar \mu \rangle \le \langle{\beta },\bar \mu \rangle $|⁠. Using the complementarity condition (3.6b), it holds |$\langle{\beta } -G_{\mathbf{w}}(\bar{{\textbf{u}}}),\bar{\mu }\rangle = 0$|⁠, hence we have

$$ \begin{align*}& \langle G_{\mathbf{w}}({\textbf{u}}(\bar{{\textbf{q}}}_{kh})-\bar{{\textbf{u}}}),\bar{\mu}\rangle \le \langle G_{\mathbf{w}}({\textbf{u}}(\bar{{\textbf{q}}}_{kh}))-P_{\beta}G_{\mathbf{w}}({\textbf{u}}(\bar{{\textbf{q}}}_{kh})),\bar{\mu}\rangle. \end{align*} $$

Now using that

$$ \begin{align}& |P_{\beta}v-P_{\beta}u|\le |v-u|,\end{align} $$

(5.14)

by the triangle inequality and using that |$G_{\mathbf{w}}(\bar{{\textbf{u}}}_{kh})\le{\beta }$|⁠, we obtain

$$ \begin{align*} & \begin{aligned} \langle G_{\textbf{w}}({\textbf{u}}(\bar{{\textbf{q}}}_{kh}))-P_{\beta}G_{\textbf{w}}({\textbf{u}}(\bar{{\textbf{q}}}_{kh})),\bar{\mu}\rangle \le & \| G_{\textbf{w}}({\textbf{u}}(\bar{{\textbf{q}}}_{kh}))-P_{\beta}G_{\textbf{w}}({\textbf{u}}(\bar{{\textbf{q}}}_{kh})\|_{L^\infty(I)}\|\bar{\mu}\|_{C(\bar{I})^*}\\ \le & \| G_{\textbf{w}}({\textbf{u}}(\bar{{\textbf{q}}}_{kh}))-G_{\textbf{w}}(\bar{{\textbf{u}}}_{kh})\|_{L^\infty(I)}\|\bar{\mu}\|_{C(\bar{I})^*}\\ &+\| P_{\beta} G_{\textbf{w}}(\bar{{\textbf{u}}}_{kh})-P_{\beta}G_{\textbf{w}}({\textbf{u}}(\bar{{\textbf{q}}}_{kh})\|_{L^\infty(I)}\|\bar{\mu}\|_{C(\bar{I})^*}\\ \le & 2\|\textbf{w}\|_{L^2(\varOmega)}\| {\textbf{u}}_{kh}(\bar{{\textbf{q}}}_{kh})-{\textbf{u}}(\bar{{\textbf{q}}}_{kh})\|_{L^\infty(I ;L^2(\varOmega))}\|\bar{\mu}\|_{C(\bar{I})^*}\\ \le & C \ell_k^2(h^2+k)\|\bar{{\textbf{q}}}_{kh}\|_{L^\infty(I;L^2(\varOmega))}\|\textbf{w}\|_{L^2(\varOmega)}\|\bar{\mu}\|_{C(\bar{I})^*}\\ \le & C \ell_k^2(h^2+k), \end{aligned} \end{align*} $$

where in the last two steps we used Corollary 4.5 and Lemma 5.7. Thus,

$$ \begin{align*} & I_1 \le C \ell_k^2(h^2+k)+(\bar{{\textbf{u}}}-{\textbf{u}}_d,{\textbf{u}}(\bar{{\textbf{q}}}_{kh})-\bar{{\textbf{u}}})_{I \times \varOmega}. \end{align*} $$

Estimate for |$I_{2}$|⁠. Similarly, using the fully discrete state and adjoint equations (5.5a) and (5.5c), respectively, we have

$$ \begin{align*} & \begin{aligned} I_2 &= B({\textbf{u}}_{kh}(\bar{{\textbf{q}}})-\bar{{\textbf{u}}}_{kh},\bar{{\textbf{z}}}_{kh})\\ &= (\bar{{\textbf{u}}}_{kh}-{\textbf{u}}_d,{\textbf{u}}_{kh}(\bar{{\textbf{q}}})-\bar{{\textbf{u}}}_{kh})_{I \times \varOmega}+\langle G_{\textbf{w}}({\textbf{u}}_{kh}(\bar{{\textbf{q}}})-\bar{{\textbf{u}}}_{kh}),\bar{\mu}_{kh}\rangle. \end{aligned} \end{align*} $$

Using the projection |$P_{\beta }$| defined in (5.13), the last term in |$I_{2}$| can be estimated as

$$ \begin{align*} & \begin{aligned} \langle G_{\textbf{w}}({\textbf{u}}_{kh}(\bar{{\textbf{q}}})-\bar{{\textbf{u}}}_{kh}),\bar{\mu}\rangle &=\langle G_{\textbf{w}}({\textbf{u}}_{kh}(\bar{{\textbf{q}}}))-P_{\beta}G_{\textbf{w}}({\textbf{u}}_{kh}(\bar{{\textbf{q}}})),\bar{\mu}_{kh}\rangle +\langle P_{\beta}G_{\textbf{w}}({\textbf{u}}_{kh}(\bar{{\textbf{q}}}))-G_{\textbf{w}}(\bar{{\textbf{u}}}_{kh}),\bar{\mu}_{kh}\rangle\\ &\le \langle G_{\textbf{w}}({\textbf{u}}_{kh}(\bar{{\textbf{q}}}))-P_{\beta}G_{\textbf{w}}({\textbf{u}}_{kh}(\bar{{\textbf{q}}})),\bar{\mu}_{kh}\rangle+\langle{\beta} -G_{\textbf{w}}(\bar{{\textbf{u}}}_{kh}),\bar{\mu}_{kh}\rangle, \end{aligned} \end{align*} $$

where we have used that due to |$\bar \mu _{kh} \ge 0$|⁠, it holds |$\langle P_{\beta } G_{\textbf{w}}({\textbf{u}}_{kh}(\bar{\textbf{q}})),\bar \mu _{kh}\rangle \le \langle{\beta },\bar \mu _{kh}\rangle $|⁠. Using the complementarity condition (5.5b), it holds |$\langle{\beta } - G_{\textbf{w}}(\bar{{\textbf{u}}}_{kh}),\bar{\mu }_{kh}\rangle = 0$|⁠, hence we have

$$ \begin{align*}& \langle G_{\mathbf{w}}({\textbf{u}}_{kh}(\bar{{\textbf{q}}})-\bar{{\textbf{u}}}_{kh}),\bar{\mu}\rangle \le \langle G_{\mathbf{w}}({\textbf{u}}_{kh}(\bar{{\textbf{q}}}))-P_{\beta}G_{\mathbf{w}}({\textbf{u}}_{kh}(\bar{{\textbf{q}}})),\bar{\mu}_{kh}\rangle. \end{align*} $$

Using (5.14), the triangle inequality and using that |$G_{\mathbf{w}}(\bar{{\textbf{u}}}_{kh})\le{\beta }$|⁠, we obtain

$$ \begin{align*} & \begin{aligned} \langle G_{\textbf{w}}({\textbf{u}}_{kh}(\bar{{\textbf{q}}}))-P_{\beta}G_{\textbf{w}}({\textbf{u}}_{kh}(\bar{{\textbf{q}}}),\bar{\mu}_{kh}\rangle \le & \| G_{\textbf{w}}({\textbf{u}}_{kh}(\bar{{\textbf{q}}}))-P_{\beta}G_{\textbf{w}}({\textbf{u}}_{kh}(\bar{{\textbf{q}}})\|_{L^\infty(I)}\|\bar{\mu}_{kh}\|_{L^1(\bar I)}\\ \le & \| G_{\textbf{w}}({\textbf{u}}_{kh}(\bar{{\textbf{q}}}))-G_{\textbf{w}}(\bar{{\textbf{u}}})\|_{L^\infty(I)}\|\bar{\mu}_{kh}\|_{L^1(\bar I)}\\ &+\| P_{\beta} G_{\textbf{w}}(\bar{{\textbf{u}}})-P_{\beta}G_{\textbf{w}}({\textbf{u}}_{kh}(\bar{{\textbf{q}}})\|_{L^\infty(I)}\|\bar{\mu}_{kh}\|_{L^1(\bar I)}\\ \le & 2\|\textbf{w}\|_{L^2(\varOmega)}\| {\textbf{u}}(\bar{{\textbf{q}}})-({\textbf{u}}_{kh}(\bar{{\textbf{q}}})\|_{L^\infty(I ;L^2(\varOmega))}\|\bar{\mu}_{kh}\|_{L^1(\bar I)}\\ \le & C \ell_k^2(h^2+k)\|\bar{{\textbf{q}}}\|_{L^\infty(I;L^2(\varOmega))}\|\textbf{w}\|_{L^2(\varOmega)}\|\bar{\mu}_{kh}\|_{L^1(\bar I)}\\ \le & C \ell_k^2(h^2+k), \end{aligned} \end{align*} $$

where in the last two steps we used Corollary 4.5 and Lemma 5.7. Thus,

$$ \begin{align*} & I_2 \le C \ell_k^2(h^2+k)+(\bar{{\textbf{u}}}_{kh}-{\textbf{u}}_d,{\textbf{u}}_{kh}(\bar{{\textbf{q}}})-\bar{{\textbf{u}}}_{kh})_{I \times \varOmega}. \end{align*} $$

Combining the estimates for |$I_{1}$| and |$I_{2}$| and using that

$$ \begin{align*} & \begin{aligned} (\bar{{\textbf{u}}}-{\textbf{u}}_d,{\textbf{u}}(\bar{{\textbf{q}}}_{kh})&-\bar{{\textbf{u}}})_{I \times \varOmega}+(\bar{{\textbf{u}}}_{kh}-{\textbf{u}}_d,{\textbf{u}}_{kh}(\bar{{\textbf{q}}})-\bar{{\textbf{u}}}_{kh})_{I \times \varOmega}\\ =&(\bar{{\textbf{u}}}-{\textbf{u}}_d,\bar{{\textbf{u}}}_{kh}-\bar{{\textbf{u}}})_{I \times \varOmega}+(\bar{{\textbf{u}}}-{\textbf{u}}_d,{\textbf{u}}(\bar{{\textbf{q}}}_{kh})-\bar{{\textbf{u}}}_{kh})_{I \times \varOmega}\\ &+(\bar{{\textbf{u}}}_{kh}-{\textbf{u}}_d,{\textbf{u}}_{kh}(\bar{{\textbf{q}}})-\bar{{\textbf{u}}})_{I \times \varOmega}+(\bar{{\textbf{u}}}_{kh}-{\textbf{u}}_d,\bar{{\textbf{u}}}-\bar{{\textbf{u}}}_{kh})_{I \times \varOmega}\\ =&-\lVert\bar{{\textbf{u}}} - \bar{{\textbf{u}}}_{kh}\rVert^2_{L^2(I\times \varOmega)}+(\bar{{\textbf{u}}}-{\textbf{u}}_d,{\textbf{u}}(\bar{{\textbf{q}}}_{kh})-\bar{{\textbf{u}}}_{kh})_{I \times \varOmega}+(\bar{{\textbf{u}}}_{kh}-{\textbf{u}}_d,{\textbf{u}}_{kh}(\bar{{\textbf{q}}})-\bar{{\textbf{u}}})_{I \times \varOmega} \end{aligned} \end{align*} $$

by using Corollary 4.7, we obtain

$$ \begin{align*} & \begin{aligned} \alpha & \lVert\bar{{\textbf{q}}} - \bar{{\textbf{q}}}_{kh}\rVert^2_{L^2(I\times \varOmega)}+\lVert\bar{{\textbf{u}}} - \bar{{\textbf{u}}}_{kh}\rVert^2_{L^2(I\times \varOmega)} \\ &\le C\ell^2_k (k+h^2)+(\bar{{\textbf{u}}}-{\textbf{u}}_d,{\textbf{u}}(\bar{{\textbf{q}}}_{kh})-\bar{{\textbf{u}}}_{kh})_{I \times \varOmega}+(\bar{{\textbf{u}}}_{kh}-{\textbf{u}}_d,{\textbf{u}}_{kh}(\bar{{\textbf{q}}})-\bar{{\textbf{u}}})_{I \times \varOmega}\\ &\le C\ell^2_k (k+h^2)+\|\bar{{\textbf{u}}}-{\textbf{u}}_d\|_{L^2(I\times \varOmega)}\|{\textbf{u}}(\bar{{\textbf{q}}}_{kh})-{\textbf{u}}_{kh}(\bar{{\textbf{q}}}_{kh})\|_{L^2(I\times \varOmega)}\\ &\quad +\|\bar{{\textbf{u}}}_{kh}-{\textbf{u}}_d\|_{L^2(I\times \varOmega)}\|{\textbf{u}}_{kh}(\bar{{\textbf{q}}})-{\textbf{u}}(\bar{{\textbf{q}}})\|_{L^2(I\times \varOmega)}\\ &\le C\ell^2_k (k+h^2)\left[1+\left(\|\bar{{\textbf{u}}}\|_{L^2(I\times \varOmega)}+\|\bar{{\textbf{u}}}_{kh}\|_{L^2(I\times \varOmega)}+\|{\textbf{u}}_d\|_{L^2(I\times \varOmega)}\right)\left(\|\bar{{\textbf{q}}}_{kh}\|_{L^2(I\times \varOmega)}+\|\bar{{\textbf{q}}}\|_{L^2(I\times \varOmega)}\right)\right]\\ &\le C\ell^2_k (k+h^2), \end{aligned} \end{align*} $$

where in the last step, we used the boundedness of |$\|\bar{{\textbf{u}}}_{kh}\|_{L^{2}(I\times \varOmega )}$|⁠, |$\|\bar{{\textbf{u}}}\|_{L^{2}(I\times \varOmega )}$|⁠, |$\|\bar{{\textbf{q}}}\|_{L^{2}(I\times \varOmega )}$|⁠, and |$\|\bar{{\textbf{q}}}_{kh}\|_{L^{2}(I\times \varOmega )}$| from Theorem 3.8 and Lemma 5.7.

6. Full discretization of the optimal control problem

We discretize the control by piecewise constant functions on the same partition as the fully discrete approximation of the state and adjoint variables. We set

$$ \begin{align}& \mathbf{Q}_{0} = \{ {\textbf{q}}\in L^{2}(I;L^{2}(\varOmega)^{d}):\ {\textbf{q}} \mid_{I_{m}\times K}\in \mathbb{P}_{0}(I_{m}; \mathbb{P}_{0}(K)^{d}), \ m=1,2,\dots,M, \ K\in{\mathcal{T}}_{h} \}.\end{align} $$

(6.1)

We also define the corresponding admissible set

$$ \begin{align*} & \mathbf{Q}_{0,ad}:= \mathbf{Q}_0\cap \mathbf{Q}_{ad}. \end{align*} $$

We introduce the projection |$\pi _{d}\colon L^{2}(I;L^{2}(\varOmega )^{d}) \to \mathbf{Q}_{0}$|⁠, defined by

$$ \begin{align}& \left({\textbf{q}} - \pi_{d} {\textbf{q}},\mathbf{r}\right)_{I\times\varOmega} = 0 \qquad \forall \mathbf{r} \in \mathbf{Q}_{0},\end{align} $$

(6.2)

which by definition is stable in |$L^{2}(I\times \varOmega )$|⁠, i.e., satisfies

$$ \begin{align}& \|\pi_{d} {\textbf{q}}\|_{L^{2}(I\times \varOmega)} \le \|{\textbf{q}}\|_{L^{2}(I\times \varOmega)}.\end{align} $$

(6.3)

Note that this projection satisfies the explicit formula

$$ \begin{align*}& \pi_{d} {\textbf{q}}|_{I_{m} \times K} = \frac{1}{k_{m} |K|}\int_{I_{m} \times K} {\textbf{q}}(t,x) \ \text{d}(t,x) \qquad \text{for all}\ m=1,...,M, \ K \in \mathcal T_{h}. \end{align*} $$

Hence it is straightforward to check that this |$L^{2}$| projection onto piecewise constants is stable in |$L^\infty (I \times \varOmega )^{d}$| and |$L^\infty (I;L^{2}(\varOmega )^{d})$| and there holds

$$ \begin{align}& \|\pi_{d} {\textbf{q}}\|_{L^\infty(I \times \varOmega)} \le \|{\textbf{q}}\|_{L^\infty(I \times \varOmega)} \quad \text{and} \quad \|\pi_{d} {\textbf{q}}\|_{L^\infty(I;L^{2}(\varOmega))} \le \|{\textbf{q}}\|_{L^\infty(I;L^{2}(\varOmega))}.\end{align} $$

(6.4)

Further, we have |$\pi _{d} (\mathbf{Q}_{ad}) \subset \mathbf{Q}_{0,ad}$|⁠. We can now formulate the fully discrete optimal control problem, which reads

$$ \begin{align}& \text{Minimize} \; J({\textbf{q}}_{\sigma},{\textbf{u}}_{\sigma}) = \frac{1}{2} \lVert{\textbf{u}}_{\sigma} - {\textbf{u}}_{d}\rVert^{2}_{L^{2}(I\times \varOmega)} + \frac{\alpha}{2} \lVert{\textbf{q}}_{\sigma}\rVert^{2}_{L^{2}(I\times \varOmega)}\end{align} $$

(6.5a)

subject to |$({\textbf{q}}_{\sigma },{\textbf{u}}_{\sigma }) \in \mathbf{Q}_{0,ad}\times X^{0}_{k}(\mathbf{V}_{h})$|⁠, satisfying

$$ \begin{align}& B({\textbf{u}}_{\sigma},{\textbf{v}}_{kh}) = ( {\textbf{q}}_{\sigma}, {\textbf{v}}_{kh})_{I\times\varOmega} \quad \text{for all}\quad{\textbf{v}}_{kh} \in X^{0}_{k}(\mathbf{V}_{h})\end{align} $$

(6.5b)

and

$$ \begin{align}& G_{\textbf{w}}({\textbf{u}}_{\sigma})\mid_{I_{m}}\le \beta\quad\text{for}\quad m=1,2,\dots,M.\end{align} $$

(6.5c)

The following lemma guarantees that also for the fully discrete optimal control problem, there exist feasible controls such that the associated fully discrete state strictly satisfies the state constraint.

Lemma 6.1.

Let Assumption 3 be satisfied and let |$h$| be sufficiently small. Then the projection |$\pi _{d} \tilde{\textbf{q}} \in \mathbf{Q}_{0,ad}$| of the Slater point |$\tilde{\textbf{q}} \in \mathbf{Q}_{ad}$| from Assumption 1 satisfies the following discrete Slater condition:

$$ \begin{align*}& G_{\textbf{w}}({\textbf{u}}_{kh}(\pi_{d} \tilde{\mathbf{q}})) < \beta{\qquad \text{for all}\ t \in \bar I}. \end{align*} $$

Proof.

From Lemma 5.2, we know that there exists |$\delta> 0$| such that |$G_{\textbf{w}}({\textbf{u}}_{kh}(\tilde{\mathbf{q}})) < \beta - \delta$|⁠. As the discrete solution operator |$S_{kh}$| is linear and continuous from |$L^{2}(I;L^{2}(\varOmega )^{d}) \to L^\infty (I;L^{2}(\varOmega )^{d})$|⁠, we have

$$ \begin{align*}& G_{\textbf{w}}({\textbf{u}}_{kh}(\pi_{d} \tilde{\textbf{q}})) = G_{\textbf{w}}({\textbf{u}}_{kh}(\pi_{d} \tilde{\textbf{q}} - \tilde{\textbf{q}})) + G_{\textbf{w}}({\textbf{u}}_{kh}(\tilde{\textbf{q}})) < C\|\pi_{d} \tilde{\textbf{q}} - \tilde{\textbf{q}}\|_{L^{2}(I\times \varOmega)} + \beta -\delta. \end{align*} $$

As Assumption 3 guarantees |$k \to 0$| as |$h \to 0$|⁠, it holds |$\|\pi _{d} \tilde{\textbf{q}} - \tilde{\textbf{q}}\|_{L^{2}(I; L^{2}(\varOmega ))} \to 0$| for |$h \to 0$|⁠. This implies that for |$h$| small enough, we have |$C\|\pi _{d} \tilde{\textbf{q}} - \tilde{\textbf{q}}\|_{L^{2}(I\times \varOmega )} \le \delta $| and as a consequence |$G_{\textbf{w}}({\textbf{u}}_{kh}(\pi _{d} \tilde{\textbf{q}})) < \beta $|⁠.

Theorem 6.2.

Let Assumption 3 be satisfied and let |$h$| be sufficiently small. Then there exists a unique solution |$(\bar{\textbf{q}}_\sigma , \bar{\textbf{u}}_\sigma )$| to the fully discrete optimal control problem (6.5a)–(6.5c).

Proof.

As Lemma 6.1 shows feasibility of |$(\pi _{d} \tilde{\textbf{q}},{\textbf{u}}_{kh}(\pi _{d} \tilde{\textbf{q}}))$| under the given assumptions, the existence proof follows the same steps as the one of Theorem 3.6 on the continuous level.

Similar to Section 5, we can rewrite the problem (6.5a)–(6.5c) in the reduced form

$$ \begin{align}& \text{Minimize} \ j_{kh}({\textbf{q}}_\sigma)=J({\textbf{q}}_\sigma,S_{kh}({\textbf{q}}_\sigma)) \quad \text{over} \quad{\textbf{q}}_\sigma\in \mathbf{Q}_{0,ad}\quad \text{subject to}\quad \mathcal{G}_{kh}({\textbf{q}}_\sigma)\in \mathcal{K}_{kh}.\end{align} $$

(6.6)

Note that compared to the the variationally discretized optimal control problem (5.3), only the control space has changed.

Theorem 6.3 (First order optimality conditions for discretized controls).

A control |$\bar{{\textbf{q}}}_{\sigma }\in \mathbf{Q}_{0,ad}$| and the associated state |$\bar{{\textbf{u}}}_{\sigma }={\textbf{u}}_{kh}(\bar{{\textbf{q}}}_{\sigma }) \in X^{0}_{k}(\mathbf{V}_{h})$| is the optimal solution to the problem (6.5a)–(6.5c) if and only if there exists an adjoint state |$\bar{{\textbf{z}}}_{\sigma } \in X^{0}_{k}(\mathbf{V}_{h})$| and a Lagrange multiplier |$\bar{\mu }_{\sigma }\in L^{1}(I)$| that satisfy: Discrete state equation

$$ \begin{align}& B(\bar{{\textbf{u}}}_{\sigma},{\textbf{v}}_{kh}) = ( \bar{{\textbf{q}}}_{\sigma}, {\textbf{v}}_{kh})_{I\times\varOmega}, \quad \forall{\textbf{v}}_{kh} \in X^{0}_{k}(\mathbf{V}_{h});\end{align} $$

(6.7a)

Discrete state constraint and complementarity conditions

$$ \begin{align}& G_{\textbf{w}}(\bar{\textbf{u}}_\sigma)|_{I_{m}} \le \beta, \ m=1,...,M, \quad \bar{\mu}_{\sigma}\geq 0 \quad\text{and}\quad \langle \bar{\mu}_{\sigma},\beta-G_{\textbf{w}}(\bar{\textbf{u}}_\sigma) \rangle=0;\end{align} $$

(6.7b)

Discrete adjoint equation

$$ \begin{align}& B({\textbf{v}}_{kh},\bar{{\textbf{z}}}_{\sigma}) = ( \bar{{\textbf{u}}}_{\sigma}-{\textbf{u}}_{d}+{\bar{\mu}_{\sigma}}{\textbf{w}}, {\textbf{v}}_{kh})_{I\times\varOmega} \quad \forall{\textbf{v}}_{kh} \in X^{0}_{k}(\mathbf{V}_{h});\end{align} $$

(6.7c)

Discrete variational inequality

$$ \begin{align}& \left(\alpha\bar{{\textbf{q}}}_{\sigma}+\bar{{\textbf{z}}}_{\sigma}, \delta{\textbf{q}}-\bar{{\textbf{q}}}_{\sigma}\right)_{I\times \varOmega}\geq 0 \quad \forall \delta{\textbf{q}}\in \mathbf{Q}_{0,ad} \ \Leftrightarrow \ \bar{{\textbf{q}}}_{\sigma} =P_{[{\textbf{q}}_{a},{\textbf{q}}_{b}]}\left(-\frac{1}{\alpha}\bar{{\textbf{z}}}_{\sigma}\right).\end{align} $$

(6.7d)

Furthermore, there exist |$\bar \mu _{\sigma }^{m} \in{\mathbb{R}}_{\ge 0}$|⁠, |$m=1,2,\dots ,M$|⁠, such that the discrete Lagrange multiplier |$\bar{\mu }_{\sigma }\in L^{1}(I)$| satisfies the expression

$$ \begin{align}& \bar \mu_{\sigma} = \sum_{m=1}^{M} \frac{\bar \mu_{\sigma}^{m}}{k_{m}} \chi_{I_{m}},\end{align} $$

(6.8)

where |$\chi _{I_{m}}$| denotes the characteristic function of the interval |$I_{m}$|⁠.

Proof.

The proof is almost identical to the proof of Theorem 5.4.

Again, due to |$\bar \mu _\sigma \ge 0$|⁠, it holds |$\|\bar \mu _\sigma \|_{L^{1}(I)} = \|\bar \mu _\sigma \|_{(C(\bar I)^{*})} = \langle \bar \mu _\sigma , 1 \rangle = \sum _{m=1}^{M} \bar \mu _\sigma ^{m}$|⁠.

Lemma 6.4.

Let Assumption 3 be satisfied, and |$h$| be small enough. Then there exists a constant |$C>0$| independent of |$k$| and |$h$|⁠, such that the fully discrete optimal control |$\bar{{\textbf{q}}}_{\sigma }\in \mathbf{Q}_{0,ad}$|⁠, solving (6.5a)–(6.5c), together with its corresponding state |$\bar{{\textbf{u}}}_{\sigma }\in X^{0}_{k}(\mathbf{V}_{h})$| and corresponding multiplier |$\bar{\mu }_{\sigma }\in L^{1}(\bar I)$| satisfies the bound

$$ \begin{align*} & \|\bar{{\textbf{q}}}_{\sigma}\|_{L^\infty(I;L^2(\varOmega))} +\|\bar{{\textbf{u}}}_{\sigma}\|_{L^2(I\times \varOmega)}+\|\bar{\mu}_{\sigma}\|_{(C(\bar{I}))^*}\le C. \end{align*} $$

Proof.

By Lemma 6.1 under the given assumptions, the fully discrete control |$\pi _{d} \tilde{\textbf{q}}$| is feasible, and thus it holds

$$ \begin{align*} & \begin{aligned} J(\bar{{\textbf{q}}}_{\sigma},\bar{{\textbf{u}}}_{\sigma}) &\le J(\pi_d\tilde{{\textbf{q}}},{\textbf{u}}_{kh}(\pi_d\tilde{{\textbf{q}}}))\\ &=\frac{1}{2} \lVert{\textbf{u}}_{kh}(\pi_d\tilde{{\textbf{q}}}) - {\textbf{u}}_d\rVert^2_{L^2(I\times \varOmega)} + \frac{\alpha}{2} \lVert\pi_d\tilde{{\textbf{q}}}\rVert^2_{L^2(I\times \varOmega)}\\ &\le \lVert{\textbf{u}}_{kh}(\pi_d \tilde{{\textbf{q}}})\rVert^2_{L^2(I\times \varOmega)} + \lVert{\textbf{u}}_d\rVert^2_{L^2(I\times \varOmega)}+\frac{\alpha}{2}\lVert\tilde{{\textbf{q}}}\rVert^2_{L^2(I\times \varOmega)} \\ &\le C(T,\tilde{{\textbf{q}}},{\textbf{u}}_d), \end{aligned} \end{align*} $$

where in the last step, we have used the discrete stability result for |${\textbf{u}}_{kh}$| from Theorem 4.2 and the stability of |$\pi _{d}$| from (6.3). As a result we obtain |$ \|\bar{{\textbf{q}}}_{\sigma }\|_{L^{2}(I\times \varOmega )}+\|\bar{{\textbf{u}}}_{\sigma }\|_{L^{2}(I\times \varOmega )}\le C. $| The proof of |$\|\bar \mu _\sigma \|_{C(\bar I)^{*})} \le C$| and |$\|\bar{\textbf{q}}_\sigma \|_{L^\infty (I;L^{2}(\varOmega ))} \le C$| is then accomplished by following the same steps as the proof of Lemma 5.7, making use of |$G_{\textbf{w}}({\textbf{u}}_{kh}({\pi _{d}{\textbf{p}}}))\le \beta -\frac{1}{4}\delta $| for |$h$| sufficiently small.

Theorem 6.5.

Let Assumption 3 be satisfied and let |$h$| be small enough. Let |$(\bar{\textbf{q}}_{kh},\bar{\textbf{u}}_{kh})$| and |$(\bar{\textbf{q}}_\sigma ,\bar{\textbf{u}}_\sigma )$| denote the optimal solutions of the variationally discretized optimal control problem (5.1a)–(5.1c) and the fully discretized optimal control problem (6.5a)–(6.5c). Then there exists a constant |$C>0$| such that it holds

$$ \begin{align*} & \alpha\lVert\bar{{\textbf{q}}}_{kh} - \bar{{\textbf{q}}}_{\sigma}\rVert^{2}_{L^{2}(I\times \varOmega)} +\lVert\bar{{\textbf{u}}}_{kh} - \bar{{\textbf{u}}}_{\sigma}\rVert^{2}_{L^{2}(I\times \varOmega)} \le Ch^{2}. \end{align*} $$

Proof.

Choosing |$\delta{\textbf{q}}=\bar{{\textbf{q}}}_{\sigma } \in \mathbf{Q}_{ad}$| in (5.5d) and |$\delta{\textbf{q}}=\pi _{d}\bar{{\textbf{q}}}_{kh} \in \mathbf{Q}_{0,ad}$| in (6.7d) results in

$$ \begin{align}& \left(\alpha\bar{{\textbf{q}}}_{kh}+\bar{{\textbf{z}}}_{kh}, \bar{{\textbf{q}}}_{\sigma}-\bar{{\textbf{q}}}_{kh}\right)_{I\times \varOmega}\geq 0\quad \text{and}\quad \left(-\alpha\bar{{\textbf{q}}}_{\sigma}-\bar{{\textbf{z}}}_{\sigma}, \bar{{\textbf{q}}}_{\sigma}-\pi_{d}\bar{{\textbf{q}}}_{kh}\right)_{I\times \varOmega}\geq 0.\end{align} $$

(6.9)

Adding these two inequalities, we obtain

$$ \begin{align}& \alpha\|\bar{{\textbf{q}}}_{kh}-\bar{{\textbf{q}}}_\sigma\|^{2}_{L^{2}(I\times \varOmega)}\le (\bar{{\textbf{z}}}_{kh}- \bar{{\textbf{z}}}_{\sigma},\bar{{\textbf{q}}}_{\sigma}-\bar{{\textbf{q}}}_{kh})_{I\times \varOmega}+ (\alpha\bar{{\textbf{q}}}_{\sigma}+\bar{{\textbf{z}}}_{\sigma}, \pi_{d}\bar{{\textbf{q}}}_{kh}-\bar{{\textbf{q}}}_{kh})_{I\times \varOmega}:=I_{1}+I_{2}.\end{align} $$

(6.10)

We estimate the two terms separately.

Estimate for |$I_{1}$|⁠. Using the discrete state equations (5.5a), (6.7a) the corresponding adjoint equations (5.5c) and (6.7c), respectively, we have

$$ \begin{align*} & \begin{aligned} I_1 &= B(\bar{{\textbf{u}}}_{\sigma}-\bar{{\textbf{u}}}_{kh},\bar{{\textbf{z}}}_{kh}- \bar{{\textbf{z}}}_{\sigma})\\ &= (\bar{{\textbf{u}}}_{\sigma}-\bar{{\textbf{u}}}_{kh},\bar{{\textbf{u}}}_{kh}-\bar{{\textbf{u}}}_{\sigma})_{I \times \varOmega}+\langle G_{\textbf{w}}(\bar{{\textbf{u}}}_{\sigma})-G_{\textbf{w}}(\bar{{\textbf{u}}}_{kh}),\bar{\mu}_{kh}\rangle-\langle G_{\textbf{w}}(\bar{{\textbf{u}}}_{\sigma})-G_{\textbf{w}}(\bar{{\textbf{u}}}_{kh}),\bar{\mu}_{\sigma}\rangle\\ &\le -\lVert\bar{{\textbf{u}}}_{kh} - \bar{{\textbf{u}}}_{\sigma}\rVert^2_{L^2(I\times \varOmega)}+\langle \beta -G_{\textbf{w}}(\bar{{\textbf{u}}}_{kh}),\bar{\mu}_{kh}\rangle+\langle \beta -G_{\textbf{w}}(\bar{{\textbf{u}}}_{\sigma}),\bar{\mu}_{\sigma}\rangle\\ &= -\lVert\bar{{\textbf{u}}}_{kh} - \bar{{\textbf{u}}}_{\sigma}\rVert^2_{L^2(I\times \varOmega)}. \end{aligned} \end{align*} $$

By the Cauchy–Schwarz inequality and properties of the |$L^{2}$|-projection

$$ \begin{align*}& \begin{aligned} I_{2} &= \left(\alpha \bar{\textbf{q}}_\sigma + \bar{\textbf{z}}_\sigma,\pi_{d}\bar{\textbf{q}}_{kh} - \bar{\textbf{q}}_{kh}\right)_{I\times\varOmega}\\ & = \left(\bar{\textbf{z}}_\sigma - \pi_{d} \bar{\textbf{z}}_\sigma,\pi_{d}\bar{\textbf{q}}_{kh} - \bar{\textbf{q}}_{kh}\right)_{I\times\varOmega}\\ & \le C h^{2} \|\nabla \bar{\textbf{z}}_\sigma\|_{L^{2}(I\times \varOmega)} \|\nabla \bar{\textbf{q}}_{kh}\|_{L^{2}(I\times \varOmega)}. \end{aligned} \end{align*} $$

Using that |$\| \nabla \bar{{\textbf{q}}}_{kh}\|_{L^{2}(I\times \varOmega )}\le \alpha ^{-1}\| \nabla \bar{{\textbf{z}}}_{kh}\|_{L^{2}(I\times \varOmega )}$|⁠, which holds due to the projection formula (5.5d) and the stability of |$P_{ad}$| in |$H^{1}(\varOmega )$|⁠, see Kinderlehrer & Stampacchia (2000, Theorem A.1), Attouch et al. (2014, Theorem 5.8.2), and the stability of solutions to the fully discrete dual problem, pointed out in Remark 4.3, gives

$$ \begin{align*} & I_2\le Ch^2\left(\|\bar{{\textbf{u}}}_{\sigma} - \bar{{\textbf{u}}}_{d}\|_{L^2(I\times \varOmega)}+\|\bar{\mu}_{\sigma}\|_{L^1(I)}\|{\textbf{w}}\|_{L^2(\varOmega)}\right)\left(\|\bar{{\textbf{u}}}_{kh} - \bar{{\textbf{u}}}_{d}\|_{L^2(I\times \varOmega)}+\|\bar{\mu}_{kh}\|_{L^1(I)}\|{\textbf{w}}\|_{L^2(\varOmega)}\right). \end{align*} $$

Now the boundedness of |$\|\bar{{\textbf{u}}}_{\sigma }\|_{L^{2}(I\times \varOmega )}$|⁠, |$\|\bar{{\textbf{u}}}_{kh}\|_{L^{2}(I\times \varOmega )}$|⁠, |$\|\bar{\mu }_{\sigma }\|_{L^{1}(I)}$|⁠, and |$\|\bar{\mu }_{kh}\|_{L^{1}(I)}$| from Lemmas 5.7 and 6.4 finish the proof.

With this last error estimate, our main result now directly follows from Theorem 5.8 and Theorem 6.5.

Theorem 6.6 (Error estimate for the control).

Let Assumption 3 be satisfied and let |$h$| be small enough. Let |$\bar{\textbf{q}} \in \mathbf{Q}_{ad}$| and |$\bar{\textbf{q}}_\sigma \in \mathbf{Q}_{0,ad}$| be the solutions to the continuous and fully discrete optimal control problems (1.1a)–(1.1d) and (6.5a)–(6.5c).

$$ \begin{align*} & \sqrt{\alpha}\lVert\bar{{\textbf{q}}} - \bar{{\textbf{q}}}_{\sigma}\rVert_{L^{2}(I\times \varOmega)}+\lVert\bar{{\textbf{u}}} - \bar{{\textbf{u}}}_{\sigma}\rVert_{L^{2}(I\times \varOmega)} \le C \, \ell_{k} (k^{\frac{1}{2}}+h), \quad \ell_{k} = \ln{\frac{T}{k}}. \end{align*} $$

Remark 6.7.

All of our main results presented in Theorems 5.8, 6.5 and 6.6 do not require any additional regularity assumptions on the optimal control |$\bar{\textbf{q}}$|⁠, but work precisely with the regularity that is obtained from the optimality conditions. Furthermore, the techniques used in proving these results allow us to avoid strong coupling conditions on |$k$| and |$h$|⁠. We only require the product |$\ell _{k} h$| to converge to zero, in order to obtain a Slater condition for the discrete problems, which can be guaranteed by a very mild coupling condition as in Assumption 3.

7. Improved regularity

Despite having no a priori smoothness regularity for |$\bar{{\textbf{z}}}$| and as a result for |$\bar{{\textbf{q}}}$|⁠, our main result shows almost |$k^{\frac{1}{2}}$| convergence rate for the error of the optimal control. Similarly to Meidner et al. (2011), we can use this result to establish improved regularity for the optimal control.

Theorem 7.1.

Let |$\bar{{\textbf{q}}}\in \mathbf{Q}_{ad}$| be the optimal solution to (1.1a)–(1.1d). Then,

$$ \begin{align*} & \bar{{\textbf{q}}}\in L^2(I;H^1(\varOmega)^d)\cap H^s(I;L^2(\varOmega)^d),\quad \forall s<\frac{1}{2}. \end{align*} $$

Additionally, if |${\textbf{q}}_{a},{\textbf{q}}_{b} \in{\mathbb{R}}^{d}$|⁠, i.e., are finite, it holds |$\bar{\textbf{q}} \in{L^\infty (I \times \varOmega )^{d}}$|⁠.

Proof.

The proof is identical to the proof of Theorem 7.1 in Meidner et al. (2011).

In light of Theorem 3.10 and the imbedding |$\text{BV}(I) \hookrightarrow H^{s}(I)$| for all |$s < \frac{1}{2}$|⁠, this result is particularly interesting, as it holds without additional regularity assumptions on the weight |${\textbf{w}}$|⁠. Note also that the regularity properties of Theorem 3.10 and its corollaries were not used in the derivation of our error estimates. It remains an open question, how these results can be used to derive improved error estimates for the optimal control problem directly.

8. Numerical results

In the following, we present three numerical examples, studying the orders of convergence for the optimal control problem. We first present an example for smooth data. These results can be compared to the numerical example in Meidner et al. (2011), where the weight function |$w(x_{1},x_{2})=\sin{(\pi x_{1})}\sin{(\pi x_{2})}$| on the unit square was used. As this weight is in |$H^{1}_{0}(\varOmega )\cap H^{2}(\varOmega )$| almost first order convergence in time was observed, which we again can see for the Stokes optimal control problem. In the second example, we study an example where less regularity of the data is available, leading to a reduced order of convergence. Lastly, we consider an example with simultaneous control and state constraints. Throughout this section, we will analyze the errors of the full discretization |$\bar{\textbf{q}} - \bar{\textbf{q}}_{\sigma }$|⁠. For details regarding the implementation of a variational discretization in the presence of control constraints, we refer to Hinze (2005), Hinze et al. (2009, Chapter 3.2.5) and the references therein.

8.1 Solution algorithm

Equation (6.6) can be written in matrix notation as

$$ \begin{align}& \min_{{\textbf{q}}_\sigma \in{\mathbb{R}}^{N}} \frac{1}{2} (S_{kh} M_{kh}^{uq} {\textbf{q}}_\sigma - {\textbf{u}}_{d,kh})^{T} M_{kh}^{uu} (S_{kh} M_{kh}^{uq} {\textbf{q}}_\sigma -{\textbf{u}}_{d,kh}) + \frac{1}{2} \alpha{\textbf{q}}_\sigma^{T} M_{kh}^{qq} {\textbf{q}}_\sigma \qquad \text{s.t.} \quad W_{kh} S_{kh} {M_{kh}^{uq}} {\textbf{q}}_\sigma \le \beta,\end{align} $$

(8.1)

where |$M_{kh}^{uu}$| is the mass matrix of |$X^{0}_{k}(\mathbf{V}_{h})$|⁠, |$M_{kh}^{qq}$| is the mass matrix of |$\mathbf{Q}_{0}$|⁠, |$M_{kh}^{uq}$| is the mass matrix encoding inner products of |$X^{0}_{k}(\mathbf{V}_{h})$| and |$\mathbf{Q}_{0}$| functions, |${\textbf{u}}_{d,kh}$| is the |$L^{2}$| projection of |${\textbf{u}}_{d}$| onto |$X^{0}_{k}(\mathbf{V}_{h})$| and |$W_{kh}$| is the matrix that maps each |${\textbf{u}}_{kh}$| to the vector |$(G_{\textbf{w}}({\textbf{u}}_{kh})|_{I_{m}})_{m=1,..,M}$|⁠. Note that with a slight abuse of notation, we write |$S_{kh}$| for the control to state mapping as well as the matrix representing this mapping. The optimality conditions for this problem read

$$ \begin{align*} \alpha M_{kh}^{qq} {\textbf{q}}_\sigma + (M_{kh}^{uq})^{T} S_{kh}^{T} M_{kh}^{uu} (S_{kh} M_{kh}^{uq} {\textbf{q}}_{h} - {\textbf{u}}_{kh,d}) + (M_{kh}^{uq})^{T} S_{kh}^{T} W_{kh}^{T} \mu_\sigma &= 0,\\ W_{kh} S_{kh} M_{kh}^{uq} {\textbf{q}}_\sigma \le \beta, \quad \mu_\sigma \ge 0, \quad \mu_\sigma^{T} (W_{kh} S_{kh} M_{kh}^{uq} {\textbf{q}}_\sigma - \beta) &=0. \end{align*} $$

We solve the above problem with a primal-dual-active-set strategy (PDAS), during which, for each iteration of the active set |$\mathcal A_{n} \subset \{1,\dots ,M\}$|⁠, we solve a symmetric saddle point system

$$ \begin{align*}& \begin{pmatrix} \alpha M_{kh}^{qq} + (M_{kh}^{uq})^{T} S_{kh}^{T} M_{kh}^{uu} S_{kh} M_{kh}^{uq} & (M_{kh}^{uq})^{T} S_{kh}^{T} W_{kh}^{T} O_{n}\\ O_{n}^{T} W_{kh} S_{kh} M_{kh}^{uq} & 0 \end{pmatrix} \begin{pmatrix} {\textbf{q}}_\sigma^{n} \\ \mu_\sigma^{n} \end{pmatrix} = \begin{pmatrix} (M_{kh}^{uq})^{T} S_{kh}^{T} M_{kh}^{uu} {\textbf{u}}_{kh,d}\\ \beta \cdot \mathbb{1}_{|\mathcal A_{n}|} \end{pmatrix}, \end{align*} $$

where |$O_{n} \in{\mathbb{R}}^{|\mathcal A_{n}|\times M}$| is the matrix satisfying |$O_{n} \mu _\sigma = (\mu _\sigma )_{\mathcal A_{n}}$|⁠, i.e., selects the active indices. We solve the linear system with MINRES, using the block diagonal preconditioner

$$ \begin{align*}& P_{kh} = \begin{pmatrix} \alpha M_{kh}^{qq} & 0 \\ 0 & - \frac{1}{\alpha} W_{kh}S_{kh}M_{kh}^{uq} (M_{kh}^{qq})^{-1} (M_{kh}^{uq})^{T} S_{kh}^{T} W_{kh}^{T} \end{pmatrix}. \end{align*} $$

Note that due to the choice of control discretization |$M_{kh}^{qq}$| is a diagonal matrix, and if the partition of |$I$| is uniform, and all mesh elements of |$\varOmega $| have the same volume, |$M_{kh}^{qq}$| is a multiple of the identity matrix. Further note that if the partition of |$I$| is uniform, the matrix |$S_{kh}^{T} W_{kh}^{T}$| has the structure

$$ \begin{align*}& \left(\begin{array}{ccccc} {\textbf{z}}_{M} & {\textbf{z}}_{M-1} &... & {\textbf{z}}_{2} & {\textbf{z}}_{1}\\ 0 & {\textbf{z}}_{M} &... & {\textbf{z}}_{3} & {\textbf{z}}_{2} \\ \vdots & \vdots & \ddots & \vdots & \vdots\\ 0 & 0 &... & {\textbf{z}}_{M} & {\textbf{z}}_{M-1}\\ 0 & 0 &... & 0 & {\textbf{z}}_{M} \end{array}\right) \in{\mathbb{R}}^{ M\dim(\mathbf{V}_{h}) \times M}, \end{align*} $$

where each vector |${\textbf{z}}_{m} \in{\mathbb{R}}^{\dim (\mathbf{V}_{h})}$|⁠, |$m=1,...,M$| corresponds to the degrees of freedom of |${\textbf{z}}|_{I_{m}}$| for the solution |${\textbf{z}}$| to

$$ \begin{align*}& B({\textbf{v}}_{kh},{\textbf{z}}_{kh}) = \left({\textbf{w}} \chi_{I_{M}},{\textbf{v}}_{kh}\right)_{I\times\varOmega} \quad \text{for all} \quad{\textbf{v}}_{kh} \in X^{0}_{k}(\mathbf{V}_{h}). \end{align*} $$

Hence, to assemble this matrix only one discrete adjoint problem has to be solved, and a decomposition of the preconditioning matrix |$P_{kh}$| can be computed in advance and be reused in every iteration of the PDAS algorithm. The discrete solutions of the finite element problems were carried out in FEniCS Version 2019 (Logg et al., 2012), using the MINI Element in space.

8.2 Example 1

For this example, we consider the setting |$\varOmega = (0,1)^{2}$|⁠, |$I = (0,1]$|⁠, choose the regularization parameter |$\alpha = 1$| and set the control constraints to |${\textbf{q}}_{a} = (- \infty ,-\infty )^{T}$|⁠, |${\textbf{q}}_{b} = (+ \infty ,+\infty )^{T}$|⁠. We construct an analytic test case by considering the functions

$$ \begin{align*}& \varphi(t):= \begin{cases}{} 48 t^{2} - 128 t^{3} & t \in [0,1/4),\\ 1 & t \in [1/4,3/4],\\ 48 (1-t)^{2} - 128 (1-t)^{3} & t \in (3/4,1], \end{cases} \quad{\textbf{y}} = \frac{64 \sqrt{2}}{5\sqrt{7}}\begin{pmatrix} -\sin(\pi x_{1})^{4} \cos(\pi x_{2}) \sin(\pi x_{2})^{3}\\ \cos(\pi x_{1}) \sin(\pi x)^{3} \sin(\pi x_{2})^{4} \end{pmatrix}. \end{align*} $$

Here, |${\textbf{y}}$| has been constructed in such a way that |$\nabla \cdot{\textbf{y}} = 0$| and |$\|{\textbf{y}}\|_{L^{2}(\varOmega )}=1$|⁠. It was obtained by considering the potential |$\rho (x_{1},x_{2}) = \left (\sin (\pi x_{1})\sin (\pi x_{2})\right )^{4}$|⁠, and normalizing the vector field |$(\partial _{x_{2}}\rho (x_{1},x_{2}),-\partial _{x_{1}}\rho (x_{1},x_{2}))^{T}$|⁠. We choose |$\bar{{\textbf{u}}}=\varphi (t){\textbf{y}}(x_{1},x_{2})$|⁠, |${\textbf{w}} = {\textbf{y}}$| and |$\beta \equiv 1$|⁠. It is then straightforward to verify that |$G_{\textbf{w}}(\bar{{\textbf{u}}}) \le \beta $| for all |$t \in I$| and |$G_{\textbf{w}}(\bar{{\textbf{u}}}) = \beta $| if and only if |$t \in [1/4,3/4]$|⁠. We thus choose the multiplier |$\bar \mu = 10^{3} \chi _{[1/4,3/4]}(t)$|⁠, which by construction satisfies |$\bar \mu \ge 0$| and |$\langle G_{\textbf{w}}(\bar{\textbf{u}}) -\beta ,\bar \mu \rangle = 0$|⁠. We then proceed by choosing |$\bar p = 0$| and as a consequence set |$\bar{\textbf{q}} = \partial _{t} \bar{\textbf{u}} - \varDelta \bar{\textbf{u}}$|⁠. We obtain |$\bar{\textbf{z}} = - \bar{\textbf{q}}$| and with the choice |$\bar r = 0$| can fix |${\textbf{u}}_{d} = \bar{\textbf{u}} + \partial _{t} \bar{\textbf{z}} + \varDelta \bar{\textbf{z}} + \bar \mu{\textbf{w}}$|⁠, in such a way that the constructed |$(\bar{\textbf{q}},\bar{\textbf{u}})$| satisfy the first order optimality condition for this desired state. Note that |$\vec \psi $| has been chosen sufficiently smooth at the boundary, such that |$\varDelta \vec \psi |_{\partial \varOmega } = 0$| and thus |$\bar{\textbf{q}}|_{\partial \varOmega } = \bar{\textbf{z}}|_{\partial \varOmega } = 0$|⁠. The calculation of the analytic solution was verified using the SageMath software. We discretize this problem with a uniform triangulation of |$\varOmega $| and a uniform partition of |$I$|⁠. To get more insight into the observed orders of convergence, for a sequence of discretization levels |$\{\sigma _{l}\} = \{(h_{l},k_{l})\}$|⁠, we report the empirical orders of convergence determined by

$$ \begin{align*} (\text{EOC}_{h})_{l} &:= \dfrac{\log(\|\bar{\textbf{q}} - \bar{\textbf{q}}_{\sigma_{l}}\|_{L^{2}(I\times \varOmega)}) - \log(\|\bar{\textbf{q}} - \bar{\textbf{q}}_{\sigma_{l-1}}\|_{L^{2}(I\times \varOmega)})}{\log(h_{l}) - \log(h_{l-1})}\\ (\text{EOC}_{k})_{l} &:= \dfrac{\log(\|\bar{\textbf{q}} - \bar{\textbf{q}}_{\sigma_{l}}\|_{L^{2}(I\times \varOmega)}) - \log(\|\bar{\textbf{q}} - \bar{\textbf{q}}_{\sigma_{l-1}}\|_{L^{2}(I\times \varOmega)})}{\log(k_{l}) - \log(k_{l-1})} \end{align*} $$

Figure 1(a) displays convergence with respect to the spacial discretization parameter for fixed |$k$|⁠. The theoretical convergence order of 1 can be observed. Figure 1(b) depicts the convergence results for the time discretization parameter for different values of |$h$|⁠. Note that for this comparison, we have choosen discretization levels in time, such that the two boundary of the active set |$[1/4,3/4]$| are midpoints of the discrete subintervals, in order to exclude superconvergence effects. Due to the structure of the discretization, the number of degrees of freedom grows rather quickly, which is why very fine discretizations are expensive. Figure 1(b) shows that if the spacial discretization parameter is chosen too large, the stagnation phase sets in rather early. If one only observes the coarse discretizations, the observed order of convergence is skewed, which led to an estimated order of convergence of 0.85 reported in Christof & Vexler (2021). As the derivation of the present example with an analytic reference solution requires the involved functions to have some precise regularity, we next propose an example with desired state that has low regularity in time. In this case a numerical reference solution is needed to measure the errors.

$Numerical observation of the error $\|\bar{\textbf{q}} - \bar{\textbf{q}}_\sigma \|_{L^{2}(I\times \varOmega )}$ for Example 1. (a) Convergence with respect to $h$ for $k=10^{-3}$. (b) Convergence with respect to $k$ for different values of $h$.$

Fig. 1.

Numerical observation of the error |$\|\bar{\textbf{q}} - \bar{\textbf{q}}_\sigma \|_{L^{2}(I\times \varOmega )}$| for Example 1. (a) Convergence with respect to |$h$| for |$k=10^{-3}$|⁠. (b) Convergence with respect to |$k$| for different values of |$h$|⁠.

Open in new tab Download slide

8.3 Example 2

In this example, we consider an optimal control problem where we specify rough data |${\textbf{u}}_{d}$|⁠, and |${\textbf{w}}$|⁠, while keeping the domain |$\varOmega = (0,1)^{2}$|⁠, |$I = (0,1]$| and |${\textbf{q}}_{a} = (-\infty ,-\infty )^{T}$|⁠, |${\textbf{q}}_{b} = (+\infty ,+\infty )^{T}$|⁠. We specify, see also Fig. 2(a),

$$ \begin{align*}& {\textbf{u}}_{d} = 5 \cdot 10^{4} \varphi(t) \begin{pmatrix} \sin(\pi x_{2}) \cos(\pi x_{2}) \sin(\pi x_{1})^{2}\\ -\sin(\pi x_{1}) \cos(\pi x_{1}) \sin(\pi x_{2})^{2} \end{pmatrix} \quad \text{where} \ \varphi(t) = \begin{cases}{} \sqrt{t-\frac{1}{5}}\cdot (\frac{2}{5}-t) & \text{if}\ t \in [ \frac{1}{5}, \frac{2}{5} ],\\ -\sqrt{t-\frac{3}{5}}\cdot (\frac{4}{5}-t) & \text{if}\ t \in [ \frac{3}{5}, \frac{4}{5} ],\\ 0 & \text{else.} \end{cases} \end{align*} $$

The weight in the state constraint is given by the following function, see also Fig. 2(b),

$$ \begin{align*}& {\textbf{w}} = \chi_{\{(x_{1}-0.5)^{2}+(x_{2}-0.5)^{2} \le 0.125\}}(x_{1},x_{2}) \cdot \begin{pmatrix} x_{2}-0.5\\ -x_{1}+0.5 \end{pmatrix} \end{align*} $$

and the scalar constraint is given by |$\beta \equiv 1$|⁠. Moreover, we consider the regularization parameter |$\alpha = 10^{-4}$|⁠. As in this case, no analytical optimal solution is known, we estimate the errors using a numerical reference solution on a fine grid. To this end, we discretize the problem on 960 time intervals and 128 subdivisions of |$\varOmega $| in each direction. Note that due to this evaluation of the errors, we expect a faster convergence than theoretically derived, as on the finest discretization level, the error would equal to |$0$|⁠. In Fig. 3(a) we can again observe order 1 convergence with respect to |$h$| for fixed time discretization. In Fig. 3(b) we observe that the convergence with respect to |$k$| exhibits a rate of about |$0.6$|⁠, which is much closer to the analytically derived |$0.5$| than the rate observed for the smooth example 1.

$Data for Example 2. (a) Time function $\varphi (t)$. (b) Weight function ${\textbf{w}}$. (c) Satisfaction of the state constraint.$

Fig. 2.

Data for Example 2. (a) Time function |$\varphi (t)$|⁠. (b) Weight function |${\textbf{w}}$|⁠. (c) Satisfaction of the state constraint.

Open in new tab Download slide

$Numerical observation of the error $\|\bar{\textbf{q}} - \bar{\textbf{q}}_\sigma \|_{L^{2}(I\times \varOmega )}$ for Example 2. (a) Convergence with respect to $h$ for $k=960^{-1}$. (b) Convergence with respect to $k$ for $h=2^{-7}$.$

Fig. 3.

Numerical observation of the error |$\|\bar{\textbf{q}} - \bar{\textbf{q}}_\sigma \|_{L^{2}(I\times \varOmega )}$| for Example 2. (a) Convergence with respect to |$h$| for |$k=960^{-1}$|⁠. (b) Convergence with respect to |$k$| for |$h=2^{-7}$|⁠.

Open in new tab Download slide

8.4 Example 3

In the previous examples, no control constraints were present. To highlight that the derived error estimates are indeed not influenced by the control constraints, we augment the Example of Section 8.2 by a control constraint |$\bar{\textbf{q}} \le 200\cdot \mathbb{1}$|⁠. The remaining choices of |$\varOmega , {\textbf{w}}, {\textbf{u}}_{d}, \alpha $| and |$\beta $| are kept the same. Due to the presence of the control constraint, an analytic solution |$\bar{\textbf{q}}$| to the optimal control problem is not known, and we again compare to a fine-grid solution, computed with |$k=960^{-1}$| and |$h=2^{-7}$|⁠. The numerically observed orders of convergence in |$k$| and |$h$|⁠, as well as the active set of the control constraint for a fixed point in time, can be observed in Fig. 4. It has to be noted that while the discrete problem has a similar structure to the one without control constraint, and can be solved by the same PDAS algorithm, the resulting saddle point systems can be much larger for large active sets. Efficient preconditioners for such problems are needed, to study the performance of this algorithm for finer discretizations.

$Left: orders of convergence for an example with control constraints. Right: active sets of the discrete optimal control for $h=2^{-5}$, $k=30^{-1}$ at $t=1/3$. (grey: inactive, blue: constraint in $x_{1}$ direction active, red: constraint in $x_{2}$ direction active).$

Fig. 4.

Left: orders of convergence for an example with control constraints. Right: active sets of the discrete optimal control for |$h=2^{-5}$|⁠, |$k=30^{-1}$| at |$t=1/3$|⁠. (grey: inactive, blue: constraint in |$x_{1}$| direction active, red: constraint in |$x_{2}$| direction active).

Open in new tab Download slide

References

Appell

Banas

Merentes Díaz

N. J.

(

2014

)

Bounded Variation and Around

. Series in Nonlinear Analysis and Applications, vol.

Berlin, Boston: De Gruyter

Month:	Total Views:
April 2025	79
May 2025	4

Article Contents

A priori error estimates for optimal control problems governed by the transient Stokes equations and subject to state constraints pointwise in time

Abstract

1. Introduction

2. Notation and preliminary results

3. Continuous optimal control problem

4. Finite element approximation of the state equation

4.1 Spatial discretization

4.2 Temporal discretization: the discontinuous Galerkin method

4.3 Best approximation type fully discrete error estimate for the Stokes problem in |$L^\infty (I;L^{2}(\varOmega )^{d})$| norm

5. Variational discretization of the optimal control problem

6. Full discretization of the optimal control problem

7. Improved regularity

8. Numerical results

8.1 Solution algorithm

8.2 Example 1

8.3 Example 2

8.4 Example 3

References

Citations

Views

Altmetric

Email alerts

Citing articles via

Latest

Most Read

Most Cited

This Feature Is Available To Subscribers Only