Abstract

In this paper, we consider a state constrained optimal control problem governed by the transient Stokes equations. The state constraint is given by an |$L^{2}$| functional in space, which is required to fulfill a pointwise bound in time. The discretization scheme for the Stokes equations consists of inf-sup stable finite elements in space and a discontinuous Galerkin method in time, for which we have recently established best approximation type error estimates. Using these error estimates, for the discrete control problem we derive error estimates and as a by-product we show an improved regularity for the optimal control. We complement our theoretical analysis with numerical results.

1. Introduction

In this paper, we consider the following optimal control problem

(1.1a)

subject to the state equation

(1.1b)

control constraints

(1.1c)

and state constraints

(1.1d)

Here we assume that |$\varOmega \subset{\mathbb{R}}^{d}$|⁠, |$d=2,3$|⁠, is a convex polygonal or polyhedral domain and |$I= (0,T]$| is a bounded time interval. In the objective function, |${\textbf{u}}_{d} \in L^{2}(I\times \varOmega )^{d}$| represents the desired state and |$\alpha> 0$| is the regularization parameter. The control constraints are given by the constant vectors |${\textbf{q}}_{a},{\textbf{q}}_{b}\in{({\mathbb{R}} \cup \{\pm \infty \})^{d}}$| and satisfy |${\textbf{q}}_{a} < {\textbf{q}}_{b}$|⁠. In the state constraint, the constant scalar |$\beta $| satisfies |$\beta> 0$| and |$\mathbf{w}(x)$| is a given function in |$L^{2}(\varOmega )^{d}$|⁠. Note that for ease of presentation, we consider an optimal control problem with homogeneous initial data |${\textbf{u}}(0) = {\textbf{0}}$|⁠, while all results also extend to the inhomogeneous case |${\textbf{u}}(0) = {\textbf{u}}_{0}$|⁠. The main result of this paper states that the error between the optimal control |$\bar{{\textbf{q}}}$| for the continuous problem and the optimal solution |$\bar{{\textbf{q}}}_{\sigma }$| of the discretized problem satisfies

and is presented in Theorem 6.6. A similar optimal control problem subject to the heat equation was considered in Meidner et al. (2011), where a comparable error estimate was derived. The authors of Ludovici & Wollner (2015); Ludovici et al. (2018) discuss error estimates for parabolic problems with purely time-dependent controls and impose constraints on spatial averages of either function values or gradients of the state at every point in time. Error estimates for state constrained parabolic problems, with state constraints applied pointwise in time and space, can be found in Deckelnick & Hinze (2011); Gong & Hinze (2013); Christof & Vexler (2021).

The optimal control of flow phenomena subject to state constraints is a very active research topic, and there have been numerous contributions to the field, see, e.g., De Los Reyes & Kunisch (2006); De Los Reyes & Griesse (2008); De Los Reyes & Yousept (2009) for optimal control of the stationary Navier–Stokes equations and Fattorini & Sritharan (1998); Wang (2002a,b); Wang & Wang (2003); Liu (2010) for the transient Navier–Stokes equations, subject to general state constraints of the form |${\textbf{u}} \in \mathcal C$|⁠. Note that the above references only contain the analysis of the continuous problems and some numerical results, but no derivation of error estimates. In De Los Reyes et al. (2008), error estimates for an optimal control problem subject to the stationary Stokes equations with pointwise state constraints are shown. Let us also specifically mention John & Wachsmuth (2009), where an optimal control problem of the stationary Navier–Stokes equations was considered, and a constraint was put onto the drag functional |$\int _{\partial \varOmega } (\partial _{n} {\textbf{u}} - p \mathbf{n}) \mathbf{e}_{d} \ \text{d}s$|⁠, for some given direction of interest specified by the unit vector |$\mathbf{e}_{d}$|⁠. The setting of our work, constraining an |$L^{2}(\varOmega )$| functional pointwise in time, can be seen as a step towards discussing transient problems with drag/lift constraints at every point in time. The rest of the paper is structured as follows. In Section 2, we introduce the notation and present some analysis of the transient Stokes problem used in this paper. We then proceed to discuss the optimal control problem, including wellposedness and optimality conditions in Section 3. Depending on the regularity of available data, we discuss regularity and structural properties of the optimal solution. We introduce the discretization of the transient Stokes problem in Section 4 and recollect some important error estimates. This allows us to discuss the discrete formulations of the optimal control problem, where first in Section 5 we present the analysis and error estimates for a problem with variational discretization, cf., Hinze (2005); Deckelnick & Hinze (2011), where only the state equation is discretized, but the control is not. Following up this section, we discretize the control by piecewise constant functions in space and time, and present the analysis of the fully discrete problem in Section 6, which contains the main result of this work, Theorem 6.6. We conclude our work by using the derived error estimates to obtain improved regularity for the optimal control in Section 7 and presenting numerical results in Section 8.

2. Notation and preliminary results

We will use the standard notation for the Lebegue and Sobolev spaces over the spatial domain |$\varOmega $|⁠. The pressure space is

Throughout the paper, vector valued quantities and spaces will be indicated by boldface letters. We denote for a Banach space |$X$| and |$1 \le p \le \infty $|⁠, by |$L^{p}(I;X)$| the Bochner space of |$X$|-valued functions over |$I$|⁠, whose |$X$|-norm is |$p$|-integrable w.r.t time. If |$X$| is reflexive and |$1 \le p < \infty $|⁠, there holds the following isomorphism, see Hytönen et al. (2016, Corollary 1.3.22)

(2.1)

Note that the range includes the value |$p=1$| and not |$p= \infty $|⁠. The dual space of |$C(\bar I;L^{2}(\varOmega )^{d})$| is isomorphic to the space of regular |$L^{2}(\varOmega )^{d}$|-valued Borel measures, and we denote it by |$\mathcal M(\bar I;L^{2}(\varOmega )^{d}) \cong (C(\bar I;{L^{2}(\varOmega )^{d}}))^{*}$|⁠. Similarly for scalar regular Borel measures of |$\bar I$|⁠, we use the notation |$\mathcal M(\bar I) \cong (C(\bar I))^{*}$|⁠. To denote the vector valued spaces of divergence free functions, having various levels of regularity, we use the following notation:

where by |${\textbf{v}}\cdot \mathbf{n}$| we denote the generalized normal trace. We denote by |$\mathbf{V}^{*}$| the topological dual space of |$\mathbf{V}$|⁠, and define

We denote by |$\left (\cdot ,\cdot \right )_{\varOmega }$| and |$\left (\cdot ,\cdot \right )_{I\times \varOmega }$| the inner products of |$L^{2}(\varOmega )^{d}$| and |$L^{2}(I;L^{2}(\varOmega )^{d})$|⁠, respectively, and by |$\left \langle \cdot ,\cdot \right \rangle _{I\times \varOmega }$| the duality pairing between |$L^{2}(I;\mathbf{V})$| and |$L^{2}(I;\mathbf{V}^{*})$|⁠. We introduce the Stokes operator |$A$|⁠, defined by

The |$H^{2}$| regularity results of Kellogg & Osborn (1976); Dauge (1989) show that |$D(A) = \mathbf{V}_{2}$|⁠. As |$A$| is a positive, selfadjoint operator, fractional powers |$A^{s}$| are well defined. Of special importance is |$A^{\frac{1}{2}}$| which is an isometric isomorphism

as it holds |$\|A^{\frac{1}{2}} {\textbf{u}}\|_{L^{2}(\varOmega )}^{2} = \left (A {\textbf{u}},{\textbf{u}}\right )_{\varOmega } = \left (\nabla{\textbf{u}},\nabla{\textbf{u}}\right )_{\varOmega } = \|\nabla{\textbf{u}}\|_{L^{2}(\varOmega )}^{2}$|⁠. For the proof of |$D(A^{\frac{1}{2}}) = \mathbf{V}$|⁠, see (Sohr, 2001, Ch. III, Lemma 2.2.1). By its definition, we can extend |$A$| to an operator (denoted by the same symbol) |$A \colon \mathbf{V} \to \mathbf{V}^{*}$| yielding another isometric isomorphism between those spaces. Lastly, as

there holds the isometric isomorphism

(2.2)

Using the above defined function spaces, the weak formulation of the state equation (1.1b) for a given |${\textbf{q}}\in L^{1}(I;L^{2}(\varOmega )^{d}) + L^{2}(I;\mathbf{V}^{*})$| reads as follows: Find |${\textbf{u}}\in{\mathbf{X}}$| such that

(2.3)

For the above weak formulation, there holds the following result.

 

Theorem 2.1.
For |${\textbf{q}} \in L^{1}(I;{L^{2}(\varOmega )^{d}}) + L^{2}(I;\mathbf{V}^{*})$|⁠, there exists a unique solution |${\textbf{u}} \in L^{2}(I;\mathbf{V}) \cap C(\bar I;\mathbf{V}_{0})$| solving (2.3) and the following estimate holds:
If |${\textbf{q}} \in L^{2}(I;{L^{2}(\varOmega )^{d}})$|⁠, then |${\textbf{u}} \in L^{2}(I;{\mathbf{V}_{2}}) \cap H^{1}(I;{\mathbf{V}_{0}}) \hookrightarrow C(\bar I;\mathbf{V})$|⁠, and there holds

 

Proof.

The first part of this theorem is proven in Temam (1977, Chapter III, Theorem 1.1) and the remark on page 179 therein. The |$H^{2}$| regularity part can be shown as in Temam (1977, Chapter III, Proposition 1.2) using the |$H^{2}$| regularity result for the stationary Stokes equations in convex polygonal/polyhedral domains, proven in Dauge (1989, Theorems 5.5, 6.3) see also Kellogg & Osborn (1976, Theorem 2).

It is a classical result that the Stokes operator in the Hilbert space setting exhibits maximal parabolic regularity, i.e.

(2.4)

see Behringer et al. (2023, Proposition 2.6), holding on fairly general domains, e.g. Lipschitz domains. If |$\varOmega $| is convex, the |$H^{2}$| regularity results further imply that |${\textbf{u}} \in L^{p}(I;\mathbf{V}_{2})$|⁠. Most often maximal parabolic regularity is treated in |$L^{2}$| or |$L^{q}$| setting in space, but it can also be extended to settings of weaker spacial regularity. The property (2.2) combined with Auscher et al. (2015, Lemma 11.4) yields that maximal parabolic regularity also holds in |$\mathbf{V}^{*}$|⁠, i.e.

(2.5)

If the right-hand side |${\textbf{q}}$| is regular enough, there exists an associated pressure to the weak solution of (2.3), formulated in divergence free spaces. The regularity of the pressure depends on the regularity of the right-hand side and the velocity component of the solution. There holds the following result, see Behringer et al. (2023, Theorem 2.10, Corollary 2.11).

 

Proposition 2.2.
Let |${\textbf{q}} \in L^{s}(I;L^{2}(\varOmega )^{d})$| for some |$1< s < \infty $| and let |${\textbf{u}} \in L^{2}(I;\mathbf{V}) \cap{C(\bar I;\mathbf{V}_{0})}$| be the weak solution to (2.3). Then there exists a unique |$p \in L^{s}(I;L^{2}_{0}(\varOmega ))$| such that
(2.6)
which is to be understood as an identity in |$L^{s}(I;H^{-1}(\varOmega )^{d})$|⁠. On convex domains |$\varOmega $|⁠, the pressure satisfies |$p \in L^{s}(I;H^{1}(\varOmega ))$|⁠.

3. Continuous optimal control problem

We now introduce the control to state mapping.

 

Theorem 3.1.

Let |$S\colon{{\textbf{q}}}\mapsto{{\textbf{u}}}$| denote the solution operator for the state equation (2.3). Then |$S$| is a bounded linear operator between the following spaces:

  • |$S\colon L^{2}(I;\mathbf{V}^{*})\to \mathbf{W} \hookrightarrow C(\bar{I};L^{2}(\varOmega )^{d})$|

  • |$S\colon L^{1}(I;\mathbf{V}_{0})\to C(\bar{I};L^{2}(\varOmega )^{d})$|

  • |$S\colon L^\infty (I;L^{2}(\varOmega )^{d})\to W^{1,s}(I;L^{2}(\varOmega )^{d})\cap L^{s}(I;H^{2}(\varOmega )^{d})$|⁠, |$1 \le s<\infty $|⁠.

 

Proof.

The first two claims are the direct consequences of Theorem 2.1. Note, that the regularity |$\partial _{t} {\textbf{u}} \in L^{2}(I;\mathbf{V}^{*})$| can only be obtained by bootstrapping if the right-hand side is in |$L^{2}(I;\mathbf{V}^{*})$|⁠. The third claim is obtained by using the maximal parabolic regularity and |$H^{2}$| regularity of the stationary Stokes problem on convex polygonal/polyhedral domains, see Dauge (1989, Theorem 5.5, Theorem 6.3).

To abbreviate the notation, we will frequently use |${\textbf{u}}({\textbf{q}}):= S({\textbf{q}})$|⁠. Since |$S$| is a bounded linear operator between the spaces introduced in the previous theorem, it is Fréchet differentiable, and its directional derivative in direction |${\mathbf{\delta }\mathbf{q}}$| satisfies

i.e. is independent of |${\textbf{q}}$|⁠.

 

Remark 3.2.

The operator |$S$| is linear and coincides with its Frechét derivative, due to our choice to work with homogeneous initial data in the state equation (1.1b). The results presented in this work however also hold true in the inhomogeneous initial data case. In this case, the assumption |$\beta>0$|⁠, on the parameter |$\beta $| of the state constraint, then has to be generalized to |$\beta> \left ({\textbf{u}}_{0},{\textbf{w}}\right )_{\varOmega }$|⁠.

Let us next give a characterization of the adjoint operator |$S^{*}$|⁠. Theorem 2.1, together with Boyer & Fabrie (2013, Proposition V.1.3), yields that for any |${\textbf{q}} \in{\mathbf{Y}}$|⁠, there exists a unique solution |${\textbf{u}} \in \mathbf{X}$| such that

(3.1)

and the two formulations (2.3) and (3.1) are equivalent. The solution operator |$S$| is the inverse of the operator |$T\colon \mathbf{X} \to \mathbf{Y}$|⁠, |$\langle T {\textbf{u}},{\textbf{v}}\rangle _{\mathbf{Y}\times \mathbf{Y}^{*}} = a({\textbf{u}},{\textbf{v}})$|⁠. As |$T$| is invertible, so is |$T^{*}$| and its inverse is precisely |$S^{*}$|⁠. With this construction, the adjoint operator |$S^{*}: \mathbf{g} \mapsto{\textbf{z}}$| corresponds to the following weak formulation: given |$\mathbf{g} \in \mathbf{X}^{*}$|⁠, find |${\textbf{z}} \in{\mathbf{Y}^{*}}$| satisfying

(3.2)

We now discuss the regularity of the adjoint operator.

 

Corollary 3.3.
Let |$S^{*}$| denote the adjoint operator to the solution operator introduced in Theorem 3.1. Then it satisfies

 

Proof.

This is a direct consequence of the definition of the adjoint operator, Theorem 3.1, and the isomorphism |$(L^{1}(I;\mathbf{V}_{0}))^{*} \cong L^\infty (I;\mathbf{V}_{0})$|⁠, due to (2.1).

Due to the linearity of the adjoint operator, it again holds

For convenience, for the weight |$\mathbf{w}\in L^{2}(\varOmega )^{d}$|⁠, we define the functional |$G_{\mathbf{w}}\colon L^{2}(\varOmega )^{d}\to \mathbb{R}$| by

(3.3)

For time-dependent functions |${\textbf{v}} \colon I\to L^{2}(\varOmega )^{d}$|⁠, the application of |$G_{\mathbf{w}}$| is defined by

Using the functional |$G_{\mathbf{w}}$|⁠, the state constraint (1.1d) can be compactly rewritten as

(3.4)

 

Remark 3.4.

Due to the continuous embedding |$\mathbf{W}\hookrightarrow C(\bar{I};L^{2}(\varOmega )^{d})$|⁠, we have |$G_{\mathbf{w}}({\textbf{v}})(\cdot )\in C(\bar{I})$| for any |${\textbf{v}}\in \mathbf{W}$|⁠. Thus we can understand |$G_{\mathbf{w}}$| as a linear, continuous operator from |$\mathbf{W}$| to |$C(\bar I)$|⁠.

To write the optimal control problem (1.1a)–(1.1d) into reduced form, we define |$\mathcal{G}:=G_{\mathbf{w}}\circ S$| and the closed convex cone |$\mathcal{K}\subset C(\bar{I})$| by

Using the above definitions, the reduced form reads

(3.5)

where the admissible set is given by

We define the projection operator onto the feasible set by

which we understand componentwise and pointwise for every |$(t,x) \in I \times \varOmega $|⁠. Throughout the paper, we will work under the following assumption.

 

Assumption 1 (Slater condition).

There exists |$\tilde{{\textbf{q}}}\in \mathbf{Q}_{ad}$| such that |$G_{\mathbf{w}}({\textbf{u}}(\tilde{{\textbf{q}}}))<\beta $| for all |$t \in \bar I$|⁠, where |${\textbf{u}}(\tilde{{\textbf{q}}})$| is the solution of the weak transient Stokes problem (2.3) for this particular control |$\tilde{{\textbf{q}}}$|⁠.

 

Remark 3.5.

As the homogeneous initial data in our setting necessitate the choice |$\beta> 0$|⁠, if the control constraints |${\textbf{q}}_{a}, {\textbf{q}}_{b}$| admit the control |$\tilde{\textbf{q}} \equiv{\textbf{0}}$|⁠, the existence of such a Slater point is immediately given. In that case, the unique solution to the state equation with right-hand side |$\tilde{\textbf{q}} \equiv{\textbf{0}}$| is |${\textbf{u}}(\tilde{\textbf{q}}) \equiv{\textbf{0}}$| which trivially satisfies |$G_{\textbf{w}}({\textbf{u}}(\tilde{\textbf{q}})) = 0 < \beta $|⁠.

 

Theorem 3.6.

Under Assumption 1, there exists a unique optimal control |$\bar{\textbf{q}} \in L^{2}(I;L^{2}(\varOmega )^{d})$| with unique associated state |$\bar{\textbf{u}}$|⁠, solving the optimal control problem (1.1a)–(1.1d).

 

Proof.
The Assumption 1 yields the existence of a feasible |$\tilde{\textbf{q}}$|⁠, such that the associated state |$\tilde{\textbf{u}} = {\textbf{u}}(\tilde{\textbf{q}})$| satisfies the state constraint. Let us define
and let |$\{{\textbf{q}}_{n}\}$| and |$\{{\textbf{u}}_{n}:= {\textbf{u}}({\textbf{q}}_{n})\}$| denote sequences of feasible controls with associated states, such that |$J({\textbf{q}}_{n},{\textbf{u}}_{n}) \to \mathcal J$| as |$n\to \infty $|⁠. As it holds |$J({\textbf{q}}_{n},{\textbf{u}}_{n}) \le{\mathcal{J}+1}$| for large enough |$n$|⁠, there holds a bound |$\|{\textbf{q}}_{n}\|_{L^{2}(I\times \varOmega )} \le C$| for all |$n$|⁠. From Theorem 3.1, we obtain |$\|{\textbf{u}}_{n}\|_{L^{2}(I;H^{2}(\varOmega ))} + \|{\textbf{u}}_{n}\|_{H^{1}(I;L^{2}(\varOmega ))} \le C$|⁠. We can thus take a subsequence, denoted by the same index, such that
These allow us to pass to the limit in the weak form of the state equation, showing that |$\hat{\textbf{u}} = {\textbf{u}}(\hat{\textbf{q}})$|⁠. Furthermore, as |$L^{2}(I;H^{2}(\varOmega )^{d})\cap H^{1}(I;L^{2}(\varOmega )^{d}) \hookrightarrow C({\bar I};H^{s}(\varOmega )^{d})$| compactly, for |$s<1$|⁠, see (Simon  1986, Corollary 8), by taking another subsequence, we obtain |${\textbf{u}}_{n} \to \hat{\textbf{u}}$| in |$C({\bar I};L^{2}(\varOmega )^{d})$|⁠. As |$G_{\textbf{w}}({\textbf{u}}_{n}) \le \beta $| for all |$t \in I$|⁠, this shows |$G_{\textbf{w}}(\hat{\textbf{u}}) \le \beta $| for all |$t \in I$|⁠. Lastly, it holds due to the lower semicontinuity of the norms
which shows that |$\bar{\textbf{q}} = \hat{\textbf{q}}$| is indeed a minimizer with associated state |$\bar{\textbf{u}} = \hat{\textbf{u}}$|⁠. Using uniform convexity of the squared |$L^{2}$| norms and linearity of the state equation gives the uniqueness.

 

Theorem 3.7 (First order optimality system).
Let Assumption 1 be fulfilled. Then a control |$\bar{{\textbf{q}}}\in \mathbf{Q}_{ad}$| with associated state |$\bar{{\textbf{u}}}={\textbf{u}}(\bar{{\textbf{q}}})$| is the optimal solution to the problem (1.1a)–(1.1d) if and only if and there exists an adjoint state |$\bar{{\textbf{z}}}\in L^{2}(I;\mathbf{V})\cap L^\infty (I;\mathbf{V}_{0})$| and a Lagrange multiplier |$\bar{\mu }\in (C(\bar I))^{*}$| that satisfy: State equation
(3.6a)
State constraint and complementarity conditions
(3.6b)
Adjoint equation
(3.6c)
Variational inequality
(3.6d)
The state (3.6a) and adjoint (3.6c) equations should be understood in the weak sense (2.3) and (3.2), respectively.

 

Proof.
We derive the optimality conditions from the reduced problem (3.5). By the generalized KKT condition (cf. Casas (1993, Theorem 5.2)) and the Slater condition from Assumption 1, i.e., the existence of |$\tilde{{\textbf{q}}}\in \mathbf{Q}_{ad}$| such that |$\mathcal{G}(\tilde{{\textbf{q}}})\in \operatorname{int}\mathcal{K}$|⁠, the optimality of |$\bar{{\textbf{q}}}$| is equivalent to the existence of a Lagrange multiplier |$\bar{\mu }\in C(\bar{I})^{*}$| and the adjoint state
(3.7)
satisfying
By definition and linearity of the involved operators, we can write |$\mathcal{G}^{\prime}(\bar{\textbf{q}})^{*} = S^{*} \circ G_{\mathbf w}^{*}$|⁠, where |$G_{\mathbf w}^{*}\colon C(\bar I)^{*} \to C(\bar I; L^{2}(\varOmega )^{d})^{*}$|⁠, |$G_{\mathbf w}^{*}(\mu ) = \mu{\textbf{w}}$|⁠. Thus it holds |$\bar{\textbf{z}} = S^{*}(\bar{\textbf{u}} - {\textbf{u}}_{d} + \bar \mu{\textbf{w}})$|⁠, which gives the proposed regularity of |$\bar{\textbf{z}}$| as a consequence of Corollary 3.3. To complete the proof, we point to the following equivalence:
which is exactly (3.6b).

From the first order optimality system (3.6a)–(3.6b) we can derive the following regularity results.

 

Theorem 3.8.
Let |$\bar{{\textbf{q}}}$| denote the optimal control for the problem (1.1a). Then the following regularity holds:
If additionally, |${\textbf{q}}_{a},{\textbf{q}}_{b} \in{\mathbb{R}}^{d}$|⁠, i.e. are finite, it holds |$\bar{{\textbf{q}}}\in{L^\infty (I \times \varOmega )^{d}}$|⁠.

 

Proof.
Since |$\mathbf{w}\in L^{2}(\varOmega )^{d}$| and |$\bar{\mu }\in (C(\bar{I}))^{*}$|⁠, we have that |$\bar{\mu }{\textbf{w}}\in ( C(\bar{I};L^{2}(\varOmega )^{d}))^{*}$|⁠. By Corollary 3.3, it holds |$\bar{\textbf{z}} = S^{*}(\bar{\textbf{u}} - {\textbf{u}}_{d} + \bar \mu{\textbf{w}}) \in L^{2}(I;\mathbf{V})\cap L^\infty (I;L^{2}(\varOmega )^{d})$|⁠. Using that
and |${\textbf{q}}_{a}, {\textbf{q}}_{b}$| are constant, we have the theorem.

The above result shows the regularity available for the optimal control |$\bar{\textbf{q}}$| without any additional assumptions. If one assumes higher regularity of the data, especially of the weight |${\textbf{w}}$| of the state constraint, we can show improved regularity, and in some cases even a structural result, that the Lagrange multiplier |$\bar \mu $| contains no Dirac contributions. We first treat a general adjoint problem with measure valued right-hand side in Theorem 3.9, before showing regularity of the optimal variables in Theorem 3.10 and the corollaries thereafter.

 

Theorem 3.9.
Let |$\mu \in C(\bar I)^{*}$| and |${\textbf{w}} \in L^{2}(\varOmega )^{d}$| be given, and let |${\textbf{z}} \in L^{2}(I;\mathbf{V}) \cap L^\infty (I;\mathbf{V}_{0})$| be the weak solution to
(3.8)
Then, if additionally |${\textbf{w}} \in \mathbf{V}$|⁠, it holds |${\textbf{z}} \in L^{s}(I;\mathbf{V}) \cap \text{BV}(I;\mathbf{V}^{*})$| for all |$1 \le s < \infty $|⁠. Moreover, if |${\textbf{w}} \in \mathbf{V}_{2}$|⁠, it holds |${\textbf{z}} \in L^{s}(I;\mathbf{V}_{2}) \cap \text{BV}(I;\mathbf{V}_{0})$| for all |$1 \le s < \infty $|⁠.

 

Proof.
Throughout this proof, let |${\textbf{w}}$| satisfy at least |${\textbf{w}} \in \mathbf{V}$|⁠. Using Appell et al. (2014, Theorem 4.31), there exists a normalized function of bounded variation |$\tilde \mu \in \text{NBV}(\bar I)$|⁠, such that the application of |$\mu \in C(\bar I)^{*}$| to any |$\xi \in C(\bar I)$| can be expressed as a Riemann–Stieltjes integral
Slightly modifying Appell et al. (2014, Definition 1.2), we can normalize |$\tilde \mu $| such that it is of bounded variation, right continuous and satisfies |$\tilde \mu (T) = 0$|⁠. If |$\xi \in C^{1}(\bar I)$|⁠, due to Appell et al. (2014, Proposition 4.24, Theorem 4.17), there holds
where the last integral can be understood equivalently in the Riemann or Lebesgue sense. Working with the Lebesque integral allows us to pass to the limit |$C^{1}(\bar I) \ni \xi _{n} \to \xi \in W^{1,1}(I) \hookrightarrow C(\bar I)$|⁠, showing that
As due to the definition of |$\mathbf{X}$|⁠, it holds |$\partial _{t} {\textbf{v}} \in L^{1}(I;\mathbf{V}_{0}) + L^{2}(I;\mathbf{V}^{*})$| and |${\textbf{v}}(0) = {\textbf{0}}$| for all |${\textbf{v}} \in \mathbf{X}$|⁠, and since |${\textbf{w}} \in \mathbf{V}$|⁠, we have |$\left \langle{\textbf{v}},{\textbf{w}}\right \rangle _{\varOmega } \in W^{1,1}(I)$|⁠. Thus we obtain
(3.9)
Let us define |$\hat{\textbf{z}} = {\textbf{z}} + \tilde \mu{\textbf{w}}$|⁠. Then adding (3.8), (3.9) and the identity |$\left (\nabla{\textbf{v}}, \tilde \mu \nabla{\textbf{w}}\right )_{I\times \varOmega } = \left \langle \tilde \mu A {\textbf{w}},{\textbf{v}}\right \rangle _{I\times \varOmega }$| yields
As |$\tilde \mu \in L^\infty (I)$| and |$\tilde \mu (T) < \infty $|⁠, |$\hat{\textbf{z}}$| satisfies a backwards in time Stokes equation with right-hand side |$\tilde \mu A {\textbf{w}}$|⁠. Depending on the regularity of |${\textbf{w}}$|⁠, we obtain the following:
Case 1:  |${\textbf{w}} \in \mathbf{V}$|⁠. Here |$\tilde \mu{\textbf{w}} \in \text{BV}(I;\mathbf{V}) \hookrightarrow L^\infty (I;\mathbf{V})$| and thus |$\tilde \mu A {\textbf{w}} \in L^\infty (I;\mathbf{V}^{*})$|⁠. Using the maximal parabolic regularity in |$\mathbf{V}^{*}$| (2.5), this yields
Considering the special cases |$s=1$| and |$s=2$|⁠, we obtain
As a consequence |$ {\textbf{z}} = \hat{\textbf{z}} - \tilde \mu{\textbf{w}} \in L^{s}(I;\mathbf{V}) \cap \text{BV}(I;\mathbf{V}^{*})$| for any |$1 \le s < \infty $|⁠.
Case 2:  |${\textbf{w}} \in \mathbf{V}_{2}$|⁠. Here |$\tilde \mu{\textbf{w}} \in \text{BV}(I;\mathbf{V}_{2}) \hookrightarrow L^\infty (I;\mathbf{V}_{2})$| and thus |$\tilde \mu A {\textbf{w}} \in L^\infty (I;\mathbf{V}_{0})$|⁠. Using the maximal parabolic regularity in |$\mathbf{V}_{0}$| (2.4), this yields
Considering the special cases |$s=1$| and |$s=2$|⁠, we obtain
As a consequence |$ {\textbf{z}} = \hat{\textbf{z}} - \tilde \mu{\textbf{w}} \in L^{s}(I;\mathbf{V}_{2}) \cap \text{BV}(I;\mathbf{V}_{0})$| for any |$1 \le s < \infty $|⁠.

Since in the above proof we only have shown continuity of |$\hat{\textbf{z}}$|⁠, but |$\tilde \mu $| in general can have jumps as it is a |$\text{BV}$| function, without further information, we cannot deduce continuity of the whole solution |${\textbf{z}}$|⁠. For the adjoint equation (3.6c) this translates to the question of continuity in time of |$\bar{\textbf{z}}$| and by (3.6d) also of |$\bar{\textbf{q}}$|⁠. If in addition to the arguments of the previous theorem, we exploit the information from the optimality system, it turns out, that we can at least show continuity in time of |$\bar{\textbf{q}}$|⁠. As the adjoint state in the optimality system has another term |${\textbf{u}} - {\textbf{u}}_{d}$| on the right-hand side, depending on the available regularity of |${\textbf{u}}_{d}$| in time, we might lose some of the time regularity that we just derived.

 

Theorem 3.10.

Let |$(\bar{\textbf{q}},\bar{\textbf{u}},\bar{\textbf{z}},\bar \mu )$| satisfy the first order necessary optimality conditions (3.6a)–(3.6d). Let additionally |${\textbf{u}}_{d} \in L^{s}(I;L^{2}(\varOmega )^{d})$| for some |$s \in [2,\infty )$| and |${\textbf{w}} \in \mathbf{V}$|⁠. Then |$\bar{\textbf{z}} \in L^{s}(I;\mathbf{V}) \cap \text{BV}(I;\mathbf{V}^{*})$| and |$\bar{\textbf{q}} \in L^{s}(I;H^{1}(\varOmega )^{d}) \cap C(\bar I;L^{2}(\varOmega )^{d})$|⁠. If further |${\textbf{w}} \in \mathbf{V}_{2}$|⁠, then |$\bar{\textbf{z}} \in L^{s}(I;\mathbf{V}_{2}) \cap \text{BV}(I;\mathbf{V}_{0})$| and |$\bar{\textbf{q}} \in L^{s}(I;W^{1,\infty }(\varOmega )^{d}) \cap C(\bar I;H^{1}(\varOmega )^{d})$|⁠.

 

Proof.
As in the proof of the previous theorem, we introduce the |$\text{NBV}$| function |$\tilde \mu $| satisfying |$\left \langle \xi ,\bar \mu \right \rangle _{I} = - \left (\tilde \mu ,\partial _{t} \xi \right )_{I} - \tilde \mu (0)\xi (0)$| for all |$\xi \in W^{1,1}(I)$|⁠, and introduce |$\hat{\textbf{z}} = \bar{\textbf{z}} + \tilde \mu{\textbf{w}}$|⁠. This now satisfies
The regularity |$\bar{\textbf{u}} \in L^\infty (I;\mathbf{V}_{0})$|⁠, the assumption |${\textbf{u}}_{d} \in L^{s}(I;L^{2}(\varOmega )^{d})$| and the previous arguments again yield that
As in Theorem 3.9, this shows the |$\text{BV}$| regularity in time of |$\bar{\textbf{z}}$|⁠. To show the regularity of |$\bar{\textbf{q}}$|⁠, we shall make use of the optimality conditions. The optimal control satisfies
(3.10)
The available regularity of |$\bar{\textbf{z}}$| immediately gives the claimed |$L^{s}$| regularity in time of |$\bar{\textbf{q}}$|⁠. Note that due to the application of |$P_{[{\textbf{q}}_{a},{\textbf{q}}_{b}]}$|⁠, |$\bar{\textbf{q}}$| is in general not divergence free anymore and exhibits at most |$W^{1,\infty }(\varOmega )$| regularity in space, even if |$\bar{\textbf{z}}$| is smoother. We now turn towards showing continuity in time of |$\bar{\textbf{q}}$|⁠. Let us denote by |$[\cdot ]$| the jump function w.r.t. time, i.e. |$[\varphi ](t) = \varphi (t+) - \varphi (t-)$|⁠. Since functions in |$\text{BV}(I;X)$| for any Banach space |$X$| possess well-defined onesided limits, see Heida et al. (2019, Propositions 2.1 & 2.2), we obtain from (3.10) that |${\textbf{z}}_\alpha (t \pm ) \in \mathbf{V}_{0}$| is well defined for any |$t \in I$|⁠. Hence, by continuity of |$P_{[{\textbf{q}}_{a},{\textbf{q}}_{b}]}\colon L^{2}(\varOmega )^{d} \to L^{2}(\varOmega )^{d}$| we also have |$[\bar{\textbf{q}}] = [P_{[{\textbf{q}}_{a},{\textbf{q}}_{b}]}({\textbf{z}}_\alpha )] \in L^{2}(\varOmega )^{d}$| for all |$t \in I$|⁠. By distinguishing the different cases, it is straightforward to verify that for any |${\textbf{v}} \in C(\bar I;L^{2}(\varOmega )^{d}) + \text{BV}(I;L^{2}(\varOmega )^{d})$| it holds
Applying this chain of inequalities to |${\textbf{v}} = {\textbf{z}}_\alpha $|⁠, using |$\bar{\textbf{q}} = P_{[{\textbf{q}}_{a},{\textbf{q}}_{b}]}({\textbf{z}}_\alpha )$| and the continuity in time of |$\hat{\textbf{z}}$|⁠, we obtain
(3.11)
For all |$t \in I$| satisfying |$[\tilde \mu ](t) = 0$|⁠, this immediately shows |$[\bar{\textbf{q}}](t) = {\textbf{0}}$|⁠. Thus let us assume now that there exists |$t^{*} \in I$| with |$[\tilde \mu ](t^{*}) \neq 0$|⁠. As |$\bar \mu \ge 0$|⁠, there holds for |$t_{1} < t_{2}$| due to Appell et al. (2014, Theorem 4.17, Proposition 4.24)
i.e. |$\tilde \mu $| is monotonically increasing and therefore, it holds |$[\tilde \mu ] \ge 0$|⁠. Moreover it holds
(3.12)
Thus if |$[\tilde \mu ](t^{*}) \neq 0$|⁠, then |$[\tilde \mu ](t^{*})> 0$| and |$t^{*} \in \operatorname{supp}(\bar \mu )$|⁠, yielding that the state constraint is active in |$t^{*}$|⁠, i.e. |$G_{\textbf{w}}(\bar{\textbf{u}}(t^{*})) = \beta $|⁠. As |${\textbf{w}} \in \mathbf{V}$|⁠, due to (2.3), it holds
Since |$\bar{\textbf{q}} \in C(\bar I;{L^{2}(\varOmega )^{d}}) + \text{BV}(I;{H^{1}(\varOmega )^{d}})$| and |$\nabla \bar{\textbf{u}} \in C(\bar I; \mathbf{V})$|⁠, this identity shows that |$\partial _{t} G_{\textbf{w}}(\bar{\textbf{u}}) \in L^{2}(I)$| has representant with well defined onesided limits, which we shall denote by the same symbol, and it holds
(3.13)
where the last inequality holds due to (3.11) and |$[\tilde \mu ](t^{*})> 0$|⁠. Using |$G_{\textbf{w}}({\textbf{u}})(t) = \int _{0}^{t} \partial _{t} G_{\textbf{w}}({\textbf{u}})(s) \ \text{d} s$|⁠, it is straightforward to check that
i.e. the onesided limits of |$\partial _{t} G_{\textbf{w}}(\bar{\textbf{u}})$| correspond to the directional derivatives of |$G_{\textbf{w}}(\bar{\textbf{u}})$|⁠. As |$G_{\textbf{w}}(\bar{\textbf{u}})(t^{*}) = \beta $|⁠, |$t^{*}$| is a local maximum of |$G_{\textbf{w}}(\bar{\textbf{u}})$|⁠, yielding
Combining this with (3.13) yields |$\|[\bar{\textbf{q}}](t^{*})\|_{L^{2}(\varOmega )}^{2} = 0$|⁠, which shows |$\bar{\textbf{q}} \in C({\bar I};L^{2}(\varOmega )^{d})$|⁠. If additionally |${\textbf{w}} \in \mathbf{V}_{2}$|⁠, with the same arguments we can show |$\bar{\textbf{q}} \in C({\bar I};H^{1}(\varOmega )^{d})$|⁠.

 

Remark 3.11.

In the proof of the above theorem, we can under some circumstances also show continuity in time of |$\bar{\textbf{z}}$|⁠. In fact, the only obstacle preventing this result is the application of the projection operator onto the control constraints |$P_{[{\textbf{q}}_{a},{\textbf{q}}_{b}]}(\cdot )$|⁠. It is in general possible that at some point in time |$\hat t$|⁠, |$\bar{\textbf{z}}$| is discontinuous. In such cases, the spacial support of the jump is contained in the set of points |$x \in \varOmega $|⁠, where |$-\frac{1}{\alpha } \bar{\textbf{z}}(\hat t \pm )$| lie outside the set of admissible controls |$[{\textbf{q}}_{a},{\textbf{q}}_{b}]$|⁠. As the jump of |$\bar{\textbf{z}}$| is a scalar multiple of |${\textbf{w}}$|⁠, this gives a compatibility condition on |$\operatorname{supp}({\textbf{w}})$| and the set where the control constraints are active. As the active set of the control constraints cannot be known a priori, such a condition on |$\bar{\textbf{q}}$| is not straightforward to verify. Instead we present in the next two corollaries two sets of assumptions on the data, where we can obtain improved regularity of |$\bar{\textbf{z}}$| and |$\bar \mu $|⁠.

 

Corollary 3.12.

Let |$(\bar{\textbf{q}},\bar{\textbf{u}},\bar{\textbf{z}},\bar \mu )$| satisfy the first order necessary optimality conditions (3.6a)–(3.6d). Let further |${\textbf{q}}_{a,i} = -\infty $|⁠, |${\textbf{q}}_{b,i} = + \infty $| for |$i=1,...,d$|⁠, and |${\textbf{w}} \in \mathbf{V}$|⁠. Then |$\bar{\textbf{z}} \in C({\bar I};\mathbf{V}_{0})$| and |$\bar \mu (\{t\}) = 0$| for all |$t \in I$|⁠, i.e. |$\bar \mu $| does not contain any Dirac contributions. If additionally |${\textbf{w}} \in \mathbf{V}_{2}$|⁠, then |$\bar{\textbf{z}} \in C({\bar I};\mathbf{V})$|⁠.

 

Proof.

This is a direct consequence of the results of Theorem 3.10, as in this case, the identity |$\bar{\textbf{q}} = - \frac{1}{\alpha } \bar{\textbf{z}}$| holds.

 

Corollary 3.13.
Let |$(\bar{\textbf{q}},\bar{\textbf{u}},\bar{\textbf{z}},\bar \mu )$| satisfy the first order necessary optimality conditions (3.6a)–(3.6d). Let further |${\textbf{q}}_{a} < \textbf{0} < {\textbf{q}}_{b}$|⁠, |${\textbf{u}}_{d} \in L^{s}(I;L^{2}(\varOmega )^{d})$| for some |$s> 2$|⁠, and |${\textbf{w}} \in \mathbf{V}_{2}$| satisfy
Then |$\bar{\textbf{z}} \in C({\bar I};\mathbf{V})$| and |$\bar \mu (\{t\}) = 0$| for all |$t \in I$|⁠, i.e. |$\bar \mu $| does not contain any Dirac contributions.

 

Proof.
As |${\textbf{u}}_{d} \in L^{s}(I;L^{2}(\varOmega )^{d})$| for some |$s>2$|⁠, as in the beginning of the proof of Theorem 3.10, we obtain
where the last embedding holds due to Simon (1986, Corollary 8). As further |${\textbf{w}} \in \mathbf{V}_{2} \hookrightarrow C(\bar \varOmega )$|⁠, for any |$t^{*} \in I$| it thus holds |$\bar{\textbf{z}}(t^{*} +),\bar{\textbf{z}}(t^{*}-) \in C(\bar \varOmega )$|⁠. As |$\bar{\textbf{z}}(t^{*} +)(x^{*}) = \bar{\textbf{z}}(t^{*}-)(x^{*}) = {\textbf{0}}$|⁠, and |${\textbf{q}}_{a} < {\textbf{0}} < {\textbf{q}}_{b}$|⁠, there exists |$\delta> 0$|⁠, such that
Thus |$[\bar{\textbf{q}}](t^{*})|_{B_\delta (x^{*})} = [- \frac{1}{\alpha } \bar{\textbf{z}}](t^{*})|_{B_\delta (x^{*})}$|⁠. As |$\operatorname{supp}({\textbf{w}}) \cap B_\delta (x^{*}) \neq \emptyset $|⁠, there exists an open subset |$\omega \subset B_\delta (x^{*})$|⁠, such that |${\textbf{w}}(x) \neq 0$| for all |$x \in \omega $|⁠. In the end, we obtain, using the continuity in time of |$\bar{\textbf{q}}$| shown in Theorem 3.10:
As |$|{\textbf{w}}|> 0$| for all |$x \in \omega $|⁠, this shows |$[\tilde \mu ](t^{*}) = 0$|⁠. As |$t^{*}$| was arbitrary, this concludes the proof.

4. Finite element approximation of the state equation

4.1 Spatial discretization

Let |$\{{\mathcal{T}}_{h}\}$| be a family of triangulations of |$\bar \varOmega $|⁠, consisting of closed simplices, where we denote by |$h$| the maximum mesh-size. Let |$\mathbf{X}_{h} \subset H^{1}_{0}(\varOmega )^{d}$| and |$M_{h} \subset L^{2}_{0}(\varOmega )$| be a pair of compatible finite element spaces, i.e., there holds a uniform discrete inf-sup condition,

(4.1)

with a constant |$\gamma>0$| independent of |$h$|⁠. We shall work under the assumption that the discrete spaces have the following approximative properties.

 

Assumption 2.
There exist interpolation operators |$i_{h}\colon H^{2}(\varOmega )^{d} \cap H^{1}_{0}(\varOmega )^{d} \to \mathbf{X}_{h}$| and |$r_{h}\colon L^{2}(\varOmega ) \to M_{h}$|⁠, such that

This assumption is valid, for example, for Taylor–Hood and MINI finite elements on shape regular meshes, see Behringer et al. (2023, Assumption 7.2). We define the space of discretely divergence-free vector fields |$\mathbf{V}_{h}$| as

(4.2)

While on a computational level, especially for the examples presented in Section 8, we work with a discrete velocity-pressure formulation, in our theoretical analysis we will always use the equivalent formulation in discretely divergence free spaces, in order to shorten notation. One exception is the following stationary Stokes problem of finding for some given |$f \in H^{-1}(\varOmega )^{d}$| a solution |$({\textbf{u}},p) \in H^{1}_{0}(\varOmega )^{d} \times L^{2}_{0}(\varOmega )$| to

(4.3)

Its discrete approximation in velocity-pressure formulation reads: find |$({\textbf{u}}_{h},p_{h}) \in \mathbf{X}_{h} \times M_{h}$| satisfying

(4.4)

The above discrete system can be interpreted as a Stokes Ritz projection: given |$({\textbf{u}},p) \in H^{1}_{0}(\varOmega )^{d} \times L^{2}_{0}(\varOmega )$|⁠, find |$(R_{h}^{S}({\textbf{u}},p),R_{h}^{S,p}({\textbf{u}},p)):=({\textbf{u}}_{h},p_{h}) \in \mathbf{X}_{h} \times M_{h}$|⁠, satisfying

(4.5)

Note that, if |${\textbf{u}} \in \mathbf{V}$|⁠, then it holds |$R_{h}^{S}({\textbf{u}},p) \in \mathbf{V}_{h}$|⁠. Further, the Stokes Ritz projection satisfies the following stability, see Boffi et al. (2013, Theorem 5.2.1).

Let us recall the following error estimates for the stationary discrete Stokes problem: see Boffi et al. (2013, Theorem 5.25), John (2016, Theorems 4.21, 4.25, 4.28), Ern & Guermond (2021, Theorems 53.17 & 53.19) or Girault & Raviart (1986, Chapter II, Theorems 1.8 & 1.9).

 

Theorem 4.1.
Let |$({\textbf{u}},p)$| and |$({\textbf{u}}_{h},p_{h})$| denote the solutions to the continuous and discrete stationary Stokes problems (4.3) and (4.4), respectively. Then there holds the estimate
If |$\varOmega $| is convex, there further holds the estimate

Note, that while Boffi et al. (2013, Theorem 5.5.6) contains formal results, on how to derive error estimates also for |$p$| in a weaker norm, e.g. |$H^{-1}(\varOmega )$|⁠, the argument requires |$H^{2}$| regularity results for the compressible Stokes equations. As the corresponding results in Kellogg & Osborn (1976); Dauge (1989) require an additional decaying condition for the compressibility data, this makes derivation of weaker error estimates for the pressure complicated.

4.2 Temporal discretization: the discontinuous Galerkin method

In this section, we introduce the discontinuous Galerkin method for the time discretization of the transient Stokes equations, a similar method was considered, e.g., in Chrysafinos & Walkington (2010). For that, we partition |$I = (0,T]$| into subintervals |$I_{m} = (t_{m-1},t_{m}]$| of length |${k}_{m} = t_{m} - t_{m-1}$|⁠, where |$0= t_{0}<t_{1}<\dots <t_{M-1}<t_{M} = T$|⁠. The maximal and minimal time steps are denoted by |${k}=\max _{m} {k}_{m}$| and |${k}_{\min }=\min _{m}{k}_{m}$|⁠, respectively. The time partition fulfills the following assumptions:

  1. There are constants |$C,{\theta }>0$| independent of |${k}$| such that
    (4.6)
  2. There is a constant |$\kappa>0$| independent of |${k}$| such that for all

    |$m=1, 2, \dots , M-1$|  
    (4.7)
  3. It holds |${k} \leq \frac{T}{4}$|⁠.

For a given Banach space |$\mathcal{B}$|⁠, we define the semidiscrete space |$X_{k}^{0}(\mathcal{B})$| of piecewise constant functions in time as

(4.8)

We use the following standard notation for a function |${\textbf{u}} \in X_{k}^{r}(\mathcal{B})$| to denote one-sided limits and jumps at the time nodes:

(4.9)

We define the bilinear form |${B}(\cdot ,\cdot )$| by

With this bilinear form, we define the fully discrete approximation for the transient Stokes problem on the discretely divergence free space |$X_{k}^{0}(\mathbf{V}_{h})$|⁠:

(4.10)

The unique solution to this system is stable, as the following theorem summarizes:

 

Theorem 4.2.
Let |${\textbf{q}} \in L^{2}(I;H^{-1}(\varOmega )^{d})+L^{1}(I;L^{2}(\varOmega )^{d})$|⁠. Then there exists a unique solution |${\textbf{u}}_{kh} \in X^{0}_{k}(\mathbf{V}_{h})$| of (4.10), satisfying
where the constant |$C>0$| is independent of |$k,h$|⁠.

 

Proof.

As (4.10) poses a square system of linear equations in finite dimensions, it suffices to show uniqueness. This is a standard argument, making use of a discrete Gronwall Lemma, see e.g. Vexler & Wagner (2024, Theorem 4.13) for a proof focusing especially on the |$L^{1}(I;L^{2}(\varOmega )^{d})$| right-hand side case.

 

Remark 4.3.
Rearranging terms in the definition of the bilinear form gives the following dual representation of |$B(\cdot ,\cdot )$|  
With the same arguments as above, for given |$\mathbf{g} \in L^{2}(I;H^{-1}(\varOmega )^{d}) + L^{1}(I;L^{2}(\varOmega )^{d})$|⁠, solutions |${\textbf{z}}_{kh} \in X^{0}_{k}(\mathbf{V}_{h})$| to the discrete dual equation
exist, are unique and satisfy the stability

4.3 Best approximation type fully discrete error estimate for the Stokes problem in |$L^\infty (I;L^{2}(\varOmega )^{d})$| norm

In our recent paper Behringer et al. (2023), we have established a best approximation type error estimate for the Stokes problem in the |$L^\infty (I;L^{2}(\varOmega )^{d})$| norm. From this more general result, we obtain in the case of homogeneous initial data the following result, see Behringer et al. (2023, Corollary 6.4).

 

Theorem 4.4.
Let |${\textbf{q}} \in L^{s}(I;L^{2}(\varOmega )^{d})$| for some |$s>1$| and let |${\textbf{u}}\in \mathbf{W}$| be the weak solution to (2.3) with associated pressure |$p$| in the sense of (2.6). Let |${\textbf{u}}_{kh}\in X_{k}^{0}(\mathbf{V}_{h})$| be the fully discrete Galerkin solution to (4.10). Then there exists a constant |$C$| independent of |$k$| and |$h$|⁠, such that for any |$\chi \in X_{k}^{0}(\mathbf{V}_{h})$| there holds
where |$\ell _{k}=\ln{\frac{T}{k}}$| and |$R^{S}_{h}({\textbf{u}},p)$| is the stationary finite element Stokes projection introduced in (4.5).

Using the error estimates for the stationary Stokes Ritz projection of Theorem 4.1, in Behringer et al. (2023, Theorem 7.4) the following estimate in terms of explicit orders of convergence was shown.

 

Corollary 4.5.
If in addition to assumptions of Theorem 4.4, the domain |$\varOmega $| is convex, and |${\textbf{q}} \in L^\infty (I;L^{2}(\varOmega )^{d})$|⁠, then there exists a constant |$C$| independent of |$k$| and |$h$| such that

The above results are valid in |$L^{2}(I; L^{2}(\varOmega )^{d})$| norm as well. However, using the energy and duality arguments it is possible to show the corresponding results log-free and with less regularity assumptions on the data (cf. Leykekhman & Vexler (2024, Theorems 11 & 13)).

 

Theorem 4.6.
Let |${\textbf{q}} \in L^{2}(I;L^{2}(\varOmega )^{d})$| and let |${\textbf{u}}\in \mathbf{W}$| be the weak solution to (2.3) with associated pressure |$p$| in the sense of (2.6). Let |${\textbf{u}}_{kh}\in X_{k}^{0}(\mathbf{V}_{h})$| be the fully discrete Galerkin solution to (4.10). Then there exists a constant |$C$|⁠, independent of |$k$| and |$h$|⁠, such that for any |$\chi \in X_{k}^{0}(\mathbf{V}_{h})$|⁠, there holds
(4.11a)
and
(4.11b)
where |$R^{S}_{h}({\textbf{u}},p)$| is the stationary finite element Stokes projection defined in (4.5) and |$\pi _{k}$| is the time projection onto |$X^{0}_{k}(\mathbf{V})$|⁠, with |$\pi _{k} v|_{I_{m}} = v(t_{m}^{-})$| for |$m=1,2,\dots ,M$|⁠.

 

Corollary 4.7.
If in addition to assumptions of Theorem 4.6, the domain |$\varOmega $| is convex, then there exists a constant |$C$| independent of |$k$| and |$h$| such that

5. Variational discretization of the optimal control problem

In this section, we consider the optimal control problem subject to the fully discretized Stokes equations. We consider a variational discretization for the controls, i.e., do not fix a finite dimensional approximation of the control space yet, cf., Hinze (2005); Deckelnick & Hinze (2011). The problem reads

(5.1a)

over all |${\textbf{q}}_{kh} \in \mathbf{Q}_{ad}$|⁠, |${\textbf{u}}_{kh} \in X^{0}_{k}(\mathbf{V}_{h})$|⁠, subject to

(5.1b)
(5.1c)

Following the structure of Section 3 and using Theorem 4.2, we introduce the discrete analog to the control state map,

(5.2)

The finitely many state constraints we describe with the help of the continuous linear operator |$\mathcal{G}_{kh}\colon \mathbf{Q}_{ad} \to{\mathbb{R}}^{M}$| with |$(\mathcal{G}_{kh}({\textbf{q}}))_{m}:=G_{\mathbf{w}}\circ S_{kh} ({\textbf{q}}) \mid _{I_{m}}$| for |$m=1,2,\dots ,M$|⁠. Using the set

we can rewrite the problem (5.1a)–(5.1c) in the reduced form:

(5.3)

Before discussing wellposedness and optimality conditions of this discrete problem, we shall show that the Slater assumption on the continuous level carries over to the discrete problem. As we achieve this with the finite element error estimates presented in Section 4, we need to impose a rather weak coupling condition between |$k$| and |$h$|⁠, allowing us to deduce convergence in the result of Corollary 4.5. Throughout the remainder of this work, we thus work under the following assumption.

 

Assumption 3.
There exists a function |$\varPhi \colon (0,1) \to (0,\infty )$| with |$\lim _{h\to 0} \varPhi (h) = 0$|⁠, such that the discretization parameters |$k$| and |$h$| satisfy

 

Remark 5.1.

This assumption is valid, e.g. if there exists a constant |$C> 0$| such that |$\left |\ln \left (\frac{T}{k}\right )\right | h |\ln h| \le C$|⁠. As the choice of the term |$|\ln h|$| in such a condition can be made arbitrarily weak, we have chosen to work under the more general formulation of Assumption 3.

 

Lemma 5.2.
There exists |$h_{0}> 0$| such that for any |$h\le h_{0}$| and |$k$| satisfying Assumption 3, the Slater point |$\tilde{\textbf{q}} \in \mathbf{Q}_{ad}$| from Assumption 1 satisfies the following discrete Slater condition:

 

Proof.
Using that |$G_{\mathbf{w}}({\textbf{u}}(\tilde{{\textbf{q}}}))<\beta $| in |$\bar I$|⁠, by the Slater condition Assumption 1 there exists |$\delta>0$| such that |$G_{\mathbf{w}}({\textbf{u}}(\tilde{{\textbf{q}}}))\le \beta -\delta $|⁠. For arbitrary |$\hat{\textbf{q}} \in C^\infty (I\times \varOmega )^{d}$| it holds due to triangle inequality
Using the continuous and fully discrete stability results of the state equations, presented in Theorem 2.1 and Theorem 4.2, as well as the error estimate Corollary 4.5 for the problem with right-hand side |$\hat{\textbf{q}}$|⁠, we obtain
For any |$\varepsilon \!>\! 0$|⁠, due to the density of |$C^\infty (I\times \varOmega )^{d}$| in |${L^{2}(I;L^{2}(\varOmega )^{d})}$|⁠, we can find |$\hat{\textbf{q}}_\varepsilon $| such that |$C \|\tilde{\textbf{q}} - \hat{\textbf{q}}_\varepsilon \|_{L^{2}(I \times \varOmega ))} \!<\! \frac{\varepsilon }{2}$|⁠. Moreover, for |$h\le h_{0}$| sufficiently small, and |$k$| satisfying Assumption 3, it also holds |$C \ell _{k}^{2} (k + h^{2}) \|\hat{\textbf{q}}_\varepsilon \|_{L^\infty (I;L^{2}(\varOmega ))}\! <\! \frac{\varepsilon }{2}$|⁠. Thus in total |$\|{\textbf{u}}(\tilde{\textbf{q}}) - {\textbf{u}}_{kh}(\tilde{\textbf{q}})\|_{L^\infty (I;L^{2}(\varOmega ))} \!<\! \varepsilon $|⁠. Choosing |$\varepsilon $| small enough, such that |$\varepsilon \|{\textbf{w}}\|_{L^{2}(\varOmega )}\ < \delta$|⁠, we obtain
(5.4)

 

Theorem 5.3.

Let |$k$| and |$h$| satisfy Assumption 3, and let |$h$| be small enough. Then there exists a unique solution |$(\bar{\textbf{q}}_{kh},\bar{\textbf{u}}_{kh})$| to the optimal control problem (5.1a)–(5.1c).

 

Proof.

As Lemma 5.2 shows feasibility of |$(\tilde{\textbf{q}},{\textbf{u}}_{kh}(\tilde{\textbf{q}}))$| under the given assumptions, the existence proof follows the same steps as the one of Theorem 3.6 on the continuous level.

 

Theorem 5.4 (Discrete first order optimality system).
A control |$\bar{{\textbf{q}}}_{kh}\in \mathbf{Q}_{ad}$| and the associated state |$\bar{{\textbf{u}}}_{kh}={\textbf{u}}_{kh}(\bar{{\textbf{q}}}_{kh}) \in X^{0}_{k}(\mathbf{V}_{h})$| is the optimal solution to the problem (5.1a)–(5.1c) if and only if there exists an adjoint state |$\bar{{\textbf{z}}}_{kh} \in X^{0}_{k}(\mathbf{V}_{h})$| and a Lagrange multiplier |$\bar{\mu }_{kh}\in L^{1}(I)$| that satisfy: Discrete state equation
(5.5a)
Discrete state constraint and complementarity conditions
(5.5b)
Discrete adjoint equation
(5.5c)
Discrete variational inequality
(5.5d)
Furthermore, there exist |$\bar \mu _{kh}^{m} \in{\mathbb{R}}_{\ge 0}$|⁠, |$m=1,2,\dots ,M$|⁠, such that the discrete Lagrange multiplier |$\bar{\mu }_{kh}\in L^{1}(I)$| satisfies the expression
(5.6)
where |$\chi _{I_{m}}$| denotes the characteristic function of the interval |$I_{m}$|⁠.

 

Proof.

In Lemma 5.2, we have shown that under Assumption 3 and for small enough |$h$|⁠, there holds |$\mathcal{G}_{kh}(\tilde{{\textbf{q}}})\in \operatorname{int}(\mathcal{K}_{kh})$|⁠. Similarly to the proof of Theorem 3.7 we obtain that the optimality of |$\bar{q}_{kh}$| is equivalent to the existence of a Lagrange multiplier |$(\bar{\mu }_{kh}^{m})_{m=1}^{M}\in \mathbb{R}^{M}_{\ge 0}$| and the adjoint state |$\bar{{\textbf{z}}}_{kh}\in X^{0}_{k}(\mathbf{V}_{h})$| satisfying (5.5b), (5.5c) and (5.5d). Finally, by the construction given in (5.6), |$\bar{\mu }_{kh}$| is an element of |$L^{1}(I)$|⁠.

 

Remark 5.5.
Notice that from the definition (5.6) and using that |$\bar \mu _{kh} \ge 0$|⁠, it holds

 

Remark 5.6.

We would like to point out that although the state |$\bar{{\textbf{u}}}_{kh}$| and the adjoint |$\bar{{\textbf{z}}}_{kh}$| are fully discrete, the corresponding control |$\bar{{\textbf{q}}}_{kh}\in \mathbf{Q}_{ad}$| is piecewise constant in time via (5.5d), but not necessary piecewise polynomial in space with respect to the given mesh, due to the projection onto |$[{\textbf{q}}_{a},{\textbf{q}}_{b}]$|⁠.

With the optimality conditions established, we now show the following stability of optimal solutions to the discrete problem subject to different discretization levels.

 

Lemma 5.7.
Under Assumption 3 and for |$h$| small enough, there exists |$C>0$| independent of |$k$|⁠, |$h$|⁠, such that the optimal control |$\bar{{\textbf{q}}}_{kh}\in \mathbf{Q}_{ad}$| of the variationally discretized problem (5.1a)–(5.1c), together with its corresponding state |$\bar{{\textbf{u}}}_{kh}\in X^{0}_{k}(\mathbf{V}_{h})$| and corresponding multiplier |$\bar{\mu }_{kh}\in L^{1}(\bar I)$| satisfy the bound

 

Proof.
Due to the feasibility of |$\tilde{\textbf{q}}$|⁠, shown in Lemma 5.2, it holds
where due to Theorem 4.2, this bound is independent of |$k$| and |$h$|⁠. This results in
(5.7)
Let us define |${\textbf{p}}=\frac{1}{2}\bar{{\textbf{q}}}+\frac{1}{2}\tilde{{\textbf{q}}}$|⁠. By definition |${\textbf{p}}\in \mathbf{Q}_{ad}$| and thus by (5.5d) it holds |$ \left (\alpha \bar{{\textbf{q}}}_{kh}+\bar{{\textbf{z}}}_{kh}, {\textbf{p}}-\bar{{\textbf{q}}}_{kh}\right )_{I\times \varOmega }\geq 0. $| This yields
(5.8)
where we can bound the first two terms by (5.7) and obtain
(5.9)
For |${\textbf{p}}$|⁠, using Assumption 3, we can follow a similar argument as in the proof of Lemma 5.2, in order to obtain
Inserting this into (5.9) yields together with |$\bar \mu _{kh} \ge 0$| and the complementarity conditions (5.5b):
Thus, again using |$\bar{\mu }_{kh}\geq 0$| and Remark 5.5 results in
(5.10)
Combining (5.7) and (5.10) with Remark 4.3 yields the boundedness of |$\bar{\textbf{z}}_{kh}$| in |$L^\infty (I;L^{2}(\varOmega )^{d})$|⁠. By the representation |$\bar{\textbf{q}}_{kh} = P_{[{\textbf{q}}_{a},{\textbf{q}}_{b}]} \left ( - \frac{1}{\alpha } \bar{\textbf{z}}_{kh} \right )$|⁠, this shows |$\|\bar{\textbf{q}}_{kh}\|_{L^\infty (I;L^{2}(\varOmega ))} \le C$|⁠, which concludes the proof.

 

Theorem 5.8.
Let Assumption 3 hold and let |$h$| be sufficiently small. Let |$(\bar{\textbf{q}},\bar{\textbf{u}})$| and |$(\bar{\textbf{q}}_{kh},\bar{\textbf{u}}_{kh})$| be the unique solutions to the continuous and variationally discretized optimal control problems (1.1a)–(1.1d) and (5.1a)–(5.1c), respectively. Then there exists a constant |$C>0$|⁠, such that it holds

 

Proof.
Choosing |$\delta{\textbf{q}}=\bar{{\textbf{q}}}_{kh}$| in (3.6d) and |$\delta{\textbf{q}}=\bar{{\textbf{q}}}$| in (5.5d) results in
(5.11)
Adding these two inequalities results in
(5.12)
We estimate the two terms separately.
Estimate for |$I_{1}$|⁠. Using the weak formulations (2.3) and (3.2) of the continuous state and adjoint equations (3.6a) & (3.6c), respectively, we have
Introducing the pointwise projection onto the state constraint
(5.13)
the last term can be estimated as
where we have used that due to |$\bar \mu \ge 0$|⁠, it holds |$\langle P_{\beta } G_{\textbf{w}}({\textbf{u}}(\bar{\textbf{q}}_{kh})),\bar \mu \rangle \le \langle{\beta },\bar \mu \rangle $|⁠. Using the complementarity condition (3.6b), it holds |$\langle{\beta } -G_{\mathbf{w}}(\bar{{\textbf{u}}}),\bar{\mu }\rangle = 0$|⁠, hence we have
Now using that
(5.14)
by the triangle inequality and using that |$G_{\mathbf{w}}(\bar{{\textbf{u}}}_{kh})\le{\beta }$|⁠, we obtain
where in the last two steps we used Corollary 4.5 and Lemma 5.7. Thus,
Estimate for |$I_{2}$|⁠. Similarly, using the fully discrete state and adjoint equations (5.5a) and (5.5c), respectively, we have
Using the projection |$P_{\beta }$| defined in (5.13), the last term in |$I_{2}$| can be estimated as
where we have used that due to |$\bar \mu _{kh} \ge 0$|⁠, it holds |$\langle P_{\beta } G_{\textbf{w}}({\textbf{u}}_{kh}(\bar{\textbf{q}})),\bar \mu _{kh}\rangle \le \langle{\beta },\bar \mu _{kh}\rangle $|⁠. Using the complementarity condition (5.5b), it holds |$\langle{\beta } - G_{\textbf{w}}(\bar{{\textbf{u}}}_{kh}),\bar{\mu }_{kh}\rangle = 0$|⁠, hence we have
Using (5.14), the triangle inequality and using that |$G_{\mathbf{w}}(\bar{{\textbf{u}}}_{kh})\le{\beta }$|⁠, we obtain
where in the last two steps we used Corollary 4.5 and Lemma 5.7. Thus,
Combining the estimates for |$I_{1}$| and |$I_{2}$| and using that
by using Corollary 4.7, we obtain
where in the last step, we used the boundedness of |$\|\bar{{\textbf{u}}}_{kh}\|_{L^{2}(I\times \varOmega )}$|⁠, |$\|\bar{{\textbf{u}}}\|_{L^{2}(I\times \varOmega )}$|⁠, |$\|\bar{{\textbf{q}}}\|_{L^{2}(I\times \varOmega )}$|⁠, and |$\|\bar{{\textbf{q}}}_{kh}\|_{L^{2}(I\times \varOmega )}$| from Theorem 3.8 and Lemma 5.7.

6. Full discretization of the optimal control problem

We discretize the control by piecewise constant functions on the same partition as the fully discrete approximation of the state and adjoint variables. We set

(6.1)

We also define the corresponding admissible set

We introduce the projection |$\pi _{d}\colon L^{2}(I;L^{2}(\varOmega )^{d}) \to \mathbf{Q}_{0}$|⁠, defined by

(6.2)

which by definition is stable in |$L^{2}(I\times \varOmega )$|⁠, i.e., satisfies

(6.3)

Note that this projection satisfies the explicit formula

Hence it is straightforward to check that this |$L^{2}$| projection onto piecewise constants is stable in |$L^\infty (I \times \varOmega )^{d}$| and |$L^\infty (I;L^{2}(\varOmega )^{d})$| and there holds

(6.4)

Further, we have |$\pi _{d} (\mathbf{Q}_{ad}) \subset \mathbf{Q}_{0,ad}$|⁠. We can now formulate the fully discrete optimal control problem, which reads

(6.5a)

subject to |$({\textbf{q}}_{\sigma },{\textbf{u}}_{\sigma }) \in \mathbf{Q}_{0,ad}\times X^{0}_{k}(\mathbf{V}_{h})$|⁠, satisfying

(6.5b)

and

(6.5c)

The following lemma guarantees that also for the fully discrete optimal control problem, there exist feasible controls such that the associated fully discrete state strictly satisfies the state constraint.

 

Lemma 6.1.
Let Assumption 3 be satisfied and let |$h$| be sufficiently small. Then the projection |$\pi _{d} \tilde{\textbf{q}} \in \mathbf{Q}_{0,ad}$| of the Slater point |$\tilde{\textbf{q}} \in \mathbf{Q}_{ad}$| from Assumption 1 satisfies the following discrete Slater condition:

 

Proof.
From Lemma 5.2, we know that there exists |$\delta> 0$| such that |$G_{\textbf{w}}({\textbf{u}}_{kh}(\tilde{\mathbf{q}})) < \beta - \delta$|⁠. As the discrete solution operator |$S_{kh}$| is linear and continuous from |$L^{2}(I;L^{2}(\varOmega )^{d}) \to L^\infty (I;L^{2}(\varOmega )^{d})$|⁠, we have
As Assumption 3 guarantees |$k \to 0$| as |$h \to 0$|⁠, it holds |$\|\pi _{d} \tilde{\textbf{q}} - \tilde{\textbf{q}}\|_{L^{2}(I; L^{2}(\varOmega ))} \to 0$| for |$h \to 0$|⁠. This implies that for |$h$| small enough, we have |$C\|\pi _{d} \tilde{\textbf{q}} - \tilde{\textbf{q}}\|_{L^{2}(I\times \varOmega )} \le \delta $| and as a consequence |$G_{\textbf{w}}({\textbf{u}}_{kh}(\pi _{d} \tilde{\textbf{q}})) < \beta $|⁠.

 

Theorem 6.2.

Let Assumption 3 be satisfied and let |$h$| be sufficiently small. Then there exists a unique solution |$(\bar{\textbf{q}}_\sigma , \bar{\textbf{u}}_\sigma )$| to the fully discrete optimal control problem (6.5a)–(6.5c).

 

Proof.

As Lemma 6.1 shows feasibility of |$(\pi _{d} \tilde{\textbf{q}},{\textbf{u}}_{kh}(\pi _{d} \tilde{\textbf{q}}))$| under the given assumptions, the existence proof follows the same steps as the one of Theorem 3.6 on the continuous level.

Similar to Section 5, we can rewrite the problem (6.5a)–(6.5c) in the reduced form

(6.6)

Note that compared to the the variationally discretized optimal control problem (5.3), only the control space has changed.

 

Theorem 6.3 (First order optimality conditions for discretized controls).
A control |$\bar{{\textbf{q}}}_{\sigma }\in \mathbf{Q}_{0,ad}$| and the associated state |$\bar{{\textbf{u}}}_{\sigma }={\textbf{u}}_{kh}(\bar{{\textbf{q}}}_{\sigma }) \in X^{0}_{k}(\mathbf{V}_{h})$| is the optimal solution to the problem (6.5a)–(6.5c) if and only if there exists an adjoint state |$\bar{{\textbf{z}}}_{\sigma } \in X^{0}_{k}(\mathbf{V}_{h})$| and a Lagrange multiplier |$\bar{\mu }_{\sigma }\in L^{1}(I)$| that satisfy: Discrete state equation
(6.7a)
Discrete state constraint and complementarity conditions
(6.7b)
Discrete adjoint equation
(6.7c)
Discrete variational inequality
(6.7d)
Furthermore, there exist |$\bar \mu _{\sigma }^{m} \in{\mathbb{R}}_{\ge 0}$|⁠, |$m=1,2,\dots ,M$|⁠, such that the discrete Lagrange multiplier |$\bar{\mu }_{\sigma }\in L^{1}(I)$| satisfies the expression
(6.8)
where |$\chi _{I_{m}}$| denotes the characteristic function of the interval |$I_{m}$|⁠.

 

Proof.

The proof is almost identical to the proof of Theorem 5.4.

Again, due to |$\bar \mu _\sigma \ge 0$|⁠, it holds |$\|\bar \mu _\sigma \|_{L^{1}(I)} = \|\bar \mu _\sigma \|_{(C(\bar I)^{*})} = \langle \bar \mu _\sigma , 1 \rangle = \sum _{m=1}^{M} \bar \mu _\sigma ^{m}$|⁠.

 

Lemma 6.4.
Let Assumption 3 be satisfied, and |$h$| be small enough. Then there exists a constant |$C>0$| independent of |$k$| and |$h$|⁠, such that the fully discrete optimal control |$\bar{{\textbf{q}}}_{\sigma }\in \mathbf{Q}_{0,ad}$|⁠, solving (6.5a)–(6.5c), together with its corresponding state |$\bar{{\textbf{u}}}_{\sigma }\in X^{0}_{k}(\mathbf{V}_{h})$| and corresponding multiplier |$\bar{\mu }_{\sigma }\in L^{1}(\bar I)$| satisfies the bound

 

Proof.
By Lemma 6.1 under the given assumptions, the fully discrete control |$\pi _{d} \tilde{\textbf{q}}$| is feasible, and thus it holds
where in the last step, we have used the discrete stability result for |${\textbf{u}}_{kh}$| from Theorem 4.2 and the stability of |$\pi _{d}$| from (6.3). As a result we obtain |$ \|\bar{{\textbf{q}}}_{\sigma }\|_{L^{2}(I\times \varOmega )}+\|\bar{{\textbf{u}}}_{\sigma }\|_{L^{2}(I\times \varOmega )}\le C. $| The proof of |$\|\bar \mu _\sigma \|_{C(\bar I)^{*})} \le C$| and |$\|\bar{\textbf{q}}_\sigma \|_{L^\infty (I;L^{2}(\varOmega ))} \le C$| is then accomplished by following the same steps as the proof of Lemma 5.7, making use of |$G_{\textbf{w}}({\textbf{u}}_{kh}({\pi _{d}{\textbf{p}}}))\le \beta -\frac{1}{4}\delta $| for |$h$| sufficiently small.

 

Theorem 6.5.
Let Assumption 3 be satisfied and let |$h$| be small enough. Let |$(\bar{\textbf{q}}_{kh},\bar{\textbf{u}}_{kh})$| and |$(\bar{\textbf{q}}_\sigma ,\bar{\textbf{u}}_\sigma )$| denote the optimal solutions of the variationally discretized optimal control problem (5.1a)–(5.1c) and the fully discretized optimal control problem (6.5a)–(6.5c). Then there exists a constant |$C>0$| such that it holds

 

Proof.
Choosing |$\delta{\textbf{q}}=\bar{{\textbf{q}}}_{\sigma } \in \mathbf{Q}_{ad}$| in (5.5d) and |$\delta{\textbf{q}}=\pi _{d}\bar{{\textbf{q}}}_{kh} \in \mathbf{Q}_{0,ad}$| in (6.7d) results in
(6.9)
Adding these two inequalities, we obtain
(6.10)
We estimate the two terms separately.
Estimate for |$I_{1}$|⁠. Using the discrete state equations (5.5a), (6.7a) the corresponding adjoint equations (5.5c) and (6.7c), respectively, we have
By the Cauchy–Schwarz inequality and properties of the |$L^{2}$|-projection
Using that |$\| \nabla \bar{{\textbf{q}}}_{kh}\|_{L^{2}(I\times \varOmega )}\le \alpha ^{-1}\| \nabla \bar{{\textbf{z}}}_{kh}\|_{L^{2}(I\times \varOmega )}$|⁠, which holds due to the projection formula (5.5d) and the stability of |$P_{ad}$| in |$H^{1}(\varOmega )$|⁠, see Kinderlehrer & Stampacchia (2000, Theorem A.1), Attouch et al. (2014, Theorem 5.8.2), and the stability of solutions to the fully discrete dual problem, pointed out in Remark 4.3, gives
Now the boundedness of |$\|\bar{{\textbf{u}}}_{\sigma }\|_{L^{2}(I\times \varOmega )}$|⁠, |$\|\bar{{\textbf{u}}}_{kh}\|_{L^{2}(I\times \varOmega )}$|⁠, |$\|\bar{\mu }_{\sigma }\|_{L^{1}(I)}$|⁠, and |$\|\bar{\mu }_{kh}\|_{L^{1}(I)}$| from Lemmas 5.7 and 6.4 finish the proof.

With this last error estimate, our main result now directly follows from Theorem 5.8 and Theorem 6.5.

 

Theorem 6.6 (Error estimate for the control).
Let Assumption 3 be satisfied and let |$h$| be small enough. Let |$\bar{\textbf{q}} \in \mathbf{Q}_{ad}$| and |$\bar{\textbf{q}}_\sigma \in \mathbf{Q}_{0,ad}$| be the solutions to the continuous and fully discrete optimal control problems (1.1a)–(1.1d) and (6.5a)–(6.5c).

 

Remark 6.7.

All of our main results presented in Theorems 5.8, 6.5 and 6.6 do not require any additional regularity assumptions on the optimal control |$\bar{\textbf{q}}$|⁠, but work precisely with the regularity that is obtained from the optimality conditions. Furthermore, the techniques used in proving these results allow us to avoid strong coupling conditions on |$k$| and |$h$|⁠. We only require the product |$\ell _{k} h$| to converge to zero, in order to obtain a Slater condition for the discrete problems, which can be guaranteed by a very mild coupling condition as in Assumption 3.

7. Improved regularity

Despite having no a priori smoothness regularity for |$\bar{{\textbf{z}}}$| and as a result for |$\bar{{\textbf{q}}}$|⁠, our main result shows almost |$k^{\frac{1}{2}}$| convergence rate for the error of the optimal control. Similarly to Meidner et al. (2011), we can use this result to establish improved regularity for the optimal control.

 

Theorem 7.1.
Let |$\bar{{\textbf{q}}}\in \mathbf{Q}_{ad}$| be the optimal solution to (1.1a)–(1.1d). Then,
Additionally, if |${\textbf{q}}_{a},{\textbf{q}}_{b} \in{\mathbb{R}}^{d}$|⁠, i.e., are finite, it holds |$\bar{\textbf{q}} \in{L^\infty (I \times \varOmega )^{d}}$|⁠.

 

Proof.

The proof is identical to the proof of Theorem 7.1 in Meidner et al. (2011).

In light of Theorem 3.10 and the imbedding |$\text{BV}(I) \hookrightarrow H^{s}(I)$| for all |$s < \frac{1}{2}$|⁠, this result is particularly interesting, as it holds without additional regularity assumptions on the weight |${\textbf{w}}$|⁠. Note also that the regularity properties of Theorem 3.10 and its corollaries were not used in the derivation of our error estimates. It remains an open question, how these results can be used to derive improved error estimates for the optimal control problem directly.

8. Numerical results

In the following, we present three numerical examples, studying the orders of convergence for the optimal control problem. We first present an example for smooth data. These results can be compared to the numerical example in Meidner et al. (2011), where the weight function |$w(x_{1},x_{2})=\sin{(\pi x_{1})}\sin{(\pi x_{2})}$| on the unit square was used. As this weight is in |$H^{1}_{0}(\varOmega )\cap H^{2}(\varOmega )$| almost first order convergence in time was observed, which we again can see for the Stokes optimal control problem. In the second example, we study an example where less regularity of the data is available, leading to a reduced order of convergence. Lastly, we consider an example with simultaneous control and state constraints. Throughout this section, we will analyze the errors of the full discretization |$\bar{\textbf{q}} - \bar{\textbf{q}}_{\sigma }$|⁠. For details regarding the implementation of a variational discretization in the presence of control constraints, we refer to Hinze (2005), Hinze et al. (2009, Chapter 3.2.5) and the references therein.

8.1 Solution algorithm

Equation (6.6) can be written in matrix notation as

(8.1)

where |$M_{kh}^{uu}$| is the mass matrix of |$X^{0}_{k}(\mathbf{V}_{h})$|⁠, |$M_{kh}^{qq}$| is the mass matrix of |$\mathbf{Q}_{0}$|⁠, |$M_{kh}^{uq}$| is the mass matrix encoding inner products of |$X^{0}_{k}(\mathbf{V}_{h})$| and |$\mathbf{Q}_{0}$| functions, |${\textbf{u}}_{d,kh}$| is the |$L^{2}$| projection of |${\textbf{u}}_{d}$| onto |$X^{0}_{k}(\mathbf{V}_{h})$| and |$W_{kh}$| is the matrix that maps each |${\textbf{u}}_{kh}$| to the vector |$(G_{\textbf{w}}({\textbf{u}}_{kh})|_{I_{m}})_{m=1,..,M}$|⁠. Note that with a slight abuse of notation, we write |$S_{kh}$| for the control to state mapping as well as the matrix representing this mapping. The optimality conditions for this problem read

We solve the above problem with a primal-dual-active-set strategy (PDAS), during which, for each iteration of the active set |$\mathcal A_{n} \subset \{1,\dots ,M\}$|⁠, we solve a symmetric saddle point system

where |$O_{n} \in{\mathbb{R}}^{|\mathcal A_{n}|\times M}$| is the matrix satisfying |$O_{n} \mu _\sigma = (\mu _\sigma )_{\mathcal A_{n}}$|⁠, i.e., selects the active indices. We solve the linear system with MINRES, using the block diagonal preconditioner

Note that due to the choice of control discretization |$M_{kh}^{qq}$| is a diagonal matrix, and if the partition of |$I$| is uniform, and all mesh elements of |$\varOmega $| have the same volume, |$M_{kh}^{qq}$| is a multiple of the identity matrix. Further note that if the partition of |$I$| is uniform, the matrix |$S_{kh}^{T} W_{kh}^{T}$| has the structure

where each vector |${\textbf{z}}_{m} \in{\mathbb{R}}^{\dim (\mathbf{V}_{h})}$|⁠, |$m=1,...,M$| corresponds to the degrees of freedom of |${\textbf{z}}|_{I_{m}}$| for the solution |${\textbf{z}}$| to

Hence, to assemble this matrix only one discrete adjoint problem has to be solved, and a decomposition of the preconditioning matrix |$P_{kh}$| can be computed in advance and be reused in every iteration of the PDAS algorithm. The discrete solutions of the finite element problems were carried out in FEniCS Version 2019 (Logg et al., 2012), using the MINI Element in space.

8.2 Example 1

For this example, we consider the setting |$\varOmega = (0,1)^{2}$|⁠, |$I = (0,1]$|⁠, choose the regularization parameter |$\alpha = 1$| and set the control constraints to |${\textbf{q}}_{a} = (- \infty ,-\infty )^{T}$|⁠, |${\textbf{q}}_{b} = (+ \infty ,+\infty )^{T}$|⁠. We construct an analytic test case by considering the functions

Here, |${\textbf{y}}$| has been constructed in such a way that |$\nabla \cdot{\textbf{y}} = 0$| and |$\|{\textbf{y}}\|_{L^{2}(\varOmega )}=1$|⁠. It was obtained by considering the potential |$\rho (x_{1},x_{2}) = \left (\sin (\pi x_{1})\sin (\pi x_{2})\right )^{4}$|⁠, and normalizing the vector field |$(\partial _{x_{2}}\rho (x_{1},x_{2}),-\partial _{x_{1}}\rho (x_{1},x_{2}))^{T}$|⁠. We choose |$\bar{{\textbf{u}}}=\varphi (t){\textbf{y}}(x_{1},x_{2})$|⁠, |${\textbf{w}} = {\textbf{y}}$| and |$\beta \equiv 1$|⁠. It is then straightforward to verify that |$G_{\textbf{w}}(\bar{{\textbf{u}}}) \le \beta $| for all |$t \in I$| and |$G_{\textbf{w}}(\bar{{\textbf{u}}}) = \beta $| if and only if |$t \in [1/4,3/4]$|⁠. We thus choose the multiplier |$\bar \mu = 10^{3} \chi _{[1/4,3/4]}(t)$|⁠, which by construction satisfies |$\bar \mu \ge 0$| and |$\langle G_{\textbf{w}}(\bar{\textbf{u}}) -\beta ,\bar \mu \rangle = 0$|⁠. We then proceed by choosing |$\bar p = 0$| and as a consequence set |$\bar{\textbf{q}} = \partial _{t} \bar{\textbf{u}} - \varDelta \bar{\textbf{u}}$|⁠. We obtain |$\bar{\textbf{z}} = - \bar{\textbf{q}}$| and with the choice |$\bar r = 0$| can fix |${\textbf{u}}_{d} = \bar{\textbf{u}} + \partial _{t} \bar{\textbf{z}} + \varDelta \bar{\textbf{z}} + \bar \mu{\textbf{w}}$|⁠, in such a way that the constructed |$(\bar{\textbf{q}},\bar{\textbf{u}})$| satisfy the first order optimality condition for this desired state. Note that |$\vec \psi $| has been chosen sufficiently smooth at the boundary, such that |$\varDelta \vec \psi |_{\partial \varOmega } = 0$| and thus |$\bar{\textbf{q}}|_{\partial \varOmega } = \bar{\textbf{z}}|_{\partial \varOmega } = 0$|⁠. The calculation of the analytic solution was verified using the SageMath software. We discretize this problem with a uniform triangulation of |$\varOmega $| and a uniform partition of |$I$|⁠. To get more insight into the observed orders of convergence, for a sequence of discretization levels |$\{\sigma _{l}\} = \{(h_{l},k_{l})\}$|⁠, we report the empirical orders of convergence determined by

Figure 1(a) displays convergence with respect to the spacial discretization parameter for fixed |$k$|⁠. The theoretical convergence order of 1 can be observed. Figure 1(b) depicts the convergence results for the time discretization parameter for different values of |$h$|⁠. Note that for this comparison, we have choosen discretization levels in time, such that the two boundary of the active set |$[1/4,3/4]$| are midpoints of the discrete subintervals, in order to exclude superconvergence effects. Due to the structure of the discretization, the number of degrees of freedom grows rather quickly, which is why very fine discretizations are expensive. Figure 1(b) shows that if the spacial discretization parameter is chosen too large, the stagnation phase sets in rather early. If one only observes the coarse discretizations, the observed order of convergence is skewed, which led to an estimated order of convergence of 0.85 reported in Christof & Vexler (2021). As the derivation of the present example with an analytic reference solution requires the involved functions to have some precise regularity, we next propose an example with desired state that has low regularity in time. In this case a numerical reference solution is needed to measure the errors.

Numerical observation of the error $\|\bar{\textbf{q}} - \bar{\textbf{q}}_\sigma \|_{L^{2}(I\times \varOmega )}$ for Example 1. (a) Convergence with respect to $h$ for $k=10^{-3}$. (b) Convergence with respect to $k$ for different values of $h$.
Fig. 1.

Numerical observation of the error |$\|\bar{\textbf{q}} - \bar{\textbf{q}}_\sigma \|_{L^{2}(I\times \varOmega )}$| for Example 1. (a) Convergence with respect to |$h$| for |$k=10^{-3}$|⁠. (b) Convergence with respect to |$k$| for different values of |$h$|⁠.

8.3 Example 2

In this example, we consider an optimal control problem where we specify rough data |${\textbf{u}}_{d}$|⁠, and |${\textbf{w}}$|⁠, while keeping the domain |$\varOmega = (0,1)^{2}$|⁠, |$I = (0,1]$| and |${\textbf{q}}_{a} = (-\infty ,-\infty )^{T}$|⁠, |${\textbf{q}}_{b} = (+\infty ,+\infty )^{T}$|⁠. We specify, see also Fig. 2(a),

The weight in the state constraint is given by the following function, see also Fig. 2(b),

and the scalar constraint is given by |$\beta \equiv 1$|⁠. Moreover, we consider the regularization parameter |$\alpha = 10^{-4}$|⁠. As in this case, no analytical optimal solution is known, we estimate the errors using a numerical reference solution on a fine grid. To this end, we discretize the problem on 960 time intervals and 128 subdivisions of |$\varOmega $| in each direction. Note that due to this evaluation of the errors, we expect a faster convergence than theoretically derived, as on the finest discretization level, the error would equal to |$0$|⁠. In Fig. 3(a) we can again observe order 1 convergence with respect to |$h$| for fixed time discretization. In Fig. 3(b) we observe that the convergence with respect to |$k$| exhibits a rate of about |$0.6$|⁠, which is much closer to the analytically derived |$0.5$| than the rate observed for the smooth example 1.

Data for Example 2. (a) Time function $\varphi (t)$. (b) Weight function ${\textbf{w}}$. (c) Satisfaction of the state constraint.
Fig. 2.

Data for Example 2. (a) Time function |$\varphi (t)$|⁠. (b) Weight function |${\textbf{w}}$|⁠. (c) Satisfaction of the state constraint.

Numerical observation of the error $\|\bar{\textbf{q}} - \bar{\textbf{q}}_\sigma \|_{L^{2}(I\times \varOmega )}$ for Example 2. (a) Convergence with respect to $h$ for $k=960^{-1}$. (b) Convergence with respect to $k$ for $h=2^{-7}$.
Fig. 3.

Numerical observation of the error |$\|\bar{\textbf{q}} - \bar{\textbf{q}}_\sigma \|_{L^{2}(I\times \varOmega )}$| for Example 2. (a) Convergence with respect to |$h$| for |$k=960^{-1}$|⁠. (b) Convergence with respect to |$k$| for |$h=2^{-7}$|⁠.

8.4 Example 3

In the previous examples, no control constraints were present. To highlight that the derived error estimates are indeed not influenced by the control constraints, we augment the Example of Section 8.2 by a control constraint |$\bar{\textbf{q}} \le 200\cdot \mathbb{1}$|⁠. The remaining choices of |$\varOmega , {\textbf{w}}, {\textbf{u}}_{d}, \alpha $| and |$\beta $| are kept the same. Due to the presence of the control constraint, an analytic solution |$\bar{\textbf{q}}$| to the optimal control problem is not known, and we again compare to a fine-grid solution, computed with |$k=960^{-1}$| and |$h=2^{-7}$|⁠. The numerically observed orders of convergence in |$k$| and |$h$|⁠, as well as the active set of the control constraint for a fixed point in time, can be observed in Fig. 4. It has to be noted that while the discrete problem has a similar structure to the one without control constraint, and can be solved by the same PDAS algorithm, the resulting saddle point systems can be much larger for large active sets. Efficient preconditioners for such problems are needed, to study the performance of this algorithm for finer discretizations.

Left: orders of convergence for an example with control constraints. Right: active sets of the discrete optimal control for $h=2^{-5}$, $k=30^{-1}$ at $t=1/3$. (grey: inactive, blue: constraint in $x_{1}$ direction active, red: constraint in $x_{2}$ direction active).
Fig. 4.

Left: orders of convergence for an example with control constraints. Right: active sets of the discrete optimal control for |$h=2^{-5}$|⁠, |$k=30^{-1}$| at |$t=1/3$|⁠. (grey: inactive, blue: constraint in |$x_{1}$| direction active, red: constraint in |$x_{2}$| direction active).

References

Appell
,
J.
,
Banas
,
J.
&
Merentes Díaz
,
N. J.
(
2014
)
Bounded Variation and Around
. Series in Nonlinear Analysis and Applications, vol.
17
.
Berlin, Boston: De Gruyter
.

Attouch
,
H.
,
Buttazzo
,
G.
&
Michaille
,
G.
(
2014
)
Variational Analysis in Sobolev and BV Spaces: Applications to PDEs and Optimization, no. 17 in MOS-SIAM Series on Optimization
, 2nd edn.
Philadelphia
:
Society for Industrial and Applied Mathematics
.

Auscher
,
P.
,
Badr
,
N.
,
Haller-Dintelmann
,
R.
&
Rehberg
,
J.
(
2015
)
The square root problem for second-order, divergence form operators with mixed boundary conditions on L p
.
J. Evol. Equ.
,
15
,
165
208
.

Behringer
,
N.
,
Vexler
,
B.
&
Leykekhman
,
D.
(
2023
)
Fully discrete best-approximation-type estimates in|$L^\infty (I;L^2(\varOmega )^d$|⁠) for finite element discretizations of the transient Stokes equations
.
IMA J. Numer. Anal.
,
43
,
852
880
.

Boffi
,
D.
,
Brezzi
,
F.
&
Fortin
,
M.
(
2013
)
Mixed Finite Element Methods and Applications
. Springer Series in Computational Mathematics, vol.
44
.
Berlin, Heidelberg
:
Springer
.

Boyer
,
F.
&
Fabrie
,
P.
(
2013
)
Mathematical Tools for the Study of the Incompressible Navier-Stokes Equations and Related Models
. Applied Mathematical Sciences, vol.
183
.
New York, NY
:
Springer
.

Casas
,
E.
(
1993
)
Boundary control of semilinear elliptic equations with pointwise state constraints
.
SIAM J. Control Optim.
,
31
,
993
1006
.

Christof
,
C.
&
Vexler
,
B.
(
2021
)
New regularity results and finite element error estimates for a class of parabolic optimal control problems with pointwise state constraints
.
ESAIM: Control Optim. Calc. Var.
,
27
,
4
.

Chrysafinos
,
K.
&
Walkington
,
N. J.
(
2010
)
Discontinuous Galerkin approximations of the Stokes and Navier-Stokes equations
.
Math. Comp.
,
79
,
2135
2167
.

Dauge
,
M.
(
1989
)
Stationary Stokes and Navier–Stokes systems on two- or three-dimensional domains with corners. Part I. Linearized equations
.
SIAM J. Math. Anal.
,
20
,
74
97
.

De Los Reyes
,
J.
&
Griesse
,
R.
(
2008
)
State-constrained optimal control of the three-dimensional stationary Navier–Stokes equations
.
J. Math. Anal. Appl.
,
343
,
257
272
.

De Los Reyes
,
J. C.
&
Kunisch
,
K.
(
2006
)
A semi-smooth Newton method for regularized state-constrained optimal control of the Navier-Stokes equations
.
Computing
,
78
,
287
309
.

De Los Reyes
,
J. C.
,
Meyer
,
C.
&
Vexler
,
B.
(
2008
)
Finite element error analysis for state-constrained optimal control of the Stokes equations
.
Control Cybernet.
,
37
,
251
284
.

De Los Reyes
,
J. C.
&
Yousept
,
I.
(
2009
)
Regularized state-constrained boundary optimal control of the Navier–Stokes equations
.
J. Math. Anal. Appl.
,
356
,
257
279
.

Deckelnick
,
K.
&
Hinze
,
M.
(
2011
)
Variational discretization of parabolic control problems in the presence of pointwise state constraints
.
J. Comput. Math.
,
29
,
1
15
.

Ern
,
A.
&
Guermond
,
J.-L.
(
2021
)
Finite Elements II: Galerkin Approximation, Elliptic and Mixed PDEs
.Texts in Applied Mathematics, vol.
73
.
Cham
:
Springer International Publishing
.

Fattorini
,
H. O.
&
Sritharan
,
S. S.
(
1998
)
Optimal control problems with state constraints in fluid mechanics and combustion
.
Appl. Math. Optim.
,
38
,
159
192
.

Girault
,
V.
&
Raviart
,
P.-A.
(
1986
)
Finite Element Methods for Navier-Stokes Equations
.Springer Series in Computational Mathematics, vol.
5
.
Berlin, Heidelberg
:
Springer
.

Gong
,
W.
&
Hinze
,
M.
(
2013
)
Error estimates for parabolic optimal control problems with control and state constraints
.
Comput. Optim. Appl.
,
56
,
131
151
.

Heida
,
M.
,
Patterson
,
R. I. A.
&
Renger
,
D. R. M.
(
2019
)
Topologies and measures on the space of functions of bounded variation taking values in a Banach or metric space
.
J. Evol. Equ.
,
19
,
111
152
.

Hinze
,
M.
(
2005
)
A variational discretization concept in control constrained optimization: the linear-quadratic case
.
Comput. Optim. Appl.
,
30
,
45
61
.

Hinze
,
M.
,
Pinnau
,
R.
,
Ulbrich
,
M.
&
Ulbrich
,
S.
(eds) (
2009
)
Optimization with PDE Constraints, no. 23 in Mathematical Modelling: Theory and Applications
.
New York
:
Springer
.

Hytönen
,
T.
,
Van Neerven
,
J.
,
Veraar
,
M.
&
Weis
,
L.
(
2016
)
Bochner Spaces, in Analysis in Banach Spaces
.
Cham
:
Springer International Publishing
, pp.
1
66
.

John
,
C.
&
Wachsmuth
,
D.
(
2009
)
Optimal Dirichlet boundary control of stationary Navier–Stokes equations with state constraint
.
Numer. Funct. Anal. Optim.
,
30
,
1309
1338
.

John
,
V.
(
2016
)
Finite Element Methods for Incompressible Flow Problems
. Springer Series in Computational Mathematics, vol.
51
.
Cham
:
Springer International Publishing
.

Kellogg
,
R.
&
Osborn
,
J.
(
1976
)
A regularity result for the Stokes problem in a convex polygon
.
J. Funct. Anal.
,
21
,
397
431
.

Kinderlehrer
,
D.
&
Stampacchia
,
G.
(
2000
)
An Introduction to Variational Inequalities and Their Applications, no. 31 in Classics in Applied Mathematics
.
Philadelphia, PA
:
Society for Industrial and Applied Mathematics
 
unabridged republication of the 1980 text ed.

Leykekhman
,
D.
&
Vexler
,
B.
(
2024
)
L|$^2$|(I;H|$^1$|(⁠|$\varOmega $|⁠)) and L|$^2$|(I;L|$^2$|(⁠|$\varOmega $|⁠)), best approximation type error estimates for Galerkin solutions of transient Stokes problems
.
Calcolo
,
61
,
7
.

Liu
,
H.
(
2010
)
Optimal control problems with state constraint governed by Navier–Stokes equations
.
Nonlinear Anal. Theory Methods Appl.
,
73
,
3924
3939
.

Logg
,
A.
,
Mardal
,
K.
,
Wells
,
G. N.
, et al. (
2012
)
Automated Solution of Differential Equations by the Finite Element Method
.
Berlin, Heidelberg
:
Springer
.

Ludovici
,
F.
,
Neitzel
,
I.
&
Wollner
,
W.
(
2018
)
A priori error estimates for state-constrained semilinear parabolic optimal control problems
.
J. Optim. Theory Appl.
,
178
,
317
348
.

Ludovici
,
F.
&
Wollner
,
W.
(
2015
)
A priori error estimates for a finite element discretization of parabolic optimization problems with pointwise constraints in time on mean values of the gradient of the state
.
SIAM J. Control Optim.
,
53
,
745
770
.

Meidner
,
D.
,
Rannacher
,
R.
&
Vexler
,
B.
(
2011
)
A priori error estimates for finite element discretizations of parabolic optimization problems with pointwise state constraints in time
.
SIAM J. Control Optim.
,
49
,
1961
1997
.

Simon
,
J.
(
1986
)
Compact sets in the space L|$^p$|(0,T;B)
.
Ann. Mat. Pura Appl.
,
146
,
65
96
.

Sohr
,
H.
(
2001
)
The Navier-Stokes Equations
.
Basel
:
Springer Basel
.

Temam
,
R.
(
1977
)
Navier-Stokes Equations. Theory and Numerical Analysis
, Studies in Mathematics and Its Applications, vol.
2
.
Amsterdam-New York-Oxford
:
North-Holland Publishing Co
.

The Sage Developers
(
2022
)
SageMath, the Sage Mathematics Software System (Version 9.5)
 .

Vexler
,
B.
&
Wagner
,
J.
(
2024
)
Error estimates for finite element discretizations of the instationary Navier–Stokes equations
.
ESAIM Math. Model. Numer. Anal.
,
58
,
457
488
.

Wang
,
G.
(
2002a
)
Optimal controls of 3-dimensional Navier-Stokes equations with state constraints
.
SIAM J. Control Optim.
,
41
,
583
606
.

Wang
,
G.
(
2002b
)
Pontryagin maximum principle of optimal control governed by fluid dynamic systems with two point boundary state constraint
.
Nonlinear Anal.
,
51
,
509
536
.

Wang
,
G.
&
Wang
,
L.
(
2003
)
Maximum principle of state-constrained optimal control governed by fluid dynamic systems
.
Nonlinear Anal.
,
52
,
1911
1931
.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.