Exact Computable Representation of Some Second-Order Cone Constrained Quadratic Programming Problems

Jin, Qingwei; Tian, Ye; Deng, Zhibin; Fang, Shu-Cherng; Xing, Wenxun

doi:10.1007/s40305-013-0009-8

Exact Computable Representation of Some Second-Order Cone Constrained Quadratic Programming Problems

Regular Paper
Published: 20 March 2013

Volume 1, pages 107–134, (2013)
Cite this article

Download PDF

Journal of the Operations Research Society of China Aims and scope Submit manuscript

Exact Computable Representation of Some Second-Order Cone Constrained Quadratic Programming Problems

Download PDF

Qingwei Jin¹,
Ye Tian²,
Zhibin Deng³,
Shu-Cherng Fang³ &
…
Wenxun Xing⁴

1907 Accesses
9 Citations
Explore all metrics

Abstract

Solving the quadratically constrained quadratic programming (QCQP) problem is in general NP-hard. Only a few subclasses of the QCQP problem are known to be polynomial-time solvable. Recently, the QCQP problem with a nonconvex quadratic objective function over one ball and two parallel linear constraints is proven to have an exact computable representation, which reformulates the original problem as a linear semidefinite program with additional linear and second-order cone constraints. In this paper, we provide exact computable representations for some more subclasses of the QCQP problem, in particular, the subclass with one second-order cone constraint and two special linear constraints.

Variations and extension of the convex–concave procedure

Article 05 November 2015

Thomas Lipp & Stephen Boyd

New covering and illumination results for a class of polytopes

Article 08 April 2024

Shenghua Gao, Horst Martini, … Longzhen Zhang

On the convex hull of convex quadratic optimization problems with indicators

Article Open access 07 June 2023

Linchuan Wei, Alper Atamtürk, … Simge Küçükyavuz

1 Introduction

The quadratically constrained quadratic programming (QCQP) problem can be expressed as

$$ \begin{array}{l} \inf\ x^TA_0x+2b_0^Tx+c_0\\[4pt] \mathrm{s.t.}\quad x\in\mathcal{F} \end{array} \quad \textrm{(QCQP)} $$

(1)

where the feasible domain $\mathcal{F}\triangleq\{x\in\mathbb {R}^{n}|x^{T}A_{i}x+2b_{i}^{T}x+c_{i}\leqslant 0,\ i=1,\cdots,m_{1},\allowbreak x^{T}A_{j}x+2b_{j}^{T}x+c_{j}= 0,\ j=m_{1}+1,\cdots,m_{1}+m_{2}\}$ with $A_{i}, A_{j}\in\mathcal{S}^{n}$, the space of real symmetric square matrices of order n, $b_{i},b_{j}\in\mathbb{R}^{n}$, the n-dimensional real space, and $c_{i},c_{j}\in\mathbb{R}$, i=0,1,⋯,m ₁, j=m ₁+1,⋯,m ₁+m ₂. This problem has been extensively studied and proven to be NP-hard even if all of the constraints are linear (Ref. [10]). For the convex QCQP problem, it can be reformulated as a linear second-order cone programming problem and then solved in polynomial time using interior point methods (Ref. [9]). For the nonconvex QCQP problem, only some subclasses are known to be computable. Here, “computable” means a problem can be solved within an arbitrary precision level in polynomial time. In the literature, linear constraints, second-order cone constraints and semidefinite constraints are commonly used to construct an equivalent representation of a given QCQP problem. When the equivalent problem is polynomial-time solvable and the size of such a representation is polynomial in terms of the size of the original problem, then we say it is a “computable representation.” Computable representations of QCQP with $\mathcal{F}$ being defined by one nonconvex quadratic inequality constraint, or by one strictly convex/concave quadratic equality constraint, or by one convex quadratic inequality and one linear inequality can be found in Sturm and Zhang [13]. Moreover, the computable representation in [13] also works for the QCQP with $\mathcal{F}$ being defined by two convex quadratic inequality constraints sharing the same Hessian matrix. Kim and Kojima [7] proposed a semidefinite representation and a second-order cone representation for QCQP problems whose matrix formulations have coefficients being uniformly almost OD-nonpositive. (A real symmetric matrix is OD-nonpositive if its off-diagonal elements are nonpositive.) Furthermore, Ye and Zhang [14] provided a semidefinite representation for three subclasses of the QCQP problem with two quadratic constraints: (i) one of the two constraints in the SDP relaxation is not binding, (ii) the two constraints and the objective function are all in the homogeneous form, and (iii) one is an elliptic constraint and the other is a linear complementarity constraint. Recently, Burer and Anstreicher [2] showed an exact computable representation of QCQP with one elliptic constraint and two parallel linear constraints. However, the computable representation of QCQP problems with two binding elliptic constraints or one second-order cone constraint is still unknown. (Note that having a second-order cone constraint is equivalent to having one quadratic constraint and one linear constraint, not merely one quadratic constraint.)

In this paper, we will show computable representations of QCQP problems with the following feasible domains:

$\mathcal{F}=\{(x,y)\in\mathbb{R}^{n_{1}}\times\mathbb {R}^{n_{2}}~|~\|x\|\leqslant a_{1}+a_{2}^{T}x+a_{3}^{T}y, a_{1}+a_{2}^{T}x+a_{3}^{T}y\geqslant a_{4}\geqslant 0\}$ with $a_{1}\in\mathbb{R}$, $a_{2}\in\mathbb{R}^{n_{1}}$, $a_{3}\in\mathbb {R}^{n_{2}}$ and a ₄⩾0.
$\mathcal{F}=\{(x,y)\in\mathbb{R}^{n_{1}}\times\mathbb {R}^{n_{2}}~|~\|x\|\leqslant a_{1}+a_{2}^{T}x+a_{3}^{T}y, a_{5}\geqslant a_{1}+a_{2}^{T}x+a_{3}^{T}y\geqslant a_{4}\geqslant 0\}$ with $a_{1}\in\mathbb{R}$, $a_{2}\in\mathbb{R}^{n_{1}}$, $a_{3}\in \mathbb{R}^{n_{2}}$ and a ₅>a ₄⩾0.

The above representations generalize the ball constraint and the second-order cone constraint. As a corollary, one can obtain the computable representation of the widely used second-order cone constraint c ^T x+d⩾∥Ax+b∥ with l⩽c ^T x+d⩽u, in which $c\in \mathbb{R}^{n}, d, l, u\in\mathbb{R}, A\in\mathbb{R}^{m\times n}$ and $b\in\mathbb{R}^{m}$. In particularly, when $\mathcal{F}=\{(x_{0},x)\in\mathbb{R}\times\mathbb{R}^{n}| \|x\|\leqslant x_{0}\} $, the computable representation derived in this paper answers the question in Proposition 8.7 of [3]. Another motivation is that, in [4], a QCQP problem can be reformulated as a QCQP problem over an intersection of several second-order cones or several semidefinite constraints. However, computable representations for such problems are not known. In our paper, we take the first step to handle such problems, i.e., one second-order cone constraint.

Another advantage in our paper is the use of second-order cone in the linear conic relaxation. In most literature, given a quadratic constraint, only the straightforward SDP relaxation is used. For example, a second-order cone constraint is relaxed to

where $\mathcal{S}_{+}^{n+2}$ is the set of positive semidefinite matrices of order n+2 and M ₁•M ₂ being defined by $\mathrm {tr}(M_{1}^{T}M_{2})$, the trace of $M_{1}^{T}M_{2}$. This formulation is only a relaxation. By adding an additional constraint $y\in\mathcal{SOC}(n)$, with $\mathcal{SOC}(n)\triangleq\{(x_{0},x)\in\mathbb{R}\times\mathbb {R}^{n}| \|x\|\leqslant x_{0}\}$, a tight representation can be obtained. Such an advantage has already been observed by several scholars recently (see [2, 5, 8, 13, 14]). In our paper, new results are based on such observation and the authors suggest that more attention be paid to the second-order cone constraint while constructing a linear conic relaxation.

In our derivation of computable representations, we adopt the concepts of copositive cone and cone of nonnegative quadratic functions which have been extensively used in recent studies. In [13], given a nonempty set $\mathcal{F}\subset\mathbb {R}^{n}$, the copositive cone over $\mathcal{F}$ is defined by

$$ \mathcal{HD}_{\mathcal{F}}\triangleq\bigl\{M\in\mathcal{S}^n| x^TMx\geqslant 0, \forall x\in\mathcal{F}\bigr\}. $$

(2)

Its dual cone is

$$ \mathcal{HD}^*_{\mathcal{F}} = \textrm{cl~cone}\bigl\{xx^T\in \mathcal {S}^n|x\in\mathcal{F} \bigr\}, $$

(3)

where “cl” means the closure and “cone” stands for the conic hull of a set (the smallest convex cone containing the given set). The cone of nonnegative quadratic functions over $\mathcal{F}$ is defined by

(4)

Its dual cone has the formulation of

(5)

The above four cones are all closed convex cones. They are related through the following set:

(6)

Sturm and Zhang [13] proved that

$$ \mathcal{D}_{\mathcal{F}}=\mathcal{HD}_{\mathcal{H}_{\mathcal{F}}}\quad \mathrm{and}\quad \mathcal{D}_{\mathcal{F}}^*=\mathcal{HD}_{\mathcal{H}_{\mathcal{F}}}^*. $$

(7)

They also showed that the QCQP problem has the same objective value as that of the following linear conic programming problem:

(8)

and that of its dual

$$ \begin{array}{l} \sup\ \sigma\\ \mathrm{s.t.}\quad \left [ \begin{array}{c@{\quad}c} c_0 & b_0^T\\ b_0& A_0 \end{array} \right ] - \left [ \begin{array}{c@{\quad}c} \sigma& 0\\ 0 & 0 \end{array} \right ] \in\mathcal{D}_{\mathcal{F}} \\[10pt] \hphantom{\mathrm{s.t.}\quad} \sigma\in\mathbb{R} \end{array} \quad \textrm{(LCoD)} $$

(9)

Burer’s copositive representation [1] worked on formulating the set $\mathcal{D}_{\mathcal{F}}^{*}\cap\{Y|Y_{11}=1\}$ with $\mathcal{F}=\{x\in\mathbb{R}^{n}|Ax=b, x\in\{0,1\}^{n}\}$ under a key assumption of

$$x\in\bigl\{y\in\mathbb{R}^n|Ay=b,y\geqslant 0\bigr\} \quad \Longrightarrow\quad x\in \bigl\{y\in\mathbb {R}^n|0\leqslant y\leqslant 1 \bigr\}. $$

Burer [3] and Eichfelder and Povh [5] further extended the results to the case that $\mathcal{F}=\{x|Ax=b, x\in\mathcal{K}\}$ with $\mathcal{K}$ being a closed convex cone. Their results can be used to construct the corresponding $\mathcal{D}_{\mathcal{F}}^{*}$. Based on [3] and [5], Burer and Dong [4] used the cone of nonnegative quadratic functions over the Cartesian product of several second-order cone constraints to represent some QCQP problems, which has been mentioned before.

In the rest of the paper, some commonly used notation and properties of the cone of nonnegative quadratic functions are given in Sect. 2. An exact computable representation of the QCQP problem with one second-order cone constraint and two special linear constraints is provided in Sect. 3. Some concluding remarks follow in Sect. 4.

2 Notations and Properties

Given a nonempty set $\mathcal{F}\subseteq\mathbb{R}^{n}$, the cones $\mathcal{D}_{\mathcal{F}}$, $\mathcal{D}_{\mathcal{F}}^{*}$, $\mathcal {HD}_{\mathcal{F}}$, $\mathcal{HD}_{\mathcal{F}}^{*}$, and the set $\mathcal{H}_{\mathcal{F}}$ are respectively defined by (2)–(6). In this section, we first study the properties of these cones and then provide some useful tools for the proofs in Sect. 3.

2.1 Properties of $\mathcal{D}_{\mathcal{F}}$, $\mathcal {D}_{\mathcal{F}}^{}$, $\mathcal{HD}_{\mathcal{F}}$ and $\mathcal {HD}_{\mathcal{F}}^{}$

From [13], we have the next property.

Lemma 1

[13]

Given a nonempty set $\mathcal{F}\subseteq\mathbb{R}^{n}$, we have the following facts: (i) $\mathcal{H}_{\mathcal{F}}$ is a closed cone; (ii) $\mathcal{D}_{\mathcal{F}} = \mathcal{HD}_{\mathcal{H}_{\mathcal{F}}}$; (iii) $\mathcal{D}_{\mathcal{F}}^{*}=\mathcal{HD}_{\mathcal{H}_{\mathcal{F}}}^{*} $; (iv) $\mathcal{D}_{\mathcal{F}}$ and $\mathcal{D}_{\mathcal{F}}^{*}$ are dual to each other.

The closure operator in the definition of $\mathcal{HD}_{\mathcal {F}}^{*}$ and $\mathcal{D}_{\mathcal{F}}^{*}$ is not desirable since it may be difficult to handle in an optimization problem. In some cases, the closeness requirement is automatically fulfilled without applying the closure operator. The next two lemmas provide necessary and sufficient conditions to omit the closure operator from the definition of $\mathcal {HD}^{*}_{\mathcal{F}}$ and $\mathcal{D}_{\mathcal{F}}^{*}$ respectively.

Lemma 2

Given a nonempty set $\mathcal{F}\subseteq\mathbb{R}^{n}$, $\mathcal {HD}^{*}_{\mathcal{F}} = \mathrm{cone}\{xx^{T}\in\mathcal{S}^{n}|x\in\mathrm {cl}\mathcal{F}\}$ if and only if $\mathrm{cl}\{tx\in\mathbb{R}^{n}|x\in \mathcal{F}, t\geqslant 0\}=\{tx\in\mathbb{R}^{n}|x\in\mathrm{cl}\mathcal{F}, t\geqslant 0\}$.

Proof

It is clear that $\mathrm{cone}\{xx^{T}\in\mathcal{S}^{n}|x\in\mathrm{cl}\mathcal{F}\} \subseteq\mathcal{HD}^{*}_{\mathcal{F}}$ and $\{tx\in\mathbb{R}^{n}|x\in\mathrm{cl}\mathcal{F},\allowbreak t\geqslant \nobreak 0\} \subseteq \mathrm{cl}\{tx\in\mathbb{R}^{n}|x\in\mathcal{F}, t\geqslant 0\}$.

[“Only if” part] If $\mathcal{HD}^{*}_{\mathcal{F}} = \mathrm {cone}\{xx^{T}\in\mathcal{S}^{n}|x\in\mathrm{cl}\mathcal{F}\}$, then, for any $y\in\mathrm{cl}\{tx\in\mathbb{R}^{n}|x\in\mathcal{F}, t\geqslant 0\}$, we have y=lim_i→+∞ x ⁱ where $x^{i}\in\{tx\in \mathbb{R}^{n}|x\in\mathcal{F}, t\geqslant 0\}$. Define Y=yy ^T and X ⁱ=x ⁱ(x ⁱ)^T. Then $Y=\lim _{i\rightarrow+\infty} X^{i}\in\mathcal{HD}^{*}_{\mathcal{F}}$. From our assumption, $Y \in\mathrm{cone}\{xx^{T}\in\mathcal{S}^{n}|x\in\mathrm {cl}\mathcal{F}\}$ and the rank of Y is only 1. Therefore, $Y=\lambda \bar{x}\bar{x}^{T}$ for some λ⩾0 and $\bar{x}\in\mathrm {cl}\mathcal{F}$. This means that $y=\lambda^{1\over2}\bar{x}\in\{ tx\in\mathbb{R}^{n}|x\in\mathrm{cl}\mathcal{F}, t\geqslant 0\}$. Hence $\mathrm {cl}\{tx\in\mathbb{R}^{n}|x\in\mathcal{F}, t\geqslant 0\}=\{tx\in\mathbb {R}^{n}|x\in\mathrm{cl}\mathcal{F}, t\geqslant 0\}$.

[“If” part] If $\mathrm{cl}\{tx\in\mathbb{R}^{n}|x\in\mathcal {F}, t\geqslant 0\}=\{tx\in\mathbb{R}^{n}|x\in\mathrm{cl}\mathcal{F}, t\geqslant 0\}$, then, for any $Y\in\mathcal{HD}^{*}_{\mathcal{F}}$, we have Y=lim_i→+∞ Y ⁱ where $Y^{i}\in\mathrm{cone}\{xx^{T}\in \mathcal{S}^{n}| x\in\mathcal{F}\}$ for all i. Notice that each Y ⁱ can be decomposed as $Y^{i}=\sum_{j=1}^{r_{i}} (\lambda^{i}_{j}x^{ij})(\lambda ^{i}_{j}x^{ij})^{T}$ with $r_{i}\leqslant {n(n+1)\over2}$, $\lambda^{i}_{j}\geqslant 0$ and $x^{ij}\in\mathcal{F}$, for all i,j. Let $X^{i}\in\mathbb{R}^{n\times {n(n+1)\over2} }$ be defined such that the first r _i columns of X ⁱ are formed by $(\lambda^{i}_{j}x^{ij})$, j=1,⋯,r _i, and the rest of columns are all zeros. Since Y=lim_i→+∞ Y ⁱ and Y ⁱ=X ⁱ(X ⁱ)^T, we have lim_i→+∞(X ⁱ•X ⁱ)=lim_i→+∞tr(Y ⁱ)=tr(Y). Therefore, {X ⁱ} is a bounded sequence in $\mathbb {R}^{n\times{n(n+1)\over2}}$ and there exists $\bar{X}$ which is the limit of a subsequence of {X ⁱ}. Hence $Y=\bar{X}\bar{X}^{T}$. Notice that each column of $\bar{X}$ is an element of $\mathrm{cl}\{tx\in \mathbb{R}^{n}|x\in\mathcal{F}, t\geqslant 0\}$. From $\mathrm{cl}\{tx\in\mathbb {R}^{n}|x\in\mathcal{F}, t\geqslant 0\}=\{tx\in\mathbb{R}^{n}|x\in\mathrm {cl}\mathcal{F}\}$, each nonzero column of $\bar{X}$ can be denoted as λ _j x ^j with λ _j⩾0 and $x^{j}\in\mathrm{cl}\mathcal{F}$. Consequently, $Y=\bar{X}\bar {X}^{T}\in\mathrm{cone}\{xx^{T}\in\mathcal{S}^{n}|x\in\mathrm{cl}\mathcal {F}\}$ and $\mathcal{HD}^{*}_{\mathcal{F}} = \mathrm{cone}\{xx^{T}\in \mathcal{S}^{n}|x\in\mathrm{cl}\mathcal{F}\}$. □

Remark 1

Given a set $\mathcal{F}\subseteq\mathbb{R}^{n}$, noticing that $\mathcal{H}_{\mathcal{F}}$ is a closed cone, hence we have $\mathcal {D}_{\mathcal{F}}^{*}=\mathcal{HD}_{\mathcal{H}_{\mathcal{F}}}^{*} = \mathrm {cone}\{yy^{T}\in\mathcal{S}^{n+1}| y\in\mathcal{H}_{\mathcal{F}}\} =\mathrm{conv}\{yy^{T}\in\mathcal{S}^{n+1}| y\in\mathcal{H}_{\mathcal {F}}\}=\{\sum_{i} y^{i}(y^{i})^{T}\in\mathcal{S}^{n+1}| y^{i}\in\mathcal {H}_{\mathcal{F}}\}$. Therefore, showing $M\in\mathcal{D}_{\mathcal {F}}^{*}$ is equivalent to showing M=∑_i y ⁱ(y ⁱ)^T for some $y_{i}\in \mathcal{H}_{\mathcal{F}}$.

Remark 2

It was noticed in [6] that Lemma 1 of [13] does not always hold. Here we provide a necessary and sufficient condition for that Lemma. One may also check that Lemma 4 and Corollary 5 of [6] can be derived from our Lemma 2.

Lemma 3

Given a nonempty set $\mathcal{F}\subseteq\mathbb{R}^{n}$, $\mathcal {D}_{\mathcal{F}}^{*}= \textrm{cone}\{{ 1\ \ x^{T} \brack x\ xx^{T} }\in\mathcal{S}^{n} | x\in\mathrm{cl}\mathcal{F}\}$ if and only if $\mathcal{F}$ is a bounded set.

Proof

Since $\mathcal{D}_{\mathcal{F}}^{*}=\mathcal{HD}_{\mathcal{H}_{\mathcal {F}}}^{*}$ and $\mathcal{H}_{\mathcal{F}}= \mathrm{cl}\{t{1 \brack x}|x\in\mathcal{F}, t\geqslant 0\}$, we only need to prove that $\mathcal{H}_{\mathcal{F}}=\{{t\brack x}\in \mathbb{R}^{n+1}| x/t \in\mathrm{cl}\mathcal{F}, t>0\}\cup\{0\}$ if and only if $\mathcal{F}$ is bounded. Obviously, $\{{t\brack x}\in\mathbb{R}^{n+1}| x/t \in\mathrm {cl}\mathcal{F}, t>0\}\cup\{0\}\subseteq\mathcal{H}_{\mathcal{F}}$.

[“If” part] When $\mathcal{F}$ is bounded, for any $y={ t\brack x }\in\mathcal{H}_{\mathcal{F}}$, we have y=lim_i→+∞ y ⁱ where $y^{i}={ t^{i}\brack x^{i} }$ with t ⁱ>0 and ${x^{i}\over t^{i}}\in\mathcal{F}$. (i) If t=0, then lim_i→+∞ t ⁱ=0. Since $\mathcal{F}$ is bounded, the sequence $\{{x^{i}\over t^{i}}\}$ is bounded. Therefore, $x=\lim_{i\rightarrow+\infty}t^{i} {x^{i}\over t^{i}} = 0$, i.e., y=0. (ii) If t>0, then, since $\{{x^{i}\over t^{i}}\}$ is bounded, there exists a $z\in\mathrm{cl}\mathcal{F}$ being the limit of a subsequence of $\{{x^{i}\over t^{i}}\}$. Hence $x=\lim_{i\rightarrow +\infty}t^{i}{x^{i}\over t^{i}} = tz$, i.e., $y\in\{{t\brack x}\in\mathbb {R}^{n+1}| x/t \in\mathrm{cl}\mathcal{F}, t>0\}$. Therefore, $\mathcal {H}_{\mathcal{F}}= \{{ t\brack x }\in\mathbb{R}^{n+1}| x/t \in\mathrm{cl}\mathcal{F}, t>0 \}\cup \{0\}$.

[“Only if” part] If $\mathcal{F}$ is unbounded, then there exists a sequence {z ⁱ} in $\mathcal{F}$ such that lim_i→+∞∥z ⁱ∥=+∞. Without loss of generality, we may assume that none of these vectors is zero. Since the surface of the unit ball is closed and bounded, there exists $\bar{z}$ such that a subsequence of $\{{z^{i}\over\|z^{i}\|}\}$ converges to $\bar{z}$. We can replace {z ⁱ} by such subsequence, i.e., we can assume that $\bar{z}=\lim _{i\rightarrow+\infty} {z^{i}\over\|z^{i}\|}\neq0$. Now define $y^{i} = { t^{i}\brack x^{i} }={ 1/\|z^{i}\|\brack z^{i}/\|z^{i}\| }$. We have $\lim_{i\rightarrow+\infty}y^{i}={ 0\brack\bar{z} }\in\mathcal{H}_{\mathcal{F}}$. However, ${ 0\brack\bar{z} }\notin \{{ t\brack x }\in\mathbb{R}^{n+1}| x/t \in\mathrm{cl}\mathcal{F}, t>0 \}\cup \{0\}$. Therefore, $\mathcal{H}_{\mathcal{F}}\neq \{{ t\brack x }\in\mathbb{R}^{n+1}| x/t \in\mathrm{cl}\mathcal{F}, t>0 \}\cup \{0\}$.

Together with Lemma 2, we have $\mathcal{D}_{\mathcal{F}}^{*}= \textrm{cone}\{{ 1\ \ x^{T} \brack x\ xx^{T} }\in\mathcal{S}^{n} | x\in\mathrm{cl}\mathcal{F}\}$ if and only if $\mathcal{H}_{\mathcal{F}}=\{{t\brack x}\in\mathbb{R}^{n+1}| x/t \in \mathrm{cl}\mathcal{F}, t>0\}\cup\{0\}$, which is equivalent to saying that $\mathcal{F}$ is bounded. □

As we can see, the cone of nonnegative quadratic functions and its dual cone posses the following monotonic properties:

Lemma 4

If $\mathcal{F}_{1}\subseteq\mathcal{F}_{2} \subseteq\mathbb{R}^{n} $, then $\mathcal{D}_{\mathcal{F}_{1}}^{*}\subseteq\mathcal{D}_{\mathcal {F}_{2}}^{*}$ and $\mathcal{D}_{\mathcal{F}_{1}}\supseteq\mathcal{D}_{\mathcal{F}_{2}}$. Moreover, for any given $\mathcal{F}\subseteq\mathbb{R}^{n}$, $\mathcal {D}_{\mathcal{F}}^{*}\subseteq\mathcal{S}_{+}^{n+1}\subseteq\mathcal {D}_{\mathcal{F}}$.

Proof

The proof follows directly from the definitions (2)–(5). □

Given a set K, we use K ^∗ to denote its dual set, which is a closed convex cone. The next lemma will be needed in Lemma 6 and later proofs in Sect. 3.

Lemma 5

(Corollary 16.4.2 in [12])

If K ₁,⋯,K _s are nonempty closed convex cones in $\mathbb {R}^{n}$, then

$$\Biggl(\bigcap_{i=1}^s K_i\Biggr)^*=\mathrm{cl} \Biggl(\sum_{i=1}^sK_i^* \Biggr). $$

If there exists a common point of the relative interior of each K _i, i=1,⋯,s, then

$$\Biggl(\bigcap_{i=1}^s K_i\Biggr)^*=\Biggl(\sum _{i=1}^sK_i^*\Biggr). $$

Together with Lemma 4 and Lemma 5, we have the next result.

Lemma 6

If $\mathcal{F}=\bigcup_{i=1}^{k}\mathcal{F}_{i} \subseteq\mathbb{R}^{n}$ and each $\mathcal{F}_{i}$ is nonempty, then $\mathcal{D}_{\mathcal{F}}=\bigcap_{i=1}^{k}\mathcal{D}_{\mathcal{F}_{i}}$ and $\mathcal{D}_{\mathcal{F}}^{*}=\sum_{i=1}^{k}\mathcal{D}_{\mathcal{F}_{i}}^{*}$.

Proof

From Lemma 4, we have $\mathcal{D}_{\mathcal{F}}\subseteq \mathcal{D}_{\mathcal{F}_{i}}$ for i=1,⋯,k. Consequently, $\mathcal {D}_{\mathcal{F}}\subseteq\bigcap_{i=1}^{k}\mathcal{D}_{\mathcal{F}_{i}}$. Now if $M\in\bigcap_{i=1}^{k}\mathcal{D}_{\mathcal{F}_{i}}$, then, from $M\in \mathcal{D}_{\mathcal{F}_{i}}$, we know $M\bullet{ 1\ \ x^{T}\brack x\ xx^{T} }\geqslant 0$ for each $x\in\mathcal{F}_{i}$, which means $M\bullet{ 1\ \ x^{T}\brack x\ xx^{T} }\geqslant 0$ for all $x\in\bigcup_{i=1}^{k}\mathcal{F}_{i}=\mathcal{F}$. Therefore, $M\in\mathcal{D}_{\mathcal{F}}$ and $\mathcal{D}_{\mathcal {F}}\supseteq\bigcap_{i=1}^{k}\mathcal{D}_{\mathcal{F}_{i}}$. Consequently, $\mathcal{D}_{\mathcal{F}}=\bigcap_{i=1}^{k}\mathcal{D}_{\mathcal{F}_{i}}$.

Notice that $\mathcal{D}_{\mathcal{F}_{i}}$ is a closed convex cone and $\mathcal{D}_{\mathcal{F}_{i}}\supseteq\mathcal{S}^{n+1}_{+}$, i=1,⋯,k. From Lemma 5, we have $\sum_{i=1}^{k}\mathcal {D}_{\mathcal{F}_{i}}^{*}=(\bigcap_{i=1}^{k}\mathcal{D}_{\mathcal {F}_{i}})^{*}=(\mathcal{D}_{\mathcal{F}})^{*} =\mathcal{D}_{\mathcal{F}}^{*}$. □

When $\mathcal{F}=\mathcal{F}_{1}\times\mathbb{R}^{m}$ for some positive integer m, $\mathcal{D}_{\mathcal{F}}^{*}$ can be expressed by $\mathcal {D}_{\mathcal{F}_{1}}^{*}$ and one additional semidefinite constraint as in the next lemma.

Lemma 7

Given a nonempty set $\mathcal{F}_{1}\subseteq\mathbb{R}^{n}$ and let $\mathcal{F}=\mathcal{F}_{1}\times\mathbb{R}^{m}$, then

Proof

Let

Since $\mathcal{H}_{\mathcal{F}}=\mathcal{H}_{\mathcal{F}_{1}}\times \mathbb{R}^{m}$ and

$$\mathcal{D}_{\mathcal{F}}^*= \left \{\sum_i { u^i\brack v^i } { u^i\brack v^i }^T\in\mathcal{S}^{1+n+m}_+ \biggm{|} { u^i\brack v^i }\in\mathcal{H}_{\mathcal{F}}=\mathcal{H}_{\mathcal{F}_1}\times \mathbb{R}^m \right \}, $$

we have $Y = \sum_{i} { u^{i}\brack v^{i} }{ u^{i}\brack v^{i} }^{T}={ Y_{1}\ Y_{2}^{T}\brack Y_{2}\ Y_{3} }\in\mathcal{S}^{1+n+m}_{+}$, for any $Y\in\mathcal{D}_{\mathcal {F}}^{*}$, and $Y_{1}=\sum_{i} u^{i}(u^{i})^{T}\in\mathcal{D}_{\mathcal {F}_{1}}^{*}$. Therefore, $Y\in\mathcal{K}$ and $\mathcal{D}_{\mathcal {F}}^{*}\subseteq\mathcal{K}$.

Moreover, if $Y={ Y_{1}\ Y_{2}^{T}\brack Y_{2}\ Y_{3} }\in\mathcal{K}$, then $Y\in\mathcal{S}^{1+n+m}_{+}$ and $Y_{1}\in \mathcal{D}_{\mathcal{F}_{1}}^{*}$. We can find decompositions Y ₁=PP ^T=BB ^T, where $P\in\mathbb{R}^{(1+n)\times k}$ for some k>0 with each column of P lying in $\mathcal{H}_{\mathcal{F}_{1}}$ and $B\in\mathbb{R}^{(1+n)\times r}$ with r=rank(Y ₁). Furthermore, we have r⩽k and P=BQ for some $Q\in\mathbb {R}^{r\times k}$ being of full row rank. Since Y is positive semidefinite, there exists $R\in\mathbb{R}^{r\times m}$ such that $Y_{2}^{T}=BR$. Hence

Notice that $Y\in\mathcal{S}^{1+n+m}_{+}$ if and only if $Y_{3}-R^{T}R\in \mathcal{S}^{m}_{+}$. (Otherwise, $\bar{z}= { -B(B^{T}B)^{-1}R\bar{v}\brack \bar{v} }\in\mathbb{R}^{ (1+n+m)}$ with $\bar{v}^{T}(Y_{3}-R^{T}R)\bar{v}<0$ disproves the positive semidefiniteness of Y due to the fact that $\bar{z}^{T}Y\bar{z}=\bar{v}^{T}(Y_{3}-R^{T}R)\bar{v}<0$.) Clearly, . We now prove that ${ BB^{T} \ BR\brack R^{T}B^{T}\ R^{T}R }\in\mathcal{D}_{\mathcal{F}}^{*}$. Since BB ^T=PP ^T=BQQ ^T B ^T, we have

Let U=R ^T Q, then

Notice that each column of ${ P\brack U }$ is in $\mathcal{H}_{\mathcal{F}}$. Hence ${ BB^{T}\ BR\brack R^{T}B^{T}\ R^{T}R }\in\mathcal{D}_{\mathcal{F}}^{*} $. This leads to $Y\in\mathcal {D}_{\mathcal{F}}^{*}$ and $\mathcal{K}\subseteq\mathcal{D}_{\mathcal {F}}^{*}$. Together with $\mathcal{D}_{\mathcal{F}}^{*}\subseteq\mathcal {K}$, we have $\mathcal{D}_{\mathcal{F}}^{*}=\mathcal{K}$. □

Burer [3] proved that when $\mathcal{F}=\{x\in\mathcal {K}\subseteq\mathbb{R}^{n}|Ax=b\}$ with $\mathcal{K}$ being a closed convex cone, then

where diag(M) is a vector with [diag(M)]_i=M _ii, i=1,⋯,n, and b∘b is a vector with $[b\circ b]_{i}=b_{i}^{2}$, i=1,⋯,m. Here we give a more general result on $\mathcal {D}_{\mathcal{F}}^{*}$ and the proof is similar to that of Burer [3].

Lemma 8

Given $\mathcal{F}_{0}\subseteq\mathbb{R}^{n}$, $A\in\mathbb{R}^{m\times n}$ and $b\in\mathbb{R}^{m}$, if $\mathcal{F}=\{x\in\mathcal {F}_{0}~|~Ax=b\}$ is a nonempty set and $\mathcal{H}_{\mathcal {F}}=\mathcal{H}_{\mathcal{F}_{0}}\cap \{{ t\brack x }\in\mathbb{R}^{n+1}|Ax=tb \}$, then

Proof

Define

Since ${ 1\ \ x^{T}\brack x\ xx^{T} }\in\mathcal{G}$ for any $x\in\mathcal{F}$ and $\mathcal{G}$ is a closed convex cone, we have $\mathcal{D}_{\mathcal{F}}^{*}\subseteq \mathcal{G}$.

For the reverse direction, it is sufficient to show that every $Y\in \mathcal{G}$ can be represented as

$$Y=\sum_{i} y^i\bigl(y^i \bigr)^T $$

with $y^{i}\in\mathcal{H}_{\mathcal{F}}$. As we can see that

$$\mathcal{D}_{\mathcal{F}_0}^*=\mathrm{cone} \bigl\{yy^T\in\mathcal {S}^{n+1}|y\in\mathcal{H}_{\mathcal{F}_0} \bigr\}= \biggl\{\sum _i y^i\bigl(y^i\bigr)^T \in\mathcal{S}^{n+1} \biggm{|} y^i\in\mathcal{H}_{\mathcal {F}_0} \biggr\}. $$

For any $Y\in\mathcal{G}$, we have

$$Y=\sum_{i} y^i\bigl(y^i \bigr)^T= \sum_i { \xi^i \brack z^i } { \xi^i\brack z^i }^T $$

with ξ ⁱ⩾0 and ${ \xi^{i}\brack z^{i} }\in\mathcal{H}_{\mathcal{F}_{0}}$. We claim that: (i) if ξ ⁱ=0, then z ⁱ satisfies that Az ⁱ=0 and ${ 0\brack z^{i} }\in\mathcal{H}_{\mathcal{F}}$; (ii) if ξ ⁱ>0, then x ⁱ=z ⁱ/ξ ⁱ satisfies that Ax ⁱ=b and ${ \xi^{i}\brack z^{i} }\in\mathcal{H}_{\mathcal{F}}$.

Since $Y\in\mathcal{G}$, we have

$$\biggl(\sum_i \bigl(\xi^i \bigr)^2\biggr)b=\sum_i \xi^i Az^i $$

and

$$\biggl(\sum_i \bigl(\xi^i \bigr)^2\biggr)b\circ b = \sum_i \mathrm{diag}\bigl(A\bigl(z^i\bigl(z^i\bigr)^T \bigr)A^T\bigr)= \sum_i \bigl(Az^i\bigr)\circ\bigl(Az^i\bigr). $$

Consequently,

$$\biggl(\sum_i \xi^i Az^i \biggr)\circ\biggl(\sum_i \xi^i Az^i\biggr) = \biggl(\sum_i \bigl( \xi^i\bigr)^2\biggr)\sum_i \bigl(Az^i\bigr)\circ\bigl(Az^i\bigr). $$

By the Cauchy-Schwarz inequality, the equality sign holds if and only if there exists a $\delta\in\mathbb{R}^{m}$ such that ξ ⁱ δ=Az ⁱ for all i.

When ξ ⁱ=0, we have Az ⁱ=0. From the assumption on $\mathcal {H}_{\mathcal{F}}$, we know that ${ 0\brack z^{i} }\in\mathcal{H}_{\mathcal{F}}$ and Claim (i) holds.

When ξ ⁱ>0, we only need to prove that δ=b. Notice that

$$\biggl(\sum_j \bigl(\xi^j \bigr)^2\biggr)b=\sum_j \xi^j Az^j = \biggl(\sum_j \bigl(\xi^j\bigr)^2\biggr)\delta. $$

Since ξ ⁱ>0, the above equation leads to δ=b. This proves Claim (ii).

From Claims (i) and (ii), we have $Y\in\mathcal{D}_{\mathcal{F}}^{*}$ and $\mathcal{G}\subseteq\mathcal{D}_{\mathcal{F}}^{*} $. Together with $\mathcal{D}_{\mathcal{F}}^{*}\subseteq\mathcal{G}$, we have $\mathcal {D}_{\mathcal{F}}^{*}= \mathcal{G}$. □

Remark 3

When $\mathcal{F}_{0}$ is a closed convex cone or a closed bounded set, the assumption on $\mathcal{H}_{\mathcal{F}}$ always holds. Consequently, the study on the representation of $\mathcal{D}_{\mathcal {F}}^{*}$ can be simplified to the one of $\mathcal{D}_{\mathcal{F}_{0}}^{*}$.

Remark 4

According to Lemmas 2, 6, 7 and 8, when deriving computable representations, (i) showing $M\in\mathcal{D}_{\mathcal{F}}^{*}$ is equivalent to showing M=∑_i y ⁱ(y ⁱ)^T for some $y_{i}\in\mathcal {H}_{\mathcal{F}}$; (2) if $\mathcal{F}$ is the union of several sets, we could treat them separately; (3) we could focus on the set without linear equality constraints (under certain conditions) and free variables. These properties will simplify the proof of the computable representation.

2.2 Some Useful Results

In this subsection, we introduce some results used in the proofs in Sect. 3.

Firstly, three observations can be made here: (i) Given a nonempty set $\mathcal{F}\subseteq\mathbb{R}^{n}$ and a closed convex cone $\mathcal {K}\subseteq\mathcal{S}^{n+1}_{+}$, if $\mathcal{D}_{\mathcal {F}}^{*}\subseteq\mathcal{K}$, in order to prove $\mathcal{D}_{\mathcal {F}}^{*}=\mathcal{K}$, we only need to prove that $\mathcal{K}'\triangleq \mathcal{K}\cap\{Y\in\mathcal{S}^{n+1}| \mathrm{tr\ }(Y)\leqslant 1\} \subset\mathcal{D}_{\mathcal{F}}^{*}$. (ii) Given Y=Y ¹+Y ² with Y,Y ¹,Y ²≠0, $Y\in\mathcal{K}'$ and $Y^{1}, Y^{2}\in\mathcal{K}\subseteq\mathcal{S}^{n+1}_{+}$, a convex combination of Y can be obtained by reformulating $Y= {\mathrm {tr}(Y^{1})\over\mathrm{tr}(Y)} {Z^{1}}+{\mathrm{tr}(Y^{2})\over\mathrm {tr}(Y)} {Z^{2}}$ with $Z^{1}={\mathrm{tr}(Y)\over\mathrm{tr}(Y^{1})} Y^{1}\in \mathcal{K}'$ and $Z^{2}={\mathrm{tr}(Y)\over\mathrm{tr}(Y^{2})}Y^{2}\in \mathcal{K}'$. (iii) Since $\mathcal{K}'$ is a bounded closed convex set, the task of proving $\mathcal{K}'\subset\mathcal{D}_{\mathcal {F}}^{*}$ can be reduced to proving that every extreme point of $\mathcal {K}'$ is contained in $\mathcal{D}_{\mathcal{F}}^{*}$.

The next lemma characterizes the property of the extreme points for an SDP feasible set.

Lemma 9

[11]

Consider an SDP feasible set, for some integer p>0 and $A^{ij}\in \mathcal{S}^{n_{j}}$, i=1,⋯,m, j=1,⋯,p, let

$$F\triangleq \Biggl\{\bigl(X^1,\cdots, X^p\bigr)\in \mathcal{S}_+^{n_1}\times\cdots \times\mathcal{S}_+^{n_p} \biggm{|} \sum_{j=1}^p A^{ij}\bullet X^j=b_i, i= 1,\cdots, m \Biggr\}. $$

If (X ¹,⋯,X ^p) is an extreme point of F and r _j=rank(X ^j), then $\sum_{j=1}^{p} r_{j}(r_{j}+1)\leqslant 2m$.

In order to investigate the second-order cone constraint through the above lemma, we need its equivalent SDP representation.

Lemma 10

[2]

Given $z_{0}\in\mathbb{R}$ and $z\in\mathbb{R}^{n}$, let

and r=rank(Arrow(z ₀,z)). Then ∥z∥⩽z ₀ if and only if $\mathrm{Arrow}(z_{0},z)\in\mathcal{S}^{n+1}_{+}$. In addition, if ∥z∥⩽z ₀, then one of the following three cases holds: (i) (z ₀,z)=0 and r=0; (ii) ∥z∥=z ₀>0 and r=n; (iii) ∥z∥<z ₀ and r=n+1.

The next result about rank-one decomposition will be used repeatedly in later proofs.

Lemma 11

Let $X\in\mathcal{S}^{n}_{+}$ be a nonzero matrix and rank(X)=r. For any vector $a\in\mathbb{R}^{n}$, if Xa≠0, then $X'=X-{Xaa^{T}X\over a^{T}Xa}\in\mathcal{S}^{n}_{+}$ and rank(X′)=r−1.

Proof

Let X=Y ^T Y. The first claim can be proved by noticing that $(u^{T}Xu)\* (a^{T}Xa) = \|Yu\|^{2}\|Ya\|^{2} \geqslant ((Yu)^{T}(Ya))^{2} = (u^{T}Xa)^{2}$, for any $u\in\mathbb{R}^{n}$.

Obviously, rank(X′)⩾r−1. The second claim can be proved by noticing that (i) any u in the null space of X is also in the null space of X′; (ii) a is in the null space of X′ but not in the null space of X. □

3 QCQP with One Second-Order Cone Constraint

In this section, we focus on the exact computable representation of the QCQP problem whose domain is defined by one second-order cone constraint and some special linear constraints.

Our first result deals the QCQP problem whose domain is specified by one second-order cone constraint and one special linear constraint.

Theorem 1

Given a nonempty set $\mathcal{F}=\{(x,y)\in\mathbb{R}^{n_{1}}\times \mathbb{R}^{n_{2}}~|~\|x\|\leqslant a_{1}+a_{2}^{T}x+a_{3}^{T}y, a_{1}+a_{2}^{T}x+a_{3}^{T}y\geqslant a_{4}\geqslant 0\}$ with $a_{1}\in\mathbb{R}$, $a_{2}\in\mathbb{R}^{n_{1}}$, $a_{3}\in \mathbb{R}^{n_{2}}$ and a ₄⩾0, let

Then we have

Moreover, the corresponding problems of QCQP and LCoP defined in (1) and (8), respectively, have the same optimal value.

If there exists $(\bar{x},\bar{y})\in\mathbb{R}^{n_{1}}\times\mathbb {R}^{n_{2}}$ such that $\|\bar{x}\|< a_{1}+a_{2}^{T}\bar{x}+a_{3}^{T}\bar{y}$ and $a_{1}+a_{2}^{T}\bar{x}+a_{3}^{T}\bar{y}>a_{4}$, then $\mathcal{D}_{\mathcal{F}}$ can be simplified as

$$\mathcal{D}_{\mathcal{F}} = \left \{M\in\mathcal{S}^{1+n_1+n_2}\left | \begin{array}{l}M-\lambda_1 C_3 - \lambda_2 (e_1b^T+be_1^T) - (e_1\psi_1^TC_2^T + C_2\psi_1 e_1^T )\\[4pt] \quad {}- (b\psi_2^TC_2^T + C_2\psi_2 b^T )\in\mathcal {S}^{1+n_1+n_2}_+,\\[4pt] \lambda_1,\lambda_2\geqslant 0, \psi_1,\psi_2\in\mathcal{SOC}(n_1) \end{array} \right .\right \}. $$

Moreover, the corresponding dual problem LCoD (as defined in (9))

(10)

attains the same optimal value as that of the original problem QCQP.

Proof

Define

It is clear that $\mathcal{D}_{\mathcal{F}}^{*}\subseteq\mathcal{K}$.

To prove $\mathcal{K}\subseteq\mathcal{D}_{\mathcal{F}}^{*}$, it is sufficient to show that all the extreme points of $\mathcal {K}'\triangleq\mathcal{K}\cap\{U\in\mathcal{S}^{1+n_{1}+n_{2}}~|~\mathrm {tr\ }U\leqslant 1\}$ belong to $\mathcal{D}_{\mathcal{F}}^{*}$. In other word, for each nonzero extreme point of $\mathcal{K}'$, we need to find a rank-one decomposition with all elements falling in $\mathcal {H}_{\mathcal{F}}$.

We first prove that

$$\begin{array}{@{}l} \bigl\{(t,x,y)\in\mathbb{R}_+\times\mathbb{R}^{n_1}\times\mathbb {R}^{n_2}~|~\|x\|\leqslant a_1t+a_2^Tx+a_3^Ty, \nonumber\\[5pt] \quad a_1t+a_2^Tx+a_3^Ty \geqslant a_4t,t\geqslant 0\bigr\}\subseteq\mathcal{H}_{\mathcal{F}}. \end{array} $$

If t>0, then and hence . Otherwise, if t=0, since $\mathcal{F}$ is not empty, there exists . One can verify that . When k goes to infinity, its limit . Therefore, the above inclusion holds true.

Next, we let U ⁰ be a nonzero extreme point of $\mathcal{K}'$ and consider the following five cases for a complete proof: (i) χ=0; (ii) χ>0, a ^T Ue ₁=∥C ₁ Ue ₁∥ and a ^T Ub=∥C ₁ Ub∥; (iii) χ>0, a ^T Ue ₁>∥C ₁ Ue ₁∥ and a ^T Ub=∥C ₁ Ub∥; (iv) χ>0, a ^T Ue ₁=∥C ₁ Ue ₁∥ and a ^T Ub>∥C ₁ Ub∥; (v) χ>0, a ^T Ue ₁>∥C ₁ Ue ₁∥ and a ^T Ub>∥C ₁ Ub∥.

For case (i): It is clear that the corresponding . Furthermore, since U ⁰ is an extreme point of $\mathcal {K}'$, the corresponding matrix must be an extreme point of

We will discuss three subcases: , ${ a_{2}\brack a_{3} }^{T}Z^{0}{ b_{2}\brack b_{3} }=\|X^{0}b_{2}+(W^{0})^{T}b_{3}\|>0$ and ${ a_{2}\brack a_{3} }^{T}Z^{0}{ b_{2}\brack b_{3} }>\|X^{0}b_{2}+(W^{0})^{T}b_{3}\|$.

When , from Proposition 3 of [13], we can always find a rank-one decomposition Z ⁰=∑_i z ⁱ(z ⁱ)^T satisfying

Since Z ⁰ is positive semidefinite and ${ b_{2}\brack b_{3} }^{T}Z^{0}{ b_{2}\brack b_{3} }=0$, we have $(z^{i})^{T}{ b_{2}\brack b_{3} }=0$ for all i. One can verify that $Z^{0} = \sum_{i} {(z^{i})^{T}z^{i}\over \mathrm{tr\ } Z^{0}} [{\mathrm{tr\ } Z^{0}\over(z^{i})^{T}z^{i}} (z^{i}(z^{i})^{T})]$ and ${\mathrm{tr\ } Z^{0}\over(z^{i})^{T}z^{i}} (z^{i}(z^{i})^{T})\in\mathcal{L}$ for all i. From the fact that Z ⁰ is an extreme point of $\mathcal {L}$, then $Z^{0} = {\mathrm{tr\ } Z^{0}\over(z^{i})^{T}z^{i}} (z^{i}(z^{i})^{T})$ for all i, i.e., rank(Z ⁰)=1. Let Z ⁰=z ⁰(z ⁰)^T, then and . Notice that

Consequently, , i.e., $U^{0}\in\mathcal{D}_{\mathcal{F}}^{*}$.

When ${ a_{2}\brack a_{3} }^{T}Z^{0}{ b_{2}\brack b_{3} }=\|X^{0}b_{2}+(W^{0})^{T}b_{3}\|>0$, let $z\triangleq Z^{0}{ b_{2}\brack b_{3} }$. Noticing that

we know V≜Z ⁰−λzz ^T is positive semidefinite for some λ>0. We can rewrite the above equation as

$$Z^0 = {\mathrm{tr\ } V\over\mathrm{tr\ }Z^0}\biggl({\mathrm{tr\ } Z^0\over \mathrm{tr\ }V}V \biggr)+ {\lambda z^Tz\over\mathrm{tr\ }Z^0}\biggl({\mathrm{tr\ } Z^0\over z^Tz}zz^T \biggr). $$

Let and . Then $\mathrm{tr\ } Z^{1}=\mathrm{tr\ } Z^{2}=\mathrm{tr\ } Z^{0}\leqslant 1$. One can verify that:

Since $z=Z^{0}{ b_{2}\brack b_{3} }={ X^{0}b_{2}+(W^{0})^{T}b_{3}\brack W^{0}b_{2}+Y^{0}b_{3} }$ and $Z^{2}{b_{2}\brack b_{3}} ={\mathrm{tr}~Z^{0}\over z^{T}z}z^{T}{b_{2}\brack b_{3}}z ={X^{2}b_{2}+(W^{2})^{T}b_{3}\brack W^{2}b_{2}+Y^{2}b_{3}}$, we have

and

Therefore, Z ¹ and Z ² are all in $\mathcal{L}$. Since Z ⁰ is an extreme point of $\mathcal{L}$, then Z ⁰=Z ¹=Z ², i.e., $Z^{0}={\mathrm{tr\ } Z^{0}\over z^{T}z}zz^{T}$. Let , then U ⁰=u ⁰(u ⁰)^T. Notice that

Consequently, we have $u^{0}\in\mathcal{H}_{\mathcal{F}}$ and $U^{0}\in \mathcal{D}_{\mathcal{F}}^{*}$.

When ${ a_{2}\brack a_{3} }^{T}Z^{0}{ b_{2}\brack b_{3} }>\|X^{0}b_{2}+(W^{0})^{T}b_{3}\|$, we know that $(Z^{0}, S^{0}, s^{0}_{1}, s^{0}_{2})$, where

is an extreme point of

From Lemma 9, let $r_{Z}\triangleq\mathrm{rank}(Z^{0}), r_{S}\triangleq\mathrm{rank}(S^{0}), r_{1}\triangleq\mathrm{rank}(s^{0}_{1}), r_{2}\triangleq\mathrm{rank}(s^{0}_{2})$. Then

$$r_Z(r_Z+1)+ r_S(r_S+1)+r_1(r_1+1)+r_2(r_2+1) \leqslant 4+(n_1+1) (n_1+2). $$

Since r _S=n ₁+1, by Lemma 10, we have r _Z=1 and $Z^{0}={ x'\brack y' }{ x'\brack y' }^{T}$ with $a_{2}^{T}x'+a_{3}^{T}y'\geqslant \|x'\|$. Consequently, . Noticing that , we have and $U^{0}\in\mathcal{D}_{\mathcal{F}}^{*}$.

Therefore, we have shown that our claim holds for case (i).

For case (ii): Since χ>0, we have U ⁰ e ₁≠0. Define $U^{1}\triangleq {U^{0}e_{1}e_{1}^{T}U^{0}\over e_{1}^{T}U^{0}e_{1}}$ and U ²≜U ⁰−U ¹. From Lemma 11, we know U ² is positive semidefinite.

When U ² a=0, we have $0=U^{2}a=U^{0}a-{e_{1}^{T}U^{0}a\over e_{1}^{T}U^{0}e_{1}}U^{0}e_{1}$, i.e., U ⁰ a and U ⁰ e ₁ are linearly dependent. Therefore, U ⁰ b=U ⁰ a−a ₄ U ⁰ e ₁ is also linearly dependent on U ⁰ e ₁. Rewrite

$$U^0={\mathrm{tr\ } U^1\over\mathrm{tr\ } U^0}\biggl({\mathrm{tr\ } U^0\over \mathrm{tr\ } U^1}U^1 \biggr)+{\mathrm{tr\ } U^2\over\mathrm{tr\ } U^0}\biggl({\mathrm{tr\ } U^0\over\mathrm{tr\ } U^2}U^2 \biggr) $$

and one can verify that ${\mathrm{tr\ } U^{0}\over\mathrm{tr\ } U^{1}}U^{1}\in\mathcal{K}'$. To see ${\mathrm{tr\ } U^{0}\over\mathrm{tr\ } U^{2}}U^{2}\in\mathcal{K}'$, we only need to show $U^{2}\in\mathcal{K}$. From U ² a=U ² e ₁=0, we have a ^T U ² e ₁⩾∥C ₁ U ² e ₁∥ and b ^T U ² e ₁⩾0. Notice that

$$U^2\bullet\bigl(aa^T-C_1^TC_1 \bigr)=\bigl(U^0-U^1\bigr)\bullet\bigl(aa^T-C_1^TC_1 \bigr)=U^0\bullet \bigl(aa^T-C_1^TC_1 \bigr)\geqslant 0. $$

From U ² a=0, we have $\mathrm{tr\ }(C_{1}U^{2}C_{1}^{T})=0$. Consequently, C ₁ U ²=0, a ^T U ² b=0 and C ₁ U ² b=0. Hence $U^{2}\in\mathcal{K}$ and ${\mathrm{tr\ } U^{0}\over\mathrm{tr\ } U^{2}}U^{2}\in\mathcal{K}'$. Since U ⁰ is an extreme point in $\mathcal{K}'$, either U ²=0 or $U^{0}={\mathrm{tr\ } U^{0}\over\mathrm{tr\ } U^{1}}U^{1}={\mathrm{tr\ } U^{0}\over\mathrm{tr\ } U^{2}}U^{2}$. Noticing that U ¹ e ₁=U ⁰ e ₁≠0=U ² e ₁, we must have U ²=0 and $U^{0}=U^{1}={U^{0}e_{1}e_{1}^{T}U^{0}\over e_{1}^{T}U^{0}e_{1}}$. From $U^{0}\in\mathcal{K}$, one can verify that ${U^{0}e_{1}\over\sqrt{e_{1}^{T}U^{0}e_{1}}}$ is in $\mathcal{H}_{\mathcal{F}}$ and, therefore, $U^{0}\in\mathcal{D}_{\mathcal{F}}^{*}$.

When U ² a≠0, we have a ^T U ² a≠0. Let $U^{3}\triangleq {U^{2}aa^{T}U^{2}\over a^{T}U^{2}a}$ and U ⁴≜U ²−U ³=U ⁰−U ¹−U ³. From Lemma 11, we know U ³ and U ⁴ are both positive semidefinite.

From $\mathrm{tr\ }(C_{1}U^{4}C_{1}^{T})\geqslant 0$, we have

The above inequality indicates that $\mathrm{tr}(C_{1}U^{0}C_{1}^{T})=a^{T}U^{0}a$ if and only if $\mathrm{tr}(C_{1}U^{4}C_{1}^{T})=0$ and ${ a^{T}U^{0}e_{1}\brack C_{1}U^{0}e_{1} }$ and ${ a^{T}U^{0}a\brack C_{1}U^{0}a }$ are linearly dependent. Consequently, when $\mathrm{tr\ }(C_{1}U^{0}C_{1}^{T})=a^{T}U^{0}a$, we know that ${ a^{T}U^{0}e_{1}\brack C_{1}U^{0}e_{1} }$ and ${ a^{T}U^{0}b\brack C_{1}U^{0}b }$ are linearly dependent. Notice that

$$a^TU^2b=a^TU^2a-a_4a^TU^2e_1=a^TU^2a>0 $$

and ${ a^{T}U^{2}b\brack C_{1}U^{2}b }={ a^{T}U^{0}b\brack C_{1}U^{0}b }-{e_{1}^{T}U^{0}b\over e_{1}^{T}U^{0}e_{1}}{ a^{T}U^{0}e_{1}\brack C_{1}U^{0}e_{1} }$. Hence a ^T U ² b=∥C ₁ U ² b∥. Then we can easily verify that U ¹ and U ² are both in $\mathcal{K}$. From

$$U^0={\mathrm{tr\ } U^1\over\mathrm{tr\ } U^0}\biggl({\mathrm{tr\ } U^0\over \mathrm{tr\ } U^1}U^1 \biggr)+{\mathrm{tr\ } U^2\over\mathrm{tr\ } U^0}\biggl({\mathrm{tr\ } U^0\over\mathrm{tr\ } U^2}U^2 \biggr), $$

we know ${\mathrm{tr\ } U^{0}\over\mathrm{tr\ } U^{1}}U^{1}$ and ${\mathrm{tr\ } U^{0}\over\mathrm{tr\ } U^{2}}U^{2}$ are both in $\mathcal{K}'$. Therefore, $U^{0}=U^{1}={U^{0}e_{1}e_{1}^{T}U^{0}\over e_{1}^{T}U^{0}e_{1}}$. From $U^{0}\in\mathcal{K}$, one can verify that ${U^{0}e_{1}\over\sqrt{e_{1}^{T}U^{0}e_{1}}}\in\mathcal {H}_{\mathcal{F}}$ and $U^{0}\in\mathcal{D}_{\mathcal{F}}^{*}$.

Hence we have shown that $U^{0}\in\mathcal{D}_{\mathcal{F}}^{*}$ in case (ii).

For case (iii): We let U ¹≜λU ⁰ bb ^T U ⁰ and U ²≜U ⁰−U ¹ with λ>0 being a sufficiently small number. One can easily check that $U^{1}\in\mathcal{K}$. When λ is small enough, U ² is positive semidefinite. From a ^T U ⁰ e ₁>∥C ₁ U ⁰ e ₁∥, we know that ${ a^{T}U^{0}e_{1}\brack C_{1}U^{0}e_{1} }$ is an interior point of $\mathcal{SOC}(n_{1})$. Therefore, ${ a^{T}U^{2}e_{1}\brack C_{1}U^{2}e_{1} }={ a^{T}U^{0}e_{1}\brack C_{1}U^{0}e_{1} }-\lambda(b^{T}U^{0}e_{1}){ a^{T}U^{0}b\brack C_{1}U^{0}b }$ is also in $\mathcal{SOC}(n_{1})$ when λ is small enough, i.e., a ^T U ² e ₁⩾∥C ₁ U ² e ₁∥. We can also see that

$$\begin{array}{l} U^2\bullet\bigl(aa^T-C_1^TC_1\bigr)=\bigl(U^0-U^1\bigr)\bullet\bigl(aa^T-C_1^TC_1\bigr)=U^0\bullet \bigl(aa^T-C_1^TC_1\bigr)\geqslant 0,\\[4pt] b^TU^2e_1=b^TU^0e_1-\lambda\bigl(b^TU^0b\bigr)b^TU^0e_1\geqslant 0,\quad \textrm{and }\\[4pt] a^TU^2b=\bigl(1-\lambda\bigl(b^TU^0b\bigr)\bigr)a^TU^0b\geqslant \bigl\|\bigl(1-\lambda\bigl(b^TU^0b\bigr)\bigr)C_1U^0b\bigr\| =\bigl\|C_1U^2b\bigr\|. \end{array} $$

Therefore, we have $U^{2}\in\mathcal{K}$. Since U ⁰ is an extreme point of $\mathcal{K}'$, we know $U^{0}={\mathrm{tr\ } U^{0}\over\mathrm{tr\ } U^{1}}U^{1}={\mathrm{tr\ } U^{0}\over\mathrm{tr\ } U^{2}}U^{2}$. However, $a^{T}U^{0}e_{1} ={\mathrm{tr\ } U^{0}\over\mathrm{tr\ } U^{1}}a^{T}U^{1}e_{1} ={\mathrm{tr\ } U^{0}\over\mathrm{tr\ } U^{1}}\lambda(b^{T}U^{0}e_{1})(a^{T}U^{0}b)\* ={\mathrm{tr\ } U^{0}\over\mathrm{tr\ } U^{1}}\lambda(b^{T}U^{0}e_{1})\|C_{1}U^{0}b\| ={\mathrm{tr\ } U^{0}\over\mathrm{tr\ } U^{1}}\|C_{1}U^{1}e_{1}\| =\|C_{1}U^{0}e_{1}\|$, which contradicts to a ^T U ⁰ e ₁>∥C ₁ U ⁰ e ₁∥. This shows that no extreme point of $\mathcal{K}'$ exists in case (iii).

For case (iv): The proof is similar to that of case (iii).

For case (v): Let

Since U ⁰ is an extreme point of $\mathcal{K}'$, we can see the corresponding $(U^{0}, S_{1}^{0}, S_{2}^{0}, s^{0}_{1},\allowbreak s^{0}_{2}, s^{0}_{3})$ is an extreme point of $\mathcal{L}''$. From Lemma 9, let $r_{U}\triangleq\mathrm{rank}(U^{0}), r_{S_{1}}\triangleq\mathrm{rank}(S_{1}^{0}), r_{S_{2}}\triangleq\mathrm{rank}(S_{2}^{0}), r_{1}\triangleq\mathrm{rank}(s_{1}^{0}), r_{2}\triangleq\mathrm{rank}(s_{2}^{0})$ and $r_{3}\triangleq\mathrm{rank}(s_{3}^{0})$, then we have

Based on the assumption for (v), we have a ^T U ⁰ e ₁>∥C ₁ U ⁰ e ₁∥ and a ^T U ⁰ b>∥C ₁ U ⁰ b∥. Lemma 10 implies that $r_{S_{1}}=r_{S_{2}}=1+n_{1}$. Then the above inequality becomes

$$r_U(r_U+1)+r_1(r_1+1)+r_2(r_2+1)+r_3(r_3+1) \leqslant 6. $$

If r _U=1, then one can easily verify that $U^{0}\in\mathcal {D}_{\mathcal{F}}^{*}$. If r _U=2, we show that U ⁰ cannot be an extreme point of $\mathcal{K}'$. In this situation, r ₁=r ₂=r ₃=0, i.e., $s_{1}^{0}=s_{2}^{0}=s_{3}^{0}=0$. From a ^T U ⁰ e ₁>0 and a ^T U ⁰ b>0, we have U ⁰ e ₁≠0 and U ⁰ b≠0. Define $U^{1}\triangleq {U^{0}e_{1}e_{1}^{T}U^{0}\over e_{1}^{T}U^{0}e_{1}}$ and U ²≜U ⁰−U ¹. From Lemma 11, U ² is positive semidefinite and rank(U ²)=1. Since $s_{2}^{0}=b^{T}U^{0}e_{1}=0$, we have U ² b=U ⁰ b≠0. Therefore, $U^{2}={U^{2}bb^{T}U^{2}\over b^{T}U^{2}b}={{U^{0}bb^{T}U^{0}\over b^{T}U^{0}b}}$. This means $U^{0}={U^{0}e_{1}e_{1}^{T}U^{0}\over e_{1}^{T}U^{0}e_{1}}+{U^{0}bb^{T}U^{0}\over b^{T}U^{0}b}$. Notice that U ⁰ b and U ⁰ e ₁ are linearly independent. (Otherwise, 0≠b ^T U ⁰ b=τb ^T U ⁰ e ₁=0 for some τ≠0, which causes a contradiction.) One can further verify that ${\mathrm{tr\ }U^{0}\over \mathrm{tr\ } U^{1}}U^{1}$ and ${\mathrm{tr\ }U^{0}\over\mathrm{tr\ } U^{2}}U^{2}$ are all in $\mathcal{K}'$ and U ⁰ is the convex combination of these two distinct points which means U ⁰ cannot be an extreme point of $\mathcal{K}'$.

From the discussion of the above five cases, we have $\mathcal {K}\subseteq\mathcal{D}_{\mathcal{F}}^{*}$ and hence $\mathcal{K}= \mathcal{D}_{\mathcal{F}}^{*}$.

We now prove the dual part. Notice that

From Lemma 5, its dual is

Then it follows from Sturm and Zhang [13] that QCQP, LCoP and LCoD all have the same optimal value.

We now prove the second half of the theorem. If there is $(\bar{x},\bar {y})\in\mathbb{R}^{n_{1}}\times\mathbb{R}^{n_{2}}$ such that $\|\bar{x}\| <a_{1}+a_{2}^{T}\bar{x}+a_{3}^{T}\bar{y}$ and $a_{1}+a_{2}^{T}\bar{x}+a_{3}^{T}\bar{y}> a_{4}$, then let and $\bar{U}\triangleq\bar{u}\bar{u}^{T}$. In this way, $\bar {U}\in\mathcal{S}^{1+n_{1}+n_{2}}_{+}$, $C_{2}\bar{U}e_{1}\in\mathrm {int}\mathcal{SOC}(n_{1})$, $\bar{U}\bullet C_{3}>0$, $b^{T}\bar{U}e_{1}>0$ and $C_{2}\bar{U}b\in\mathrm{int}\mathcal{SOC}(n_{1})$. Let $U'\triangleq\bar {U}+\tau I_{1+n_{1}+n_{2}}$, τ>0. When τ is sufficiently small, we know U′ is an interior point of $\mathcal{D}_{\mathcal{F}}^{*}$. Therefore, using Lemma 5, the closure can be removed from $\mathcal{D}_{\mathcal{F}}$ and the rest of the claims becomes true. □

Remark 5

From the above proof, we see that an optimal extreme solution of the problem LCoP can lead to an optimal solution of the original problem QCQP through the explicit rank-one decomposition. Hence we have an exact solvable representation of the QCQP problem whose domain is defined by one second-order cone constraint and one special linear constraint.

When a ₄=0, Theorem 1 can be simplified as follows.

Corollary 1

Given a nonempty set $\mathcal{F}=\{(x,y)\in\mathbb{R}^{n_{1}}\times \mathbb{R}^{n_{2}}~|~\|x\|\leqslant a_{1}+a_{2}^{T}x+a_{3}^{T}y\}$ with $a_{1}\in\mathbb {R}$, $a_{2}\in\mathbb{R}^{n_{1}}$, $a_{3}\in\mathbb{R}^{n_{2}}$ and a ₄=0. Let

Then we have

Moreover, the corresponding QCQP and LCoP have the same optimal value.

If there exists $(\bar{x},\bar{y})\in\mathbb{R}^{n_{1}}\times\mathbb {R}^{n_{2}}$ such that $\|\bar{x}\|< a_{1}+a_{2}^{T}\bar{x}+a_{3}^{T}\bar{y}$, then we have

Moreover, the corresponding problem LCoD

(11)

attains the same optimal value as that of the original QCQP.

Proof

It is sufficient to show that the three constraints in $\mathcal {D}_{\mathcal{F}}^{*}$ of this corollary imply the five constraints in the $\mathcal{D}_{\mathcal{F}}^{*}$ of Theorem 1 when a ₄=0. Let b≜a−a ₄ e ₁=a. From $U\bullet(aa^{T}-C_{1}^{T}C_{1})\geqslant 0$ and

we have $(a^{T}Ua)^{2}\geqslant (a^{T}Ua)\mathrm{tr}(C_{1}UC_{1}^{T})\geqslant \mathrm{tr} (C_{1}Uaa^{T}UC_{1}^{T})=\|C_{1}Ua\|^{2}$. This shows a ^T Ub⩾∥C ₁ Ub∥. The constraint of b ^T Ue ₁⩾0 is obvious. Therefore, all the five constraints in the $\mathcal{D}_{\mathcal{F}}^{*}$ of Theorem 1 are satisfied. □

Notice that the domain $\mathcal{F}$ defined in Theorem 1 is an unbounded set. The next theorem provides an exact computable representation of the QCQP problem whose domain consists of one second-order cone constraint with both lower and upper bounds.

Theorem 2

Given a nonempty set $\mathcal{F}=\{(x,y)\in\mathbb{R}^{n_{1}}\times \mathbb{R}^{n_{2}}~|~\|x\|\leqslant a_{1}+a_{2}^{T}x+a_{3}^{T}y, a_{5}\geqslant a_{1}+a_{2}^{T}x+a_{3}^{T}y\geqslant a_{4}\geqslant 0\}$ with $a_{1}\in\mathbb{R}$, $a_{2}\in\mathbb {R}^{n_{1}}$, $a_{3}\in\mathbb{R}^{n_{2}}$ and a ₅>a ₄⩾0. Let

Then we have

Moreover, the corresponding QCQP and LCoP have the same optimal value.

If there is $(\bar{x},\bar{y})\in\mathbb{R}^{n_{1}}\times\mathbb {R}^{n_{2}}$ such that $\|\bar{x}\|< a_{1}+a_{2}^{T}\bar{x}+a_{3}^{T}\bar{y}$ and $a_{5}>a_{1}+a_{2}^{T}\bar{x}+a_{3}^{T}\bar{y}>a_{4}$, then

$$\mathcal{D}_{\mathcal{F}} = \left \{M\in\mathcal{S}^{1+n_1+n_2}\left | \begin{array}{@{}l@{}}M-\lambda_1 C_3 - \lambda_2 (e_1b^T+be_1^T)-\lambda _3(e_1\bar{b}^T+\bar{b}e_1^T)\\[5pt] \quad {}- (b\psi_1^TC_2^T + C_2\psi_1 b^T )\\[5pt] \quad {}- (\bar{b}\psi_2^TC_2^T + C_2\psi_2 \bar{b}^T )\in\mathcal {S}^{1+n_1+n_2}_+, \\[5pt] \lambda_1,\lambda_2,\lambda_3\geqslant 0, \psi_1,\psi_2\in\mathcal{SOC}(n_1) \end{array} \right .\right \}. $$

Moreover, the corresponding problem LCoD

(12)

attains the same optimal value as that of the original QCQP.

Proof

Define

It is clear that $\mathcal{D}_{\mathcal{F}}^{*}\subseteq\mathcal{K}$.

To show $\mathcal{K}\subseteq\mathcal{D}_{\mathcal{F}}^{*}$, it is sufficient to prove that all the extreme points of $\mathcal {K}'\triangleq\mathcal{K}\cap\{U\in\mathcal{S}^{1+n_{1}+n_{2}}~|~\mathrm {tr\ }U\leqslant 1\}$ belong to $\mathcal{D}_{\mathcal{F}}^{*}$. In other word, for each nonzero extreme point of $\mathcal{K}'$, we can find a rank-one decomposition with all elements being in $\mathcal{H}_{\mathcal{F}}$.

We first prove that

If t>0, then and . If t=0, then x=0 and $a_{3}^{T}y=0$. Since $\mathcal{F}$ is nonempty, there exists ${ \bar{x}\brack \bar{y} }\in\mathcal{F}$. One can verify that . When k goes to infinity, its limit . Therefore, the above inclusion holds true.

Next, we need to consider five cases for a complete proof: (i) χ=0; (ii) χ>0, a ^T Ub=∥C ₁ Ub∥ and $a^{T}U\bar{b}=\|C_{1}U\bar{b}\| $; (iii) χ>0, a ^T Ub>∥C ₁ Ub∥ and $a^{T}U\bar{b}=\|C_{1}U\bar{b}\|$; (iv) χ>0, a ^T Ub=∥C ₁ Ub∥ and $a^{T}U \bar{b}>\|C_{1}U\bar{b}\|$; (v) χ>0, a ^T Ub>∥C ₁ Ub∥ and $a^{T}U\bar{b}>\|C_{1}U\bar{b}\|$. Let U ⁰ be a nonzero extreme point of $\mathcal{K}'$.

For case (i): Corresponding to U ⁰≠0, we have (x ⁰,y ⁰)=0. Therefore, $a^{T}U^{0}e_{1}=b^{T}U^{0}e_{1}=\bar{b}^{T}U^{0}e_{1}=0$. From $a^{T}U^{0}\bar{b}\geqslant 0$ and $a^{T}U^{0}\bar{b}=a_{5}a^{T}U^{0}e_{1}-a^{T}U^{0}a=-a^{T}U^{0}a\leqslant 0$, we know U ⁰ a=0. Consequently, $U^{0}b=U^{0}\bar{b}=0$. From $U^{0}\bullet (aa^{T}-C_{1}^{T}C_{1})=-\mathrm{tr }(C_{1}U^{0}C_{1}^{T})=-\mathrm{tr\ }X^{0}\leqslant 0$ we have X ⁰=0 and W ⁰=0. Furthermore, since U ⁰ is an extreme point of $\mathcal{K}'$, the matrix Y ⁰ must be the extreme point of the set

$$\mathcal{L}\triangleq \bigl\{Y\in\mathcal{S}^{n_2}_+\vert \mathrm{tr\ }Y\leqslant 1, a_3^TYa_3=0\bigr\} $$

and it is a rank-one matrix, i.e., Y ⁰=y ⁰(y ⁰)^T for some $y^{0}\in \mathbb{R}^{n_{2}}$ with $a_{3}^{T}y^{0}=0$. Let , then we have U ⁰=u ⁰(u ⁰)^T. Notice that $u^{0}\in\mathcal {H}_{\mathcal{F}}$ and $U^{0}\in\mathcal{D}_{\mathcal{F}}^{*}$.

For Case (ii): When U ⁰ b=0, we have $e_{1}^{T}U^{0}\bar {b}=e_{1}^{T}U^{0}((a_{5}-a_{4})e_{1}-b)>0$, which means that $U^{0}\bar{b}\neq0$. Define $U^{1}\triangleq{U^{0}\bar{b}\bar{b}^{T}U^{0}\over\bar{b}^{T}U^{0}\bar{b}}$ and U ²≜U ⁰−U ¹. Then we have

$$U^2\bullet\bigl(aa^T-C_1^TC_1 \bigr)=U^0\bullet\bigl(aa^T-C_1^TC_1 \bigr)\geqslant 0. $$

We can check all the required conditions in $\mathcal{K}$ to verify that $U^{1}, U^{2} \in\mathcal{K}$. Since U ⁰ is an extreme point of $\mathcal{K}'$, we have $U^{0}=U^{1}={U^{0}\bar{b}\bar{b}^{T}U^{0}\over\bar{b}^{T}U^{0}\bar{b}}$. From $U^{0}\bar{b}\in\mathcal{H}_{\mathcal{F}}$, we know $U^{0}\in\mathcal {D}_{\mathcal{F}}^{*}$.

When $U^{0}\bar{b}=0$, similar to the situation of U ⁰ b=0, we can show that $U^{0}={U^{0}bb^{T}U^{0}\over b^{T}U^{0}b}\in\mathcal{D}_{\mathcal{F}}^{*}$.

When U ⁰ b≠0 and $U^{0}\bar{b}\neq0$, define $U^{1}\triangleq {U^{0}bb^{T}U^{0}\over b^{T}U^{0}b}$ and U ²≜U ⁰−U ¹. We first consider that $U^{2}\bar{b}=0$. In this case, $U^{0}\bar{b}={b^{T}U^{0}\bar {b}\over b^{T}U^{0}b}U^{0}b$. Noticing that $U^{2}b=U^{2}\bar{b}=0$ and

$$U^2\bullet\bigl(aa^T-C_1^TC_1 \bigr)=U^0\bullet\bigl(aa^T-C_1^TC_1 \bigr)\geqslant 0, $$

we have $U^{1}, U^{2} \in\mathcal{K}$. Since U ⁰ is an extreme point of $\mathcal{K}'$, we have $U^{0}=U^{1}={U^{0}bb^{T}U^{0}\over b^{T}U^{0}b}$. Noticing $U^{0}\bar{b}\in \mathcal{H}_{\mathcal{F}}$, we have $U^{0}\in\mathcal{D}_{\mathcal {F}}^{*}$. Then we consider that $U^{2}\bar{b}\neq0$. In this case, $U^{2}a={1\over a_{5}-a_{4}}(a_{4}U^{2}\bar{b}+a_{5}U^{2}b)={1\over a_{5}-a_{4}}a_{4}U^{2}\bar {b}\neq0$. Let $U^{3}\triangleq{U^{2}aa^{T}U^{2}\over a^{T}U^{2}a}$ and U ⁴≜U ²−U ³=U ⁰−U ¹−U ³. From Lemma 11, U ⁴ is positive semidefinite. Therefore,

$$U^0\bullet\bigl(C_1^TC_1\bigr)= \bigl(U^1+U^3+U^4\bigr)\bullet \bigl(C_1^TC_1\bigr)\geqslant \bigl(U^1+U^3 \bigr)\bullet \bigl(C_1^TC_1\bigr). $$

Notice that

Let $\tau_{1}\triangleq{a_{5}\over a_{5}-a_{4}}$ and τ ₂≜τ ₁−1. Then, $a=\tau_{1}b+\tau_{2}\bar{b}$. From a ^T U ⁰ b=∥C ₁ U ⁰ b∥ and $a^{T}U^{0}\bar{b}=\|C_{1}U^{0}\bar{b}\|$, we know

and

$$\bigl(a^TU^0a\bigr)^2=\tau_1 \bigl(a^TU^0b\bigr)a^TU^0a+ \tau_2^2\bigl(a^TU^0\bar{b} \bigr)^2+\tau_1\tau _2\bigl(a^TU^0b \bigr)a^TU^0\bar{b}. $$

Therefore,

From $a^{T}U^{0}a=\tau_{1}a^{T}U^{0}b+\tau_{2}a^{T}U^{0}\bar{b}=\tau_{1}\|C_{1}U^{0}b\|+\tau _{2}\|C_{1}U^{0}\bar{b}\|\geqslant \|C_{1}U^{0}a\|$, we have $a^{T}U^{2}a [(U^{1}+U^{3})\bullet(C_{1}^{T}C_{1}) - a^{T}U^{0}a ]\geqslant 0$. Consequently, $U^{0}\bullet(C_{1}^{T}C_{1})\geqslant a^{T}U^{0}a$. The equality sign holds if and only if $U^{4}\bullet(C_{1}^{T}C_{1})=0$ and the two vectors ${ a^{T}U^{0}a\brack C_{1}U^{0}a }$ and ${ a^{T}U^{0}b\brack C_{1}U^{0}b }$ are linearly dependent. This also implies that ${ a^{T}U^{0}\bar{b}\brack C_{1}U^{0}\bar{b} }$ and ${ a^{T}U^{0}b\brack C_{1}U^{0}b }$ are linearly dependent. From this result, we can verify that U ¹ and U ² are both in $\mathcal{K}$. Since U ⁰ is an extreme point in $\mathcal{K}'$, we have $U^{0}=U^{1}={U^{0}bb^{T}U^{0}\over b^{T}U^{0}b}$. Again, noticing $U^{0}b\in\mathcal{H}_{\mathcal{F}}$, we have $U^{0}\in\mathcal {D}_{\mathcal{F}}^{*}$. This completes the proof of case (ii).

For case (iii): We let $U^{1}\triangleq\lambda U^{0}\bar{b}\bar {b}^{T}U^{0}$ and U ²≜U ⁰−U ¹ with λ>0 being a sufficient small number. One can easily check that $U^{1}\in\mathcal {K}$. Notice that when λ is sufficiently small, U ² is positive semidefinite. From a ^T U ⁰ b>∥C ₁ U ⁰ b∥, we know ${ a^{T}U^{0}b\brack C_{1}U^{0}b }$ is an interior point of $\mathcal{SOC}(n_{1})$. Therefore, ${ a^{T}U^{2}b\brack C_{1}U^{2}b }={ a^{T}U^{0}b\brack C_{1}U^{0}b }-\lambda(b^{T}U^{0}\bar{b}){ a^{T}U^{0}\bar{b}\brack C_{1}U^{0}\bar{b} } \in\mathcal{SOC}(n_{1})$ when λ is sufficiently small, i.e., a ^T U ² b⩾∥C ₁ U ² b∥. We can also see that

$$\begin{array} {l} U^2\bullet\bigl(aa^T-C_1^TC_1 \bigr)=\bigl(U^0-U^1\bigr)\bullet\bigl(aa^T-C_1^TC_1 \bigr)=U^0\bullet \bigl(aa^T-C_1^TC_1 \bigr)\geqslant 0, \\[4pt] b^TU^2e_1=b^TU^0e_1- \lambda\bigl(b^TU^0\bar{b}\bigr)\bar{b}^TU^0e_1 \geqslant 0 \quad \textrm{and} \\[4pt] a^TU^2\bar{b}=\bigl(1-\lambda\bigl(\bar{b}^TU^0 \bar{b}\bigr)\bigr)a^TU^0\bar{b}\geqslant \bigl\| \bigl(1-\lambda \bigl(\bar{b}^TU^0\bar{b}\bigr)\bigr)C_1U^0 \bar{b}\bigr\|=\bigl\|C_1U^2\bar{b}\bigr\|. \end{array} $$

Consequently, $U^{2} \in\mathcal{K}$. Remembering that U ⁰ is an extreme point of $\mathcal{K}'$, we have $U^{0}={\mathrm{tr\ } U^{0}\over\mathrm{tr\ } U^{1}}U^{1}={\mathrm{tr\ } U^{0}\over\mathrm{tr\ } U^{2}}U^{2}$. However, $a^{T}U^{0}b ={\mathrm{tr\ } U^{0}\over\mathrm{tr\ } U^{1}}a^{T}U^{1}b ={\mathrm{tr\ } U^{0}\over\mathrm{tr\ } U^{1}}\lambda(\bar {b}^{T}U^{0}b)\*(a^{T}U^{0}\bar{b}) ={\mathrm{tr\ } U^{0}\over\mathrm{tr\ } U^{1}}\lambda(\bar{b}^{T}U^{0}b)\| C_{1}U^{0}\bar{b}\| ={\mathrm{tr\ } U^{0}\over\mathrm{tr\ } U^{1}}\|C_{1}U^{1}b\| =\|C_{1}U^{0}b\|$, which causes a contradiction to the fact of a ^T U ⁰ b>∥C ₁ U ⁰ b∥. This means that there is no extreme point of $\mathcal{K}'$ to be worried about for case (iii).

For case (iv): The proof is similar to that of case (iii).

For case (v): Let

Since U ⁰ is an extreme point of $\mathcal{K}'$, the corresponding $(U^{0}, S_{1}^{0}, S_{2}^{0}, s^{0}_{1}, s^{0}_{2}, s^{0}_{3}, s^{0}_{4}, s^{0}_{5})$ is an extreme point of $\mathcal{L}'$.

From Lemma 9, letting $r_{U}\triangleq\mathrm{rank}(U^{0}), r_{S_{1}}\triangleq\mathrm{rank}(S_{1}^{0}), r_{S_{2}}\triangleq\mathrm{rank}(S_{2}^{0})$ and $r_{i}\triangleq\mathrm{rank}(s_{i}^{0})$ (treat nonnegative number as a matrix of order one), i=1,⋯,5, we have

$$r_U(r_U+1)+r_{S_1}(r_{S_1}+1)+r_{S_2}(r_{S_2}+1)+ \sum_{i=1}^5r_i(r_i+1) \leqslant 2(n_1+1) (n_1+2)+10. $$

Under the conditions of case (v), we have a ^T U ⁰ b>∥C ₁ U ⁰ b∥ and $a^{T}U^{0}\bar{b}>\|C_{1}U^{0}\bar{b}\|$. From Lemma 10, $r_{S_{1}}=r_{S_{2}}=1+n_{1}$. Furthermore, $s^{0}_{2}={1\over a_{5}}b^{T}U^{0}(a+\bar {b})>\nobreak 0$ and $s^{0}_{3}={1\over a_{5}}\bar{b}^{T}U^{0}(a+\bar{b})>0$. Hence the above inequality becomes

$$r_U(r_U+1)+r_1(r_1+1)+r_4(r_4+1)+r_5(r_5+1) \leqslant 6. $$

If r _U=1, then one can easily verify that $U^{0}\in\mathcal {D}_{\mathcal{F}}^{*}$. If r _U=2, we show that U ⁰ cannot be an extreme point of $\mathcal{K}'$. In this situation, r ₁=r ₄=r ₅=0, i.e., $s_{1}^{0}=s_{4}^{0}=s_{5}^{0}=0$. From a ^T U ⁰ b>0 and $a^{T}U^{0}\bar{b}>0$, we have U ⁰ b≠0 and $U^{0}\bar{b}\neq0$. Define $U^{1} \triangleq {U^{0}bb^{T}U^{0}\over b^{T}U^{0}b}$ and U ²≜U ⁰−U ¹. From Lemma 11, U ² is positive semidefinite and rank(U ²)=1. Since $s_{4}^{0}=\bar{b}^{T}U^{0}b=0$, we have $U^{2}\bar {b}=U^{0}\bar{b}\neq0$. Therefore, $U^{2}={U^{2}\bar{b}\bar{b}^{T}U^{2}\over\bar {b}^{T}U^{2}\bar{b}}={U^{0}\bar{b}\bar{b}^{T}U^{0}\over\bar{b}^{T}U^{0}\bar{b}}$. This means $U^{0}={U^{0}bb^{T}U^{0}\over b^{T}U^{0}b}+{U^{0}\bar{b}\bar{b}^{T}U^{0}\over \bar{b}^{T}U^{0}\bar{b}}$. Notice that $U^{0}\bar{b}$ and U ⁰ b are linearly independent. (Otherwise, $0\neq b^{T}U^{0}b=\tau b^{T}U^{0}\bar{b}=0$ for some τ≠0, which causes a contradiction.) One can further verify that ${\mathrm{tr\ }U^{0}\over\mathrm{tr\ } U^{1}}U^{1}$ and ${\mathrm{tr\ }U^{0}\over\mathrm{tr\ } U^{2}}U^{2}$ are both in $\mathcal{K}'$ and U ⁰ is a convex combination of these two distinct points. This shows that U ⁰ cannot be an extreme point of $\mathcal{K}'$.

After checking all the cases, we know $\mathcal{K}\subseteq\mathcal {D}_{\mathcal{F}}^{*}$ and, consequently, $\mathcal{K}= \mathcal {D}_{\mathcal{F}}^{*}$. The proof of the rest part of this theorem is similar to that of Theorem 1. We omit it here. □

Remark 6

The proofs in Theorem 1 and Theorem 2 are similar. Here we provide an intuitive but less rigorous discussion about these two theorems. Note that $\bar {b}=a_{5}e_{1}-a$. When a ₅=∞, then a ₅ e ₁ will dominate a in the definition of $\bar{b}$. Therefore, $\bar{b}$ will be replaced by e ₁ and the computable representation in Theorem 2 degenerates to the one in Theorem 1. However, this approximation will lead to differences in the proofs such as case (i) in each of them.

As in the previous case, when a ₄=0, the results of Theorem 2 can be simplified.

Corollary 2

Given a nonempty set $\mathcal{F}=\{(x,y)\in\mathbb{R}^{n_{1}}\times \mathbb{R}^{n_{2}}~|~\|x\|\leqslant a_{1}+a_{2}^{T}x+a_{3}^{T}y, a_{1}+a_{2}^{T}x+a_{3}^{T}y\leqslant a_{5}\}$ with $a_{1}\in\mathbb{R}$, $a_{2}\in\mathbb{R}^{n_{1}}$, $a_{3}\in \mathbb{R}^{n_{2}}$ and a ₅⩾0. Let

Then we have

Moreover, the corresponding QCQP and LCoP have the same optimal value.

If there is $(\bar{x},\bar{y})\in\mathbb{R}^{n_{1}}\times\mathbb {R}^{n_{2}}$ such that $\|\bar{x}\|< a_{1}+a_{2}^{T}\bar{x}+a_{3}^{T}\bar{y}$ and $a_{1}+a_{2}^{T}\bar{x}+a_{3}^{T}\bar{y}<a_{5}$, then

$$\mathcal{D}_{\mathcal{F}} = \left \{M\in\mathcal{S}^{1+n_1+n_2}\left \vert \begin{array}{l}M-\lambda_1 C_3 - \lambda_2 \bigl(e_1\bar{b}^T+\bar{b}e_1^T\bigr)\\[4pt] \quad {}- \bigl(\bar{b}\psi^TC_2^T + C_2\psi\bar{b}^T \bigr)\in\mathcal {S}^{1+n_1+n_2}_+,\\[4pt] \lambda_1,\lambda_2\geqslant 0, \psi\in\mathcal{SOC}(n_1) \end{array} \right . \right \}. $$

Moreover, the corresponding LCoD problem

(13)

attains the same optimal value as that of the original QCQP.

Proof

It is sufficient to show that the four constraints in $\mathcal {D}_{\mathcal{F}}^{*}$ of this corollary imply the seven constraints in that of Theorem 2, when a ₄=0. Let b≜a−a ₄ e ₁=a. From $U\bullet(aa^{T}-C_{1}^{T}C_{1})\geqslant 0$ and

we have $(a^{T}Ua)^{2}\geqslant (a^{T}Ua)\mathrm{tr\ } C_{1}UC_{1}^{T}\geqslant \mathrm{tr\ } C_{1}Uaa^{T}UC_{1}^{T}=\|C_{1}Ua\|^{2}$, which shows that a ^T Ub⩾∥C ₁ Ub∥. From $a^{T}U\bar{b}\geqslant 0$ and $e_{1}={1\over a_{5}}(a+\bar{b})$, we have $b^{T}Ue_{1}={1\over a_{5}}a^{T}U(a+\bar{b})\geqslant 0$. Moreover, the last constraint is satisfied due to the fact that $\bar{b}^{T}Ub=\bar{b}^{T}Ua\geqslant 0$. Since all of the seven constraints are satisfied, the rest follows Theorem 2. □

Remark 7

In the literature, a widely used form of the second-order cone constraint is c ^T x+d⩾∥Ax+b∥, in which $c\in \mathbb{R}^{n}, d\in\mathbb{R}, A\in\mathbb{R}^{m\times n}$ and $b\in\mathbb{R}^{m}$. In this case, the domain $\mathcal{F}$ can be equivalently written as $\mathcal{F}\triangleq\{(x,y_{0},y)\in \mathbb{R}^{n}\times\mathbb{R}\times\mathbb{R}^{m}~|~y_{0}\geqslant \|y\|, Ax+b=y, c^{T}x+d=y_{0}\}$. From Lemma 8 and Corollary 1, we have

where $B\triangleq{ \ A \ \ \ 0 \ -I_{m}\brack c^{T}\ -1 \ \ \ 0 }$. Therefore, a computable representation is also available for the domain defined by the second-order cone constraint in the widely used form.

Similarly, one can obtain the computable representation of c ^T x+d⩾∥Ax+b∥ with l⩽c ^T x+d⩽u, in which $c\in \mathbb{R}^{n}, d, l, u\in\mathbb{R}, A\in\mathbb{R}^{m\times n}$ and $b\in\mathbb{R}^{m}$.

Remark 8

According to Lemma 6, from Theorems 1, 2 and Corollary 2, a bigger set $\mathcal{F}=\{(x,y)\in\mathbb{R}^{n_{1}}\times\mathbb{R}^{n_{2}}| x^{T}x\leqslant (a_{1}+a_{2}^{T}x+a_{3}^{T}y)^{2}, a_{4}\leqslant a_{1}+a_{2}^{T}x+a_{3}^{T}y\leqslant a_{5}\}$ with $a_{4}, a_{5}\in\mathbb{R}$ can be treated as the union of several sets discussed in the above theorems. Consequently, this set $\mathcal{F}$ also has a computable representation.

4 Concluding Remarks

In this paper, we have developed an exact computable representation of the QCQP problem whose feasible domain is defined by one second-order cone constraint and two special linear constraints. In each case, the representation involves a linear conic programming problem with linear, second-order cone and semidefinite constraints. We have shown that finding an optimal extreme solution to such a linear conic program can lead to an optimal solution to the original QCQP problem. In particular, we now know that the problem of optimizing a nonconvex quadratic function subject to one general second-order cone constraint is computable. We expect the results obtained will further advance the study of copositive programming problems.

References

Burer, S.: On the copositive representation of binary and continuous nonconvex quadratic programs. Math. Program. 120, 479–495 (2009)
Article MathSciNet MATH Google Scholar
Burer, S., Anstreicher, K.M.: Second-order cone constraints for extended trust-region subproblems. SIAM J. Optim. (2012). www.optimizationonline.org/DBHTML/2011/03/2957.html
Burer, S.: Copositive programming. In: Anjos, M.F., Lasserre, J.B. (eds.) Handbook on Semidefinite, Conic and Polynomial Optimization. Springer, Berlin (2011)
Google Scholar
Burer, S., Dong, H.: Representing quadratically constrained quadratic programs as generalized copositive programs. Oper. Res. Lett. 40, 203–206 (2011)
Article MathSciNet Google Scholar
Eichfelder, G., Povh, J.: On the set-semidefinite representation of non-convex quadratic programs with cone constraints. Croat. Oper. Res. Rev. 1, 26–39 (2011)
MathSciNet Google Scholar
Eichfelder, G., Povh, J.: On the set-semidefinite representation of nonconvex quadratic programs over arbitrary feasible sets. Optim. Lett. (2012). doi:10.1007/s11590-012-0450-3
MATH Google Scholar
Kim, S., Kojima, M.: Exact solutions of some nonconvex quadratic optimization problems via SDP and SOCP relaxations. Comput. Optim. Appl. 26, 143–154 (2003)
Article MathSciNet MATH Google Scholar
Kojima, M., Kim, S., Waki, H.: A general framework for convex relaxation of polynomial optimization problems over cones. J. Oper. Res. Soc. Jpn. 46, 125–144 (2003)
MathSciNet MATH Google Scholar
Nemirovskii, A., Scheinberg, K.: Extension of Karmarkar’s algorithm onto convex quadratically constrained quadratic problems. Math. Program. 72, 273–289 (1996)
MathSciNet MATH Google Scholar
Pardalos, P.M., Vavasis, S.A.: Quadratic programming with one negative eigenvalue is NP-Hard. J. Glob. Optim. 1, 15–22 (1991)
Article MathSciNet MATH Google Scholar
Pataki, G.: On the rank of extreme matrices in semidefinite programs and the multiplicity of optimal eigenvalues. Math. Oper. Res. 23, 339–358 (1998)
Article MathSciNet MATH Google Scholar
Rockafellar, R.T.: Convex Analysis. Princeton University Press, Princeton (1972)
Google Scholar
Sturm, J.F., Zhang, S.: On cones of nonnegative quadratic functions. Math. Oper. Res. 28, 246–267 (2003)
Article MathSciNet MATH Google Scholar
Ye, Y., Zhang, S.: New results on quadratic minimization. SIAM J. Optim. 14, 245–267 (2003)
Article MathSciNet MATH Google Scholar

Download references

Acknowledgements

The authors would like to thank the anonymous reviewers for their valuable comments and suggestions to improve the quality of the paper.

Author information

Authors and Affiliations

Department of Management Science and Engineering, Zhejiang University, Hangzhou, 310058, China
Qingwei Jin
School of Business Administration, Southwestern University of Finance and Economics, Chengdu, 611130, China
Ye Tian
Edward P. Fitts Department of Industrial and Systems Engineering, North Carolina State University, Raleigh, NC, 27695, USA
Zhibin Deng & Shu-Cherng Fang
Department of Mathematical Sciences, Tsinghua University, Beijing, 100084, China
Wenxun Xing

Authors

Qingwei Jin
View author publications
You can also search for this author in PubMed Google Scholar
Ye Tian
View author publications
You can also search for this author in PubMed Google Scholar
Zhibin Deng
View author publications
You can also search for this author in PubMed Google Scholar
Shu-Cherng Fang
View author publications
You can also search for this author in PubMed Google Scholar
Wenxun Xing
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ye Tian.

Additional information

This work was generously supported by US Army Research Office Grant (No. W911NF-04-D-0003), by the North Carolina State University Edward P. Fitts Fellowship and by National Natural Science Foundation of China (No. 11171177). It is the policy of the Army Research Office that university personnel do not need to do joint work with ARO personnel in order to receive grants from the Army Research Office.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Jin, Q., Tian, Y., Deng, Z. et al. Exact Computable Representation of Some Second-Order Cone Constrained Quadratic Programming Problems. J. Oper. Res. Soc. China 1, 107–134 (2013). https://doi.org/10.1007/s40305-013-0009-8

Download citation

Received: 16 December 2012
Revised: 26 February 2013
Accepted: 02 March 2013
Published: 20 March 2013
Issue Date: March 2013
DOI: https://doi.org/10.1007/s40305-013-0009-8

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Exact Computable Representation of Some Second-Order Cone Constrained Quadratic Programming Problems

Abstract

Similar content being viewed by others

Variations and extension of the convex–concave procedure

New covering and illumination results for a class of polytopes

On the convex hull of convex quadratic optimization problems with indicators

1 Introduction

2 Notations and Properties

2.1 Properties of \(\mathcal{D}_{\mathcal{F}}\), \(\mathcal {D}_{\mathcal{F}}^{*}\), \(\mathcal{HD}_{\mathcal{F}}\) and \(\mathcal {HD}_{\mathcal{F}}^{*}\)

Lemma 1

Lemma 2

Proof

Remark 1

Remark 2

Lemma 3

Proof

Lemma 4

Proof

Lemma 5

Lemma 6

Proof

Lemma 7

Proof

Lemma 8

Proof

Remark 3

Remark 4

2.2 Some Useful Results

Lemma 9

Lemma 10

Lemma 11

Proof

3 QCQP with One Second-Order Cone Constraint

Theorem 1

Proof

Remark 5

Corollary 1

Proof

Theorem 2

Proof

Remark 6

Corollary 2

Proof

Remark 7

Remark 8

4 Concluding Remarks

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation

2.1 Properties of \(\mathcal{D}_{\mathcal{F}}\), \(\mathcal {D}_{\mathcal{F}}^{}\), \(\mathcal{HD}_{\mathcal{F}}\) and \(\mathcal {HD}_{\mathcal{F}}^{}\)