Keywords

1 Introduction

The standard approach to uncertainty in portfolio selection models is to refer to a unique real-world probability measure \(\mathbf {P}\), usually estimated from data. Nevertheless, sometimes, due to the presence of unobserved variables or since the information is partial, misspecified or “imprecise”, it is no longer possible to handle uncertainty through a single probability measure. This has led to the development of models taking care of ambiguity, i.e., coping with a class of probability measures.

In recent years a growing interest has been addressed towards ambiguity in decision and economic literature (see, e.g., the survey papers [5] and [6]). Mainly, the aim is to propose models generalizing the classical subjective expected utility model in a way to avoid paradoxical situations like that discussed in [4], showing preference patterns under ambiguity that are not consistent with the expected utility paradigm. In particular, referring to the Anscombe-Aumann framework, the seminal papers [17] and [7] incorporate ambiguity in decisions, respectively, either through a non-additive uncertainty measure or through a class of probability measures. The two approaches are generally not equivalent but in case of a 2-monotone capacity [19] they reduce to Choquet expected utility. Choquet expected utility has been investigated also in a von Neumann-Morgenstern-like framework, by referring to “objective” completely monotone capacities modeling generalized lotteries [2, 10, 14].

In this paper, we formalize a dynamic portfolio selection problem under ambiguity, referring to an \(\epsilon \)-contamination [9] of a binomial market model [3] and to a CRRA utility function [18]. Thanks to the completeness of the market, we formulate the dynamic portfolio selection in terms of the final wealth. Moreover, since the lower envelope of the \(\epsilon \)-contamination class, obtained from the real-world probability \(\mathbf {P}\), is a completely monotone capacity, we assume that the investors’ preferences are consistent with the Choquet expected utility theory. The problem amounts to finding a final wealth maximizing the corresponding Choquet expected utility functional. We stress that, due to the properties of the Choquet integral [8], an agent which is a Choquet expected utility maximizer is actually an expected utility maximinimizer, with respect to the \(\epsilon \)-contamination class.

Notice that, referring to a one-period model, a portfolio selection problem under ambiguity via maximinimization has been faced in [15, 16]. Moreover, our problem is analogous to that formulated in Appendix C of [11], the latter differing from ours for working in a continuous setting and considering a distortion of the probability \(\mathbf {P}\).

We show that our problem can be reduced to a family of linearly constrained concave problems, indexed by the set of all permutations of the sample space. Moreover, the initial problem is proved to have a unique optimal solution. Focusing on the special case of uniformly distributed stock returns (determining a uniform probability distribution on the sample space) we provide a characterization of the optimal solution relying only on a number of permutations equal to the cardinality of the sample space. Yet in the uniform case, we study the effect of the contamination parameter \(\epsilon \) on the optimal portfolio, showing the presence of a threshold above which the optimal self-financing strategy reduces to 0 for all times: this highlights that with such values of \(\epsilon \) the ambiguity in the model is so high to make the risk-free portfolio the most suitable choice.

This paper differs from the approach used in [12], where among their results, the authors characterize a generalization of the random walk by replacing a probability with a capacity on the branches of a tree, for each time and by imposing dynamic consistency (or rectangularity). We stress that dynamic consistency is a mathematically helpful property in order to capture a form of martingale property, but it is a strong limitation for a non-additive measure.

2 Dynamic Portfolio Selection in the Binomial Model

We refer to the multi-period binomial model introduced by Cox, Ross and Rubinstein [3]. Such model considers a perfect (competitive and frictionless) market under no-arbitrage, where two basic securities are traded: a non-dividend-paying stock and a risk-free bond.

The evolution of the prices of such securities is expressed by the stochastic processes \(\{S_0,\ldots ,S_T\}\) and \(\{B_0,\ldots ,B_T\}\), with \(T \in \mathbb {N}\), \(S_0=s > 0\), \(B_0 = 1\), and for \(t = 1,\ldots ,T\),

$$ \frac{S_t}{S_{t - 1}} = \left\{ \begin{array}{ll} u, &{} \text{ with } \text{ probability } p\\ d, &{} \text{ with } \text{ probability } 1 - p \end{array} \right. \quad \text{ and } \quad \frac{B_t}{B_{t-1}} = (1 + r), $$

where \(u>d>0\) are the “up” and “down” stock price coefficients, r is the risk-free interest rate over each period, satisfying \(u> (1+r) > d\), and \(p \in (0,1)\) is the probability of an “up” movement for the stock price.

All the processes we consider are defined on a filtered probability space \((\varOmega ,\mathcal {F},\{\mathcal {F}_t\}_{t=0,\ldots ,T},\mathbf {P})\), where \(\varOmega = \{1,\ldots ,2^T\}\) and \(\mathcal {F}_t\) is the algebra generated by random variables \(\{S_0,\ldots ,S_t\}\), for \(t = 0,\ldots ,T\), with \(\mathcal {F}_0 = \{\emptyset ,\varOmega \}\) and \(\mathcal {F}_T = \mathcal {F}= \wp (\varOmega )\), where the latter denotes the power set of \(\varOmega \). As usual \(\mathbf {E}^\mathbf {P}\) denotes the expected value with respect to \(\mathbf {P}\).

The process \(\{S_0,\ldots ,S_T\}\) is a multiplicative binomial process where the returns \(\frac{S_1}{S_0},\ldots ,\frac{S_T}{S_{T-1}}\) are i.i.d. random variables, and the probability \(\mathbf {P}\), usually said real-world probability, is completely singled out by the parameter p, indeed, for \(t = 0,\ldots ,T\) and \(k = 0, \ldots , t\), we have that

$$ \mathbf {P}(S_t = u^kd^{t-k}s) = \frac{t!}{k!(t-k)!}p^k(1-p)^{t-k}. $$

We consider a self-financing strategy \(\{\theta _0,\ldots ,\theta _{T-1}\}\), that is an adapted process where each \(\theta _t\) is the (random) number of shares of stock to buy (if positive) or short-sell (if negative) at time t up to time \(t+1\). Such strategy determines an adapted wealth process \(\{V_0,\ldots ,V_T\}\), where, for \(t=0,\ldots ,T-1\),

$$\begin{aligned} V_{t+1} = (1+r)V_t + \theta _tS_t\left( \frac{S_{t+1}}{S_t} - (1+r)\right) . \end{aligned}$$
(1)

This market model is said to be complete, i.e., there is a unique probability measure \(\mathbf {Q}\) on \(\mathcal {F}\) equivalent to \(\mathbf {P}\), usually said risk-neutral probability, such that the discounted wealth process of any self-financing strategy is a martingale under \(\mathbf {Q}\):

$$ \frac{V_t}{(1+r)^t} = \mathbf {E}_t^\mathbf {Q}\left[ \frac{V_T}{(1+r)^T}\right] , $$

for \(t=0,\ldots ,T\), where \(\mathbf {E}_t^\mathbf {Q}[\cdot ] = \mathbf {E}^\mathbf {Q}[\cdot |\mathcal {F}_t]\) and \(\mathbf {E}^\mathbf {Q}\) is the expectation with respect to \(\mathbf {Q}\). Notice that \(\mathbf {E}_0^\mathbf {Q}\) can be identified with \(\mathbf {E}^\mathbf {Q}\). In particular, completeness implies that every payoff \(V_T \in \mathbb {R}^\varOmega \) depending only on the stock price history can be replicated by a dynamic self-financing strategy \(\{\theta _0,\ldots ,\theta _{T-1}\}\) and its unique no-arbitrage price at time \(t = 0\) is

$$ V_0 = \frac{\mathbf {E}^\mathbf {Q}[V_T]}{(1+r)^T}. $$

Notice that the process \(\{S_0,\ldots ,S_T\}\) continues to be a multiplicative binomial process also under \(\mathbf {Q}\), the latter being completely characterized by the parameter

$$ q = \frac{(1+r) - d}{u-d} \in (0,1), $$

which replaces p. In particular, every state \(i \in \varOmega \) is identified with the path of the stock price evolution corresponding to the T-digit binary expansion of number \(i - 1\), in which zeroes are interpreted as “up” movements and ones as “down” movements. If in such representation there are k “up” movements and \(T-k\) “down” movements, then (we avoid braces to simplify writing)

$$ \mathbf {P}(i) = p^k(1-p)^{T-k} \quad \text{ and } \quad \mathbf {Q}(i) = q^k(1-q)^{T-k}. $$

In the market model we have just introduced, the real-world probability measure \(\mathbf {P}\) is assumed to encode the beliefs of an investor. Assuming that investor’s preferences are consistent with the expected utility theory, given a utility function u with \(\mathrm {dom}\,u = \mathbb {R}_{++} = (0,+\infty )\), usually assumed to be sufficiently regular, strictly increasing and strictly concave, and an initial wealth \(V_0 > 0\), the dynamic portfolio selection consists in determining a self-financing strategy \(\{\theta _0,\ldots ,\theta _{T-1}\}\) resulting in a final wealth \(V_T \in \mathbb {R}_{++}^\varOmega \) solving

$$ \max _{\theta _0,\ldots ,\theta _{T-1}} \mathbf {E}^\mathbf {P}[u(V_T)]. $$

Due to the dynamic consistency property of the conditional expected value operator, this problem can be efficiently solved through dynamic programming [18].

In this paper we suppose that our investor has ambiguous beliefs, meaning that, rather than having a single probability \(\mathbf {P}\), he/she actually considers a class of probabilities \(\mathcal {P}\). In this case, the choice of \(\{\theta _0,\ldots ,\theta _{T-1}\}\) should take into account all the probabilities in \(\mathcal {P}\), by adopting a suitable criterion of choice, justified by a normative criterion of rationality.

3 Modeling Ambiguity Through the \(\epsilon \)-Contamination Model

Here we consider the binomial market model (see Sect. 2), introducing ambiguity to get an \(\epsilon \)-contaminated binomial model. Given the real-world probability \(\mathbf {P}\) defined on \(\mathcal {F}\) (which is completely singled out by p) and \( \epsilon \in (0,1)\), the corresponding \(\epsilon \)-contamination model (see, e.g., [9, 19]) is the class of probability measures on \(\mathcal {F}\) defined as

$$ \mathcal {P}_{p,\epsilon } = \{\mathbf {P}' = (1-\epsilon )\mathbf {P}+ \epsilon \mathbf {P}'' \,:\,\mathbf {P}'' \text{ is } \text{ a } \text{ probability } \text{ measure } \text{ on } \mathcal {F}\}, $$

whose lower envelope \(\nu _{p,\epsilon } = \min \mathcal {P}_{p,\epsilon }\) is a completely monotone normalized capacity defined on \(\mathcal {F}\) such that

$$ \nu _{p,\epsilon }(A) = \left\{ \begin{array}{ll} (1 - \epsilon )\mathbf {P}(A), &{} \text{ if } A \ne \varOmega ,\\[1ex] 1, &{} \text{ if } A = \varOmega . \end{array} \right. $$

For every permutation \(\sigma \) of \(\varOmega \), we can define a probability measure \(\mathbf {P}^\sigma \) on \(\mathcal {F}\), whose value on the singletons (we avoid braces to simplify writing) is

$$ \mathbf {P}^\sigma (\sigma (i)) = \nu _{p,\epsilon }(E_i^\sigma ) - \nu _{p,\epsilon }(E_{i+1}^\sigma ), $$

where \(E_i^\sigma = \{\sigma (i), \ldots , \sigma (2^T)\}\), for all \(i \in \varOmega \), and \(E_{2^T + 1}^\sigma = \emptyset \). It turns out that the class \(\mathcal {P}_{p,\epsilon }\) is the convex hull of probabilities \(\mathbf {P}^\sigma \)’s.

We consider a CRRA utility function defined, for \(\gamma >0\), \(\gamma \ne 1\), as

$$ u_\gamma (x) = \frac{x^{1-\gamma }}{1-\gamma }, \quad \text{ for } x > 0. $$

For every random variable \(V_T \in \mathbb {R}_{++}^\varOmega \), we can define the functional

where the integral on the right side is of Choquet type. In particular, since \(u_\gamma \) is strictly increasing, for all \(V_T \in \mathbb {R}_{++}^\varOmega \), if \(\sigma \) is a permutation of \(\varOmega \) such that \(V_T(\sigma (1)) \le \ldots \le V_T(\sigma (2^T))\) then the functional \(\mathbf {CEU}_{\gamma ,p,\epsilon }\) can be expressed (see [8]) as follows

$$\begin{aligned} \mathbf {CEU}_{\gamma ,p,\epsilon }[V_T] = \sum \limits _{k=1}^{2^T} \mathbf {P}^\sigma (k)\frac{(V_T(k))^{1-\gamma }}{1 - \gamma }. \end{aligned}$$
(2)

Proposition 1

The functional \(\mathbf {CEU}_{\gamma ,p,\epsilon }\) is concave, that is, for all \(V_T,V_T' \in \mathbb {R}_{++}^\varOmega \)and all \(\alpha \in [0,1]\), it holds

$$ \mathbf {CEU}_{\gamma ,p,\epsilon }[\alpha V_T + (1-\alpha ) V_T'] \ge \alpha \mathbf {CEU}_{\gamma ,p,\epsilon }[V_T] + (1-\alpha ) \mathbf {CEU}_{\gamma ,p,\epsilon }[V_T']. $$

Proof

Since \(u_\gamma \) is (strictly) concave, for all \(i \in \varOmega \) and \(\alpha \in [0,1]\), we have

$$ u_\gamma (\alpha V_T(i) + (1-\alpha ) V_T'(i)) \ge \alpha u_\gamma (V_T(i)) + (1-\alpha ) u_\gamma (V_T'(i)). $$

Hence, since \(\nu _{p,\epsilon }\) is completely monotone and so 2-monotone, Theorems 4.24 and 4.35 in [8] imply (by monotonicity and concavity of the Choquet integral)

By Theorem 4.39 in [8], we also have that

$$ \mathbf {CEU}_{\gamma ,p,\epsilon }[V_T] = \min _{\mathbf {P}' \in \mathcal {P}_{p,\epsilon }} \mathbf {E}^{\mathbf {P}'}[u_\gamma (V_T)], $$

therefore, \(\mathbf {CEU}_{\gamma ,p,\epsilon }\) is a lower expected utility and maximizing it we are actually applying a maximin criterion of choice.

Given an initial wealth \(V_0 > 0\), our aim is to select a self-financing strategy \(\{\theta _0,\ldots ,\theta _{T-1}\}\) resulting in a final wealth \(V_T \in \mathbb {R}_{++}^\varOmega \), solving

$$ \max _{\theta _0,\ldots ,\theta _{T-1}} \mathbf {CEU}_{\gamma ,p,\epsilon }[V_T]. $$

Taking into account the completeness of the market, the above problem can be rewritten maximizing over the final wealth \(V_T\)

$$\begin{aligned} \begin{array}{c} \mathrm {maximize}\; \mathbf {CEU}_{\gamma ,p,\epsilon }[V_T] \\[1ex] \text{ subject } \text{ to: }\\[1ex] \left\{ \begin{array}{l} \mathbf {E}^\mathbf {Q}[V_T] - (1+r)^TV_0 = 0,\\[1ex] V_T \in \mathbb {R}_{++}^\varOmega . \end{array} \right. \end{array} \end{aligned}$$
(3)

By Proposition 1, the objective function in (3) is a concave function on \(\mathbb {R}_{++}^\varOmega \), subject to a linear constraint, therefore every local maximum is a global maximum and, further, the set of global maxima is convex [1].

Since by (2) the computation of \(\mathbf {CEU}_{\gamma ,p,\epsilon }[V_T]\) depends on a permutation \(\sigma \) such that the values of \(V_T\) are increasingly ordered, the above problem (3) can be decomposed in a family of optimization problems, each indexed by a permutation \(\sigma \) of \(\varOmega = \{1,\ldots ,2^T\}\), corresponding to a possible ordering of \(V_T\):

$$\begin{aligned} \begin{array}{c} \mathrm {maximize}\; \left[ \sum \limits _{k=1}^{2^T} \mathbf {P}^\sigma (k)\frac{(V_T(k))^{1-\gamma }}{1 - \gamma }\right] \\[3ex] \text{ subject } \text{ to: }\\[1ex] \left\{ \begin{array}{l} \sum \limits _{k=1}^{2^T} \mathbf {Q}(k)V_T(k) - (1+r)^TV_0 = 0,\\[3ex] V_T(\sigma (i-1)) - V_T(\sigma (i)) \le 0, \quad \text{ for } \text{ all } i \in \varOmega \setminus \{1\},\\[1ex] V_T \in \mathbb {R}_{++}^\varOmega . \end{array} \right. \end{array} \end{aligned}$$
(4)

Denoting by \(V_T^\sigma \) an optimal solution related to permutation \(\sigma \), an optimal solution \(V_T^*\) for (3) can be found by selecting \(V_T^\sigma \) for a permutation \(\sigma \) where the objective function is maximum. Notice that there can be more permutations where the maximum is attained.

Proposition 2

The following statements hold:

  1. (i)

    for every permutation \(\sigma \), problem (4) has an optimal solution and such optimal solution is unique;

  2. (ii)

    problem (3) has an optimal solution and such optimal solution is unique.

Proof

(i). The subset \(\mathcal {V}^\sigma \) of \(\mathbb {R}^\varOmega \) satisfying the equality and inequality constraints in (4) is closed, and \(\mathcal {V}^\sigma \cap \mathbb {R}_{++}^\varOmega \ne \emptyset \), since \(V_T = (1+r)^T V_0 \in \mathcal {V}^\sigma \cap \mathbb {R}_{++}^\varOmega \). Proceeding as in the proof of Theorem 2.12 in [13], we have that problem (4) has an optimal solution \(V_T^\sigma \), and such optimal solution is unique since the objective function in (4) is strictly concave [1], as \(u_\gamma \) is strictly concave.

(ii). Statement (i) implies that problem (3) has an optimal solution \(V_T^*\) obtained by selecting \(V_T^\sigma \) for a permutation \(\sigma \) where the objective function is maximum. Suppose there are two permutations \(\sigma ,\sigma '\) attaining the maximum and such that \(V_T^\sigma \ne V_T^{\sigma '}\). Since the set of optimal solutions of (3) is convex, this implies that, for all \(\alpha \in [0,1]\), \(V_T^\alpha = \alpha V_T^\sigma + (1-\alpha )V_T^{\sigma '}\) is an optimal solution of (3). So, since the number of permutations of \(\varOmega \) is finite, we can find an \(\alpha ^*\) such that \(V_T^{\alpha ^*}\) solves (4) for a permutation \(\sigma ''\) but \(V_T^{\alpha ^*} \ne V_T^{\sigma ''}\), reaching a contradiction. Finally, this implies that problem (3) has a unique optimal solution.

3.1 Characterization of the Case \(p=\frac{1}{2}\)

Here we focus on the special case \(p = \frac{1}{2}\), for which the probability distribution \(\mathbf {P}^\sigma \) on \(\varOmega \) is such that

$$ \mathbf {P}^{\sigma }(\sigma (1)) = \frac{1 + (2^T - 1)\epsilon }{2^T} \quad \text{ and } \quad \mathbf {P}^{\sigma }(\sigma (i)) = \frac{1-\epsilon }{2^T}, \quad \text{ for } \text{ all } i \in \varOmega \setminus \{1\}. $$

The definition of \(\mathbf {P}^\sigma \) shows that only the value \(\sigma (1)\) of every permutation \(\sigma \) deserves attention: all the permutations \(\sigma ,\sigma '\) such that \(\sigma (1) = \sigma '(1)\) can be considered to be equivalent. Therefore, we can restrict to \(2^T\) arbitrary permutations \(\sigma _1,\ldots ,\sigma _{2^T}\) satisfying \(\sigma _h(1) = h\), for all \(h \in \varOmega \).

In this case, the family of optimization problems (4) can be reduced to a family of optimization problems indexed by the permutations \(\sigma _1,\ldots ,\sigma _{2^T}\):

$$\begin{aligned} \begin{array}{c} \mathrm {maximize}\; \left[ \sum \limits _{k=1}^{2^T} \mathbf {P}^{\sigma _h}(k)\frac{(V_T(k))^{1-\gamma }}{1 - \gamma }\right] \\[3ex] \text{ subject } \text{ to: }\\[1ex] \left\{ \begin{array}{l} \sum \limits _{k=1}^{2^T} \mathbf {Q}(k)V_T(k) - (1+r)^TV_0 = 0,\\[3ex] V_T(\sigma _h(1)) - V_T({\sigma _h}(i)) \le 0, \quad \text{ for } \text{ all } i \in \varOmega \setminus \{1\},\\[1ex] V_T \in \mathbb {R}_{++}^\varOmega . \end{array} \right. \end{array} \end{aligned}$$
(5)

Analogously to problem (4), for every permutation \(\sigma _h\), problem (5) has a unique optimal solution \(V_T^{\sigma _h}\). Then, the optimal solution \(V_T^*\) for (3) can be found by selecting a \(V_T^{\sigma _h}\) for a permutation \(\sigma _h\) (possibly non-unique) where the objective function is maximum. This shows that, when \(p = \frac{1}{2}\), problem (3) can be solved by solving \(2^T\) problems (5).

Below we provide a characterization of the optimal solution of problem (3). In what follows, let \(\sigma \) be a permutation of \(\varOmega \) and \(I \subseteq \varOmega \setminus \{1\}\). If \(I \ne \emptyset \), for all \(i \in I\), consider the constants

$$ \begin{array}{c} A^{\sigma ,I}_i = \frac{\mathbf {P}^\sigma (\sigma (i))}{\sum \limits _{k \in I \cup \{1\}}\mathbf {P}^\sigma (\sigma (k))} \left[ \left( \frac{\mathbf {Q}(\sigma (i))}{\mathbf {P}^\sigma (\sigma (i))} - \frac{\mathbf {Q}(\sigma (1))}{\mathbf {P}^\sigma (\sigma (1))} \right) \left( \sum \limits _{k \in (I \cup \{1\}) \setminus \{i\}} \mathbf {P}^\sigma (\sigma (k)) \right) \right. \\ \left. ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ + \sum \limits _{k \in I \setminus \{i\}} \left( \mathbf {P}^\sigma (\sigma (k)) \left( \frac{\mathbf {Q}(\sigma (1))}{\mathbf {P}^\sigma (\sigma (1))} - \frac{\mathbf {Q}(\sigma (k))}{\mathbf {P}^\sigma (\sigma (k))} \right) \right) \right] , \end{array} $$

where summations over an empty set are intended to be 0.

In turn, the above constants are used to define the following weights:

$$ \lambda _1^{\sigma ,I} = \left( \frac{(1+r)^TV_0}{ \begin{array}{c} \sum \limits _{k=1}^{2^T}\mathbf {Q}(\sigma (k))\left( \frac{1}{\mathbf {P}^\sigma (\sigma (k))} \left( \mathbf {Q}(\sigma (k)) + \mathbf {1}_{\{1\}}(k) \left( \sum \limits _{i \in I}A_i^{\sigma ,I}\right) \right. \right. \\ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ \left. \left. - \sum \limits _{i \in I} \mathbf {1}_{\{i\}}(k) A_i^{\sigma ,I} \right) \right) ^{-\frac{1}{\gamma }} \end{array} } \right) ^{-\gamma }, $$

where \(\mathbf {1}_{\{i\}}\) is the indicator function of the singleton \(\{i\}\), while, for all \(i \in I\), set

$$ \lambda _i^{\sigma ,I} = A_i^{\sigma ,I} \lambda _1^{\sigma ,I}, $$

and \(\lambda _i^{\sigma ,I} = 0\), for all \(i \in \varOmega \setminus (I \cup \{1\})\). Notice that, if \(I = \emptyset \), then the two inner summations involving I in the definition of \(\lambda _1^{\sigma ,I}\) are set to 0: in this case, only \(\lambda _1^{\sigma ,I}\) will be non-null.

Theorem 1

For \(p = \frac{1}{2}\) and \(T \ge 1\), a random variable \(V_T \in \mathbb {R}_{++}^\varOmega \) is the optimal solution of problem (3) if and only if there is a permutation \(\sigma \) of \(\varOmega \) and a subset \(I \subseteq \varOmega \setminus \{1\}\) inducing the weights \(\lambda _i^{\sigma ,I}\)’s such that the following conditions hold:

  1. (i)

    \(\lambda _i^{\sigma ,I} \ge 0\), for all \(i \in I\);

  2. (ii)

    \(V_T(\sigma (1)) = \left( \frac{1}{\mathbf {P}^\sigma (\sigma (1))}\left( \mathbf {Q}(\sigma (1))\lambda _1^{\sigma ,I}+\sum \limits _{k=2}^{2^T}\lambda _k^{\sigma ,I}\right) \right) ^{-\frac{1}{\gamma }}\), for all \(i \in \varOmega \setminus \{1\}\), \(V_T(\sigma (i)) = \left( \frac{1}{\mathbf {P}^\sigma (\sigma (i))}\left( \mathbf {Q}(\sigma (i))\lambda _1^{\sigma ,I}-\lambda _i^{\sigma ,I}\right) \right) ^{-\frac{1}{\gamma }}\);

  3. (iii)

    \(V_T(\sigma (1)) \le V_T(\sigma (i))\), for all \(i \in \varOmega \setminus (I \cup \{1\})\);

and there is no other permutation \(\sigma '\) of \(\varOmega \) and no other subset \(I' \subseteq \varOmega \setminus \{1\}\) determining the weights \(\lambda ^{\sigma ',I'}\)’s and the random variable \(V_T' \in \mathbb {R}_{++}^\varOmega \) satisfying (i)–(iii) such that \(\mathbf {CEU}_{\gamma ,p,\epsilon }[V_T'] > \mathbf {CEU}_{\gamma ,p,\epsilon }[V_T]\).

Proof

Let \(\sigma \) be a permutation of \(\varOmega \). We first show that \(V_T \in \mathbb {R}_{++}^\varOmega \) is the optimal solution of the corresponding problem (5) if and only if there is a subset \(I \subseteq \varOmega \setminus \{1\}\) inducing weights \(\lambda _i^{\sigma ,I}\)’s satisfying (i)(iii). Denote by \(f(V_T) = f(V_T(1),\ldots ,V_T(2^T))\) and \(g(V_T) = g(V_T(1),\ldots ,V_T(2^T)) = 0\) the objective function and the equality constraint in (5), which are, respectively, (strictly) concave and linear. We also have that all inequality constraints in (5) are linear, therefore, the Karush-Kuhn-Tucker (KKT) conditions are necessary and sufficient in this case [1]. Define the Lagrangian function

$$ L(V_T,\lambda _1,\ldots ,\lambda _{2^T}) = f(V_T) - \lambda _1g(V_T) - \sum _{k = 2}^{2^T}\lambda _k(V_T(\sigma (1)) - V_T(\sigma (k))), $$

for which it holds

$$\begin{aligned} \frac{\partial L}{\partial V_T(\sigma (1))}= & {} \mathbf {P}^{\sigma }(\sigma (1))(V_T(\sigma (1)))^{-\gamma } - \lambda _1 \mathbf {Q}(\sigma (1)) - \sum _{k = 2}^{2^T} \lambda _k,\\ \frac{\partial L}{\partial V_T(\sigma (i))}= & {} \mathbf {P}^\sigma (\sigma (i))(V_T(\sigma (i)))^{-\gamma } - \lambda _1 \mathbf {Q}(\sigma (i)) + \lambda _i, \quad \text{ for } \text{ all } i \in \varOmega \setminus \{1\}. \end{aligned}$$

Imposing the KKT conditions, we look for \(V_T \in \mathbb {R}_{++}^\varOmega \), \(\lambda _1 \in \mathbb {R}\), \(\lambda _2,\ldots ,\lambda _{2^T} \ge 0\) such that \(\frac{\partial L}{\partial V_T(\sigma (k))} = 0\), for all \(k \in \varOmega \), \(g(V_T) = 0\), \(V_T(\sigma (1)) \le V_T(\sigma (i))\) and \(\lambda _i(V_T(\sigma (1)) - V_T(\sigma (i))) = 0\), for all \(i \in \varOmega \setminus \{1\}\).

By \(\frac{\partial L}{\partial V_T(\sigma (k))} = 0\), for all \(k \in \varOmega \), we derive

$$\begin{aligned} V_T(\sigma (1))= & {} \left( \frac{1}{\mathbf {P}^\sigma (\sigma (1))} \left( \lambda _1 \mathbf {Q}(\sigma (1)) + \sum _{k = 2}^{2^T} \lambda _k \right) \right) ^{-\frac{1}{\gamma }},\\ V_T(\sigma (i))= & {} \left( \frac{1}{\mathbf {P}^\sigma (\sigma (i))} \left( \lambda _1 \mathbf {Q}(\sigma (i)) - \lambda _i \right) \right) ^{-\frac{1}{\gamma }}, \quad \text{ for } \text{ all } i \in \varOmega \setminus \{1\}. \end{aligned}$$

Moreover, by the complementary slackness conditions \(\lambda _i(V_T(\sigma (1)) - V_T(\sigma (i))) = 0\), for all \(i \in \varOmega \setminus \{1\}\), there must exist \(I \subseteq \varOmega \setminus \{1\}\) such that \(\lambda _i = 0\), for all \(i \in \varOmega \setminus (I \cup \{1\})\), while, for all \(i \in I\), \(V_T(\sigma (1)) - V_T(\sigma (i)) = 0\).

The case \(I = \emptyset \) is trivial, thus suppose \(I \ne \emptyset \). For every \(i \in I\), equation \(V_T(\sigma (1)) - V_T(\sigma (i)) = 0\) holds if and only if

$$ \left( \frac{\mathbf {Q}(\sigma (1))}{\mathbf {P}^\sigma (\sigma (1))} - \frac{\mathbf {Q}(\sigma (i))}{\mathbf {P}^\sigma (\sigma (i))} \right) \lambda _1 + \left( \frac{1}{\mathbf {P}^\sigma (\sigma (1))} + \frac{1}{\mathbf {P}^{\sigma }(\sigma (i))} \right) \lambda _i + \frac{1}{\mathbf {P}^\sigma (\sigma (1))} \sum \limits _{k \in I \setminus \{i\}} \lambda _k = 0. $$

Choose an enumeration of \(I \cup \{1\} = \{i_1,i_2,\ldots ,i_n\}\) with \(i_1 = 1\). Then the above equations give rise to the homogeneous linear system \(A \mathbf{x} = \mathbf{0}\), whose unknown is the column vector \(\mathbf{x} = [\lambda _1\; \lambda _{i_2}\;\cdots \;\lambda _{i_n}]^T \in \mathbb {R}^{n \times 1}\) and whose coefficient matrix is \(A = \left[ \mathbf{q} | B \right] \in \mathbb {R}^{(n-1) \times n}\) where

Hence, \(B \in \mathbb {R}^{(n-1) \times (n-1)}\) can be decomposed in the sum of two matrices \(C,D \in \mathbb {R}^{(n-1) \times (n-1)}\), where C is a constant matrix with all entries equal to \(\frac{1}{\mathbf {P}^\sigma (\sigma (1))}\) and D is the diagonal matrix whose diagonal contains the elements \(\frac{1}{\mathbf {P}^\sigma (\sigma (i_2))}, \ldots , \frac{1}{\mathbf {P}^\sigma (\sigma (i_n))}\). Since the determinant does not change subtracting the first row from all other rows, we can consider the matrix

(6)

Applying the Laplace expansion along the first row we get that

$$ \det B = \det (C + D) = \frac{\sum \limits _{j = 1}^n \mathbf {P}^\sigma (\sigma (i_j))}{\prod \limits _{j = 1}^n \mathbf {P}^\sigma (\sigma (i_j))}, $$

and since \(\det B \ne 0\), we have that \(\mathrm {rank}\,B = n-1\) and the system admits non-trivial solutions, depending on one real parameter that we identify with \(\lambda _1\). Now, apply Cramer’s rule to the reduced system \(B \mathbf{y} = - \lambda _1 \mathbf{q}\) with unknown the column vector \(\mathbf{y} = \left[ \lambda _{i_2} \;\cdots \; \lambda _{i_n}\right] ^T \in \mathbb {R}^{(n-1)\times 1}\).

For \(j=2,\ldots ,n\), denote by \(B_{j - 1}\) the matrix obtained by substituting the \((j - 1)\)-th column of B with the vector \(- \lambda _1\mathbf{q}\). Applying the Laplace expansion along the \((j-1)\)-th column of \(B_{j - 1}\) and noticing that all minors can be transformed (by swapping rows and keeping track of sign changes) in the sum of a constant matrix and a diagonal matrix (possibly with a zero on the diagonal), we have

$$ \begin{array}{c} \det B_{j-1} = \left[ \left( \frac{\mathbf {Q}(\sigma (i_j))}{\mathbf {P}^\sigma (\sigma (i_j))} - \frac{\mathbf {Q}(\sigma (1))}{\mathbf {P}^\sigma (\sigma (1))} \right) \frac{\sum \limits _{k \in (I\cup \{1\}) \setminus \{i_j\}}\mathbf {P}^\sigma (\sigma (k))}{\prod \limits _{k \in (I\cup \{1\}) \setminus \{i_j\}}\mathbf {P}^\sigma (\sigma (k))} \right. \\ ~~~~~~~~~~~~~~~ \left. + \sum \limits _{k \in I \setminus \{i_j\}} \left( \left( \frac{\mathbf {Q}(\sigma (1))}{\mathbf {P}^\sigma (\sigma (1))} - \frac{\mathbf {Q}(\sigma (k))}{\mathbf {P}^\sigma (\sigma (k))} \right) \frac{1}{\prod \limits _{s \in (I \cup \{1\}) \setminus \{i_j,k\}} \mathbf {P}^\sigma (\sigma (s))} \right) \right] \lambda _1 \end{array} $$

therefore, \(\lambda _{i_j} = \frac{\det B_{j-1}}{\det B} = A_{i_j}^{\sigma ,I} \lambda _1\).

Substituting in \(g(V_T) = 0\) the expressions of \(V_T(\sigma (k))\), for all \(k \in \varOmega \), and \(\lambda _i\), for all \(i \in \varOmega \setminus \{1\}\), we get for \(\lambda _1\) the expression of \(\lambda _1^{\sigma ,I}\), thus \(\lambda _i\) coincides with \(\lambda _i^{\sigma ,I}\). Hence, \(V_T \in \mathbb {R}_{++}^\varOmega \) is the optimal solution for the problem (5) if and only if there exists \(I \subseteq \varOmega \setminus \{1\}\) inducing weights \(\lambda _i^{\sigma ,I}\)’s satisfying (i)(iii).

For every \(h \in \varOmega \), let \(\sigma _h\) be a permutation of \(\varOmega \) such that \(\sigma _h(1) = h\) and denote by \(V_T^{\sigma _h}\) the optimal solution of problem (5) for \(\sigma _h\). By the definition of the Choquet integral [8], problem (3) is equivalent to maximizing \(\mathbf {CEU}_{\gamma ,p,\epsilon }\) over optimal solutions of the family of problems (4), indexed by all permutations of \(\varOmega \). In turn, since, for every permutations \(\sigma ,\sigma '\) of \(\varOmega \) such that \(\sigma (1) = \sigma '(1)\), it holds \(\mathbf {P}^\sigma = \mathbf {P}^{\sigma '}\), such maximization can be reduced to maximizing \(\mathbf {CEU}_{\gamma ,p,\epsilon }\) over optimal solutions of the family of problems (5), indexed by permutations \(\sigma _1,\ldots ,\sigma _{2^T}\) of \(\varOmega \). This finally proves the theorem.

Theorem 1 allows to find the analytic expression of the optimal solution of problem (3) in the case \(p = \frac{1}{2}\), reducing it to \(2^T\) combinatorial optimization problems, each corresponding to a permutation \(\sigma _h\) of \(\varOmega \) such that \(\sigma _h(1) = h\), for all \(h \in \varOmega \).

Example 1

Consider a two-period model with \(T = 2\), \(\gamma = 2\), \(\epsilon = \frac{1}{50}\), \(p = \frac{1}{2}\), \(u =2\), and \(d = \frac{1}{2}\). Suppose , , and the risk-free interest rate is \(r = 5\%\) over every period. This implies that \(q = \frac{11}{30}\) while, for every permutation \(\sigma \) of \(\varOmega = \{1,2,3,4\}\), it holds \(\mathbf {P}^\sigma (\sigma (1)) = \frac{53}{200}\) and \(\mathbf {P}^\sigma (\sigma (i)) = \frac{49}{200}\).

For \(\sigma _1 = \langle 1,2,3,4\rangle \) the only subset of \(\varOmega \setminus \{1\}\) satisfying (i)(iii) is \(I = \{4\}\) with:

  • \(\lambda _1^{\sigma _1,I} = 0.00822164\), \(\lambda _2^{\sigma _1,I} = 0\), \(\lambda _3^{\sigma _1,I} = 0\), \(\lambda _4^{\sigma _1,I} = 0.00118255\),

  • \(V_2^{\sigma _1}(1) = 10.7623\), \(V_2^{\sigma _1}(2) = 11.328\), \(V_2^{\sigma _1}(3) = 11.328\), \(V_2^{\sigma _1}(4) = 10.7623\),

  • \(\mathbf {CEU}_{\gamma ,p,\epsilon }[V_2^{\sigma _1}] = -0.0906436\).

For \(\sigma _2 = \langle 2,1,3,4\rangle \) the only subset of \(\varOmega \setminus \{1\}\) satisfying (i)(iii) is \(I = \{4\}\) with:

  • \(\lambda _1^{\sigma _2,I} = 0.00803641\), \(\lambda _2^{\sigma _2,I} = 0\), \(\lambda _3^{\sigma _2,I} = 0\), \(\lambda _4^{\sigma _2,I} = 0.000778428\),

  • \(V_2^{\sigma _2}(1) = 15.0585\), \(V_2^{\sigma _2}(2) = 10.0101\), \(V_2^{\sigma _2}(3) = 11.4578\), \(V_2^{\sigma _2}(4) = 10.0101\),

  • \(\mathbf {CEU}_{\gamma ,p,\epsilon }[V_2^{\sigma _2}] = -0.0886014\).

For \(\sigma _3 = \langle 3,1,2,4\rangle \) the only subset of \(\varOmega \setminus \{1\}\) satisfying (i)(iii) is \(I = \{4\}\) with:

  • \(\lambda _1^{\sigma _3,I} = 0.00803641\), \(\lambda _2^{\sigma _3,I} = 0\), \(\lambda _3^{\sigma _3,I} = 0\), \(\lambda _4^{\sigma _3,I} = 0.000778428\),

  • \(V_2^{\sigma _3}(1) = 15.0585\), \(V_2^{\sigma _3}(2) = 11.4578\), \(V_2^{\sigma _3}(3) = 10.0101\), \(V_2^{\sigma _3}(4) = 10.0101\),

  • \(\mathbf {CEU}_{\gamma ,p,\epsilon }[V_2^{\sigma _3}] = -0.0886014\).

For \(\sigma _4 = \langle 4,1,2,3\rangle \) the only subset of \(\varOmega \setminus \{1\}\) satisfying (i)(iii) is \(I = \emptyset \) with:

  • \(\lambda _1^{\sigma _4,I} = 0.0079751\), \(\lambda _2^{\sigma _4,I} = 0\), \(\lambda _3^{\sigma _4,I} = 0\), \(\lambda _4^{\sigma _4,I} = 0\),

  • \(V_2^{\sigma _4}(1) = 15.1162\), \(V_2^{\sigma _4}(2) = 11.5017\), \(V_2^{\sigma _4}(3) = 11.5017\), \(V_2^{\sigma _4}(4) = 9.1017\),

  • \(\mathbf {CEU}_{\gamma ,p,\epsilon }[V_2^{\sigma _4}] = -0.0879255\).

A simple inspection shows that the maximum value of \(\mathbf {CEU}_{\gamma ,p,\epsilon }\) is obtained for \(\sigma _4\), therefore we take \(V^*_2 = V_2^{\sigma _4}\).

Using the martingale property with respect to \(\mathbf {Q}\) of the wealth process \(\{V_0^*,V_1^*,V_2^*\}\), that is \(V_1^* = \frac{\mathbf {E}^\mathbf {Q}_1[V_2^*]}{1+r}\) and \(V_0^* = V_0 = \frac{\mathbf {E}^\mathbf {Q}_0[V_2^*]}{(1+r)^2}\), we can recover the optimal self-financing strategy \(\{\theta _0,\theta _1\}\) through (1):

figure a

4 The Effect of \(\epsilon \) on the Optimal Portfolio

Now we investigate the effect of \(\epsilon \) on the optimal portfolio. Figures 1 and 2 show the optimal value of , \(u \in \{1.2,1.4,1.6,1.8,2\}\), \(d = \frac{1}{u}\), \(r = 5\%\), \(\gamma = 2\), \(p = \frac{1}{2}\) and \(\epsilon \) ranging in [0, 1) with step 0.01, for \(T = 2\) and \(T = 3\), respectively. In particular, \(\epsilon = 0\) stands for absence of ambiguity.

In both figures we can see that, for increasing \(\epsilon \), the optimal value of \(\mathbf {CEU}_{\gamma ,p,\epsilon }\) decreases until reaching a constant value that corresponds to \(u_\gamma ((1+r)^T V_0)\). It actually holds that, once the optimal value of \(\mathbf {CEU}_{\gamma ,p,\epsilon }\) reaches \(u_\gamma ((1+r)^T V_0)\), the optimal portfolio results in the final risk-free wealth \(V_T^* = (1+r)^T V_0\). In such cases, the corresponding self-financing strategy is such that \(\theta _t = 0\), for \(t=0,\ldots ,T-1\), i.e., the optimal portfolio consists of only a risk-free bond investment for all the periods.

Fig. 1.
figure 1

Optimal values of \(\mathbf {CEU}_{\gamma ,p,\epsilon }\) for \(T = 2\), , \(u \in \{1.2,1.4,1.6,1.8,2\}\), \(d = \frac{1}{u}\), \(r = 5\%\), \(\gamma = 2\), \(p = \frac{1}{2}\) and \(\epsilon \) ranging in [0, 1) with step 0.01.

Fig. 2.
figure 2

Optimal values of \(\mathbf {CEU}_{\gamma ,p,\epsilon }\) for \(T = 3\), , \(u \in \{1.2,1.4,1.6,1.8,2\}\), \(d = \frac{1}{u}\), \(r = 5\%\), \(\gamma = 2\), \(p = \frac{1}{2}\) and \(\epsilon \) ranging in [0, 1) with step 0.01.

Figures 1 and 2 highlight the existence of a value \(\epsilon ^*(T,\gamma ,p,u,d,r,V_0)\) above which the optimal self-financing strategy reduces to 0 for all times. In a sense, we may think at \(\epsilon ^*(T,\gamma ,p,u,d,r,V_0)\) as a threshold above which the ambiguity incorporated in the real-world probability \(\mathbf {P}\) through the \(\epsilon \)-contamination is so high to make the risk-free portfolio the most suitable choice. The following Table 1 reports the intervals containing \(\epsilon ^*(T,\gamma ,p,u,d,r,V_0)\) for the parameter setting of Figs. 1 and 2. Let us stress that a numerical approximation of \(\epsilon ^*(T,\gamma ,p,u,d,r,V_0)\) can be achieved by applying a suitable bisection algorithm, while its analytic expression will be the aim of future research.

Table 1. Intervals containing \(\epsilon ^*(T,\gamma ,p,u,d,r,V_0)\) for \(T \in \{2,3\}\), , \(u \in \{1.2,1.4,1.6,1.8,2\}\), \(d = \frac{1}{u}\), \(r = 5\%\), \(\gamma =2\), \(p = \frac{1}{2}\).

5 Conclusions and Future Works

In this paper we formulate a dynamic portfolio selection problem under ambiguity by referring to an \(\epsilon \)-contaminated binomial market model and a CRRA utility function. We provide a characterization of the optimal solution in the case the stock price returns are uniformly distributed and investigate the effect of the contamination parameter \(\epsilon \) on the optimal portfolio. It turns out that, in the uniform case, one can find a threshold for \(\epsilon \) above which the optimal portfolio reduces to a risk-free investment on every period. An analytic characterization of such threshold is reserved for future research. Further, a characterization of the optimal solution in the general case remains open, as well as the generalization to other classes of utility functions. Another line of investigation is the design of efficient algorithms to solve the family of concave optimization problems to which the initial problem can be reduced, since, being indexed by the permutations of the sample space, they constitute a computational challenge.