Risk and uncertainty
By “uncertain” knowledge, let me explain, I do not mean merely to
distinguish what is known for certain from what is only probable.
The sense in which I am using the term is that in which the prospect
of a European war is uncertain….
There is no scientific basis to form any calculable probability
whatever.
We simply do not know.
Risk maximization
Quote from Cover (1991), “Universal portfolios”
In general, volatile uncorrelated stocks lead to great gains for the rate at which a portfolio grows...
Quote from V. Anantharam and V. S. Borkar (2017), “A variational
formula for risk-sensitive reward.”
Work on *risk-sensitive reward maximization* has been relatively uncommon; see, e.g., [24]. Unlike in the case of the classical discounted or ergodic costs, the two risk-sensitive control problems are not trivially equivalent by treating cost as a negative reward. In fact, risk-sensitive reward maximization is the natural set-up in portfolio optimization...
Part I: Distortion riskmetrics + bandits
Risk-sensitive Bandits: Arm Mixture Optimality and Regret-efficient Algorithms
M. Tatli, A. Mukherjee, P. L.A., K. Shanmugam and A. Tajer
AISTATS 2025 (To appear)
Summary
- DRM-sensitive bandit problem
- Many DRMs, solitary arm is not optimal. Instead, it is optimal to
play an arm-mixture
- Learning optimal mixtures
- Estimation: \(K\)-continuous valued mixing
coefficients
- Tracking: optimal mixture
- Regret bounds for ETC-type and UCB-type
algorithms
Risk-neutral bandit problem setup
\(K\) arms /
experiments
Expected return of arms: \(\mathbf{\mu}
= [\mu_1,\cdots,\mu_K]^\top\)
Goal: Arm with largest expected
return:
\[ a^* \in \arg\max_{i\in[K]} \mu_i
\]
Risk-neutral
Objective: Regret minimization (Exploration-Exploitation trade-off)
Minimize cumulative regret:
\[R_T\triangleq T\mu_{a^\star} -
\sum\limits_{s=1}^T \mathbb{E}[X_{A_s}]\]
Main message
Human preferences can be explained using distorted
probabilities
People usually overweight extreme/unlikely events
How to distort the probabilities? Distortion riskmetrics
(DR)
Probabilistic distortions: basis for Nobel-prize winning Prospect
Theory work of Tversky and Kahnemann