Sorta Sharpe
sharpe ratio is a second-order approximation to a higher-order problem
Note: This post has a lot of formal definitions that are really just superfluous for the high level understanding of what I actually think is important. If you are able to understand what a CDF graph is, then that’s all you need to be able to understand this post and feel free to skip the calculus!
“What’s your sharpe?”
I think that this is a pretty common question to get for someone managing a book or fund or whatever. It is a helpful stat to have in the back of your head when trying to understand the return profile that someone is running, but in isolation it’s ~okay~ at best. For example, do you really care if someone runs a 50 sharpe book but it only returns a few thousand dollars? Probably not. I mean at that point it’s kinda just like congrats you made a few thousand dollars. If the scalability to a reasonable size isn’t there then it becomes meaningless especially because if you recall from my previous post1, I (and you) shouldn’t really care about variance for tiny dollar amounts.
Okay so sharpe matters when you have a strategy when you put on size? Yes. But is a strictly higher sharpe better if the EV is the same? Well it can be; it depends. In order for me to continue, we need to go over stochastic dominance (it’s related).
Stochastic Dominance
Stochastic dominance2 is a way to say that one lottery (or strategy/outcome) is strictly better than another without making any assumptions about utility, other than that a high level of wealth is preferable (an increasing utility function). So I guess unless you take Biggy Small’s lyrics literally, this should apply to you.
First Order Stochastic Dominance (FOSD)
As long as someone’s utility function is increasing w.r.t. wealth, then if one lottery (outcome) FOSDs another, it is preferred by all, regardless of the exact curvature of one’s utility function.
How do we know if one lottery FOSDs another? Well it is just that at every wealth level, the probability of ending up with less than or equal to that amount is never higher under the preferred lottery. In other words, the CDF of the preferred outcome is less than or equal to the CDF of the non preferred one at all points.
Below, A FOSDs B.
To be more precise with our definition:
And now for some calculus II flashbacks, we will integrate by parts:
Second Order Stochastic Dominance (SOSD)
As long as a person’s utility function is increasing and concave in wealth (i.e., the person is risk-averse), if one lottery SOSDs another, it is preferred by all such individuals, regardless of the exact degree of risk aversion.
How do we know if one lottery SOSDs another? Unlike FOSD, this allows the distributions to cross (but only at one point!). At every wealth level, the area under the CDF of the preferred lottery is less than or equal to that of the non-preferred one. Equivalently, the cumulative probability of “bad outcomes,” when integrated over all thresholds, is always smaller. (It is worth noting that FOSD implies SOSD, but the reverse is not necessarily true.)
There can only be SOSD if the preferred lottery has an expected value greater than or equal to the non-preferred one. If the expected values are the same, then the preferred lottery must be a mean preserved spread of the non-preferred. In other words, the preferred lottery just has weakly lower higher order moments (variance, skew, and kurtosis). It is not enough to state that just because the variance is lower that a lottery SOSDs another (which is kind of my point with Sharpe that we will loop back to).
The integral condition:
Difference in expected utilities:
Integration by parts:
Mean preservation means that the boundary term vanishes:
Reduced expression:
Define cumulative CDF difference:
Integrate by parts (again!):
Final SOSD condition:
SOSD preference:
Back to Sharpe
Ok, so we can probably see that if the expected value is substantially more, such that the CDFs never intersect, then the sharpe is irrelevant because we can have FOSD even under a lower sharpe ratio. When we have cases of equal mean and variance, higher order moments matter too.
Suppose that we have two lotteries with equal mean and variance, but switched skew:
Ok so clearly A and B have the same mean and variance but the skew is ‘flipped’. We should prefer lottery A under increasing, concave utility. I really like using extremes to highlight examples. Kelly Criterion never has you do anything with a chance of going to 0 since ln(0) → -inf (this is bad…lol). Suppose that you only have 9 dollars to your name, the negative skew can bring you to 0, where as the positive skew lottery cannot, so obviously we prefer A in that case, but even if we have more to our name, our preference should still be the same.
The Taylor expansion of ln(w) makes this a little more clear: +mean, -variance, +skew, -kurtosis, …
And this makes intuitive sense right? because if you look at a concave function, the left is ‘steeper’ than the right so moving equal distance to the left as the right has a bigger change in utility as the vertical distances you move are asymmetric.
And this is why even under two strategies, one having lower variance but the same mean (strictly higher sharpe) is not enough to let you know that you prefer the strategy with the higher sharpe. Perhaps the strategy with the higher sharpe has a rare, but nasty left tail, making the total expected utility less for risk-adverse individuals. The strategy must SOSD the other to be considered objectively better.
Sort of Vol Risk Premium (VRP)
Ok this is a little separate, but assume that options market makers (OMMs) are risk adverse individuals. Based on what we went over, where do you expect their fair value of the option w.r.t. the midpoint of the quotes that they show?
It must be below the midpoint as to be indifferent between a buy and a sell, the OMM needs to ask for strictly more edge on the ask to compensate for the fact that being short vol has negative skew for your payout, while as being long vol has positive skew. Therefore, they need ask for more edge to have added EV to compensate for the ‘reversed’ payout structure that drags down utility.






nice post
the last point about OMM markets is interesting. i would say in practice (at least from my experience), particularly for inventory-wearing OMMs, the risk averseness is in inventory management and asymmetric risk limits for long/short positions on vol/gamma etc. rather than asymmetry in quote widths for bid/ask. now that i think a bit more about it, i think the situation you describe is especially relevant in relatively illiquid markets where there are few OMMs and wider spreads. but again, just my experience