Individual improvements but global deterioration. How is it possible?


Summary

It is normal. It is totally possible to have increasing retention rates locally and a decreasing retention rate globally. This is an illustration of Simpson’s paradox, named after the mathematician Edward Simpson.


I monitor my Acquisition business on a daily basis by checking KPIs such as traffic and conversion rate in the different segments, audiences, etc.
And of course, I monitor my Retention business by checking KPIs such as the retention rate (RR%).

1. Business monitoring

I monitor the subscriptions that are due to renewal every month. These are my Expirations. The monthly Retention Rate (RR%) is defined as the number of Expirations retained over the total number of expirations, this month.

Table #1 : global view.

RR% = Retained / Expirations
For my monitoring needs, I do month over month comparisons to know how my business trends.

2. Monitoring in detail

I want to know my business in detail, be able to anticipate any change in trends. So, I look at my KPIs at a lower level. I segment by source of acquisition, by product, by country, by previous renewal price, by you-name-it …

Table #2 : same view segmented by Product (A and B).

In table #2 above, we have a split by product. Product A and B retain respectively at 75% and 25% and the overall RR% is 65%.

Overall RR% = Sum of Retained / Sum of Expirations.

3. Local Improvements but Global Deterioration.

I keep on comparing, month over month, the RR% of each segment. When I look at the individual RR%, each of them increase ⬆️. However, the overall RR% is decreasing ⬇️.

How is it possible ?🤯
This is the Simpson’s paradox. It’s a statistical paradox in which a phenomenon observed in several groups is reversed when the groups are combined.

Still not convinced? Wanna see some numbers?

I sell subscriptions for a video streaming service📽️. I propose :

  • An advertising-free service (Product A)🎬🍿 very appreciated which constitutes most my product mix.
  • An entry-level service (Product B) 🎬 which attracts a few curious people and retention is low

This month (table #3), I had the same number of subscriptions to renew but the product mix varies compared to last month (Table #2).

Table #3 : Same total number of expirations as in table #2.

The product mix changed slightly toward Product B. Compared to table #2, each product RR% is increasing📈.
The RR% of Product A goes from 75% to 77%: +2 point 👍
The RR% of Product B increases from 25% to 27%: + 2 points👍
The overall RR% goes from 65% to 64%: -1 point. 👎

4. Equally confusing: the opposite is true.

The following month, I have still the same number of subscriptions to renew with a product mix in favor of Product A.

Table #4 : Same total number of expirations as in table #2 and #3.

In table #4, the individual RR% decrease significantly 📉compared to table #2 & #3. However, the overall RR% is up 📈.

The RR% of product A goes from 77% to 72%: -5 points 👎
The RR% of product B goes from 27% to 20%. : -7 points 👎
The overall RR% goes from 65% to 64%: +1.8 points. 👍

5. What is this mathematical mystery?

This is the principle of weighted average. The average RR% isn’t equal to the average of the percentages. In the calculation of the RR%, the weight of expirations is much greater than the weight of the RR%.

Intuitively, we want to believe that an improvement in each RR% will generate an overall improvement in the RR%. This is only true if only one parameter in the equation changes.

Key Takeaways

  1. There’s an 11th commandment:☝️ “You shall not average percentages”.
  2. Telescope 🔭 and microscope 🔬: It’s crucial to monitor your business by having a global view and a detailed view. One doesn’t go without the other.
  3. The Simpson paradox reminds us that it is as easy to make mistake as to manipulate minds by presenting only part of the numbers.

The Simpson paradox on wikipedia

Add a Comment

Your email address will not be published. Required fields are marked *