๐๐ข๐ฆ๐ฉ๐ฌ๐จ๐งโ๐ฌ ๐ฉ๐๐ซ๐๐๐จ๐ฑ is not a school of media criticism about why the earlier seasons of the Simpsons are so superior to the later ones. ๐๐ญโ๐ฌ ๐ญ๐ก๐ ๐ง๐๐ฆ๐ ๐จ๐ ๐ ๐ฉ๐ก๐๐ง๐จ๐ฆ๐๐ง๐จ๐ง ๐ฐ๐ก๐๐ซ๐ ๐ญ๐ก๐ ๐ฌ๐ญ๐๐ญ๐ข๐ฌ๐ญ๐ข๐๐๐ฅ ๐ญ๐๐ง๐๐๐ง๐๐ข๐๐ฌ ๐จ๐ ๐ญ๐ก๐ ๐ฐ๐ก๐จ๐ฅ๐ ๐๐จ๐งโ๐ญ ๐ฅ๐จ๐จ๐ค ๐ฅ๐ข๐ค๐ ๐ญ๐ก๐ ๐ฌ๐ญ๐๐ญ๐ข๐ฌ๐ญ๐ข๐๐๐ฅ ๐ญ๐๐ง๐๐๐ง๐๐ข๐๐ฌ ๐จ๐ ๐ญ๐ก๐ ๐ฉ๐๐ซ๐ญ๐ฌ. And sometimes itโs hard to spot.
I ran into it today when trying to gauge the performance of different message groups across two larger categories of messages. For the sake of anonymity, letโs say we launched messaging in two towns at the same time: Springfield and Shelbyville. The business (our customer) was interested in message-group-wise performance as well as the Springfield vs. Shelbyville comparison.
In the aggregate, Shelbyville has an advantage - you can see that in the bar chart on the left. But the story is different when you look at the performance of each individual message group (the figure on the right). For most messages, performance was better in Springfield than in Shelbyville. Why?
In discussions of Simpsonโs paradox, reference is made to the โlurking variable,โ which is just a menacing way to refer to a confounding variable that tells a different story than the aggregate.ย Here, the โlurking variableโ is which message group youโre talking about.
Take a look at the scatter plot on the right.ย Each point represents an individual message group; the horizontal axis represents the message groupโs performance in Springfield, and the vertical axis represents the message groupโs performance in Shelbyville.ย The diagonal represents what it would look like for a message group to have the exact same performance in both towns.
The first insight that pops out from this plot is that most of the points are to the lower-right of the diagonal.ย In other words, ๐ฆ๐จ๐ฌ๐ญ ๐ฆ๐๐ฌ๐ฌ๐๐ ๐ ๐ ๐ซ๐จ๐ฎ๐ฉ๐ฌ ๐ก๐๐ฏ๐ ๐ ๐ก๐ข๐ ๐ก๐๐ซ ๐ฉ๐๐ซ๐๐จ๐ซ๐ฆ๐๐ง๐๐ ๐ข๐ง ๐๐ฉ๐ซ๐ข๐ง๐ ๐๐ข๐๐ฅ๐ ๐ญ๐ก๐๐ง ๐ญ๐ก๐๐ฒ ๐๐จ ๐ข๐ง ๐๐ก๐๐ฅ๐๐ฒ๐ฏ๐ข๐ฅ๐ฅ๐.
But thereโs another thing represented in the scatter plot - the relative volume messages that went out for each message group, which is encoded in color.ย A blue point represents a message group with low volume; a red point represents high volume.
And you can see pretty immediately thereโs one bright red spot amid a sea of purple-ish blue ones.ย ๐๐ก๐๐ญโ๐ฌ ๐ญ๐ก๐ ๐ก๐ข๐ ๐ก๐๐ฌ๐ญ-๐ฏ๐จ๐ฅ๐ฎ๐ฆ๐ ๐ฆ๐๐ฌ๐ฌ๐๐ ๐ ๐ ๐ซ๐จ๐ฎ๐ฉ, ๐๐ง๐ ๐ข๐ญ ๐ฃ๐ฎ๐ฌ๐ญ ๐ก๐๐ฉ๐ฉ๐๐ง๐ฌ ๐ญ๐จ ๐๐ ๐จ๐ง ๐ญ๐ก๐ ๐๐ก๐๐ฅ๐๐ฒ๐ฏ๐ข๐ฅ๐ฅ๐ > ๐๐ฉ๐ซ๐ข๐ง๐ ๐๐ข๐๐ฅ๐ ๐ฌ๐ข๐๐ ๐จ๐ ๐ญ๐ก๐ ๐ ๐ซ๐๐ฉ๐ก.
In other words, weโve got one group that performs better in Shelbyville than it does in Springfield, and just because that group is way higher-volume than all the others, it shifts the whole average.ย If you just look in aggregate, you might assume thereโs something special about Shelbyville that makes it receptive to your offering.ย But when you break it down by message group, that interpretation starts looking a little less compelling - maybe itโs really just something unique about that one group.ย ย
๐๐ฌ ๐ข๐ญ ๐ซ๐๐๐ฅ๐ฅ๐ฒ ๐ฌ๐๐๐ ๐ญ๐จ ๐ฌ๐๐ฒ ๐ญ๐ก๐๐ญ ๐ญ๐ก๐๐ฒ ๐ฅ๐จ๐ฏ๐ ๐ฒ๐จ๐ฎ ๐ฆ๐จ๐ซ๐ ๐ข๐ง ๐๐ก๐๐ฅ๐๐ฒ๐ฏ๐ข๐ฅ๐ฅ๐ ๐ญ๐ก๐๐ง ๐ข๐ง ๐๐ฉ๐ซ๐ข๐ง๐ ๐๐ข๐๐ฅ๐, ๐ฃ๐ฎ๐ฌ๐ญ ๐๐๐๐๐ฎ๐ฌ๐ ๐ ๐ฌ๐ข๐ง๐ ๐ฅ๐ ๐ก๐ข๐ ๐ก-๐๐ซ๐๐ช๐ฎ๐๐ง๐๐ฒ ๐ฆ๐๐ฌ๐ฌ๐๐ ๐ ๐ ๐ซ๐จ๐ฎ๐ฉ ๐ก๐๐ฉ๐ฉ๐๐ง๐ฌ ๐ญ๐จ ๐๐ฉ๐ฉ๐๐๐ฅ ๐ญ๐จ ๐๐ก๐๐ฅ๐๐ฒ๐ฏ๐ข๐ฅ๐ฅ๐ข๐๐ง๐ฌ ๐ฆ๐จ๐ซ๐?
Why does this matter? Well, we think about this kind of thing at Aampe a lot.ย If you judge performance based on broad overall measures and miss a โlurking variableโ that changes the story, you can end up making suboptimal decisions.
At the same time, itโs hard to figure out which variables matter. If you tried to be really comprehensive about figuring out what influences performance, the complexity could spiral out of control.
Thatโs exactly why you need an agentic platform like Aampe. Aampeโs core offering is a way to look at each of your unique users, figure out what they prefer, and design a user experience just for them - instead of what it looks like most users prefer (which could be misleading in aggregate). ๐๐ ๐๐จ๐งโ๐ญ ๐ฆ๐๐ค๐ ๐๐๐๐ข๐ฌ๐ข๐จ๐ง๐ฌ ๐๐๐ฌ๐๐ ๐จ๐ง ๐ญ๐ก๐ ๐๐จ๐ซ๐๐ฌ๐ญ - ๐ฐ๐ ๐ก๐๐ฏ๐ ๐๐ง ๐๐ ๐๐ง๐ญ ๐๐จ๐ซ ๐๐๐๐ก ๐ญ๐ซ๐๐.