
Imagine you survey 1,000 customers about how they feel about your brand's pricing. Half say it's too expensive. Half say it's about right. You average the responses, and the number that lands in your dashboard tells you customers find your pricing "somewhat acceptable." You adjust nothing. Six months later, you've lost a segment you never knew was leaving.
This is not a hypothetical. It is how aggregate data fails teams every day — not because the data is wrong, but because the analysis obscures exactly what the data was trying to reveal: the gap between the groups.
There's an old observation in statistics: if one foot is in a bucket of ice and the other is in a bucket of boiling water, on average you're comfortable. Averages are mathematically legitimate but can be practically misleading. In a stable, relatively homogeneous market, that limitation is manageable. In the market that actually exists right now, it can be risky.
Consumer behavior in 2025 and into 2026 has bifurcated in ways that make the average customer a near-fiction. According to Moody's Analytics, the top 20% of income earners now account for 59% of total U.S. consumer spending — a near record high — while the bottom 80% accounts for a record-low 41%. That isn't a normal distribution. It's two different economies sitting inside the same aggregate number.
Morgan Stanley puts it similarly: the top 40% of households by income now drive 60% of all consumer spending. When your market data gets averaged across that divide, the "typical customer" you're optimizing for represents a statistical midpoint that corresponds to almost no one's actual situation.
When the top 20% of earners drive nearly 60% of spending, any aggregate metric that treats all consumers equally is describing a market that no longer exists. Your "average customer" is a weighted fiction.
The K-shaped economy — the term economists use to describe trajectories diverging rather than recovering together — has deepened. By January 2026, the gap in spending growth between high-income households and all others had reached its highest level since mid-2022, according to Bank of America Institute.
The divergence isn't only financial. Research from Columbia Business School professor Oded Netzer finds that political polarization is now reshaping where consumers shop and which brands they choose — with 60% of consumers saying they "buycott" products based on political identity. Add to that the pattern identified by Simon-Kucher: consumers are increasingly abandoning the middle, either trading down to private label or trading up to premium, while mid-market brands absorb the squeeze from both directions.
The result is that many consumer datasets are quietly bimodal — two distinct clusters that happen to produce a misleading aggregate when you flatten them together.
Bimodal data looks like normal data when you average it. Before reading any aggregate metric, check whether the underlying distribution has two humps where you expected one. If it does, the average is not your story — the shape is.
Here's the frustrating part: most teams know this problem exists. The problem is well-documented across the industry: marketers consistently report that reaching the right audience remains their single biggest challenge — even as segmentation tools grow more sophisticated and data collection expands. They're refreshing their approach regularly. They're still getting it wrong.
The reason, more often than not, is that segmentation is being updated without changing the underlying question being asked. Teams re-run the same aggregate analysis on new data, discover slightly different averages, and call it an updated view of the customer. What they're actually producing is a new average that still conceals the same split. The segmentation gets newer. The blind spot stays the same.
The fix isn't more segmentation — it's better questions earlier in the process. Before you aggregate anything, ask whether there's a reason this population might divide into meaningfully different groups. Income, obviously. But also tenure as a customer, channel preference, political identity in politically charged categories, and age cohort in markets where economic experience varies dramatically by generation. Flag those fault lines before the data gets collapsed into a mean.
| Segmentation Approach | What It Reveals | What It Misses |
|---|---|---|
| Aggregate average (mean score) | Overall trend direction | Divergent subgroups, bimodal distributions |
| Demographic (age, gender) | Surface-level differences | Within-group income and identity variation |
| Income-tiered segmentation | Spending capacity differences | Identity-driven purchasing behavior |
| Behavioral + attitudinal | How and why people choose | Harder to scale; high resource intensity |
| Distribution-first analysis | Bimodality, variance, emerging splits | Requires deliberate habit change in reporting |
The practical steps aren't complicated, but they require a deliberate change in habit.
Track standard deviation alongside your averages as a routine part of every dashboard. A mean that holds steady while standard deviation grows is the smoking gun of a splitting market — the headline number looks fine, but the audience underneath it is pulling apart.
There's a reason average-first reporting persists: it feels stable. A single number per metric, a clean trend line quarter over quarter, a dashboard that doesn't ask uncomfortable questions. It is genuinely easier to operate this way, and in placid markets it's probably fine.
The market right now is not placid. The consumers your aggregate data describes — the composite, the middle, the mean — are becoming a smaller and smaller share of what drives actual behavior. Building strategy around them is a bit like navigating by a star that burned out a thousand years ago. The light is still reaching you. The source no longer exists.
What makes this particularly costly is that the error is invisible at the aggregate level. Revenue looks fine until it doesn't. Satisfaction scores hold steady until they don't. By the time the average moves enough to trigger concern, the underlying shift has often been underway for two or three quarters. The aggregate metric is the last to know.
The signal is almost always in the variance. Start there.
Research insights, methodology guides, and practical advice — delivered to your inbox.