· Justin B · Data Strategy · 3 min read
Is Big Data Too Big to Scale?
Examining the reality gap between big data promises and practical business insights, featuring Derek Steer's critical analysis of BI tool limitations.
In the era of digital transformation, organizations have collected vast troves of data with the promise that these datasets would yield transformative insights. However, a growing sentiment in the tech industry questions whether the âbig dataâ revolution has delivered on its promises.
The Myth of Universal Big Data
The tech industry has sold a compelling narrative about the value of data. As Derek Steer explains in his talk âWe Donât Actually Have Big Data,â BI tools typically demonstrate their value through a consistent story: an analyst finds a âblipâ in the data, explores it, and uncovers valuable insights - but this narrative, while compelling in sales, rarely matches actual usage.
This narrative has been persistent across the industry. Steer points out how major BI platforms all use similar messaging: âquickly find meaningful insights within your dataâ or âdiscover and share insights that can change your business in the world.â But thereâs a fundamental disconnect between this sales pitch and reality.
The Visualization Problem
The visualization challenge is particularly pronounced. When truly large organizations with massive datasets (like Target with $73 billion in revenue or Facebook with over a billion users) create visualizations, they can produce meaningful charts with clear patterns.
However, as Steer humorously points out: most companies donât have charts that look like smooth, insightful curves - âyour charts look like this,â showing a sparse, ambiguous graph. âWhat do you do with this? You donât slice into it⊠you squint at it and youâre like âitâs up, is⊠I donât know.ââ
This visualization problem is particularly evident when dealing with recent data. Most businesses typically analyze only the last 90 days of data, which often isnât enough to establish meaningful patterns. While large tech companies might have billions of data points to create smooth, insightful visualizations, the average business looking at their last quarter of data is left with sparse charts that resist clear interpretation.
The Small Data Reality
The reality for most businesses is starkly different from the big data success stories we hear about. While we were âpromised previously unimagined insightsâ (an actual line from Snowflakeâs website according to Steer), most companies instead get âdirectional vibesâ where you look at a chart and can only say âitâs uppish, I donât know.â
A Path Forward
Rather than chasing the elusive promise of big data insights, Steer suggests several alternative approaches:
BI tools should focus more on helping interpret data rather than just exploring it - âinterpretation of data is often a lot harder than exploration and what most tools focus on is exploration.â
Sometimes manually examining each data point is better than trying to find patterns in small datasets - âwhen you have reasonably small data, just looking at every example isnât often that hard,â referencing how a colleague realized it was better to manually read seven articles rather than build a sentiment analysis system.
The big data revolution has undoubtedly transformed certain industries, but for most organizations, the promise remains unfulfilled. Perhaps itâs time to acknowledge that rather than continuing to invest in tools that promise to extract insights from supposedly âbigâ data, companies should focus on better understanding the limited data they actually have.