Data-Driven? Think again

Cassie Kozyrkov Cassie Kozyrkov
October 5, 2020 Big Data, Cloud & DevOps

The psychological habit most people lack and why you can’t hope to use data to guide your actions effectively without it

Businesses are hiring data scientists in droves to make rigorous, scientific, unbiased, data-driven decisions.

And now, the bad news: those decisions usually aren’t.

For a decision to be data-driven, it has to be the data — as opposed to something else entirely — that drive it. Seems so straightforward, and yet it’s so rare in practice because decision-makers lack a key psychological habit.

Data-drivenness destroyed

Imagine that you are considering buying something online instead of making a pilgrimage to the other side of town to fetch it. You’ve boiled your decision down to whether or not you trust the online seller. A quick search yields some relevant data: you see that the seller has an average rating of 4.2 out of 5.

Without decision-making fundamentals, your decision will be at best inspired by data, but not driven by it.

Now you can’t use that 4.2 to drive your decision. Game over! Once we’ve seen the answer, we’re free to pick the most convenient question. If the first thing we do is poke around in our data, our decision will be, at best, something I like to call data-inspired.

Data-inspired

That’s where we, like whales encountering plankton, swim around in some numbers, and then reach an emotional tipping point and… decide. There are numbers near our decision somewhere, but those numbers don’t drive it. The decision comes from somewhere else entirely.

The decision-maker’s mind was made up before the data, so the decision was there all along. Turns out humans interact with data selectively to confirm choices we’ve already made in our heart of hearts. We find the most convenient light in which to see evidence, and we don’t always know we’re doing it. Psychologists have a lovely name for this: confirmation bias.

Many people only use data to feel better about decisions they’ve already made.

Fitting the question to the answer

Is 4.2/5 a good number? Depends on your unconscious biases. A decision-maker who really wants to make the online purchase will squint at that 4.2 and sing a happy song about how that’s a high number. “It’s more than 4.0!” They can even show a rigorous analysis about how it is statistically significantly higher than 4.0. (With certainty! It’s the p-value you’ve always wanted.) In the meantime, someone who really doesn’t want to use that seller will find another way to frame the question in response to the data: “Why would I settle for a seller with less than 4.5 stars?” Or perhaps “But look at those 1-star reviews. I don’t like how many there are.” Sound familiar?

The more ways there are to slice the data, the more your analysis is a breeding ground for confirmation bias.

Mathematical complexity doesn’t provide the antidote, it merely makes it harder to see the problem. As a result, what’s obvious in the trivial example we just saw becomes hidden in a jumble of gorgeous Gaussians. Don’t assume your friendly neighborhood data scientist sees it either. The more ways there are to slice the data, the more your analysis is a breeding ground for confirmation bias.

The result? Decision-makers end up using data to feel better about doing what they were going to do anyway.

An expensive hobby

When the analysis is complex or the data are hard to process, a pinch of tragedy finds its way into our comedy. Sometimes boiling everything down to arrive at that 4.2 number takes months of toil by a horde of data scientists and engineers. At the end of a grueling journey, the data science team triumphantly presents the result: it’s 4.2 out of 5! The math was done meticulously. The team worked nights and weekends to get it in on time.

What do the stakeholders do with it? Yup, same as our previous 4.2: look at it through their confirmation bias goggles, with no effect on real-world actions. It doesn’t even matter that it’s accurate—nothing would be different if all those poor data scientists just made some numbers up.

Using data like that to feel better about actions we’re going to take anyway is an expensive (and wasteful) hobby. Data scientist friends, if your organization suffers from this kind of decision-maker, then I suggest sticking to the most lightweight and simple analyses to save time and money. Until the decision-makers are better trained, your showy mathematical jiu jitsu is producing nothing but dissipated heat.

Antidote to confirmation bias

Problem: you’re free to move the goalposts after you find out where the data landed. (Of course you score a goal every time. You’re just that good.)

Solution: set the goalposts in advance and resist temptation to move them later.

In other words, the decision-maker has some homework to do before anyone analyzes the data.

Until decision-makers are better trained, showy mathematical jiu jitsu only produces dissipated heat.

Framing the decision and setting decision criteria is a science of its own (we’ll dive into it in future posts, as the problem we examine here is just the tip of the iceberg), but in the meantime a quick fix that goes a long way is to come up with your decision boundary up front in your data science project.

Practice makes perfect

I recently went clothes shopping in Brooklyn with my friend Emma. Showing off a pretty dress, she tugs at the pricetag on the back. “Hey, what does this say?” she asks me. “If it’s less than 80 bucks, I’ll buy it.”

Now that’s some decision intelligence! Instead of first seeing the price and then talking herself into a decision she’s already made, she uses the data to drive it. With a well-practiced reflex, she weighs how much she likes the dress and her budget, then sets the decision boundary, and only allows herself to see the data (price) once that’s done. She’s in the habit of using data in the right order and that’s a muscle you can exercise too.

People don’t always need to be data-driven and Emma knows that. She doesn’t have to make unimportant decisions that way, but she also knows that practice makes perfect. It’s much easier to build the habit on trivial decisions than to struggle when the important ones come around.

Lessons from negotiation class

This idea is not new. Many different courses teach it, though one that’s almost guaranteed to cover it on day 1 is negotiation. If you haven’t put a value on your BATNA (~ a walk-away point) before entering a negotiation, you may as well paint “no idea what I’m doing” on your forehead. It’s the same thing by a different name: figuring out your decision boundary between your default action and the alternative.

The antidote is setting your decision criteria in advance.

In fact, standard advice for negotiators is to think through the entire range of potential offer combinations and plan your reactions to them in advance, otherwise it’s very easy for an experienced opponent to take advantage of you. Even without all the persuasion tactics at your counterpart’s disposal, irrelevant short-term factors like your blood sugar levels, your mood, how much the other party is smiling, and whether the sun is shining can have a disproportionate effect on the deal. Again, the same goes for data analysis — think of the data as negotiating with you to change your mind. The antidote there is planning your response in advance. Next time you’re negotiating a salary, for example, make sure you’ve thought about your number before you hear theirs.

It’s easy when you get the hang of it

Whether you think about what a number means to you before or after you see it, you still have to think about it. Doing it beforehand helps you counter some of the bugs in your human programming, with large payoffs in decision quality and negotiation performance. Improving the order of operations here is a valuable habit to cultivate and crucial if you’d like to be involved in data-driven decision-making. And here’s some bonus good news: with practice it’ll feel automatic.

  • Experfy Insights

    Top articles, research, podcasts, webinars and more delivered to you monthly.

  • Cassie Kozyrkov

    Tags
    Data ScientistsData-Driven DecisionsUnbiased
    © 2021, Experfy Inc. All rights reserved.
    Leave a Comment
    Next Post
    Introduction to Reinforcement Learning

    Introduction to Reinforcement Learning

    Leave a Reply Cancel reply

    Your email address will not be published. Required fields are marked *

    More in Big Data, Cloud & DevOps
    Big Data, Cloud & DevOps
    Cognitive Load Of Being On Call: 6 Tips To Address It

    If you’ve ever been on call, you’ve probably experienced the pain of being woken up at 4 a.m., unactionable alerts, alerts going to the wrong team, and other unfortunate events. But, there’s an aspect of being on call that is less talked about, but even more ubiquitous – the cognitive load. “Cognitive load” has perhaps

    5 MINUTES READ Continue Reading »
    Big Data, Cloud & DevOps
    How To Refine 360 Customer View With Next Generation Data Matching

    Knowing your customer in the digital age Want to know more about your customers? About their demographics, personal choices, and preferable buying journey? Who do you think is the best source for such insights? You’re right. The customer. But, in a fast-paced world, it is almost impossible to extract all relevant information about a customer

    4 MINUTES READ Continue Reading »
    Big Data, Cloud & DevOps
    3 Ways Businesses Can Use Cloud Computing To The Fullest

    Cloud computing is the anytime, anywhere delivery of IT services like compute, storage, networking, and application software over the internet to end-users. The underlying physical resources, as well as processes, are masked to the end-user, who accesses only the files and apps they want. Companies (usually) pay for only the cloud computing services they use,

    7 MINUTES READ Continue Reading »

    About Us

    Incubated in Harvard Innovation Lab, Experfy specializes in pipelining and deploying the world's best AI and engineering talent at breakneck speed, with exceptional focus on quality and compliance. Enterprises and governments also leverage our award-winning SaaS platform to build their own customized future of work solutions such as talent clouds.

    Join Us At

    Contact Us

    1700 West Park Drive, Suite 190
    Westborough, MA 01581

    Email: [email protected]

    Toll Free: (844) EXPERFY or
    (844) 397-3739

    © 2025, Experfy Inc. All rights reserved.