Ain’t No Such a Thing as a Citizen Data Scientist

Venkat Raman Venkat Raman
November 2, 2020 Big Data, Cloud & DevOps

Dear Aspiring Data Scientist,

Before you start using ‘low code’ or ‘drag & drop’ data science tools, please learn the fundamentals.

Why aspire to be ‘Citizen Data Scientist’ when you can truly become a ‘Data Scientist.

Don’t get swayed by the fancy titles like ‘Citizen Data Scientist.’ It is funny that so much hard selling is happening in data science.

I mean, just because we know how to use a thermometer or operate BP machine, should we start calling ourselves ‘Citizen Doctor’?

Strategy — undermine the difficulty of doing data science!

The undermining of difficulty in doing data science is not healthy. Many ‘become a data scientist in a 1-month course’ sellers and ‘low code data science solution’ sellers use this strategy.

The ‘low code/no-code solution’ sellers will often argue that one could gain intuition by *doing* things. The counter-argument to that is, using a low code/no-code solution is like using a calculator. Before one can operate a calculator, one needs to have numeracy skills. Learning the fundamentals in data science is like acquiring numeracy skills.

Why 85 % of Data Science projects fail?

85 % of Data Science projects fail in the enterprise because people think it is easy to do data science but only do it wrongly. The realization often comes late.

Many fall victim to the  become a data scientist in 1 month/ 6 months type courses’ and often wonder why they are not being hired.

The market is the ultimate truth-teller.

It somehow knows who the good players are and operates an excellent filtering mechanism. The reason being, the market is comprised of companies that have ‘skin in the game.’

Companies having ‘skin in the game’ don’t gamble. They hire genuine talent. The simple ‘skin in the game’ test one can do by themselves is ask one simple question. Would I use the machine learning classifier myself?

Also, the real utility of heart disease prediction or earthquake prediction is not the prediction that it will happen with x% certainty, but WHEN

This ‘temporal’ part no model can predict accurately.

Doing Data Science is easy. Or is it?

One of the reasons data scienceseems *easy to do* is because many algorithms can be fit in 2–3 lines of code. There is simply no intellectual pain.

Compare this to programming. A person has to think about the syntax, design pattern, and logic. When things go astray in programming, there are multiple checkpoints in the form of error alerts like Runtime, Syntax error, and compiler error. One gets an immediate reality check on how good or bad a programmer he/she is. As a result, one does not go up and about calling themselves ‘citizen software engineer.’

On the flip side, When it comes to data science, there is no runtime or syntax error equivalent. There are no warning signs that says one can’t apply a particular algorithm on the data. There is no immediate reality of check-in data science.

This is one reason why people who advocate ‘learning the fundamentals is not important’ go scot-free. This is why fancy but harmful titles like ‘citizen Data Scientist’ arise.

The above criticism might sound rude/bitter, but it is all in the hope that one day we can all say 85% of Data Science projects succeed rather than fail.

  • Experfy Insights

    Top articles, research, podcasts, webinars and more delivered to you monthly.

  • Venkat Raman

    Tags
    Citizen Data ScientistData ScienceData Scientist
    © 2021, Experfy Inc. All rights reserved.
    Leave a Comment
    Next Post
    How Magento Web Development Is The Best Choice For Your E-Commerce Site?

    How Magento Web Development Is The Best Choice For Your E-Commerce Site?

    Leave a Reply Cancel reply

    Your email address will not be published. Required fields are marked *

    More in Big Data, Cloud & DevOps
    Big Data, Cloud & DevOps
    Cognitive Load Of Being On Call: 6 Tips To Address It

    If you’ve ever been on call, you’ve probably experienced the pain of being woken up at 4 a.m., unactionable alerts, alerts going to the wrong team, and other unfortunate events. But, there’s an aspect of being on call that is less talked about, but even more ubiquitous – the cognitive load. “Cognitive load” has perhaps

    5 MINUTES READ Continue Reading »
    Big Data, Cloud & DevOps
    How To Refine 360 Customer View With Next Generation Data Matching

    Knowing your customer in the digital age Want to know more about your customers? About their demographics, personal choices, and preferable buying journey? Who do you think is the best source for such insights? You’re right. The customer. But, in a fast-paced world, it is almost impossible to extract all relevant information about a customer

    4 MINUTES READ Continue Reading »
    Big Data, Cloud & DevOps
    3 Ways Businesses Can Use Cloud Computing To The Fullest

    Cloud computing is the anytime, anywhere delivery of IT services like compute, storage, networking, and application software over the internet to end-users. The underlying physical resources, as well as processes, are masked to the end-user, who accesses only the files and apps they want. Companies (usually) pay for only the cloud computing services they use,

    7 MINUTES READ Continue Reading »

    About Us

    Incubated in Harvard Innovation Lab, Experfy specializes in pipelining and deploying the world's best AI and engineering talent at breakneck speed, with exceptional focus on quality and compliance. Enterprises and governments also leverage our award-winning SaaS platform to build their own customized future of work solutions such as talent clouds.

    Join Us At

    Contact Us

    1700 West Park Drive, Suite 190
    Westborough, MA 01581

    Email: [email protected]

    Toll Free: (844) EXPERFY or
    (844) 397-3739

    © 2025, Experfy Inc. All rights reserved.