{"id":1991,"date":"2019-10-04T03:22:30","date_gmt":"2019-10-04T03:22:30","guid":{"rendered":"http:\/\/kusuaks7\/?p=1596"},"modified":"2024-03-14T13:15:20","modified_gmt":"2024-03-14T13:15:20","slug":"six-bits-of-advice-for-data-scientists","status":"publish","type":"post","link":"https:\/\/www.experfy.com\/blog\/bigdata-cloud\/six-bits-of-advice-for-data-scientists\/","title":{"rendered":"Six bits of advice for Data Scientists"},"content":{"rendered":"\t\t<div data-elementor-type=\"wp-post\" data-elementor-id=\"1991\" class=\"elementor elementor-1991\" data-elementor-post-type=\"post\">\n\t\t\t\t\t\t<section class=\"has_eae_slider elementor-section elementor-top-section elementor-element elementor-element-509f69a9 elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"509f69a9\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"has_eae_slider elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-b827f3a\" data-id=\"b827f3a\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-39a93722 elementor-widget elementor-widget-text-editor\" data-id=\"39a93722\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tTo err is human.\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-ef9c8b8 elementor-widget elementor-widget-text-editor\" data-id=\"ef9c8b8\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tThe important thing is to look at our mistakes. And learn from them.\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-70252e0 elementor-widget elementor-widget-text-editor\" data-id=\"70252e0\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tA data scientist needs to be\u00a0critical\u00a0and always on a lookout of something that others miss.\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-5828f6e elementor-widget elementor-widget-text-editor\" data-id=\"5828f6e\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tBut sometimes in our day to day job and coding perse, we get lost in our train of thought and fail to look at the overall picture.\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-44acc29 elementor-widget elementor-widget-text-editor\" data-id=\"44acc29\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tIn the end,\u00a0our business partners have only hired us to generate value, and we won\u2019t be able to generate value unless we develop business critical thinking.\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-3fca443 elementor-widget elementor-widget-heading\" data-id=\"3fca443\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h3 class=\"elementor-heading-title elementor-size-default\"><h3>1. Beware of the Clean Data Syndrome<\/h3><\/h3>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-0a6dcf5 elementor-widget elementor-widget-image\" data-id=\"0a6dcf5\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"image.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<img decoding=\"async\" src=\"https:\/\/miro.medium.com\/max\/3460\/0*KtQBD3Xq1f97befj\" alt=\"\" \/>\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-33be6ea elementor-widget elementor-widget-text-editor\" data-id=\"33be6ea\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\nHow many times it happens that we start working straight on the data we get. Start creating models? Or even present descriptive analytics generated automatically to our business counterparts?\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-8309828 elementor-widget elementor-widget-text-editor\" data-id=\"8309828\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tBut, do you ever ask \u2014\u00a0<strong>Does this data make sense?<\/strong>\n<blockquote>Falsely assuming that the data is clean could lead you towards wrong Hypotheses.<\/blockquote>\n<strong><em>You actually can discern a lot of important patterns by looking at discrepancies in the data.<\/em><\/strong>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-97f3b47 elementor-widget elementor-widget-text-editor\" data-id=\"97f3b47\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tFor example, if you notice that a particular column has more than 50% values missing, one might think about dropping the column.\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-5f26269 elementor-widget elementor-widget-text-editor\" data-id=\"5f26269\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<strong><em>But\u00a0what if some data collection instrument has some error?\u00a0You could have helped the business to improve the process.<\/em><\/strong>\n\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-e65fc85 elementor-widget elementor-widget-text-editor\" data-id=\"e65fc85\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tOr let us say you have a distribution of Male vs. Female as 90:10 in a Female Cosmetic business.\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-9844fc4 elementor-widget elementor-widget-text-editor\" data-id=\"9844fc4\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<strong><em>One may assume clean data and show the results as it is or they can use common sense and ask their business partner if the labels are switched.<\/em><\/strong>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-da390b7 elementor-widget elementor-widget-heading\" data-id=\"da390b7\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h3 class=\"elementor-heading-title elementor-size-default\"><h3>2. Be Aware<\/h3><\/h3>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-8c902dd elementor-widget elementor-widget-image\" data-id=\"8c902dd\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"image.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<img decoding=\"async\" src=\"https:\/\/miro.medium.com\/max\/7500\/0*nM35n0vo--4yaKVV\" alt=\"\" \/>\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-ef1eab5 elementor-widget elementor-widget-text-editor\" data-id=\"ef1eab5\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tWe all know fab.com. For those who don\u2019t, it is a website that sells \u201cCURATED HEALTH, FITNESS &amp; WELLNESS PRODUCTS.\u201d\n\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-2de99c6 elementor-widget elementor-widget-text-editor\" data-id=\"2de99c6\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tBut it was not always so.\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-032b2f8 elementor-widget elementor-widget-text-editor\" data-id=\"032b2f8\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tFab.com started up as fabulis.com,\u00a0<strong><em>a site to help gay men meet people.<\/em><\/strong>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-9aff016 elementor-widget elementor-widget-text-editor\" data-id=\"9aff016\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tOne of the site\u2019s popular features was the \u201cGay deal of the Day.\u201d\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-677617f elementor-widget elementor-widget-text-editor\" data-id=\"677617f\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<strong><em>One day the deal was for Hamburgers \u2014 and half of the buyers were women. Why were women on the site?<\/em><\/strong>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-b486289 elementor-widget elementor-widget-text-editor\" data-id=\"b486289\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<strong><em>The above fact caused the data team to realize that there was a market for selling goods to women.\u00a0<\/em><\/strong>So Fabulis.com changed its business model to fab.com as a sale site for designer products.\n<blockquote>Be on the lookout for something out of the obvious. Be ready to ask questions. If you find something you may have hit gold.<\/blockquote>\n<strong><em>Data can help a business to optimize revenue, but sometimes data has the power of changing the direction of the company as well.<\/em><\/strong>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-cdcec83 elementor-widget elementor-widget-text-editor\" data-id=\"cdcec83\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tAnother example,\u00a0<a href=\"https:\/\/www.fastcompany.com\/1783127\/flickr-founders-glitch-can-game-wants-you-play-nice-be-blockbuster\" target=\"_blank\" rel=\"noopener noreferrer\" class=\"broken_link\">Flickr started up as a Multiplayer game<\/a>. Only when the founders noticed that people were using it as a photo upload service, did they pivot to a photo-sharing app.\n\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-e7c0d53 elementor-widget elementor-widget-text-editor\" data-id=\"e7c0d53\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tThere are countless examples as such.\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-40b6dbe elementor-widget elementor-widget-text-editor\" data-id=\"40b6dbe\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tTry to make a good example out of your company\u2019s business as well.\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-9087890 elementor-widget elementor-widget-heading\" data-id=\"9087890\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h3 class=\"elementor-heading-title elementor-size-default\"><h3>3. Start Focusing on the right metrics<\/h3>\n<\/h3>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-aa56f0f elementor-widget elementor-widget-image\" data-id=\"aa56f0f\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"image.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<img decoding=\"async\" src=\"https:\/\/miro.medium.com\/max\/5910\/0*1oD9f4y5peVVvxqb\" alt=\"\" \/>\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-1799ab8 elementor-widget elementor-widget-text-editor\" data-id=\"1799ab8\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<strong><em>What do we want to optimize for?<\/em><\/strong>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-2fa90a1 elementor-widget elementor-widget-text-editor\" data-id=\"2fa90a1\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tMost of the businesses fail to answer this simple question.\n\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-1f3ebc4 elementor-widget elementor-widget-text-editor\" data-id=\"1f3ebc4\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<strong><em>Every business problem is a little different, and it should be optimized differently.<\/em><\/strong>\n\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-032b5e6 elementor-widget elementor-widget-text-editor\" data-id=\"032b5e6\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tFor example, A website owner might ask you to optimize for active users.\u00a0<strong><em>But is it the\u00a0<\/em><\/strong><strong><em>right metric<\/em><\/strong><strong><em>?<\/em><\/strong>\u00a0It is just a vanity metric, which will always increase.\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-5fd47e8 elementor-widget elementor-widget-text-editor\" data-id=\"5fd47e8\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tI would instead try to optimize the percentage of users that are active to know how my product is performing.\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-8e0401a elementor-widget elementor-widget-text-editor\" data-id=\"8e0401a\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tAnother Example, We all have created classification models. A lot of time we have tried to increase accuracy for our models.<strong><em>\u00a0But do we want accuracy as a metric of our model performance?<\/em><\/strong>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-c45b302 elementor-widget elementor-widget-text-editor\" data-id=\"c45b302\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\n<strong><em>What if we are predicting the number of asteroids that will hit the earth.<\/em><\/strong>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-78813e6 elementor-widget elementor-widget-text-editor\" data-id=\"78813e6\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tJust say zero all the time. And you will be 99% accurate. My model can be reasonably accurate, but not at all valuable. A better metric would be the F score.\n<blockquote>Designing a Data Science project is much more important than the modeling itself.<\/blockquote>\n\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-6b8d75a elementor-widget elementor-widget-heading\" data-id=\"6b8d75a\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\"><h3>4. Statistics Lie sometimes, maybe a lot of times<\/h3><\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-0831d7e elementor-widget elementor-widget-image\" data-id=\"0831d7e\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"image.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<img decoding=\"async\" src=\"https:\/\/miro.medium.com\/max\/783\/1*oSmF-71XGwU7PUtPtSMuvA.png\" alt=\"\" \/>\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-0681c9a elementor-widget elementor-widget-text-editor\" data-id=\"0681c9a\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p style=\"text-align: center;\"><img fetchpriority=\"high\" decoding=\"async\" src=\"https:\/\/miro.medium.com\/max\/771\/1*snJmT9Iy0TtngpJFW5GGEQ.png\" width=\"617\" height=\"504\" \/><\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-7390dd1 elementor-widget elementor-widget-text-editor\" data-id=\"7390dd1\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p style=\"text-align: center;\"><a href=\"https:\/\/medium.com\/@david.a.ortiz\/lying-with-stats-everything-you-thought-you-knew-about-statistics-1caaed95dcc6\" target=\"_blank\" rel=\"noopener noreferrer\" class=\"broken_link\">Source<\/a><\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-28010ab elementor-widget elementor-widget-text-editor\" data-id=\"28010ab\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<strong><em>Be critical of everything that gets quoted to you.\u00a0<\/em><\/strong>Statistics have been\u00a0used to lie\u00a0in advertisements, in workplaces and a lot of other marketing venues in the past. People will do anything to get sales or promotions.\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-0a8728a elementor-widget elementor-widget-text-editor\" data-id=\"0a8728a\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tFor example:\u00a0<a href=\"http:\/\/marketinglaw.osborneclarke.com\/retailing\/colgates-80-of-dentists-recommend-claim-under-fire\/\" target=\"_blank\" rel=\"noopener noreferrer\"><strong><em>Do you remember Colgate\u2019s claim that 80% of dentists recommended their brand?<\/em><\/strong><\/a>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-1f04f67 elementor-widget elementor-widget-text-editor\" data-id=\"1f04f67\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tThis statistic seems pretty good at first. All dentists use Colgate; I should too. Right?\n\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-6922daf elementor-widget elementor-widget-text-editor\" data-id=\"6922daf\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<strong><em>It turns out that at the time of surveying the dentists, they could choose several brands \u2014 not just one. So other brands could be just as popular as Colgate.<\/em><\/strong>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-9ed465d elementor-widget elementor-widget-text-editor\" data-id=\"9ed465d\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tThe marketing department is just a myth creation machine. I can understand that.\n<blockquote>The marketing department is just a myth creation machine.<\/blockquote>\nBut it is painful when you see such sort in research. For instance, the\u00a0<a href=\"https:\/\/en.wikipedia.org\/wiki\/Small_Arms_Survey\" target=\"_blank\" rel=\"noopener noreferrer\">Small Arms survey<\/a>\u00a0suggests that\u00a0for every 100 Americans, there are 120 guns.\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-d90e34f elementor-widget elementor-widget-text-editor\" data-id=\"d90e34f\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\nIt feels reasonable to assume that every American must be packing heat. Then there is another study that shows that\u00a0<a href=\"http:\/\/www.gallup.com\/poll\/150353\/self-reported-gun-ownership-highest-1993.aspx\" target=\"_blank\" rel=\"noopener noreferrer\">only 47 percent of households have guns in them<\/a>.\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-cc18c0e elementor-widget elementor-widget-text-editor\" data-id=\"cc18c0e\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tAre you confused yet?\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-f5a841c elementor-widget elementor-widget-text-editor\" data-id=\"f5a841c\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tSome of the household pack tons of guns. And thus it won\u2019t be reasonable to say that every American is armed.\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-184a527 elementor-widget elementor-widget-text-editor\" data-id=\"184a527\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<strong><em>Also\u00a0never trust a chart that doesn\u2019t label the Y-axis.<\/em><\/strong>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-054415d elementor-widget elementor-widget-text-editor\" data-id=\"054415d\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tBelow chart was shown by\u00a0<strong>Rep. Jason Chaffetz<\/strong>\u00a0(R-UT) during a congressional hearing, to Planned Parenthood president, showing how abortions are going up and life-saving procedures are down.\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-851fc93 elementor-widget elementor-widget-image\" data-id=\"851fc93\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"image.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<img decoding=\"async\" src=\"https:\/\/miro.medium.com\/max\/1060\/0*CyiHRHqjbibFh_3-.jpg\" alt=\"\" \/>\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-c0920be elementor-widget elementor-widget-text-editor\" data-id=\"c0920be\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tLook at the real picture with labeled axes, and one can see the lies being told\u2026\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-b12575e elementor-widget elementor-widget-image\" data-id=\"b12575e\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"image.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<img decoding=\"async\" src=\"https:\/\/miro.medium.com\/max\/1499\/0*3ayNAl4UsrZ5oOUF.jpg\" alt=\"\" \/>\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-9f28595 elementor-widget elementor-widget-text-editor\" data-id=\"9f28595\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tAdd the fact that the U.S. Preventative Services Task Force changed its recommendation to get cancer screenings every two years, instead of every year, and one can even explain the decline in Cancer Screenings.\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-9864411 elementor-widget elementor-widget-text-editor\" data-id=\"9864411\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<blockquote>Be objective while seeing charts presented by politicians.<\/blockquote>\n\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-468ee9f elementor-widget elementor-widget-heading\" data-id=\"468ee9f\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h3 class=\"elementor-heading-title elementor-size-default\"><h3>5. The Long string rule of Probability<\/h3><\/h3>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-8e34340 elementor-widget elementor-widget-image\" data-id=\"8e34340\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"image.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<img decoding=\"async\" src=\"https:\/\/miro.medium.com\/max\/2400\/1*UJegPs3SgESdTy7ueYkJoQ.png\" alt=\"\" \/>\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"has_eae_slider elementor-section elementor-top-section elementor-element elementor-element-aa10b16 elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"aa10b16\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"has_eae_slider elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-5779d65\" data-id=\"5779d65\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-3bfbdbc elementor-widget elementor-widget-text-editor\" data-id=\"3bfbdbc\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tIt happened during the summer of 1913 in a Casino in Monaco.\n\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-924a0ce elementor-widget elementor-widget-text-editor\" data-id=\"924a0ce\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tGamblers watched in amazement as a casino\u2019s roulette wheel landed on black\u00a0<strong>26 times in a row.<\/strong>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-0f39494 elementor-widget elementor-widget-text-editor\" data-id=\"0f39494\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tAnd since the\u00a0probability\u00a0of a Red vs. Black is precisely half,\u00a0<strong><em>they were confident that red was \u201cdue\u201d.<\/em><\/strong>\u00a0It was a field day for the Casino \u2014 a perfect example of\u00a0<a href=\"https:\/\/en.wikipedia.org\/wiki\/Gambler%27s_fallacy\" target=\"_blank\" rel=\"noopener noreferrer\">Gambler\u2019s fallacy<\/a>, aka the Monte Carlo fallacy.\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-261b2b7 elementor-widget elementor-widget-text-editor\" data-id=\"261b2b7\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tAnd this happens in real life.\u00a0<a href=\"https:\/\/papers.ssrn.com\/sol3\/papers.cfm?abstract_id=2538147\" target=\"_blank\" rel=\"noopener noreferrer\" class=\"broken_link\"><strong><em>People tend to avoid long strings of the same answer<\/em><\/strong><\/a><strong><em>. Sometimes sacrificing accuracy of judgment for the sake of getting a pattern of decisions that look fairer or probable.<\/em><\/strong>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-db5551e elementor-widget elementor-widget-text-editor\" data-id=\"db5551e\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tFor example,<strong><em>\u00a0an admissions officer may reject the next application if he has approved three applications in a row,\u00a0<\/em><\/strong>even if the application should have been accepted on merit.\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-0f0fa9e elementor-widget elementor-widget-text-editor\" data-id=\"0f0fa9e\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<strong><em>Don\u2019t give in to such fallacies.<\/em><\/strong>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-82d41eb elementor-widget elementor-widget-text-editor\" data-id=\"82d41eb\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tThe world works on probabilities. We are seven billion people, doing an event every second of our life.\n\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-70fc1aa elementor-widget elementor-widget-text-editor\" data-id=\"70fc1aa\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tRare events are bound to happen. But don\u2019t put your money on them.\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-46496b7 elementor-widget elementor-widget-heading\" data-id=\"46496b7\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h3 class=\"elementor-heading-title elementor-size-default\"><h3>6. Correlation does not imply Causation<\/h3>\n<\/h3>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-c980448 elementor-widget elementor-widget-image\" data-id=\"c980448\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"image.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<img decoding=\"async\" src=\"https:\/\/miro.medium.com\/max\/1083\/0*iG3obzSx-myY9UfL.png\" alt=\"\" \/>\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-cabca9f elementor-widget elementor-widget-text-editor\" data-id=\"cabca9f\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\n<strong><em>Can you believe this \u2014 Autism being caused by Organic food. Good. Or the fact is just the opposite.\u00a0Does Autism increase organic food sales?\u00a0Not really. Or maybe.<\/em><\/strong>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-235eaa5 elementor-widget elementor-widget-text-editor\" data-id=\"235eaa5\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tJust because two variables move together in tandem doesn\u2019t necessarily mean that one causes the another.\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-9a9e5a6 elementor-widget elementor-widget-text-editor\" data-id=\"9a9e5a6\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<blockquote>Correlation does not imply causation.<\/blockquote>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-d6e2c8d elementor-widget elementor-widget-text-editor\" data-id=\"d6e2c8d\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tIt is the Holy\u00a0Grail\u00a0of a Data scientist toolbox.\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-dcc5fb9 elementor-widget elementor-widget-text-editor\" data-id=\"dcc5fb9\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tThere have been other hilarious examples for this in the past. Some of my favorites are:\n<ul>\n \t<li><strong><em>Looking at the firehouse department data you infer that the more firemen are sent to a fire, the more damage is done.<\/em><\/strong><\/li>\n \t<li>When investigating the cause of crime in New York City in the 80s, an academic found a\u00a0<strong><em>strong correlation between the amount of serious crime committed and the amount of ice cream sold by street vendors!<\/em><\/strong>Obviously, there was an unobserved variable causing both. Summers are when the crime is the greatest and when the most ice cream is sold. So Ice cream sales don\u2019t cause crime. Neither crime increases ice cream sales.<\/li>\n<\/ul>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-94b728b elementor-widget elementor-widget-text-editor\" data-id=\"94b728b\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<strong><em>This article appeared in <a href=\"https:\/\/towardsdatascience.com\/6-bits-of-advice-for-data-scientists-6e5758c52fb2\" class=\"broken_link\" rel=\"noopener\">Towards Data Science<\/a>.<\/em><\/strong>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<\/div>\n\t\t","protected":false},"excerpt":{"rendered":"<p>To err is human.The important thing is to look at our mistakes. And learn from them.A data scientist needs to be\u00a0critical\u00a0and always on a lookout of something that others miss.But sometimes in our day to day job and coding perse, we get lost in our train of thought and fail to look at the overall<\/p>\n","protected":false},"author":653,"featured_media":4142,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"content-type":"","footnotes":""},"categories":[187],"tags":[94],"ppma_author":[3409],"class_list":["post-1991","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-bigdata-cloud","tag-data-science"],"authors":[{"term_id":3409,"user_id":653,"is_guest":0,"slug":"rahul-agarwal","display_name":"Rahul Agarwal","avatar_url":"https:\/\/www.experfy.com\/blog\/wp-content\/uploads\/2020\/04\/medium_cc5785b8-8195-44e6-a0de-2e33be05d7cb-150x150.png","user_url":"http:\/\/bit.ly\/384SBYb","last_name":"Agarwal","first_name":"Rahul","job_title":"","description":"Rahul Agarwal is a Data Scientist at Walmart Labs."}],"_links":{"self":[{"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/posts\/1991","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/users\/653"}],"replies":[{"embeddable":true,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/comments?post=1991"}],"version-history":[{"count":4,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/posts\/1991\/revisions"}],"predecessor-version":[{"id":36446,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/posts\/1991\/revisions\/36446"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/media\/4142"}],"wp:attachment":[{"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/media?parent=1991"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/categories?post=1991"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/tags?post=1991"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/ppma_author?post=1991"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}