{"id":1988,"date":"2019-10-03T03:45:14","date_gmt":"2019-10-03T03:45:14","guid":{"rendered":"http:\/\/kusuaks7\/?p=1593"},"modified":"2024-03-18T15:17:30","modified_gmt":"2024-03-18T15:17:30","slug":"the-hidden-risk-of-ai-and-big-data","status":"publish","type":"post","link":"https:\/\/www.experfy.com\/blog\/ai-ml\/the-hidden-risk-of-ai-and-big-data\/","title":{"rendered":"The Hidden Risk of AI and Big Data"},"content":{"rendered":"\t\t<div data-elementor-type=\"wp-post\" data-elementor-id=\"1988\" class=\"elementor elementor-1988\" data-elementor-post-type=\"post\">\n\t\t\t\t\t\t<section class=\"has_eae_slider elementor-section elementor-top-section elementor-element elementor-element-2ed24839 elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-eae-slider=\"15802\" data-id=\"2ed24839\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"has_eae_slider elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-60b648b0\" data-eae-slider=\"89116\" data-id=\"60b648b0\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-cae418d elementor-widget elementor-widget-heading\" data-id=\"cae418d\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h3 class=\"elementor-heading-title elementor-size-default\"><h3>Recent advances in AI have been made possible through access to \u201cBig Data\u201d and cheap computing power. But can it go wrong?<\/h3><\/h3>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-48732d9 elementor-widget elementor-widget-text-editor\" data-id=\"48732d9\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tBig data is suddenly everywhere. From scarcity and difficulty to find data (and information), we now have a deluge of data. In recent years, the amount of available data has been growing in an exponential pace. This is in turn made possible due to the immense growth in number of devices recording data, as well as the connectivity between all these devices through the internet of things.<em>\u00a0<\/em>Everyone seems to be collecting, analyzing, making money from and celebrating (or fearing) the powers of Big data. By combining the power of modern computing, it promises to solve virtually any problem &#8211; just by crunching the numbers.\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-c0105e2 elementor-widget elementor-widget-text-editor\" data-id=\"c0105e2\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tBut, can big data really deliver on all this hype? In some cases, yes, in others, maybe not. On the one hand, there is no doubt that big data has already had a critical impact in certain areas. For instance, almost every successful artificial intelligence solution involves some serious number crunching.\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-3133fb7 elementor-widget elementor-widget-text-editor\" data-id=\"3133fb7\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\nThe first thing to note is that although AI is currently very good at finding patterns and relationships within big datasets, it is still not very intelligent (depending on your definition of intelligence, but that\u2019s another story!). Crunching the numbers can effectively identify and find subtle patterns in our data, but it cannot directly tell us which of those correlations are actually meaningful.\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-1e1123d elementor-widget elementor-widget-heading\" data-id=\"1e1123d\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h3 class=\"elementor-heading-title elementor-size-default\"><h3><strong>Correlation vs. Causation<\/strong><\/h3><\/h3>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-a773571 elementor-widget elementor-widget-text-editor\" data-id=\"a773571\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tWe all know (or should know!) that \u201c<a href=\"https:\/\/en.wikipedia.org\/wiki\/Correlation_does_not_imply_causation\" target=\"_blank\" rel=\"nofollow noopener noreferrer\">Correlation doesn\u2019t imply causation<\/a>\u201d. However, the human mind is hardwired to look for patterns, and when we see lines sloping together and apparent patterns in our data, it is hard for us to resist the urge to assign a reason.\n\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-3de8f10 elementor-widget elementor-widget-text-editor\" data-id=\"3de8f10\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tStatistically we can\u2019t make that leap, however. Tyler Vigen, the author of\u00a0<a href=\"http:\/\/www.tylervigen.com\/\" target=\"_blank\" rel=\"nofollow noopener noreferrer\"><em>Spurious Correlations,<\/em><\/a>\u00a0has made sport of this on his website (which I can very much recommend visiting for a look at some entertaining statistics!) Some examples of such spurious correlations can be found in the figures below, where I have collected a few examples showing how Ice cream are apparently causing a lot of bad things, ranging from forest fires to shark attacks and polio outbreaks.\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-fa7be2c elementor-widget elementor-widget-text-editor\" data-id=\"fa7be2c\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tHaving a look at these plots, one could argue that we should probably have banned Ice cream a long time ago. And, actually, in the 1940s Polio example, public health experts recommended that people stop eating ice cream as part of an &#8220;anti-polio diet&#8221;. Fortunately, they eventually came to realize that the correlation between polio outbreaks and ice-cream consumption was simply caused by the fact that polio outbreaks were most common during summer.\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-68ef535 elementor-widget elementor-widget-text-editor\" data-id=\"68ef535\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tIn\u00a0<a href=\"https:\/\/en.wikipedia.org\/wiki\/Statistics\" target=\"_blank\" rel=\"nofollow noopener noreferrer\">statistics<\/a>, a\u00a0spurious relationship\u00a0or\u00a0<a href=\"https:\/\/en.wikipedia.org\/wiki\/Spurious_relationship\" target=\"_blank\" rel=\"nofollow noopener noreferrer\">spurious correlation<\/a>\u00a0is a\u00a0mathematical relationship\u00a0in which two or more events or variables are\u00a0<a href=\"https:\/\/en.wikipedia.org\/wiki\/Correlation_and_dependence\" target=\"_blank\" rel=\"nofollow noopener noreferrer\">associated<\/a>\u00a0but\u00a0<a href=\"https:\/\/en.wikipedia.org\/wiki\/Correlation_does_not_imply_causation\" target=\"_blank\" rel=\"nofollow noopener noreferrer\"><em>not<\/em><\/a>\u00a0<a href=\"https:\/\/en.wikipedia.org\/wiki\/Causality\" target=\"_blank\" rel=\"nofollow noopener noreferrer\">causally related<\/a>, due to either coincidence or the presence of a certain third, unseen factor (referred to as a &#8220;common response variable&#8221;, &#8220;confounding factor&#8221;, or &#8220;<a href=\"https:\/\/en.wikipedia.org\/wiki\/Lurking_variable\" target=\"_blank\" rel=\"nofollow noopener noreferrer\">lurking variable<\/a>&#8220;). An example of such \u201clurking variables\u201d could e.g. be the seeming correlation between ice cream sales and shark attacks (I feel quite confident that increased sales of ice cream does not cause sharks to attack people). However, there is a common link behind these two numbers, namely temperature. Higher temperature causes more people to buy ice cream as well as more people to go for a\u00a0swim. Thus, this \u201clurking variable\u201d is really the cause of the apparent correlation. Luckily, we have learned to separate correlation from causation, and we can still enjoy some ice cream on a hot summer day without fearing polio outbreaks and shark attacks!\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-52c14b8 elementor-widget elementor-widget-heading\" data-id=\"52c14b8\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h3 class=\"elementor-heading-title elementor-size-default\"><h3><strong>The power and limits of correlations<\/strong><\/h3><\/h3>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-c7880bf elementor-widget elementor-widget-text-editor\" data-id=\"c7880bf\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tWith enough data, computing power and statistical algorithms patterns will be found. But are these patterns of any interest? Not all of them will be, as spurious patterns could easily outnumber the meaningful ones. Big data combined with algorithms can be an extremely useful tool when applied correctly to the right problems. However, no scientist thinks you can solve the problem by crunching data alone, no matter how powerful the statistical analysis. You should always start your analysis based on an underlying understanding of the problem you are trying to solve.\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-39008d5 elementor-widget elementor-widget-heading\" data-id=\"39008d5\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\"><h3><strong>Data science is the end of science (or is it?)<\/strong><\/h3><\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-2bd7dba elementor-widget elementor-widget-text-editor\" data-id=\"2bd7dba\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tIn June 2008, C. Anderson, former editor-in-chief of Wired Magazine, wrote a provocative essay titled: \u201c<a href=\"https:\/\/www.wired.com\/2008\/06\/pb-theory\/\" target=\"_blank\" rel=\"nofollow noopener noreferrer\">The End of Theory: The Data Deluge Makes the Scientific Method Obsolete\u201d,<\/a>\u00a0in which he states that \u201cwith enough data, the numbers speak for themselves\u201d. \u201cCorrelation supersedes causation, and science can advance even without coherent models and unified theories\u201d.\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-f58ee5c elementor-widget elementor-widget-text-editor\" data-id=\"f58ee5c\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\nThe strength and generality of this approach relies on the amount of data: the more data, the more powerful and effective is the method based on computationally discovered correlations. We can simply feed the numbers into powerful computers and let statistical algorithms automatically find interesting patterns and insight.\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-8fdbc8a elementor-widget elementor-widget-text-editor\" data-id=\"8fdbc8a\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tUnfortunately, this simplified way of analysis has some potential pitfalls, which can be illustrated nicely through an example found on the\u00a0<a href=\"https:\/\/john-poppelaars.blogspot.com\/2015\/04\/do-numbers-really-speak-for-themselves.html\" target=\"_blank\" rel=\"nofollow noopener noreferrer\">blog by John Poppelaars<\/a>:\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-209cb4f elementor-widget elementor-widget-text-editor\" data-id=\"209cb4f\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<blockquote><em>Suppose we would like to create a prediction model for some variable Y. This could for example be the stock price of a company, the click-through rates of online ads or next week\u2019s weather. Next we gather all the data we can lay your hands on and put it in some statistical procedure to find the best possible prediction model for Y. A common procedure is to first estimate the model using all the variables, screen out the unimportant ones (the ones not significant at some predefined significance level) and re-estimate the model with the selected subset of variables and repeat this procedure until a significant model is found. Simple enough, isn&#8217;t it?<\/em><\/blockquote>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-3ae8b99 elementor-widget elementor-widget-text-editor\" data-id=\"3ae8b99\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<blockquote><em>Anderson suggested way of analysis has some serious drawbacks however. Let me illustrate. Following the above example, I created a set of data points for Y by drawing 100 samples from a uniform distribution between zero and one, so it\u2019s random noise. Next I created a set of 50 explanatory variables X(i) by drawing 100 samples from a uniform distribution between zero and one for each of them. So, all 50 explanatory variables are random noise as well. I estimate a linear regression model using all X(i) variables to predict Y. Since nothing is related (all uniform distributed and independent variables) an R squared of zero is expected, but in fact it isn&#8217;t. It turns out to be 0.5. Not bad for a regression based on random noise! Luckily, the model is not significant. The variables that are not significant are eliminated step by step and the model re-estimated. This procedure is repeated until a significant model is found. After a few steps a significant model is found with an Adjusted R squared of 0.4 and 7 variables at a significance level of at least 99%. Again, we are regressing random noise, there is absolutely no relationship in it, but still we find a significant model with 7 significant parameters. This is what would happen if we just feed data to statistical algorithms to go find patterns.\u201d<\/em><\/blockquote>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-c427d33 elementor-widget elementor-widget-heading\" data-id=\"c427d33\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h3 class=\"elementor-heading-title elementor-size-default\"><h3><strong>The larger the data set, the stronger the noise<\/strong><\/h3><\/h3>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-1a1b402 elementor-widget elementor-widget-text-editor\" data-id=\"1a1b402\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tRecent\u00a0<a href=\"http:\/\/www.di.ens.fr\/users\/longo\/files\/BigData-Calude-LongoAug21.pdf\" target=\"_blank\" rel=\"nofollow noopener noreferrer\">research\u00a0<\/a>has provided proof that as data sets grow larger they have to contain arbitrary correlations. These correlations appear simply due to the size of the data, which indicates that many of the correlations will be spurious. Unfortunately, too much information tends to behave like very little information.\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-faba4a4 elementor-widget elementor-widget-text-editor\" data-id=\"faba4a4\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tThis is a major concern in applications where you work with high-dimensional data. As an example, let\u2019s say you gather sensor data from thousands of sensors on an industrial plant, and then mine these data for patterns to optimize performance. In such cases, you could easily be fooled into acting upon phantom correlations rather than real indicators of operational performance. This could potentially be very bad news, both financially and in terms of safe operation of the plant.\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-d92c8a7 elementor-widget elementor-widget-heading\" data-id=\"d92c8a7\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h3 class=\"elementor-heading-title elementor-size-default\"><h3><strong>Adding data vs. adding information<\/strong><\/h3><\/h3>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-9b95377 elementor-widget elementor-widget-text-editor\" data-id=\"9b95377\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tAs data scientists, we might often claim that the best solution to improving our AI model is to \u201cadd more data\u201d. However, the idea that just \u201cadding more data\u201d will magically improve the performance of your model might not be the case. What we should focus on is rather to \u201cadd more information\u201d. The distinction between \u201cadding data\u201d and \u201cadding information\u201d is crucial: Adding more data does not equal adding more information (at least useful and correct information). On the contrary, by blindly adding more and more data, we encounter the risk of adding data that contains misinformation that can accordingly downgrade the performance of our models. With the abundant access of data, as well as the computing power to process it, this becomes increasingly important to consider.\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-4c6b321 elementor-widget elementor-widget-heading\" data-id=\"4c6b321\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\"><h3><strong>Outlook<\/strong><\/h3><\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-eed0607 elementor-widget elementor-widget-text-editor\" data-id=\"eed0607\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tSo, should the above challenges stop you from adopting data driven decision making? No, far from it. Data driven decision making is here to stay. It will become increasingly valuable as we gain more knowledge on how to best harness all available data and information to drive performance, that being clicks on your website or optimal operation of an industrial plant.\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-5343ade elementor-widget elementor-widget-text-editor\" data-id=\"5343ade\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tHowever, it is important to be aware that it requires more than just hardware and lots of data to succeed. Big data and computing power are important ingredients, but it is not the full solution. Instead, you should understand the underlying mechanisms that connect the data. Data will not speak for itself, we give numbers their meaning. The Volume, Variety or Velocity of data cannot change that.\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-5f8055c elementor-widget elementor-widget-text-editor\" data-id=\"5f8055c\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tThis article is also published on <a href=\"https:\/\/www.linkedin.com\/pulse\/hidden-risk-ai-big-data-vegard-flovik-phd\/\" rel=\"noopener\">LinkedIn<\/a>.\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-700fc8f elementor-widget elementor-widget-heading\" data-id=\"700fc8f\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h3 class=\"elementor-heading-title elementor-size-default\"><h3>Sources:<\/h3><\/h3>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-0c67ba4 elementor-widget elementor-widget-text-editor\" data-id=\"0c67ba4\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<div><a href=\"https:\/\/www.di.ens.fr\/users\/longo\/files\/BigData-Calude-LongoAug21.pdf\" target=\"_blank\" rel=\"nofollow noopener noreferrer\">Calude, C.S. &amp; Longo, G. Found Sci (2017) 22: 595: The Deluge of Spurious Correlations<\/a><\/div>\n<div>blogs.crikey.com: Does big data equal big problems?<\/div>\n<div><a href=\"https:\/\/core.ac.uk\/download\/pdf\/144286974.pdf\" target=\"_blank\" rel=\"nofollow noopener noreferrer\">Guiseppe Longo: Mathematical Use and Abuse of Big Data<\/a><\/div>\n<div><a href=\"https:\/\/www.nytimes.com\/2014\/04\/07\/opinion\/eight-no-nine-problems-with-big-data.html\" target=\"_blank\" rel=\"nofollow noopener noreferrer\">NY Times: Eight (No, Nine!) problems with big data<\/a><\/div>\n<div><a href=\"https:\/\/www.wired.com\/2008\/06\/pb-theory\/\" target=\"_blank\" rel=\"nofollow noopener noreferrer\">The Wire: THE END OF THEORY: THE DATA DELUGE MAKES THE SCIENTIFIC METHOD OBSOLETE<\/a><\/div>\n<div><a href=\"https:\/\/john-poppelaars.blogspot.com\/2015\/04\/do-numbers-really-speak-for-themselves.html\" target=\"_blank\" rel=\"nofollow noopener noreferrer\" data-artdeco-is-focused=\"true\">John Poppelars: Do numbers really speak for themselves<\/a><\/div>\n<div><\/div>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<\/div>\n\t\t","protected":false},"excerpt":{"rendered":"<p>Recent advances in AI have been made possible through access to \u201cBig Data\u201d and cheap computing power. But can it go wrong? Big data is suddenly everywhere. From scarcity and difficulty to find data (and information), we now have a deluge of data. In recent years, the amount of available data has been growing in<\/p>\n","protected":false},"author":651,"featured_media":4126,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[183],"tags":[97],"ppma_author":[3401],"class_list":["post-1988","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-ai-ml","tag-artificial-intelligence"],"authors":[{"term_id":3401,"user_id":651,"is_guest":0,"slug":"vegard-flovik","display_name":"Vegard Flovik","avatar_url":"https:\/\/www.experfy.com\/blog\/wp-content\/uploads\/2020\/04\/medium_10d5ff84-76ce-4fa5-80dc-3a0dae50f16d-150x150.jpg","author_category":"","user_url":"http:\/\/www.axbit.com","last_name":"Flovik","first_name":"Vegard","job_title":"","description":"Vegard Flovik, Ph.D., is a Lead Data Scientist. Machine learning and advanced analytics at Axbit AS, a professional ICT systems partner."}],"_links":{"self":[{"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/posts\/1988","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/users\/651"}],"replies":[{"embeddable":true,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/comments?post=1988"}],"version-history":[{"count":0,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/posts\/1988\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/media\/4126"}],"wp:attachment":[{"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/media?parent=1988"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/categories?post=1988"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/tags?post=1988"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/ppma_author?post=1988"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}