{"id":7877,"date":"2020-05-22T07:48:13","date_gmt":"2020-05-22T07:48:13","guid":{"rendered":"http:\/\/blog.experfy.com\/?p=7877"},"modified":"2023-12-11T15:32:32","modified_gmt":"2023-12-11T15:32:32","slug":"how-deep-learning-can-help-scientific-research","status":"publish","type":"post","link":"https:\/\/www.experfy.com\/blog\/ai-ml\/how-deep-learning-can-help-scientific-research\/","title":{"rendered":"How Deep Learning Can Help Scientific Research"},"content":{"rendered":"\t\t<div data-elementor-type=\"wp-post\" data-elementor-id=\"7877\" class=\"elementor elementor-7877\" data-elementor-post-type=\"post\">\n\t\t\t\t\t\t<section class=\"has_eae_slider elementor-section elementor-top-section elementor-element elementor-element-58743e14 elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"58743e14\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"has_eae_slider elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-26d63563\" data-id=\"26d63563\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-360795df elementor-widget elementor-widget-text-editor\" data-id=\"360795df\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>Whether we take it for granted or not, deep learning algorithms have become an inseparable part of our daily lives. Personalized feeds, face and voice recognition, web search, smart speakers, digital assistants, email, and many other applications that we can\u2019t part ways with use\u00a0<a href=\"https:\/\/bdtechtalks.com\/2019\/02\/15\/what-is-deep-learning-neural-networks\/\" target=\"_blank\" rel=\"noreferrer noopener\">deep learning algorithms<\/a>\u00a0under the hood.<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>But how effective is deep learning in scientific research, where problems are often much more complex than classifying an image and requirements are much more sensitive than recommending what to buy next?<\/p>\n<!-- \/wp:paragraph -->\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-f949b92 elementor-widget elementor-widget-text-editor\" data-id=\"f949b92\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>To answer this question, former Google CEO Eric Schmidt and Google AI researcher Maithra Raghu have put together a comprehensive guide on the different deep learning techniques and their application to scientific research.<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>\u201cThe amount of data collected in a wide array of scientific domains is dramatically increasing in both size and complexity,\u201d the authors write, adding that along with advances in machine learning, this rich corpus of data can provide \u201cmany exciting opportunities for deep learning applications in scientific settings.\u201d<\/p>\n<!-- \/wp:paragraph -->\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-c0b2c17 elementor-widget elementor-widget-text-editor\" data-id=\"c0b2c17\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>Titled \u201c<a href=\"https:\/\/arxiv.org\/abs\/2003.11755\" target=\"_blank\" rel=\"noreferrer noopener\">A Survey of Deep Learning for Scientific Discovery<\/a>,\u201d their guide provides a very accessible overview of deep learning and\u00a0<a href=\"https:\/\/bdtechtalks.com\/2019\/08\/05\/what-is-artificial-neural-network-ann\/\" target=\"_blank\" rel=\"noreferrer noopener\">neural networks<\/a>\u00a0for scientists who aren\u2019t necessarily versed in the complex language of artificial intelligence algorithms.<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>I strongly recommend reading the entire 48-page document and visit many of its references. But here are some key takeaways.<\/p>\n<!-- \/wp:paragraph -->\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-6f4bd2a elementor-widget elementor-widget-heading\" data-id=\"6f4bd2a\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\"><!-- wp:heading -->\n<h2>You don\u2019t necessarily need to do deep learning<\/h2>\n<!-- \/wp:heading --><\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-3acfc52 elementor-widget elementor-widget-text-editor\" data-id=\"3acfc52\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>With deep learning being all the rage, it\u2019s easy to be tempted to apply it to anything and everything. After all, the basic proposition is very attractive: It\u2019s an end-to-end AI model that takes a bunch of data, develops a mathematical representation, and performs complex classification and prediction tasks.<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>Deep neural networks can tackle problems previously solved by other types of <a href=\"https:\/\/www.experfy.com\/blog\/ai-ml\/machine-learning-algorithms-in-laymans-terms-part-1\/\" target=\"_blank\" rel=\"noreferrer noopener\">machine learning algorithms,<\/a> such as content recommendation or fraud detection. They can also handle problems that were traditionally difficult to handle with other machine learning techniques, including complex\u00a0<a href=\"https:\/\/bdtechtalks.com\/2019\/01\/14\/what-is-computer-vision\/\" target=\"_blank\" rel=\"noreferrer noopener\">computer vision<\/a>\u00a0and natural language processing (NLP) tasks.<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>However, Schmidt and Raghu warn, when formulating a problem, it is important to consider whether deep learning provides the right set of tools to solve it. \u201cIn many settings, deep learning may not be the best technique to start with or best suited to the problem,\u201d they write.<\/p>\n<!-- \/wp:paragraph -->\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-9d534fa elementor-widget elementor-widget-text-editor\" data-id=\"9d534fa\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>For many problems, simpler\u00a0<a href=\"https:\/\/bdtechtalks.com\/2017\/08\/28\/artificial-intelligence-machine-learning-deep-learning\/\" target=\"_blank\" rel=\"noreferrer noopener\">machine learning algorithms<\/a>\u00a0often provide more efficient solutions. For instance, if you want to find the most relevant of a set of chemical characteristics of different substances, you might be better off using \u201cdimensionality reduction,\u201d a technique that can find the features that contribute most to outcomes.<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>On the other hand, if you have limited data or if your data has been neatly arranged in a tabular format, you might want to consider trying a regression model before using neural networks. Neural networks usually (but not always) need lots of data. They are also difficult to interpret. In contrast, linear and logistic regression algorithms can provide more accurate results when the data is scarce, especially if the problem is linear in nature. Regression models also provide a clear mathematical equation with coefficients that explain the relevance of each feature in the dataset.<\/p>\n<!-- \/wp:paragraph -->\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-a4e3835 elementor-widget elementor-widget-heading\" data-id=\"a4e3835\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\"><h2>Deep learning for image-related scientific tasks<\/h2>\n<!-- \/wp:heading --><\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-6d0564b elementor-widget elementor-widget-image\" data-id=\"6d0564b\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"image.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<img decoding=\"async\" src=\"https:\/\/i2.wp.com\/bdtechtalks.com\/wp-content\/uploads\/2020\/04\/covid-net-coronavirus-detection.png?resize=696%2C684&#038;ssl=1\" alt=\"\" \/>\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-fbae8d5 elementor-widget elementor-widget-text-editor\" data-id=\"fbae8d5\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t&#8220;Covid-net coronavirus detection algorithm&#8221; \n<figcaption>Scientists are using deep learning algorithms to detect signs of COVID-19 infection in chest x-rays of patients (source: <a href=\"https:\/\/arxiv.org\/abs\/2003.09871\" target=\"_blank\" rel=\"noreferrer noopener\">COVID-Net<\/a>)\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-88cbc6e elementor-widget elementor-widget-text-editor\" data-id=\"88cbc6e\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<!-- wp:paragraph -->\n<p>One area where deep learning algorithms have been very effective is the processing of visual data. The authors describe\u00a0<a href=\"https:\/\/bdtechtalks.com\/2020\/01\/06\/convolutional-neural-networks-cnn-convnets\/\" target=\"_blank\" rel=\"noreferrer noopener\">convolutional neural networks<\/a>\u00a0as \u201cthe most well known family of neural networks\u201d and \u201cvery useful in working with any kind of image data.\u201d<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>Aside from the\u00a0<a href=\"https:\/\/bdtechtalks.com\/2019\/12\/30\/computer-vision-applications-deep-learning\/\" target=\"_blank\" rel=\"noreferrer noopener\">commercial and industrial applications<\/a>, CNNs have found their way into many scientific domains. One of the best known applications of convolutional neural networks is medical imaging analysis. There are already many deep learning algorithms that examine CT scans and x-rays and help in the diagnosis of diseases such as cancer. Recently, scientists have been using CNNs to\u00a0<a href=\"https:\/\/bdtechtalks.com\/2020\/03\/09\/artificial-intelligence-covid-19-coronavirus\/\" target=\"_blank\" rel=\"noreferrer noopener\">find symptoms of the novel coronavirus<\/a>\u00a0in chest x-rays.<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>Some of the visual applications of deep learning are less known. For instance, neuroscientist are experimenting with pose-detection neural networks to\u00a0<a href=\"http:\/\/www.mousemotorlab.org\/deeplabcut\" target=\"_blank\" rel=\"noreferrer noopener\">track the movements of animals<\/a>\u00a0and analyze their behavior.<\/p>\n<!-- \/wp:paragraph -->\n\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-0c11a6f elementor-widget elementor-widget-heading\" data-id=\"0c11a6f\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\"><h2>NLP technology can expand to other fields<\/h2>\n<!-- \/wp:heading --><\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-2112794 elementor-widget elementor-widget-text-editor\" data-id=\"2112794\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<!-- wp:paragraph -->\n<p>Another area that has benefitted immensely from advances in deep learning algorithms is\u00a0<a href=\"https:\/\/bdtechtalks.com\/2018\/02\/20\/ai-machine-learning-nlg-nlp\/\" target=\"_blank\" rel=\"noreferrer noopener\">natural language processing<\/a>. Recurrent neural networks, long short-term memory (LSTM) networks, and Transformers have proven to be especially good at performing language-related tasks such as translation and question-answering.<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>To be clear, current AI algorithms process language in\u00a0<a href=\"https:\/\/bdtechtalks.com\/2018\/10\/22\/ai-deep-learning-human-language\/\" target=\"_blank\" rel=\"noreferrer noopener\">fundamentally different\u2014and inferior\u2014ways\u00a0<\/a>than the human brain. Even the largest neural network will fail at some of the simplest tasks that a human child with a very rudimentary understanding of language can perform.<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>This is because like all other types of neural networks, RNNs and Transformers are at their very core pattern-matching machines. They can find recurring patterns in sequences of data, whether it be text or any other kind of information. According to Schmidt and Raghu, these structures can be used in \u201cProblems where the data has a sequential nature (with different sequences of varying length), and prediction problems such as determining the next sequence token, transforming one sequence to another, or determining sequence similarities are important tasks.\u201d<\/p>\n<!-- \/wp:paragraph -->\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-4ac986c elementor-widget elementor-widget-text-editor\" data-id=\"4ac986c\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<!-- wp:paragraph -->\n<p>While this scheme presents\u00a0<a href=\"https:\/\/bdtechtalks.com\/2019\/10\/07\/rebooting-ai-gary-marcus-ernest-davis\/\" target=\"_blank\" rel=\"noreferrer noopener\">limits in dealing with the abstract and implied meanings of language<\/a>, it has some very interesting applications in scientific research in areas such genomics and proteomics, where sequential structures play an important role.<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>Transformers have proven to be especially efficient in scientific research. In\u00a0<a href=\"https:\/\/www.biorxiv.org\/content\/10.1101\/622803v2\" target=\"_blank\" rel=\"noreferrer noopener\">one recent project<\/a>, AI researchers used unsupervised learning to train a bidirectional Transformer on 86 billion amino acids across 250 million sequences spanning evolutionary diversity. \u201cThe resulting model maps raw sequences to representations of biological properties without labels or prior domain knowledge,\u201d the researchers write. This is an important step toward understanding protein sequences and extracting general and transferable information about proteins from raw sequences.<\/p>\n<!-- \/wp:paragraph -->\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-361d30b elementor-widget elementor-widget-heading\" data-id=\"361d30b\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\"><!-- wp:heading -->\n<h2>What if you don\u2019t have a lot of data?<\/h2>\n<!-- \/wp:heading --><\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-9c0d1fc elementor-widget elementor-widget-image\" data-id=\"9c0d1fc\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"image.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<img decoding=\"async\" src=\"https:\/\/i0.wp.com\/bdtechtalks.com\/wp-content\/uploads\/2019\/12\/random-vectors.jpg?resize=696%2C392&#038;ssl=1\" alt=\"\" \/>\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-090b37d elementor-widget elementor-widget-text-editor\" data-id=\"090b37d\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<!-- wp:paragraph -->\n<p>One of the\u00a0<a href=\"https:\/\/bdtechtalks.com\/2018\/02\/27\/limits-challenges-deep-learning-gary-marcus\/\" target=\"_blank\" rel=\"noreferrer noopener\">main criticisms against deep learning<\/a>\u00a0is its need for vast amounts of training data. In many fields of science, there\u2019s not enough labeled data available. In others such as medicine, the data collection is prohibitively expensive and subject to the laws of handling sensitive personal information.<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>Deep neural networks also consume\u00a0<a href=\"https:\/\/www.pcmag.com\/news\/ai-could-save-the-world-if-it-doesnt-ruin-the-environment-first\" target=\"_blank\" rel=\"noreferrer noopener\">a lot of compute resources and electricity<\/a>\u00a0during training, requirements that many people and organizations can\u2019t meet.<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>But not every deep learning model requires lots of training data. In the past few years, advances in\u00a0<a href=\"https:\/\/bdtechtalks.com\/2019\/06\/10\/what-is-transfer-learning\/\" target=\"_blank\" rel=\"noreferrer noopener\">transfer learning<\/a>\u00a0have enabled many developers to create deep learning models without the need for a lot of data and computation resources. Transfer learning involves finetuning a pre-trained AI model for a new task. Transfer learning has had remarkable success in computer vision, and there are many freely available AI models that have already been trained on millions of examples.<\/p>\n<!-- \/wp:paragraph -->\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-16392a3 elementor-widget elementor-widget-text-editor\" data-id=\"16392a3\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>As long as the new problem is close enough to the domain of the base model and you have a decent set of examples, you\u2019ll have a reasonable chance of being able to finetune the AI model for the new task.<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>\u201cTypically, performing transfer learning is an excellent way to start work on a new problem of interest. There is the benefit of using a well-tested, standard neural network architecture, aside from the knowledge reuse, stability and convergence boosts offered by pretrained weights,\u201d the authors write.<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>Meanwhile, they also warn: \u201cNote however that the precise effects of transfer learning are not yet fully understood, and an active research area.\u201d<\/p>\n<!-- \/wp:paragraph -->\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-bf60594 elementor-widget elementor-widget-text-editor\" data-id=\"bf60594\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\n<!-- wp:paragraph -->\n<p>Another area that is worth watching in the coming months is\u00a0<a href=\"https:\/\/bdtechtalks.com\/2020\/03\/23\/yann-lecun-self-supervised-learning\/\" target=\"_blank\" rel=\"noreferrer noopener\">self-supervised learning<\/a>, a branch of artificial intelligence that can learn from raw data without the need for human-labeled examples. Self-supervised learning is still in a very preliminary stage, however, and also an active area of research.<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>But an area that has already yielded result is generative models such as\u00a0<a href=\"https:\/\/bdtechtalks.com\/2018\/05\/28\/generative-adversarial-networks-artificial-intelligence-ian-goodfellow\/\" target=\"_blank\" rel=\"noreferrer noopener\">generative adversarial networks<\/a>\u00a0(GAN). GANs can generate fake data that resembles their real counterparts. Perhaps they\u2019re best known for the natural-but-nonexistent human faces they can create. Artists are now\u00a0<a href=\"https:\/\/bdtechtalks.com\/2018\/10\/29\/deep-learning-arts-music-literature\/\" target=\"_blank\" rel=\"noreferrer noopener\">using GANs to generate art<\/a>\u00a0that can sell at stellar prices.<\/p>\n<!-- \/wp:paragraph -->\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-b656803 elementor-widget elementor-widget-text-editor\" data-id=\"b656803\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<!-- wp:html -->\n<figure><iframe src=\"https:\/\/www.youtube.com\/embed\/BIZg_PPuj_0?version=3&amp;rel=1&amp;fs=1&amp;autohide=2&amp;showsearch=0&amp;showinfo=1&amp;iv_load_policy=1&amp;wmode=transparent\" width=\"100%\" height=\"392\" allowfullscreen=\"allowfullscreen\"><\/iframe><\/figure>\n<!-- \/wp:html -->\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-be2175c elementor-widget elementor-widget-text-editor\" data-id=\"be2175c\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<!-- wp:paragraph -->\n<p>For simplicity, let\u2019s assume there are three customers (c1, c2, c3) in this batch, and one vehicle (v1) information is provided as a sale.<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:list -->\n<ul><li>P(C=c1) represents the likelihood of c1 to buy any car. Assuming no prior knowledge about each customer, their likelihood of buying any car should be the same: P(C=c1) = P(C=c2) = P(C=c3), which equals a constant (e.g. 1\/3 in this situation)<\/li><li>P(V=v1) is the likelihood for v1 to be sold, given it is shown in this batch, this should be 1 (100% likelihood to be sold)<\/li><\/ul>\n<!-- \/wp:list -->\n\n<!-- wp:paragraph -->\n<p>Since there is only one customer making the purchase, this probability can be extended into:<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>P(V=v1) = P(C=c1, V=v1) + P(C=c2, V=v1) + P(C=c3, V=v1) = 1.0<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>For each of the item, given the following formula<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>P(C=c1, V=v1) = P(C=c1|V=v1) * P(V=v1) = P(V=v1|C=c1) * P(C=c1)<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>We can see P(C=c1|V=v1) is proportional to P(V=v1|C=c1). So now, we can get the formula for the probability calculation:<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>P(C=c1|V=v1) = P(V=v1|C=c1) \/ (P(V=v1|C=c1) + P(V=v1|C=c2) + P(V=v1|C=c3))<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>and the key is to get the probability for each P(V|C). Such a formula can be verbally explained as: the likelihood for a vehicle to be purchased by a specific customer is proportional to the likelihood for the customer to buy this specific vehicle.<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>The above formula may look too \u201cmathematical\u201d, so let me put it into an intuitive context: assuming three people were in a room, one is a musician, one is an athlete, and one is a data scientist. You were told there is a violin in this room belong to one of them. Now guess, whom do you think is the owner of the violin? This is pretty straightforward, right? given the likelihood of musician to own a violin is high, and the likelihood of athlete and data scientists to own a violin is lower, it is much more likely for the violin to belong to the musician. The \u201cmathematical\u201d thinking process is illustrated below.<\/p>\n<!-- \/wp:paragraph -->\n\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"has_eae_slider elementor-section elementor-top-section elementor-element elementor-element-c574677 elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"c574677\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"has_eae_slider elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-3fdd119\" data-id=\"3fdd119\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-6e98bae elementor-widget elementor-widget-heading\" data-id=\"6e98bae\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\"><h2>Scientific research and deep learning\u2019s interpretability issues<\/h2>\n<!-- \/wp:heading --><\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-4521119 elementor-widget elementor-widget-text-editor\" data-id=\"4521119\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>Another challenge that deep learning often presents is interpretability. Deep neural networks are complex functions with parameters that can span in the millions\u00a0<a href=\"https:\/\/bdtechtalks.com\/2020\/02\/03\/google-meena-chatbot-ai-language-model\/\" target=\"_blank\" rel=\"noreferrer noopener\">or even billions<\/a>, and making sense of how they solve problems and make predictions is often perplexing.<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>This can pose a challenge to many areas of scientific research, where the focus is on understanding rather than prediction, and the researchers seek to identify the underlying mechanisms behind the patterns observed in the data. \u201cWhen applying deep learning in scientific settings, we can use these observed phenomena as prediction targets, but the ultimate goal remains to understand what attributes give rise to these observations,\u201d Schmidt and Raghu write.<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>Fortunately, advances in\u00a0<a href=\"https:\/\/bdtechtalks.com\/2018\/09\/25\/explainable-interpretable-ai\/\" target=\"_blank\" rel=\"noreferrer noopener\">explainable artificial intelligence<\/a>\u00a0have helped, to some degree, overcome these barriers. While fully understanding and controlling the step-by-step decision-making mechanisms of neural networks remains a challenge, techniques developed in the past few years help us interpret the process.<\/p>\n<!-- \/wp:paragraph -->\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-00ea865 elementor-widget elementor-widget-text-editor\" data-id=\"00ea865\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<!-- wp:paragraph -->\n<p>Schmidt and Raghu AI interpretability techniques into two broad categories: Feature attribution and model inspection.<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>Feature attribution helps us better understand which features in a specific sample have contributed to a neural network\u2019s output. These techniques produce saliency maps that highlight these features. For instance, if you\u2019re inspecting an image classifier, the saliency map would highlight the parts of the image that the AI has homed in on when determining its category.<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>There are different techniques that produce saliency maps, including GradCAM, LIME, and\u00a0<a href=\"https:\/\/bdtechtalks.com\/2018\/10\/15\/kate-saenko-explainable-ai-deep-learning-rise\/\" target=\"_blank\" rel=\"noreferrer noopener\">RISE<\/a>. They are good methods for inspecting the output of neural networks to understand whether their decisions are based on the right or wrong features.<\/p>\n<!-- \/wp:paragraph -->\n\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-235824c elementor-widget elementor-widget-image\" data-id=\"235824c\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"image.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<img decoding=\"async\" src=\"https:\/\/i1.wp.com\/bdtechtalks.com\/wp-content\/uploads\/2018\/10\/RISE-explainable-AI-example-saliency-map.png?resize=696%2C524&#038;ssl=1\" alt=\"\" \/>\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-479db20 elementor-widget elementor-widget-text-editor\" data-id=\"479db20\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t&#8220;RISE explainable AI example saliency map&#8221; \n<figcaption>Examples of saliency maps produced by RISE\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-451e27d elementor-widget elementor-widget-text-editor\" data-id=\"451e27d\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>Model inspection, on the other hand, tries to probe neurons in the hidden layers of a network and find the kind of input that activates them. These techniques provide better insights into the general workings of the AI model. Some of the interesting work done in this area is\u00a0<a href=\"https:\/\/bdtechtalks.com\/2019\/02\/04\/explainable-ai-gan-dissection-ibm-mit\/\" target=\"_blank\" rel=\"noreferrer noopener\">GANPaint<\/a>, which lets you examine the effects of manipulating individual neurons, and\u00a0<a href=\"https:\/\/bdtechtalks.com\/2019\/03\/11\/openai-google-neural-networks-visualization\/\" target=\"_blank\" rel=\"noreferrer noopener\">Activation Atlassses<\/a>, a tool that visualizes interactions between neurons in a neural network.<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>The opportunities for applying deep learning to scientific discovery are numerous, and the paper compiled by Schmidt and Raghu provides a great starting guide for aspiring scientists.<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>\u201cAs the amount of data collected across many diverse scientific domains continues to increase in both sheer amount and complexity, deep learning methods offer many exciting possibilities for both fundamental predictive problems as well as revealing subtle properties of the underlying data generation process,\u201d the authors write.<\/p>\n<!-- \/wp:paragraph -->\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<\/div>\n\t\t","protected":false},"excerpt":{"rendered":"<p>How effective is deep learning in scientific research, where problems are often much more complex than classifying an image and requirements are much more sensitive than recommending what to buy next? The opportunities for applying deep learning to scientific discovery are numerous.<\/p>\n","protected":false},"author":109,"featured_media":7878,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"content-type":"","footnotes":""},"categories":[183],"tags":[97,206],"ppma_author":[1946],"class_list":["post-7877","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-ai-ml","tag-artificial-intelligence","tag-deep-learning"],"authors":[{"term_id":1946,"user_id":109,"is_guest":0,"slug":"ben-dickson","display_name":"Ben Dickson","avatar_url":"https:\/\/www.experfy.com\/blog\/wp-content\/uploads\/2020\/04\/medium_8aaf6bea-c4c1-455f-8156-8007d70910f8-150x150.jpg","user_url":"https:\/\/bdtechtalks.com\/","last_name":"Dickson","first_name":"Ben","job_title":"","description":"Ben Dickson is an experienced software engineer and tech blogger. He contributes regularly to major tech websites such as the Next Web, the Daily Dot, PCMag.com, Cointelegraph, VentureBeat, International Business Times UK, and The Huffington Post."}],"_links":{"self":[{"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/posts\/7877","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/users\/109"}],"replies":[{"embeddable":true,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/comments?post=7877"}],"version-history":[{"count":6,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/posts\/7877\/revisions"}],"predecessor-version":[{"id":34859,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/posts\/7877\/revisions\/34859"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/media\/7878"}],"wp:attachment":[{"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/media?parent=7877"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/categories?post=7877"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/tags?post=7877"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/ppma_author?post=7877"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}