{"id":9235,"date":"2020-08-06T07:16:06","date_gmt":"2020-08-06T07:16:06","guid":{"rendered":"https:\/\/www.experfy.com\/blog\/?p=9235"},"modified":"2023-11-22T10:36:19","modified_gmt":"2023-11-22T10:36:19","slug":"deep-learning-for-anomaly-detection-a-comprehensive-survey","status":"publish","type":"post","link":"https:\/\/www.experfy.com\/blog\/ai-ml\/deep-learning-for-anomaly-detection-a-comprehensive-survey\/","title":{"rendered":"Deep Learning for Anomaly Detection: A Comprehensive Survey"},"content":{"rendered":"\t\t<div data-elementor-type=\"wp-post\" data-elementor-id=\"9235\" class=\"elementor elementor-9235\" data-elementor-post-type=\"post\">\n\t\t\t\t\t\t<section class=\"has_eae_slider elementor-section elementor-top-section elementor-element elementor-element-3a155c69 elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-eae-slider=\"93320\" data-id=\"3a155c69\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"has_eae_slider elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-5be84c45\" data-eae-slider=\"84128\" data-id=\"5be84c45\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-21361fb elementor-widget elementor-widget-text-editor\" data-id=\"21361fb\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\n<p class=\"has-medium-font-size wp-block-paragraph\"><em>Reviewing challenges, methods and opportunities in deep anomaly detection<\/em><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">This post summarizes a comprehensive survey paper on deep learning for anomaly detection \u2014 \u201c<a href=\"https:\/\/arxiv.org\/abs\/2007.02500\" target=\"_blank\" rel=\"noreferrer noopener\">Deep Learning for Anomaly Detection: A Review<\/a>\u201d [1], discussing challenges, methods and opportunities in this direction.<\/p>\n\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-f7dc6ab elementor-widget elementor-widget-text-editor\" data-id=\"f7dc6ab\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>Anomaly detection, a.k.a. outlier detection, has been an active research area for several decades, due to its broad applications in a large number of key domains such as risk management, compliance, security, financial surveillance, health and medical risk, and AI safety. Although it is a problem widely studied in various communities including data mining, machine learning, computer vision and statistics, there are still some unique problem complexities and challenges that require advanced approaches. In recent years, deep learning enabled anomaly detection has emerged as a critical direction towards addressing these challenges. However, there is a lack of systematic review and discussion of the research progress in this direction. We aim to present a comprehensive review of this direction to discuss the main challenges, a large number of state-of-the-art methods, how they address the challenges, as well as future opportunities.<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:separator --><hr class=\"wp-block-separator\" \/><!-- \/wp:separator -->\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-bd7ea6d elementor-widget elementor-widget-heading\" data-id=\"bd7ea6d\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\">Largely Unsolved Challenges in Anomaly Detection<\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-478ffe1 elementor-widget elementor-widget-text-editor\" data-id=\"478ffe1\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>Although anomaly detection is a lasting active research area for years, there are still a number of largely unsolved challenges due to some unique and complex nature of anomalies, e.g.,\u00a0<em>unknownness\u00a0<\/em>(they remain unknown until actually occur),\u00a0<em>heterogeneity\u00a0<\/em>(different anomalies demonstrate completely different abnormal characteristics),\u00a0<em>rareness\u00a0<\/em>(anomalies are rarely occurred data instances),\u00a0<em>diverse form of anomalies<\/em>\u00a0(point anomaly, contextual anomaly, and group anomaly).<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>One of the most challenging issues is\u00a0<strong>the difficulty to achieve high anomaly detection recall rate (Challenge #1)<\/strong>. Since anomalies are highly rare and heterogeneous, it is difficult to identify all of the anomalies. Many normal instances are wrongly reported as anomalies while true yet sophisticated anomalies are missed.<\/p>\n<!-- \/wp:paragraph -->\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-81a19cf elementor-widget elementor-widget-text-editor\" data-id=\"81a19cf\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<!-- wp:paragraph -->\n<p><strong>Anomaly detection in high-dimensional and\/or not-independent data (Challenge #2)<\/strong>\u00a0is also a significant challenge. Anomalies often exhibit evident abnormal characteristics in a low-dimensional space yet become hidden and unnoticeable in a high-dimensional space. High-dimensional anomaly detection has been a long-standing problem. Subspace\/feature selection-based methods may be a straightforward solution. However, identifying intricate (e.g., high-order, nonlinear and heterogeneous) feature interactions and couplings may be essential in high-dimensional data yet remains a major challenge for anomaly detection.<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-ba25aaf elementor-widget elementor-widget-text-editor\" data-id=\"ba25aaf\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>Due to the difficulty and cost of collecting large-scale <a href=\"https:\/\/www.experfy.com\/blog\/labeled-data-first-step-towards-ai\/\" target=\"_blank\" rel=\"noreferrer noopener\">labeled <\/a>anomaly data, it is important to have\u00a0<strong>data-efficient learning of normality\/abnormality (Challenge #3)<\/strong>. wo major challenges are how to learn expressive normality\/abnormality representations with a small amount of labeled anomaly data and how to learn detection models that are generalized to novel anomalies uncovered by the given labeled anomaly data.<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>Many weakly\/semi-supervised anomaly detection methods assume the given labeled training data is clean, which can be highly vulnerable to noisy instances that are mistakenly labeled as an opposite class label. One main challenge here is how to develop\u00a0<strong>noise-resilient anomaly detection (Challenge #4)<\/strong>.<\/p>\n<!-- \/wp:paragraph -->\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-18d4344 elementor-widget elementor-widget-text-editor\" data-id=\"18d4344\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<!-- wp:paragraph -->\n<p>Most of existing methods are for point anomalies, which cannot be used for conditional anomaly and group anomaly since they exhibit completely different behaviors from point anomalies. One main challenge here is to incorporate the concept of conditional\/group anomalies into anomaly measures\/models for the\u00a0<strong>detection of those complex anomalies (Challenge #5)<\/strong>.<\/p>\n<!-- \/wp:paragraph -->\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-499e8dd elementor-widget elementor-widget-text-editor\" data-id=\"499e8dd\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<!-- wp:paragraph -->\n<p>In many critical domains there may be some major risks if anomaly detection models are directly used as black-box models. For example, the\u00a0<em>rare\u00a0<\/em>data instances reported as anomalies may lead to possible algorithmic bias against the minority groups presented in the data, such as under-represented groups in fraud detection and crime detection systems. An effective approach to mitigate this type of risk is to have\u00a0<strong>anomaly explanation (Challenge #6)<\/strong>\u00a0algorithms that provide straightforward clues about why a specific data instance is identified as anomaly. Providing such explanation can be as important as detection accuracy in some applications. To derive anomaly explanation from specific detection methods is still a largely unsolved problem, especially for complex models. Developing inherently interpretable anomaly detection models is also crucial, but it remains a main challenge to well balance the model\u2019s interpretability and effectiveness.<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:separator --><hr class=\"wp-block-separator\" \/><!-- \/wp:separator -->\n\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-2ca614f elementor-widget elementor-widget-heading\" data-id=\"2ca614f\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\"><h2 id=\"0da4\">Addressing the Challenges with Deep Anomaly Detection<\/h2><\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-5ddbc07 elementor-widget elementor-widget-text-editor\" data-id=\"5ddbc07\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<!-- wp:paragraph -->\n<p>In a nutshell deep anomaly detection aims at learning feature representations or anomaly scores via neural networks for the sake of anomaly detection. In recent years, a large number of deep anomaly detection methods have been introduced, demonstrating significantly better performance than conventional anomaly detection on addressing challenging detection problems in a variety of real-world applications. We systematically review the current deep anomaly detection methods and their capabilities in addressing the aforementioned challenges.<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>To have a thorough understanding of the area, we introduce a hierarchical taxonomy to classify existing deep anomaly detection methods into three main categories and 11 fine-grained categories from the modeling perspective. An overview of the taxonomy of the methods, together with the challenges they address, is shown in Figure 1. Specifically, deep anomaly detection consists of three conceptual paradigms \u2014\u00a0<em>Deep Learning for Feature Extraction<\/em>,\u00a0<em>Learning Feature Representations of Normality<\/em>, and\u00a0<em>End-to-end Anomaly Score Learning<\/em>.<\/p>\n<!-- \/wp:paragraph -->\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-1fd372d elementor-widget elementor-widget-image\" data-id=\"1fd372d\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"image.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<img decoding=\"async\" src=\"https:\/\/miro.medium.com\/max\/2138\/1*_UjVx_B8TJtCWtks3IKeYg.png\" alt=\"\" \/>\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-b0f6508 elementor-widget elementor-widget-text-editor\" data-id=\"b0f6508\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<!-- wp:paragraph -->\n<p>In the\u00a0<em>Deep Learning for Feature Extraction\u00a0<\/em>framework<em>,\u00a0<\/em>deep learning and anomaly detection are fully separated in the first main category, so deep learning techniques are used as some independent feature extractors only. The two modules are dependent on each other in some form in the second main category \u2014\u00a0<em>Learning Feature Representations of Normality<\/em>, with an objective of learning expressive representations of normality. This category of methods can be further divided into two subcategories based on whether traditional anomaly measures are incorporated into their objective functions. These two subcategories encompass seven fine-grained categories of methods, with each category taking a different approach to formulate its objective function. The two modules are fully unified in the third main category \u2014\u00a0<em>End-to-end Anomaly Score Learning<\/em>, in which the methods are dedicated to learning anomaly scores via neural networks in an end-to-end fashion. These methods are further grouped into four categories based on the formulation of neural network-enabled anomaly scoring.<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>For each category of methods, we review detailed methodology and algorithms, covering their key intuitions, objective functions, underlying assumptions, advantages and disadvantages, and discuss how they address the aforementioned challenges. The full details are difficult to demonstrate here. See the full paper below for detail.<\/p>\n<!-- \/wp:paragraph -->\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-14d6ebd elementor-widget elementor-widget-text-editor\" data-id=\"14d6ebd\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<!-- wp:verse -->\n<pre class=\"wp-block-verse\"><a href=\"https:\/\/arxiv.org\/abs\/2007.02500\" target=\"_blank\" rel=\"noreferrer noopener\">Deep Learning for Anomaly Detection: A Review<\/a>  <a href=\"https:\/\/arxiv.org\/abs\/2007.02500\" target=\"_blank\" rel=\"noreferrer noopener\">Anomaly detection, a.k.a. outlier detection, has been a lasting yet active research area in various research\u2026arxiv.org<\/a><\/pre>\n<!-- \/wp:verse -->\n\n<!-- wp:paragraph -->\n<p>&nbsp;<\/p>\n<!-- \/wp:paragraph -->\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<\/div>\n\t\t","protected":false},"excerpt":{"rendered":"<p>Anomaly detection has been an active research area for several decades, due to its broad applications in a large number of key domains such as  risk management, compliance, security, financial surveillance, health and medical risk, and AI safety.<\/p>\n","protected":false},"author":878,"featured_media":9236,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[183],"tags":[524,206],"ppma_author":[3792],"class_list":["post-9235","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-ai-ml","tag-anomaly-detection","tag-deep-learning"],"authors":[{"term_id":3792,"user_id":878,"is_guest":0,"slug":"guansong-pang","display_name":"Guansong Pang","avatar_url":"https:\/\/www.experfy.com\/blog\/wp-content\/uploads\/2020\/08\/Guansong-Pang-150x150.png","author_category":"","user_url":"https:\/\/sites.google.com\/site\/gspangsite\/","last_name":"Pang","first_name":"Guansong","job_title":"","description":"Guansong Pang is Research Fellow in Data Mining and Machine Learning at the Australian Institute for Machine Learning, University of Adelaide."}],"_links":{"self":[{"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/posts\/9235","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/users\/878"}],"replies":[{"embeddable":true,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/comments?post=9235"}],"version-history":[{"count":0,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/posts\/9235\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/media\/9236"}],"wp:attachment":[{"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/media?parent=9235"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/categories?post=9235"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/tags?post=9235"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/ppma_author?post=9235"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}