{"id":1867,"date":"2019-08-06T03:43:26","date_gmt":"2019-08-06T03:43:26","guid":{"rendered":"http:\/\/kusuaks7\/?p=1472"},"modified":"2024-07-19T13:10:12","modified_gmt":"2024-07-19T13:10:12","slug":"the-data-fabric-for-machine-learning-part-1","status":"publish","type":"post","link":"https:\/\/www.experfy.com\/blog\/ai-ml\/the-data-fabric-for-machine-learning-part-1\/","title":{"rendered":"The Data Fabric for Machine Learning \u2013 Part 1"},"content":{"rendered":"\t\t<div data-elementor-type=\"wp-post\" data-elementor-id=\"1867\" class=\"elementor elementor-1867\" data-elementor-post-type=\"post\">\n\t\t\t\t\t\t<section class=\"has_eae_slider elementor-section elementor-top-section elementor-element elementor-element-2654259b elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"2654259b\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"has_eae_slider elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-7b6f75b6\" data-id=\"7b6f75b6\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-158a2d1 elementor-widget elementor-widget-heading\" data-id=\"158a2d1\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h3 class=\"elementor-heading-title elementor-size-default\"><h3 style=\"color: #aaa;font-style: italic\">How the new advances in semantics and the data fabric can help us be better at Machine Learning. Also, a new definition of machine learning.<\/h3><\/h3>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-440d5c4 elementor-widget elementor-widget-heading\" data-id=\"440d5c4\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h1 class=\"elementor-heading-title elementor-size-default\"><h1 id=\"5fb1\" data-selectable-paragraph=\"\">Introduction<\/h1><\/h1>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-75201db elementor-widget elementor-widget-text-editor\" data-id=\"75201db\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p id=\"6c9e\" data-selectable-paragraph=\"\">If you search for machine learning online you\u2019ll find around 2,050,000,000 results. Yeah for real. It\u2019s not easy to find that description or definition that fits every use or case, but there are amazing ones. Here I\u2019ll propose a different definition of machine learning, focusing on a new paradigm, the data fabric.<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-c8d3551 elementor-widget elementor-widget-heading\" data-id=\"c8d3551\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h1 class=\"elementor-heading-title elementor-size-default\"><h1 id=\"408b\" data-selectable-paragraph=\"\">Objectives<\/h1><\/h1>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"has_eae_slider elementor-section elementor-top-section elementor-element elementor-element-15c4475 elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"15c4475\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"has_eae_slider elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-e44a7a7\" data-id=\"e44a7a7\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-6947815 elementor-widget elementor-widget-heading\" data-id=\"6947815\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\"><h2 id=\"23aa\" data-selectable-paragraph=\"\">General<\/h2>\n<\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-4d773a2 elementor-widget elementor-widget-text-editor\" data-id=\"4d773a2\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<blockquote>Explain the data fabric connection with machine learning.<\/blockquote>\n\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-97ca138 elementor-widget elementor-widget-heading\" data-id=\"97ca138\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\"><h2 id=\"ff84\" data-selectable-paragraph=\"\">Specifics<\/h2><\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-36c0c5d elementor-widget elementor-widget-text-editor\" data-id=\"36c0c5d\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<ul>\n \t<li id=\"0711\" data-selectable-paragraph=\"\">Give a description of the data fabric and ecosystems to create it.<\/li>\n \t<li id=\"2bcf\" data-selectable-paragraph=\"\">Explain in a few words what is machine learning.<\/li>\n \t<li id=\"cd11\" data-selectable-paragraph=\"\">Propose a way of visualizing machine learning insights inside of the data fabric.<\/li>\n<\/ul>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-8ef4c65 elementor-widget elementor-widget-heading\" data-id=\"8ef4c65\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h1 class=\"elementor-heading-title elementor-size-default\"><h1 id=\"d9f9\" data-selectable-paragraph=\"\">Main theory<\/h1><\/h1>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-1039e95 elementor-widget elementor-widget-text-editor\" data-id=\"1039e95\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p id=\"428e\" data-selectable-paragraph=\"\">If we can construct a\u00a0<strong>data fabric<\/strong>\u00a0that supports all the data in the company, then a\u00a0<strong>business<\/strong>\u00a0<strong>insight<\/strong>\u00a0inside of it can be thought as a\u00a0<strong>dent<\/strong>\u00a0in it. The\u00a0<strong>automatic process<\/strong>\u00a0of discovering what that insight is, it\u2019s called\u00a0<strong>machine learning<\/strong>.\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-a16db07 elementor-widget elementor-widget-heading\" data-id=\"a16db07\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h1 class=\"elementor-heading-title elementor-size-default\"><h1 id=\"a873\" data-selectable-paragraph=\"\">Section 1. What is the Data Fabric?<\/h1><\/h1>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-1b13370 elementor-widget elementor-widget-image\" data-id=\"1b13370\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"image.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<img decoding=\"async\" src=\"https:\/\/miro.medium.com\/max\/700\/0*lXF1Gih6svQfmu_k.jpg\" alt=\"\" \/>\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-dfa7170 elementor-widget elementor-widget-text-editor\" data-id=\"dfa7170\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p id=\"816b\" data-selectable-paragraph=\"\">I\u2019ve\u00a0<a href=\"https:\/\/towardsdatascience.com\/deep-learning-for-the-masses-and-the-semantic-layer-f1db5e3ab94b\" class=\"broken_link\" rel=\"noopener\">talked before<\/a>\u00a0about the data fabric, and I gave a definition of it (I\u2019ll put it here again bellow).<\/p>\n<p id=\"edc4\" data-selectable-paragraph=\"\">There are several words we should mention when we talk about the data fabric: graphs, knowledge-graph, ontology, semantics, linked-data. Read the article from above if you want those definitions; and then we can say that:<\/p>\n\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-3d8e7b2 elementor-widget elementor-widget-text-editor\" data-id=\"3d8e7b2\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<blockquote><em>The Data Fabric is the platform that supports all the data in the company. How it\u2019s managed, described, combined and universally accessed. This platform is formed from an Enterprise Knowledge Graph to create an uniform and unified data environment.<\/em><\/blockquote>\n<p id=\"460f\" data-selectable-paragraph=\"\">Let\u2019s break that definition in parts. The first thing we need it\u2019s a\u00a0<strong>knowledge graph<\/strong>.<\/p>\n<p id=\"778c\" data-selectable-paragraph=\"\">The knowledge graph consists in integrated collections of data and information that also contains huge numbers of links between different data. The key here is that instead of looking for possible answers, under this new model\u00a0<strong>we\u2019re seeking an answer.<\/strong>\u00a0We want the facts \u2014 where those facts come from is less important. The data here can represent concepts, objects, things, people and actually whatever you have in mind. The graph fills in the relationships, the connections between the concepts.<\/p>\n<p id=\"a54b\" data-selectable-paragraph=\"\">Knowledge graphs also allow you to create structures for the relationships in the graph. With it, it\u2019s possible to set up a framework to study data and its relation to other data (<a href=\"https:\/\/towardsdatascience.com\/ontology-and-data-science-45e916288cc5\" class=\"broken_link\" rel=\"noopener\">remember ontology?<\/a>).<\/p>\n<p id=\"56bf\" data-selectable-paragraph=\"\">In this context we can ask this question to our data lake:<\/p>\n\n<blockquote>What exists here?<\/blockquote>\n<p id=\"1b4c\" data-selectable-paragraph=\"\">The concept of the data lake it\u2019s important too because we need a place to store our data, govern it and run our jobs. But we need a smart data lake, a place that understand what we have and how to use it, that\u2019s one of the benefits of having a data fabric.<\/p>\n<p id=\"0964\" data-selectable-paragraph=\"\">The data fabric should be uniform and unified, meaning that we should make an effort in being able to organize all the data in the organization in one place and really manage and govern it.<\/p>\n\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-e4d0880 elementor-widget elementor-widget-heading\" data-id=\"e4d0880\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h1 class=\"elementor-heading-title elementor-size-default\"><h1 id=\"1c68\" data-selectable-paragraph=\"\">Section 2. What is Machine Learning?<\/h1>\n<\/h1>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-146d523 elementor-widget elementor-widget-image\" data-id=\"146d523\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"image.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<img decoding=\"async\" src=\"https:\/\/miro.medium.com\/max\/1000\/0*qZWN3MaEuxiNIa6U.png\" alt=\"\" \/>\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-2cb4a90 elementor-widget elementor-widget-text-editor\" data-id=\"2cb4a90\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p id=\"9004\" data-selectable-paragraph=\"\">Machine Learning has been around for a while now. There are great descriptions, books, articles and blogs about it so I\u2019m not going to bore you with 10 paragraphs on what is it.<\/p>\n<p id=\"2c59\" data-selectable-paragraph=\"\">I just want to make some points clear.<\/p>\n\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-b7caab7 elementor-widget elementor-widget-text-editor\" data-id=\"b7caab7\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<blockquote><strong>Machine Learning is not magic.<\/strong>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-7f3c090 elementor-widget elementor-widget-text-editor\" data-id=\"7f3c090\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tMachine Learning is a part of the data science workflow. But not the end of it.\n\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-e2e2b65 elementor-widget elementor-widget-text-editor\" data-id=\"e2e2b65\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tMachine Learning needs data to exist. At least for now.<\/blockquote>\n<p id=\"18f4\" data-selectable-paragraph=\"\">Ok after that, let me give a kinda borrowed and personalized definition of machine learning:<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-c0b23eb elementor-widget elementor-widget-text-editor\" data-id=\"c0b23eb\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<blockquote>Machine learning is the automatic process of understanding patterns in data and some data representations by using algorithms that are able to extract those patters without being specifically programmed for that, to create models that solves a particular (or multiple) problem(s).<\/blockquote>\n<p id=\"2f99\" data-selectable-paragraph=\"\">You can agree with this definition or not, there are great ones in the literature right now, I just think this one it\u2019s simple and useful for what I want to express.<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-5b081cf elementor-widget elementor-widget-heading\" data-id=\"5b081cf\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h1 class=\"elementor-heading-title elementor-size-default\">\n<h1 id=\"0b56\" data-selectable-paragraph=\"\">Section 3. Doing Machine Learning in the Data Fabric<\/h1><\/h1>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-fdee86c elementor-widget elementor-widget-image\" data-id=\"fdee86c\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"image.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<img decoding=\"async\" src=\"https:\/\/miro.medium.com\/max\/700\/0*efp_U5uvSIhni4rb.jpg\" alt=\"\" \/>\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-fb981fb elementor-widget elementor-widget-text-editor\" data-id=\"fb981fb\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p id=\"f8f7\" data-selectable-paragraph=\"\">In Einstein theory of gravity (General Relativity) he proposed mathematically that mass can deform space-time, and that deformation is what we understand as gravity. I know that if you are not familiar with the theory it can sound weird. Let me try to explain it.<\/p>\n<p id=\"1bfa\" data-selectable-paragraph=\"\">In the \u201cflat\u201d space-time of special relativity, where gravity is absent, the laws of mechanics take on an especially simple form: As long as no external force is acting on an object, it will move on a straight line through space-time: at a constant velocity along a straight path (Newton\u2019s first law of mechanics).<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-2806a64 elementor-widget elementor-widget-text-editor\" data-id=\"2806a64\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p id=\"bb2e\" data-selectable-paragraph=\"\">But when we have mass and acceleration we can say we are in the presence of gravity. Like Wheeler said:<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-f77e804 elementor-widget elementor-widget-text-editor\" data-id=\"f77e804\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<blockquote>Spacetime tells matter how to move; matter tells spacetime how to curve.<\/blockquote>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"has_eae_slider elementor-section elementor-top-section elementor-element elementor-element-6531acf elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"6531acf\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"has_eae_slider elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-f06eafd\" data-id=\"f06eafd\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-17119af elementor-widget elementor-widget-text-editor\" data-id=\"17119af\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p id=\"7fc1\" data-selectable-paragraph=\"\">In the image above the \u201ccubes\u201d are a representation of the space-time fabric, and when mass move within it, it deforms it, the way the \u201clines\u201d move would tell us how a near object will behave close to that one. So gravity is something like:<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-5a9816f elementor-widget elementor-widget-text-editor\" data-id=\"5a9816f\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p id=\"b2fc\" data-selectable-paragraph=\"\">So when we have mass we can make a \u201cdent\u201d in space-time, and after that what we see when we are close to that dent, is gravity. We have to be close enough to the object to feel it.<\/p>\n<p id=\"9e85\" data-selectable-paragraph=\"\">That\u2019s exactly what I\u2019m proposing what machine learning can be in the data fabric. I know I sound crazy. Let me explain myself.<\/p>\n<p id=\"c8c4\" data-selectable-paragraph=\"\">Let\u2019s say we have created a data fabric. For me the best tool out there for me is Anzo as I mentioned in other articles.<\/p>\n\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-69fb147 elementor-widget elementor-widget-image\" data-id=\"69fb147\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"image.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<img decoding=\"async\" src=\"https:\/\/miro.medium.com\/max\/1400\/1*v2nboJXUzRq9OmMM7LpjTA.png\" alt=\"\" \/>\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-8c20740 elementor-widget elementor-widget-text-editor\" data-id=\"8c20740\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p style=\"text-align: center;\" data-selectable-paragraph=\"\"><a href=\"https:\/\/www.cambridgesemantics.com\/\" rel=\"noopener\">https:\/\/www.cambridgesemantics.com\/<\/a><\/p>\n<p id=\"12f0\" data-selectable-paragraph=\"\">You can build something called \u201cThe Enterprise Knowledge Graph\u201d with Anzo, and of course create your data fabric.<\/p>\n<p id=\"f31b\" data-selectable-paragraph=\"\">The nodes and edges of the graph flexibly capture a high-resolution twin of every data source \u2014 structured or unstructured. The graph can help users answer any question quickly and interactively, allowing users to converse with the data to uncover\u00a0<strong>insights<\/strong>.<\/p>\n<p id=\"c824\" data-selectable-paragraph=\"\">By the way, this is how I\u2019m picturing an insight:<\/p>\n\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"has_eae_slider elementor-section elementor-top-section elementor-element elementor-element-f185545 elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"f185545\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"has_eae_slider elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-efcb14a\" data-id=\"efcb14a\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-9f45839 elementor-widget elementor-widget-image\" data-id=\"9f45839\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"image.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<img decoding=\"async\" src=\"https:\/\/miro.medium.com\/max\/700\/1*0w6fXJHqIbgfmYUSIBz1BQ.png\" alt=\"\" \/>\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-22a16a1 elementor-widget elementor-widget-text-editor\" data-id=\"22a16a1\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p style=\"text-align: center;\" data-selectable-paragraph=\"\">Image by\u00a0<a href=\"https:\/\/www.instagram.com\/heizelvazquez\/\" rel=\"noopener\">H\u00e9izel V\u00e1zquez<\/a><\/p>\n<p id=\"f16b\" data-selectable-paragraph=\"\">If we have the data fabric:<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-cd3b959 elementor-widget elementor-widget-image\" data-id=\"cd3b959\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"image.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<img decoding=\"async\" src=\"https:\/\/miro.medium.com\/max\/2400\/1*b04zqRXIGDWoAdUsKiiL1w.png\" alt=\"\" \/>\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-15004f1 elementor-widget elementor-widget-text-editor\" data-id=\"15004f1\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p style=\"text-align: center;\" data-selectable-paragraph=\"\">Image by\u00a0<a href=\"https:\/\/www.instagram.com\/heizelvazquez\/\" rel=\"noopener\">H\u00e9izel V\u00e1zquez<\/a><\/p>\n<p id=\"2577\" data-selectable-paragraph=\"\">what I\u2019m proposing is that an insight can be thought as a\u00a0<strong>dent<\/strong>\u00a0in it. And the automatic process of discovering what that insight is, it\u2019s machine learning.<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-158ebd6 elementor-widget elementor-widget-image\" data-id=\"158ebd6\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"image.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<img decoding=\"async\" src=\"https:\/\/miro.medium.com\/max\/2400\/1*h0eLec-nM_Qn7UEFW4PoRw.png\" alt=\"\" \/>\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-fc54d34 elementor-widget elementor-widget-text-editor\" data-id=\"fc54d34\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p style=\"text-align: center;\" data-selectable-paragraph=\"\">Image by\u00a0<a href=\"https:\/\/www.instagram.com\/heizelvazquez\/\" rel=\"noopener\">H\u00e9izel V\u00e1zquez<\/a><\/p>\n<p id=\"ba2e\" data-selectable-paragraph=\"\">So now we can say:<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-db6787d elementor-widget elementor-widget-text-editor\" data-id=\"db6787d\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<blockquote><mark>Machine learning is the automatic process of discovering hidden insights in data fabric by using algorithms that are able to find those insights without being specifically programmed for that, to create models that solves a particular (or multiple) problem(s).<\/mark><\/blockquote>\n<p id=\"22a6\" data-selectable-paragraph=\"\">Insights generated with the fabric are themselves new data that becomes explicit\/manifest as part of the fabric. i.e. Insights can grow the graph, potentially yielding further insights.<\/p>\n<p id=\"5738\" data-selectable-paragraph=\"\">In the data fabric we come with a problem, trying to find those hidden insights in the data, and then using machine learning we can discover them. How would this look in real life?<\/p>\n<p id=\"a046\" data-selectable-paragraph=\"\">The people at\u00a0<a href=\"https:\/\/www.cambridgesemantics.com\/\" rel=\"noopener\">Cambridge Semantics<\/a>\u00a0has the answer with Anzo too. The Anzo for Machine Learning solution replaces this tedious, error-prone work with a modern data platform designed to rapidly integrate, harmonize and transform data from all relevant data sources into optimized Machine Learning-ready feature datasets.<\/p>\n<p id=\"4903\" data-selectable-paragraph=\"\">The data fabric provides the advanced data transformation functionality essential for fast and effective feature engineering to help separate key business signals from irrelevant noise.<\/p>\n<p id=\"866d\" data-selectable-paragraph=\"\">Remember,\u00a0<strong>data come first<\/strong>, this new paradigm integrates and harmonizes all relevant data sources \u2014 structured and unstructured data alike \u2014 using a built-in graph database and semantic data layer. The data fabric conveys the business context and meaning of your data, making it easier for business users to understand and properly utilize.<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-ac95011 elementor-widget elementor-widget-text-editor\" data-id=\"ac95011\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p id=\"134a\" data-selectable-paragraph=\"\">Reproducibility is important for data science and of course machine learning, so we need an easy way to reuse harmonized structured and unstructured data by managing catalogs of data sets as well as ongoing aspects of data integrations such as data quality processing, and this is what the data fabric provides. It also retains end-to-end lineage and provenance for the data comprising machine learning datasets so that it is easy to find out what data transformations are required when it comes to using models in production.<\/p>\n<p id=\"42e4\" data-selectable-paragraph=\"\">In following articles I\u2019ll give a concrete example on how to do machine learning in this new framework.<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-4681226 elementor-widget elementor-widget-heading\" data-id=\"4681226\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h1 class=\"elementor-heading-title elementor-size-default\"><h1 id=\"e4cb\" data-selectable-paragraph=\"\">Conclusions<\/h1><\/h1>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-978430f elementor-widget elementor-widget-text-editor\" data-id=\"978430f\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p id=\"81c8\" data-selectable-paragraph=\"\">Machine learning is not new, but there is a new paradigm to do it, and maybe it\u2019s the future of the field (how optimistic of me). Inside of the data fabric, we have new concepts like ontology, semantics, layers, knowledge-graph, etc; but all of those can improve the way we think about and do machine learning.<\/p>\n<p id=\"8412\" data-selectable-paragraph=\"\">In this paradigm, we discover hidden insights in the data fabric by using algorithms that are able to find those insights without being specifically programmed for that, to create models that solves a particular (or multiple) problem(s).<\/p>\n\n<\/section>\n\n<hr \/>\n\n<section>\n<p id=\"1556\" data-selectable-paragraph=\"\">Thanks to the amazing team at\u00a0<a href=\"http:\/\/www.cienciaydatos.org\/\" class=\"broken_link\" rel=\"noopener\">Ciencia y Datos<\/a>\u00a0for helping with this article.<\/p>\n\n<\/section>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<\/div>\n\t\t","protected":false},"excerpt":{"rendered":"<p>Machine learning is not new, but there is a new paradigm to do it, and maybe it&rsquo;s the future of the field. Inside of the data fabric, we have new concepts like ontology, semantics, layers, knowledge-graph, etc; but all of those can improve the way we think about and do machine learning. In this paradigm, we discover hidden insights in the data fabric by using algorithms that are able to find those insights without being specifically programmed for that.<\/p>\n","protected":false},"author":252,"featured_media":3531,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"content-type":"","footnotes":""},"categories":[183],"tags":[92],"ppma_author":[2881],"class_list":["post-1867","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-ai-ml","tag-machine-learning"],"authors":[{"term_id":2881,"user_id":252,"is_guest":0,"slug":"favio-vazquez","display_name":"Favio V\u00e1zquez","avatar_url":"https:\/\/secure.gravatar.com\/avatar\/?s=96&d=mm&r=g","user_url":"","last_name":"V\u00e1zquez","first_name":"Favio","job_title":"","description":"<a href=\"https:\/\/towardsdatascience.com\/@favio.vazquezp?source=post_header_lockup\">Favio V&aacute;zquez<\/a>, physicist and computer engineer, is Data Scientist at <a href=\"http:\/\/www.bbvadata.com\/\">BBVA Data &amp; Analytics<\/a>. He works on Big Data, Data Science, Machine Learning and Computational Cosmology. Since 2015, he&#039;s been part of the Apache Spark collaboration, with some minor bug fixes, and improvement of documentation."}],"_links":{"self":[{"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/posts\/1867","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/users\/252"}],"replies":[{"embeddable":true,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/comments?post=1867"}],"version-history":[{"count":6,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/posts\/1867\/revisions"}],"predecessor-version":[{"id":36901,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/posts\/1867\/revisions\/36901"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/media\/3531"}],"wp:attachment":[{"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/media?parent=1867"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/categories?post=1867"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/tags?post=1867"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/ppma_author?post=1867"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}