{"id":9639,"date":"2020-09-10T09:31:14","date_gmt":"2020-09-10T09:31:14","guid":{"rendered":"https:\/\/www.experfy.com\/blog\/?p=9639"},"modified":"2023-11-08T10:31:12","modified_gmt":"2023-11-08T10:31:12","slug":"data-engineering-in-2020","status":"publish","type":"post","link":"https:\/\/www.experfy.com\/blog\/bigdata-cloud\/data-engineering-in-2020\/","title":{"rendered":"Data engineering in 2020"},"content":{"rendered":"\t\t<div data-elementor-type=\"wp-post\" data-elementor-id=\"9639\" class=\"elementor elementor-9639\" data-elementor-post-type=\"post\">\n\t\t\t\t\t\t<section class=\"has_eae_slider elementor-section elementor-top-section elementor-element elementor-element-35eb18d2 elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-eae-slider=\"85890\" data-id=\"35eb18d2\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"has_eae_slider elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-2c48ff9b\" data-eae-slider=\"4976\" data-id=\"2c48ff9b\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"has_eae_slider elementor-section elementor-top-section elementor-element elementor-element-84e6531 elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-eae-slider=\"19190\" data-id=\"84e6531\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"has_eae_slider elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-41344fc\" data-eae-slider=\"71744\" data-id=\"41344fc\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-7d75bb3 elementor-widget elementor-widget-text-editor\" data-id=\"7d75bb3\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p><mark>It is incredible how fast data processing tools and technologies are evolving. And with it, the nature of the data engineering discipline is changing as well. Tools I am using today are very different from what I used ten or even five years ago, however, many lessons learned are still relevant today.<\/mark><\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-7338f3d elementor-widget elementor-widget-text-editor\" data-id=\"7338f3d\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>I have started to work in the data space long before\u00a0<a href=\"https:\/\/medium.com\/free-code-camp\/the-rise-of-the-data-engineer-91be18f1e603\" target=\"_blank\" rel=\"noreferrer noopener\">data engineering became a thing<\/a>\u00a0and\u00a0<a href=\"https:\/\/hbr.org\/2012\/10\/data-scientist-the-sexiest-job-of-the-21st-century\" target=\"_blank\" rel=\"noreferrer noopener\">data scientist became the sexiest job of the 21st century<\/a>. I \u2018officially\u2019 became a big data engineer six years ago, and I know firsthand the challenges developers with a background in \u201ctraditional\u201d data development have going through this journey. Of course, this transition is not easy for software engineers either, it is just different.<\/p>\n<!-- \/wp:paragraph -->\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-42317e9 elementor-widget elementor-widget-text-editor\" data-id=\"42317e9\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>Even though technologies keep changing \u2014 and that\u2019s the reality for anyone working in the tech industry \u2014 some of the skills I had to learn are still relevant, but often overlooked by data developers who are just starting to make the transition to data engineering. These usually are the skills that software developers often take for granted.<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>In this post, I will talk about the evolution of data engineering and what skills \u201ctraditional\u201d data developers might need to learn today (Hint: it is not Hadoop).<\/p>\n<!-- \/wp:paragraph -->\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-3ee5aa1 elementor-widget elementor-widget-heading\" data-id=\"3ee5aa1\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\">The birth of the data engineer.<\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-b771699 elementor-widget elementor-widget-text-editor\" data-id=\"b771699\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>Data teams before the Big Data craze were composed of business intelligence and ETL developers. Typical BI \/ ETL developer activities involved moving data sets from location A to location B (ETL) and building the web-hosted dashboards with that data (BI). Specialised technologies existed for each of those activities, with the knowledge concentrated within the IT department. However, apart from that, BI and ETL development had very little to do with software engineering, the discipline which was maturing heavily at the beginning of the century.<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>As the data volumes grew and interest in data analytics increased, in the past ten years, new technologies were invented. Some of them died, and others became widely adopted, that in turn changed demands in skills and teams\u2019 structures. As modern BI tools allowed analysts and business people to create dashboards with minimal support from IT teams, data engineering became a new discipline, applying software engineering principles to ETL development using a new set of tools.<\/p>\n<!-- \/wp:paragraph -->\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-1a2d4d2 elementor-widget elementor-widget-heading\" data-id=\"1a2d4d2\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\">The challenges.<\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-b6cf04a elementor-widget elementor-widget-image\" data-id=\"b6cf04a\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"image.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<img fetchpriority=\"high\" decoding=\"async\" width=\"1024\" height=\"683\" src=\"https:\/\/www.experfy.com\/blog\/wp-content\/uploads\/2020\/09\/1_P8BWYM7YJO_OV3vTzW4LRQ-1024x683.jpeg\" class=\"attachment-large size-large wp-image-33972\" alt=\"\" srcset=\"https:\/\/www.experfy.com\/blog\/wp-content\/uploads\/2020\/09\/1_P8BWYM7YJO_OV3vTzW4LRQ-1024x683.jpeg 1024w, https:\/\/www.experfy.com\/blog\/wp-content\/uploads\/2020\/09\/1_P8BWYM7YJO_OV3vTzW4LRQ-300x200.jpeg 300w, https:\/\/www.experfy.com\/blog\/wp-content\/uploads\/2020\/09\/1_P8BWYM7YJO_OV3vTzW4LRQ-768x512.jpeg 768w, https:\/\/www.experfy.com\/blog\/wp-content\/uploads\/2020\/09\/1_P8BWYM7YJO_OV3vTzW4LRQ-1536x1024.jpeg 1536w, https:\/\/www.experfy.com\/blog\/wp-content\/uploads\/2020\/09\/1_P8BWYM7YJO_OV3vTzW4LRQ-2048x1365.jpeg 2048w, https:\/\/www.experfy.com\/blog\/wp-content\/uploads\/2020\/09\/1_P8BWYM7YJO_OV3vTzW4LRQ-610x407.jpeg 610w, https:\/\/www.experfy.com\/blog\/wp-content\/uploads\/2020\/09\/1_P8BWYM7YJO_OV3vTzW4LRQ-750x500.jpeg 750w, https:\/\/www.experfy.com\/blog\/wp-content\/uploads\/2020\/09\/1_P8BWYM7YJO_OV3vTzW4LRQ-1140x760.jpeg 1140w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/>\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-2dfae91 elementor-widget elementor-widget-text-editor\" data-id=\"2dfae91\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>Creating a data pipeline may sound easy, but at big data scale, this meant bringing together a dozen different technologies (or more!). A data engineer had to understand a myriad of technologies in-depth, pick the right tool for the job and write code in Scala, Java or Python to create resilient and scalable solutions. A data engineer had to know their data to be able to create jobs which benefit from the power of distributed processing. A data engineer had to understand the infrastructure to be able to identify reasons for failed jobs.<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>Conceptually, many of those data pipelines were typical ETL jobs \u2014 collecting data sets from a number of data sources, putting them in a centralised data store ready for analytics and transforming them for business intelligence or machine learning. However, \u201ctraditional\u201d ETL developers didn\u2019t have the necessary skills to perform these tasks in the Big Data world.<\/p>\n<!-- \/wp:paragraph -->\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-eed47ae elementor-widget elementor-widget-heading\" data-id=\"eed47ae\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\">Is it still the case today?<\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-7976417 elementor-widget elementor-widget-text-editor\" data-id=\"7976417\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<!-- wp:paragraph -->\n<p>I have reviewed many articles describing what skills data engineers should have. Most of them advise learning technologies like Hadoop, Spark, Kafka, Hive, HBase, Cassandra, MongoDB, Oozie, Flink, Zookeeper, and the list goes on.<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>While I agree that it won\u2019t hurt to know these technologies, I find that in many cases today, in 2020, it is enough to \u201cknow about them\u201d \u2014 what particular use cases they are designed to solve, where they should or shouldn\u2019t be used and what are the alternatives. Rapidly evolving cloud technology has given rise to a huge range of cloud-native applications and services in recent years. In the same way as modern BI tools made data analysis more accessible to the wider business several years ago, modern cloud-native data stack simplifies data ingestion and transformation tasks.<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:quote -->\n<blockquote class=\"wp-block-quote\">\n<p><em>I do not think that technologies like Apache Spark will become any less popular in the next few years as they are great for complex data transformations.<\/em><\/p>\n<p><em>Still, the high rate of adoption of cloud data warehouses such as Snowflake and Google BigQuery indicates that there are certain advantages they provide. One of them is that Spark requires highly specialised skills, whereas ETL solutions on top of cloud data platforms are heavily reliant on SQL skills even for big data \u2014 such roles are much easier to fill.<\/em><\/p>\n<\/blockquote>\n<!-- \/wp:quote -->\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-92975ff elementor-widget elementor-widget-heading\" data-id=\"92975ff\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\">What skills do data developers need to have?<\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-ae1c04d elementor-widget elementor-widget-image\" data-id=\"ae1c04d\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"image.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<img decoding=\"async\" src=\"https:\/\/miro.medium.com\/max\/1733\/1*C7hAYweTAH6qmvXe1EBbwA.jpeg\" alt=\"\" \/>\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-139cfac elementor-widget elementor-widget-text-editor\" data-id=\"139cfac\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>BI \/ ETL developers usually possess a strong understanding of database fundamentals, data modelling and SQL. These skills are still valuable today and mostly transferable to a modern data stack \u2014 which is leaner and easier to learn than the Hadoop ecosystem.<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>Below are three areas I often observe \u201ctraditional\u201d data developers having gaps in their knowledge because, for a long time, they didn\u2019t have tools and approaches software engineers did. Understanding and fixing those gaps will not take a lot of time, but might make a transition to a new set of tools much smoother.<\/p>\n<!-- \/wp:paragraph -->\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-51f32e1 elementor-widget elementor-widget-heading\" data-id=\"51f32e1\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h4 class=\"elementor-heading-title elementor-size-default\">1. <strong>Version control (Git) and understanding of CI\/CD pipeline<\/strong><\/h4>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-b5c69be elementor-widget elementor-widget-text-editor\" data-id=\"b5c69be\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>SQL code is a code, and as such, software engineering principles should be applied.<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:list -->\n<ul>\n<li>It is important to know who, when and why changed the code<\/li>\n<li>Code should come with the tests which can be automatically run<\/li>\n<li>Code should be easily deployable to different environments<\/li>\n<\/ul>\n<!-- \/wp:list -->\n\n<!-- wp:paragraph -->\n<p>I am a big fan of\u00a0<a href=\"https:\/\/www.getdbt.com\/\" target=\"_blank\" rel=\"noreferrer noopener\">DBT\u00a0<\/a>\u2014 an open-source tool which brings software engineering best practices to SQL world and simplifies all these steps. It does\u00a0<strong>much more<\/strong>\u00a0than that so I strongly advise to check it out.<\/p>\n<!-- \/wp:paragraph -->\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-870c3fa elementor-widget elementor-widget-heading\" data-id=\"870c3fa\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h4 class=\"elementor-heading-title elementor-size-default\"><strong>2. Good understanding of the modern cloud data analytics stack<\/strong><\/h4>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-eddef71 elementor-widget elementor-widget-text-editor\" data-id=\"eddef71\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>We tend to stick with the tools we know because they often make us more productive. However, many challenges we are facing are not unique, and often can be solved today more efficiently.<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>It might be intimidating trying to navigate in the cloud ecosystem at first. One workaround is to learn from other companies\u2019 experiences.<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>Many successful startups are very open about their data stack and the lessons they learnt on their journey. These days, it is common to adopt a version of a cloud data warehouse and several other components for data ingestion (such as Fivetran or Segment) and data visualisation. Seeing a few architectures is usually enough to get a 10,000-foot view and know what to research further when needed \u2014 e.g. dealing with events or streaming data might be an entirely new concept.<\/p>\n<!-- \/wp:paragraph -->\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-6c7c6e0 elementor-widget elementor-widget-heading\" data-id=\"6c7c6e0\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h4 class=\"elementor-heading-title elementor-size-default\"><strong>3. Knowing a programming language in addition to SQL<\/strong><\/h4>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-5da9f36 elementor-widget elementor-widget-text-editor\" data-id=\"5da9f36\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<!-- wp:paragraph -->\n<p>As much as I love Scala, Python seems to be a safe bet to start with today. It is reasonably easy to pick up, loved by data scientists and supported pretty much by all components of cloud ecosystems. SQL is great for many data transformations, but sometimes it is easier to parse complex data structure with Python before ingesting it into a table or use it to automate specific steps in the data pipeline.<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:quote -->\n<blockquote class=\"wp-block-quote\">\n<p><em>This is not an exhaustive list, and different companies might require different skills, what brings me to my last point \u2026<\/em><\/p>\n<\/blockquote>\n<!-- \/wp:quote -->\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-d62e29c elementor-widget elementor-widget-image\" data-id=\"d62e29c\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"image.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<img decoding=\"async\" src=\"https:\/\/miro.medium.com\/max\/6016\/1*ap5aI2L5eP2-ptHv7vkvIg.jpeg\" alt=\"\" \/>\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-77795c2 elementor-widget elementor-widget-text-editor\" data-id=\"77795c2\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>Data processing tools and technologies have evolved massively over the last few years. Many of them have evolved to the point where they can easily scale as the data volume grows while working well with the \u201csmall data\u201d too. That can significantly simplify both the data analytics stack and the skills required to use it.<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>Does it mean that the role of a data engineer is changing? I think so. It doesn\u2019t mean it gets easier \u2014 business demands grow as technology advances. However, it seems that this role might become more specialised or split into a few different disciplines.<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>New tools allow data engineers to focus on core data infrastructure, performance optimisation, custom data ingestion pipelines and overall pipeline orchestration. At the same time, data transformation code in those pipelines can be owned by anyone who is comfortable with SQL. For example,\u00a0<a href=\"https:\/\/www.datacouncil.ai\/blog\/emerging-data-roles-the-analytics-engineer\" target=\"_blank\" rel=\"noreferrer noopener\">analytics engineering<\/a>\u00a0is starting to become a thing. This role sits at the intersection of data engineering and data analytics and focuses on data transformation and data quality. Cloud data warehouse engineering is another one.<\/p>\n<!-- \/wp:paragraph -->\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-1325958 elementor-widget elementor-widget-text-editor\" data-id=\"1325958\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<!-- wp:paragraph -->\n<p>Regardless of whether the distinction in job titles will become widely adopted or not, I believe that \u201ctraditional\u201d data developers possess many fundamental skills to be successful in many data engineering related activities today \u2014 strong SQL and data modelling are some of them. By understanding the modern cloud analytics data stack and how different components can be combined together, learning a programming language and getting used to version control, this transition can be reasonably seamless.<\/p>\n<!-- \/wp:paragraph -->\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<\/div>\n\t\t","protected":false},"excerpt":{"rendered":"<p>It is incredible how fast data processing tools and technologies are evolving. And with it, the nature of the data engineering discipline is changing as well. <\/p>\n","protected":false},"author":909,"featured_media":9640,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[187],"tags":[611,610],"ppma_author":[3933],"class_list":["post-9639","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-bigdata-cloud","tag-data-engineering","tag-data-processing-tools"],"authors":[{"term_id":3933,"user_id":909,"is_guest":0,"slug":"maria-patrakova","display_name":"Maria Patrakova","avatar_url":"https:\/\/www.experfy.com\/blog\/wp-content\/uploads\/2020\/09\/Maria-Patrakova-150x150.jpg","author_category":"","user_url":"http:\/\/www.finity.com.au\/","last_name":"Patrakova","first_name":"Maria","job_title":"","description":"Maria Patrakova is Head Of Data Engineering at Finity Consulting,  Australia\u2019s largest independent actuarial and analytics consulting firm."}],"_links":{"self":[{"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/posts\/9639","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/users\/909"}],"replies":[{"embeddable":true,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/comments?post=9639"}],"version-history":[{"count":0,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/posts\/9639\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/media\/9640"}],"wp:attachment":[{"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/media?parent=9639"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/categories?post=9639"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/tags?post=9639"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/ppma_author?post=9639"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}