{"id":1851,"date":"2016-11-04T15:41:57","date_gmt":"2016-11-04T19:41:57","guid":{"rendered":"https:\/\/solutionsreview.com\/data-integration\/?p=1851"},"modified":"2016-11-04T16:26:22","modified_gmt":"2016-11-04T20:26:22","slug":"the-future-of-etl-and-the-argument-for-spark-augmentation","status":"publish","type":"post","link":"https:\/\/solutionsreview.com\/data-integration\/the-future-of-etl-and-the-argument-for-spark-augmentation\/","title":{"rendered":"The Future of ETL and the Argument for Spark Augmentation"},"content":{"rendered":"<p style=\"text-align: justify\">In managing databases, extract, transform, load (ETL) refers to three separate functions combined into a single programming tool.\u00a0Fragmentation in the legacy Data Integration market made us wonder whether or not traditional integration tools were <a href=\"https:\/\/solutionsreview.com\/data-integration\/is-data-integration-dead\/\" target=\"_blank\">becoming obsolete<\/a> before our very eyes. With this in mind, we <a href=\"https:\/\/solutionsreview.com\/data-integration\/crowd-etl-still-very-much-alive\/\" target=\"_blank\">asked the crowd<\/a> whether or not they believed Data Integration as we\u2019ve known it was dying. To our surprise, the answer was a resounding no, and it appears that legacy tools are still being used in many verticals as enterprises prepare for the next wave in data tools.<\/p>\n<p style=\"text-align: justify\">In a recent presentation at Spark Summit EU, ING&#8217;s Chapter Lead in Analytics Bas Geerdink spoke to this very topic, recommending a migration from ETL to Apache Spark for data processing and movement. Geerdink, who is also a certified Spark developer argues that ETL has seen no real technological or market evolution like BI and the data warehouse have in recent years.\u00a0ETL tools don&#8217;t seem to have a major role in the future outside of niche use cases, with this slideshow even referring to these solutions as &#8220;<a href=\"https:\/\/solutionsreview.com\/data-integration\/data-latency-is-turning-etl-into-et-hell\/\" target=\"_blank\">ETL Hell<\/a>.&#8221; Make your own conclusion, and click through the presentation to learn more.<\/p>\n<div class=\"column one-fourth\"><p>&nbsp;<\/p><\/div>\n<div class=\"column half\"><p><iframe loading=\"lazy\" title=\"Spark Summit EU talk by Bas Geerdink\" src=\"https:\/\/www.slideshare.net\/slideshow\/embed_code\/key\/sNiuMB2C2krqmC\" width=\"427\" height=\"356\" frameborder=\"0\" marginwidth=\"0\" marginheight=\"0\" scrolling=\"no\" style=\"border:1px solid #CCC; border-width:1px; margin-bottom:5px; max-width: 100%;\" allowfullscreen> <\/iframe> <\/p>\n<div style=\"margin-bottom:5px\"> <strong> <a href=\"https:\/\/www.slideshare.net\/SparkSummit\/spark-summit-eu-talk-by-bas-geerdink-68139317\" title=\"Spark Summit EU talk by Bas Geerdink\" target=\"_blank\">Spark Summit EU talk by Bas Geerdink<\/a> <\/strong> from <strong><a href=\"https:\/\/www.slideshare.net\/SparkSummit\" target=\"_blank\">Spark Summit<\/a><\/strong> <\/div>\n<p>&nbsp;<\/p><\/div>\n<div class=\"hr hr\"><\/div>\n<br \/>Widget not in any sidebars<br \/>\n<p>[youtube https:\/\/www.youtube.com\/watch?v=beLcYabuj6c&amp;amp;w=560&amp;amp;h=315]<\/p>\n","protected":false},"excerpt":{"rendered":"<p>In managing databases, extract, transform, load (ETL) refers to three separate functions combined into a single programming tool.\u00a0Fragmentation in the legacy Data Integration market made us wonder whether or not traditional integration tools were becoming obsolete before our very eyes. With this in mind, we asked the crowd whether or not they believed Data Integration [&hellip;]<\/p>\n","protected":false},"author":23,"featured_media":1855,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"inline_featured_image":false,"footnotes":""},"categories":[1,4],"tags":[89,396,194,397,395],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v23.5 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>The Future of ETL and the Argument for Spark Augmentation<\/title>\n<meta name=\"description\" content=\"In a presentation at Spark Summit EU, ING&#039;s Chapter Lead in Analytics Bas Geerdink takes the stance that ETL tools should be replaced by Apache Spark.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/solutionsreview.com\/data-integration\/the-future-of-etl-and-the-argument-for-spark-augmentation\/\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Tim King\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"1 minute\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/solutionsreview.com\/data-integration\/the-future-of-etl-and-the-argument-for-spark-augmentation\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/solutionsreview.com\/data-integration\/the-future-of-etl-and-the-argument-for-spark-augmentation\/\"},\"author\":{\"name\":\"Tim King\",\"@id\":\"https:\/\/solutionsreview.com\/data-integration\/#\/schema\/person\/154e152a275103e373e24ada7f2feb5c\"},\"headline\":\"The Future of ETL and the Argument for Spark Augmentation\",\"datePublished\":\"2016-11-04T19:41:57+00:00\",\"dateModified\":\"2016-11-04T20:26:22+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/solutionsreview.com\/data-integration\/the-future-of-etl-and-the-argument-for-spark-augmentation\/\"},\"wordCount\":259,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/solutionsreview.com\/data-integration\/#organization\"},\"image\":{\"@id\":\"https:\/\/solutionsreview.com\/data-integration\/the-future-of-etl-and-the-argument-for-spark-augmentation\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/solutionsreview.com\/data-integration\/files\/2016\/11\/oie_4205943OeFPLDZB.jpg\",\"keywords\":[\"Apache Spark\",\"Bas Geerdink\",\"ETL\",\"ING\",\"Spark Summit\"],\"articleSection\":[\"Best Practices\",\"Presentations\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/solutionsreview.com\/data-integration\/the-future-of-etl-and-the-argument-for-spark-augmentation\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/solutionsreview.com\/data-integration\/the-future-of-etl-and-the-argument-for-spark-augmentation\/\",\"url\":\"https:\/\/solutionsreview.com\/data-integration\/the-future-of-etl-and-the-argument-for-spark-augmentation\/\",\"name\":\"The Future of ETL and the Argument for Spark Augmentation\",\"isPartOf\":{\"@id\":\"https:\/\/solutionsreview.com\/data-integration\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/solutionsreview.com\/data-integration\/the-future-of-etl-and-the-argument-for-spark-augmentation\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/solutionsreview.com\/data-integration\/the-future-of-etl-and-the-argument-for-spark-augmentation\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/solutionsreview.com\/data-integration\/files\/2016\/11\/oie_4205943OeFPLDZB.jpg\",\"datePublished\":\"2016-11-04T19:41:57+00:00\",\"dateModified\":\"2016-11-04T20:26:22+00:00\",\"description\":\"In a presentation at Spark Summit EU, ING's Chapter Lead in Analytics Bas Geerdink takes the stance that ETL tools should be replaced by Apache Spark.\",\"breadcrumb\":{\"@id\":\"https:\/\/solutionsreview.com\/data-integration\/the-future-of-etl-and-the-argument-for-spark-augmentation\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/solutionsreview.com\/data-integration\/the-future-of-etl-and-the-argument-for-spark-augmentation\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/solutionsreview.com\/data-integration\/the-future-of-etl-and-the-argument-for-spark-augmentation\/#primaryimage\",\"url\":\"https:\/\/solutionsreview.com\/data-integration\/files\/2016\/11\/oie_4205943OeFPLDZB.jpg\",\"contentUrl\":\"https:\/\/solutionsreview.com\/data-integration\/files\/2016\/11\/oie_4205943OeFPLDZB.jpg\",\"width\":800,\"height\":350,\"caption\":\"The Future of ETL and the Argument for Spark Augmentation\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/solutionsreview.com\/data-integration\/the-future-of-etl-and-the-argument-for-spark-augmentation\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/solutionsreview.com\/data-integration\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"The Future of ETL and the Argument for Spark Augmentation\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/solutionsreview.com\/data-integration\/#website\",\"url\":\"https:\/\/solutionsreview.com\/data-integration\/\",\"name\":\"Best Data Integration Vendors, News &amp; Reviews for Big Data, Applications, ETL and Hadoop\",\"description\":\"Data Integration Buyers Guide and Best Practices\",\"publisher\":{\"@id\":\"https:\/\/solutionsreview.com\/data-integration\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/solutionsreview.com\/data-integration\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/solutionsreview.com\/data-integration\/#organization\",\"name\":\"Solutions Review\",\"url\":\"https:\/\/solutionsreview.com\/data-integration\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/solutionsreview.com\/data-integration\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/solutionsreview.com\/data-integration\/files\/2016\/02\/Solutions_Review_Header_Data_Integration_225.png\",\"contentUrl\":\"https:\/\/solutionsreview.com\/data-integration\/files\/2016\/02\/Solutions_Review_Header_Data_Integration_225.png\",\"width\":225,\"height\":90,\"caption\":\"Solutions Review\"},\"image\":{\"@id\":\"https:\/\/solutionsreview.com\/data-integration\/#\/schema\/logo\/image\/\"}},{\"@type\":\"Person\",\"@id\":\"https:\/\/solutionsreview.com\/data-integration\/#\/schema\/person\/154e152a275103e373e24ada7f2feb5c\",\"name\":\"Tim King\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/solutionsreview.com\/data-integration\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/solutionsreview.com\/data-integration\/files\/2023\/12\/tk.jpg\",\"contentUrl\":\"https:\/\/solutionsreview.com\/data-integration\/files\/2023\/12\/tk.jpg\",\"caption\":\"Tim King\"},\"description\":\"Tim is Solutions Review's Executive Editor covering the human impact of AI on the future of work and learning. He is also the Media Strategist behind Insight Jam (1M+ on YouTube) events and programming. A 2017 and 2018 Most Influential Business Journalist and 2021 \\\"Who's Who\\\" in multiple categories, Tim is a recognized thought leader in enterprise tech and AI.\",\"url\":\"https:\/\/solutionsreview.com\/data-integration\/author\/timking\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"The Future of ETL and the Argument for Spark Augmentation","description":"In a presentation at Spark Summit EU, ING's Chapter Lead in Analytics Bas Geerdink takes the stance that ETL tools should be replaced by Apache Spark.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/solutionsreview.com\/data-integration\/the-future-of-etl-and-the-argument-for-spark-augmentation\/","twitter_misc":{"Written by":"Tim King","Est. reading time":"1 minute"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/solutionsreview.com\/data-integration\/the-future-of-etl-and-the-argument-for-spark-augmentation\/#article","isPartOf":{"@id":"https:\/\/solutionsreview.com\/data-integration\/the-future-of-etl-and-the-argument-for-spark-augmentation\/"},"author":{"name":"Tim King","@id":"https:\/\/solutionsreview.com\/data-integration\/#\/schema\/person\/154e152a275103e373e24ada7f2feb5c"},"headline":"The Future of ETL and the Argument for Spark Augmentation","datePublished":"2016-11-04T19:41:57+00:00","dateModified":"2016-11-04T20:26:22+00:00","mainEntityOfPage":{"@id":"https:\/\/solutionsreview.com\/data-integration\/the-future-of-etl-and-the-argument-for-spark-augmentation\/"},"wordCount":259,"commentCount":0,"publisher":{"@id":"https:\/\/solutionsreview.com\/data-integration\/#organization"},"image":{"@id":"https:\/\/solutionsreview.com\/data-integration\/the-future-of-etl-and-the-argument-for-spark-augmentation\/#primaryimage"},"thumbnailUrl":"https:\/\/solutionsreview.com\/data-integration\/files\/2016\/11\/oie_4205943OeFPLDZB.jpg","keywords":["Apache Spark","Bas Geerdink","ETL","ING","Spark Summit"],"articleSection":["Best Practices","Presentations"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/solutionsreview.com\/data-integration\/the-future-of-etl-and-the-argument-for-spark-augmentation\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/solutionsreview.com\/data-integration\/the-future-of-etl-and-the-argument-for-spark-augmentation\/","url":"https:\/\/solutionsreview.com\/data-integration\/the-future-of-etl-and-the-argument-for-spark-augmentation\/","name":"The Future of ETL and the Argument for Spark Augmentation","isPartOf":{"@id":"https:\/\/solutionsreview.com\/data-integration\/#website"},"primaryImageOfPage":{"@id":"https:\/\/solutionsreview.com\/data-integration\/the-future-of-etl-and-the-argument-for-spark-augmentation\/#primaryimage"},"image":{"@id":"https:\/\/solutionsreview.com\/data-integration\/the-future-of-etl-and-the-argument-for-spark-augmentation\/#primaryimage"},"thumbnailUrl":"https:\/\/solutionsreview.com\/data-integration\/files\/2016\/11\/oie_4205943OeFPLDZB.jpg","datePublished":"2016-11-04T19:41:57+00:00","dateModified":"2016-11-04T20:26:22+00:00","description":"In a presentation at Spark Summit EU, ING's Chapter Lead in Analytics Bas Geerdink takes the stance that ETL tools should be replaced by Apache Spark.","breadcrumb":{"@id":"https:\/\/solutionsreview.com\/data-integration\/the-future-of-etl-and-the-argument-for-spark-augmentation\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/solutionsreview.com\/data-integration\/the-future-of-etl-and-the-argument-for-spark-augmentation\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/solutionsreview.com\/data-integration\/the-future-of-etl-and-the-argument-for-spark-augmentation\/#primaryimage","url":"https:\/\/solutionsreview.com\/data-integration\/files\/2016\/11\/oie_4205943OeFPLDZB.jpg","contentUrl":"https:\/\/solutionsreview.com\/data-integration\/files\/2016\/11\/oie_4205943OeFPLDZB.jpg","width":800,"height":350,"caption":"The Future of ETL and the Argument for Spark Augmentation"},{"@type":"BreadcrumbList","@id":"https:\/\/solutionsreview.com\/data-integration\/the-future-of-etl-and-the-argument-for-spark-augmentation\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/solutionsreview.com\/data-integration\/"},{"@type":"ListItem","position":2,"name":"The Future of ETL and the Argument for Spark Augmentation"}]},{"@type":"WebSite","@id":"https:\/\/solutionsreview.com\/data-integration\/#website","url":"https:\/\/solutionsreview.com\/data-integration\/","name":"Best Data Integration Vendors, News &amp; Reviews for Big Data, Applications, ETL and Hadoop","description":"Data Integration Buyers Guide and Best Practices","publisher":{"@id":"https:\/\/solutionsreview.com\/data-integration\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/solutionsreview.com\/data-integration\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/solutionsreview.com\/data-integration\/#organization","name":"Solutions Review","url":"https:\/\/solutionsreview.com\/data-integration\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/solutionsreview.com\/data-integration\/#\/schema\/logo\/image\/","url":"https:\/\/solutionsreview.com\/data-integration\/files\/2016\/02\/Solutions_Review_Header_Data_Integration_225.png","contentUrl":"https:\/\/solutionsreview.com\/data-integration\/files\/2016\/02\/Solutions_Review_Header_Data_Integration_225.png","width":225,"height":90,"caption":"Solutions Review"},"image":{"@id":"https:\/\/solutionsreview.com\/data-integration\/#\/schema\/logo\/image\/"}},{"@type":"Person","@id":"https:\/\/solutionsreview.com\/data-integration\/#\/schema\/person\/154e152a275103e373e24ada7f2feb5c","name":"Tim King","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/solutionsreview.com\/data-integration\/#\/schema\/person\/image\/","url":"https:\/\/solutionsreview.com\/data-integration\/files\/2023\/12\/tk.jpg","contentUrl":"https:\/\/solutionsreview.com\/data-integration\/files\/2023\/12\/tk.jpg","caption":"Tim King"},"description":"Tim is Solutions Review's Executive Editor covering the human impact of AI on the future of work and learning. He is also the Media Strategist behind Insight Jam (1M+ on YouTube) events and programming. A 2017 and 2018 Most Influential Business Journalist and 2021 \"Who's Who\" in multiple categories, Tim is a recognized thought leader in enterprise tech and AI.","url":"https:\/\/solutionsreview.com\/data-integration\/author\/timking\/"}]}},"_links":{"self":[{"href":"https:\/\/solutionsreview.com\/data-integration\/wp-json\/wp\/v2\/posts\/1851"}],"collection":[{"href":"https:\/\/solutionsreview.com\/data-integration\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/solutionsreview.com\/data-integration\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/solutionsreview.com\/data-integration\/wp-json\/wp\/v2\/users\/23"}],"replies":[{"embeddable":true,"href":"https:\/\/solutionsreview.com\/data-integration\/wp-json\/wp\/v2\/comments?post=1851"}],"version-history":[{"count":0,"href":"https:\/\/solutionsreview.com\/data-integration\/wp-json\/wp\/v2\/posts\/1851\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/solutionsreview.com\/data-integration\/wp-json\/wp\/v2\/media\/1855"}],"wp:attachment":[{"href":"https:\/\/solutionsreview.com\/data-integration\/wp-json\/wp\/v2\/media?parent=1851"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/solutionsreview.com\/data-integration\/wp-json\/wp\/v2\/categories?post=1851"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/solutionsreview.com\/data-integration\/wp-json\/wp\/v2\/tags?post=1851"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}