{"id":1241,"date":"2016-03-03T04:10:52","date_gmt":"2016-03-03T09:10:52","guid":{"rendered":"https:\/\/solutionsreview.com\/business-intelligence\/?p=1241"},"modified":"2016-11-07T09:57:53","modified_gmt":"2016-11-07T14:57:53","slug":"data-quality-trumps-visualization","status":"publish","type":"post","link":"https:\/\/solutionsreview.com\/business-intelligence\/data-quality-trumps-visualization\/","title":{"rendered":"Data Quality Trumps Data Visualization"},"content":{"rendered":"<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-1663\" src=\"https:\/\/solutionsreview.com\/business-intelligence\/files\/2015\/08\/oie_3221485UYRhW3v.jpg\" alt=\"Trump\" width=\"800\" height=\"350\" srcset=\"https:\/\/solutionsreview.com\/business-intelligence\/files\/2015\/08\/oie_3221485UYRhW3v.jpg 800w, https:\/\/solutionsreview.com\/business-intelligence\/files\/2015\/08\/oie_3221485UYRhW3v-450x197.jpg 450w, https:\/\/solutionsreview.com\/business-intelligence\/files\/2015\/08\/oie_3221485UYRhW3v-768x336.jpg 768w, https:\/\/solutionsreview.com\/business-intelligence\/files\/2015\/08\/oie_3221485UYRhW3v-600x263.jpg 600w, https:\/\/solutionsreview.com\/business-intelligence\/files\/2015\/08\/oie_3221485UYRhW3v-180x79.jpg 180w, https:\/\/solutionsreview.com\/business-intelligence\/files\/2015\/08\/oie_3221485UYRhW3v-400x175.jpg 400w\" sizes=\"(max-width: 800px) 100vw, 800px\" \/><\/p>\n<p><strong>By\u00a0Raghu Thiagarajan<\/strong><\/p>\n<p style=\"text-align: justify\">Data is messy \u2014 always has been, always will be. Visualization has become the poster child for addressing complex data problems that could be easily overlooked and\u00a0simplifies it using design. Patterns or insights may go unnoticed in a data spreadsheet, but if we put the same information on a bubble \u00a0chart, the insights become obvious. However, it seems that often times data quality is being overlooked in favor of visualization.<\/p>\n<p style=\"text-align: justify\">With the plethora of data sources that are popping up \u2013 structured, unstructured and semi-structured \u2013 platforms like Hadoop and NoSQL databases are becoming popular because they can accommodate messiness by not forcing you to normalize everything into predefined data models with consistent dimensions of data. However, this can often delay the problem of data cleanliness. Too often, business analysts arrive at the visualization of their analysis, only to find missing, partial or incorrect information.<\/p>\n<p><strong>Data Aggregation Improves Accuracy<\/strong><\/p>\n<p style=\"text-align: justify\">Businesses have lots of data, in lots of formats, in lots of locations. Some of it is in the cloud, some of it is in legacy databases and some of it is in spreadsheets saved to desktops. With so many sources, it\u2019s common that some information is left out, and only partial data makes its way to the final visualization stage. You need to make sure that you are aggregating data from every source. Combined data helps provide context and can lead to insights that wouldn\u2019t be noticed with only limited data access. Indeed, accuracy improves with scale, and trends and exceptions stand out more clearly.<\/p>\n<p><strong>Data Quality at Every Stage<\/strong><\/p>\n<p style=\"text-align: justify\">Data quality must be maintained by inspecting for dirty, invalid or inconsistent data at any stage in the complex analytics pipeline. Everything from minor errors to blatant mistakes must be able to be tracked and viewed. For example, if you are importing and analyzing gender and do not check for data quality you may overlook inputs of \u201cM\/F\u201d which do not compute. There need to be easy access points and stop checks throughout the entire analytics process so users can make sure that all of the data is clean and going through correctly.<\/p>\n<p style=\"text-align: justify\">With access to viewing data lineage you can check at any point who was responsible for manipulating data and exactly what they did to it, and in turn maintain data quality throughout. Data quality and consistency are imperative when it comes to ultimately extracting value from Big Data. If at any point in the data pipeline there is a question about data validity, the overall value of the resulting insights is in question.<\/p>\n<p style=\"text-align: justify\">While some may gloss over the importance of data quality, it is still absolutely crucial to success.<\/p>\n<p style=\"text-align: justify\">As Big Data tools and solutions aim to be fully enterprise-ready, they must look at combating the data quality issue head on and not wait until the last step. The challenge for analytics tools is to build in data quality protocols, and for end users to think beyond their visualization needs for a solution that tackles the more difficult problems.<\/p>\n<p style=\"text-align: justify\"><em><img loading=\"lazy\" decoding=\"async\" class=\" wp-image-1242 alignleft\" src=\"https:\/\/solutionsreview.com\/business-intelligence\/files\/2015\/08\/oie_1222650l38XrWdC.jpg\" alt=\"Raghu Thiagarajan\" width=\"175\" height=\"129\" srcset=\"https:\/\/solutionsreview.com\/business-intelligence\/files\/2015\/08\/oie_1222650l38XrWdC.jpg 380w, https:\/\/solutionsreview.com\/business-intelligence\/files\/2015\/08\/oie_1222650l38XrWdC-300x221.jpg 300w\" sizes=\"(max-width: 175px) 100vw, 175px\" \/>Raghu Thiagarajan is responsible for directing <a href=\"https:\/\/www.datameer.com\/\" target=\"_blank\">Datameer<\/a>\u2019s Big Data analytics products across the company\u2019s platform, cloud and application portfolio. He has over 20 years of experience in the software industry, and has previously held leadership positions in engineering, product management and product strategy at Sybase, CrossWorlds Software, IBM, Tibco and Hortonworks. He is interested in the convergence of transactional, event-oriented and analytic workloads and easing Big Data consumption for business users. <a href=\"https:\/\/www.linkedin.com\/pub\/raghu-thiagarajan\/0\/6ab\/497\" target=\"_blank\">Connect with him on LinkedIn<\/a>.<\/em><\/p>\n<p style=\"text-align: justify\"><div class=\"widget\"><div class=\"aside-card\">\t\t\t<div class=\"textwidget\"><p><a href=\"https:\/\/insightjam.com\"><img decoding=\"async\" title=\"Insight Jam Ad\" src=\"https:\/\/solutionsreview.com\/wp-content\/uploads\/2023\/11\/ij2.jpg\" alt=\"Insight Jam Ad\" \/><\/a><\/p>\n<\/div>\n\t\t<\/div><\/div><\/p>\n","protected":false},"excerpt":{"rendered":"<p>By\u00a0Raghu Thiagarajan Data is messy \u2014 always has been, always will be. Visualization has become the poster child for addressing complex data problems that could be easily overlooked and\u00a0simplifies it using design. Patterns or insights may go unnoticed in a data spreadsheet, but if we put the same information on a bubble \u00a0chart, the insights [&hellip;]<\/p>\n","protected":false},"author":23,"featured_media":1663,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"inline_featured_image":false,"footnotes":""},"categories":[4],"tags":[14,184,35,298,183,299],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v23.5 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Data Quality Trumps Data Visualization<\/title>\n<meta name=\"description\" content=\"The challenge for analytics tools is to build in data quality protocols, and for end users to think beyond their visualization needs.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/solutionsreview.com\/business-intelligence\/data-quality-trumps-visualization\/\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Tim King\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"3 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/solutionsreview.com\/business-intelligence\/data-quality-trumps-visualization\/\",\"url\":\"https:\/\/solutionsreview.com\/business-intelligence\/data-quality-trumps-visualization\/\",\"name\":\"Data Quality Trumps Data Visualization\",\"isPartOf\":{\"@id\":\"https:\/\/solutionsreview.com\/business-intelligence\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/solutionsreview.com\/business-intelligence\/data-quality-trumps-visualization\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/solutionsreview.com\/business-intelligence\/data-quality-trumps-visualization\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/solutionsreview.com\/business-intelligence\/files\/2015\/08\/oie_3221485UYRhW3v.jpg\",\"datePublished\":\"2016-03-03T09:10:52+00:00\",\"dateModified\":\"2016-11-07T14:57:53+00:00\",\"author\":{\"@id\":\"https:\/\/solutionsreview.com\/business-intelligence\/#\/schema\/person\/154e152a275103e373e24ada7f2feb5c\"},\"description\":\"The challenge for analytics tools is to build in data quality protocols, and for end users to think beyond their visualization needs.\",\"breadcrumb\":{\"@id\":\"https:\/\/solutionsreview.com\/business-intelligence\/data-quality-trumps-visualization\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/solutionsreview.com\/business-intelligence\/data-quality-trumps-visualization\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/solutionsreview.com\/business-intelligence\/data-quality-trumps-visualization\/#primaryimage\",\"url\":\"https:\/\/solutionsreview.com\/business-intelligence\/files\/2015\/08\/oie_3221485UYRhW3v.jpg\",\"contentUrl\":\"https:\/\/solutionsreview.com\/business-intelligence\/files\/2015\/08\/oie_3221485UYRhW3v.jpg\",\"width\":800,\"height\":350},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/solutionsreview.com\/business-intelligence\/data-quality-trumps-visualization\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/solutionsreview.com\/business-intelligence\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Data Quality Trumps Data Visualization\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/solutionsreview.com\/business-intelligence\/#website\",\"url\":\"https:\/\/solutionsreview.com\/business-intelligence\/\",\"name\":\"Best Business Intelligence and Data Analytics Tools, Software, Solutions &amp; Vendors\",\"description\":\"BI Guides, Analysis and Best Practices\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/solutionsreview.com\/business-intelligence\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/solutionsreview.com\/business-intelligence\/#\/schema\/person\/154e152a275103e373e24ada7f2feb5c\",\"name\":\"Tim King\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/solutionsreview.com\/business-intelligence\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/solutionsreview.com\/business-intelligence\/files\/2023\/12\/tk.jpg\",\"contentUrl\":\"https:\/\/solutionsreview.com\/business-intelligence\/files\/2023\/12\/tk.jpg\",\"caption\":\"Tim King\"},\"description\":\"Tim is Solutions Review's Executive Editor and leads coverage on data management and analytics. A 2017 and 2018 Most Influential Business Journalist and 2021 \\\"Who's Who\\\" in Data Management, Tim is a recognized industry thought leader and changemaker. Story? Reach him via email at tking@solutionsreview.com.\",\"url\":\"https:\/\/solutionsreview.com\/business-intelligence\/author\/timking\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Data Quality Trumps Data Visualization","description":"The challenge for analytics tools is to build in data quality protocols, and for end users to think beyond their visualization needs.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/solutionsreview.com\/business-intelligence\/data-quality-trumps-visualization\/","twitter_misc":{"Written by":"Tim King","Est. reading time":"3 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/solutionsreview.com\/business-intelligence\/data-quality-trumps-visualization\/","url":"https:\/\/solutionsreview.com\/business-intelligence\/data-quality-trumps-visualization\/","name":"Data Quality Trumps Data Visualization","isPartOf":{"@id":"https:\/\/solutionsreview.com\/business-intelligence\/#website"},"primaryImageOfPage":{"@id":"https:\/\/solutionsreview.com\/business-intelligence\/data-quality-trumps-visualization\/#primaryimage"},"image":{"@id":"https:\/\/solutionsreview.com\/business-intelligence\/data-quality-trumps-visualization\/#primaryimage"},"thumbnailUrl":"https:\/\/solutionsreview.com\/business-intelligence\/files\/2015\/08\/oie_3221485UYRhW3v.jpg","datePublished":"2016-03-03T09:10:52+00:00","dateModified":"2016-11-07T14:57:53+00:00","author":{"@id":"https:\/\/solutionsreview.com\/business-intelligence\/#\/schema\/person\/154e152a275103e373e24ada7f2feb5c"},"description":"The challenge for analytics tools is to build in data quality protocols, and for end users to think beyond their visualization needs.","breadcrumb":{"@id":"https:\/\/solutionsreview.com\/business-intelligence\/data-quality-trumps-visualization\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/solutionsreview.com\/business-intelligence\/data-quality-trumps-visualization\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/solutionsreview.com\/business-intelligence\/data-quality-trumps-visualization\/#primaryimage","url":"https:\/\/solutionsreview.com\/business-intelligence\/files\/2015\/08\/oie_3221485UYRhW3v.jpg","contentUrl":"https:\/\/solutionsreview.com\/business-intelligence\/files\/2015\/08\/oie_3221485UYRhW3v.jpg","width":800,"height":350},{"@type":"BreadcrumbList","@id":"https:\/\/solutionsreview.com\/business-intelligence\/data-quality-trumps-visualization\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/solutionsreview.com\/business-intelligence\/"},{"@type":"ListItem","position":2,"name":"Data Quality Trumps Data Visualization"}]},{"@type":"WebSite","@id":"https:\/\/solutionsreview.com\/business-intelligence\/#website","url":"https:\/\/solutionsreview.com\/business-intelligence\/","name":"Best Business Intelligence and Data Analytics Tools, Software, Solutions &amp; Vendors","description":"BI Guides, Analysis and Best Practices","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/solutionsreview.com\/business-intelligence\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/solutionsreview.com\/business-intelligence\/#\/schema\/person\/154e152a275103e373e24ada7f2feb5c","name":"Tim King","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/solutionsreview.com\/business-intelligence\/#\/schema\/person\/image\/","url":"https:\/\/solutionsreview.com\/business-intelligence\/files\/2023\/12\/tk.jpg","contentUrl":"https:\/\/solutionsreview.com\/business-intelligence\/files\/2023\/12\/tk.jpg","caption":"Tim King"},"description":"Tim is Solutions Review's Executive Editor and leads coverage on data management and analytics. A 2017 and 2018 Most Influential Business Journalist and 2021 \"Who's Who\" in Data Management, Tim is a recognized industry thought leader and changemaker. Story? Reach him via email at tking@solutionsreview.com.","url":"https:\/\/solutionsreview.com\/business-intelligence\/author\/timking\/"}]}},"_links":{"self":[{"href":"https:\/\/solutionsreview.com\/business-intelligence\/wp-json\/wp\/v2\/posts\/1241"}],"collection":[{"href":"https:\/\/solutionsreview.com\/business-intelligence\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/solutionsreview.com\/business-intelligence\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/solutionsreview.com\/business-intelligence\/wp-json\/wp\/v2\/users\/23"}],"replies":[{"embeddable":true,"href":"https:\/\/solutionsreview.com\/business-intelligence\/wp-json\/wp\/v2\/comments?post=1241"}],"version-history":[{"count":0,"href":"https:\/\/solutionsreview.com\/business-intelligence\/wp-json\/wp\/v2\/posts\/1241\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/solutionsreview.com\/business-intelligence\/wp-json\/wp\/v2\/media\/1663"}],"wp:attachment":[{"href":"https:\/\/solutionsreview.com\/business-intelligence\/wp-json\/wp\/v2\/media?parent=1241"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/solutionsreview.com\/business-intelligence\/wp-json\/wp\/v2\/categories?post=1241"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/solutionsreview.com\/business-intelligence\/wp-json\/wp\/v2\/tags?post=1241"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}