{"id":1024,"date":"2024-01-01T15:17:03","date_gmt":"2024-01-01T15:17:03","guid":{"rendered":"https:\/\/solutionsreview.com\/expert\/?p=1024"},"modified":"2024-02-02T14:43:07","modified_gmt":"2024-02-02T14:43:07","slug":"the-dangers-of-dirty-data","status":"publish","type":"post","link":"https:\/\/solutionsreview.com\/thought-leaders\/the-dangers-of-dirty-data\/","title":{"rendered":"The Dangers of Dirty Data"},"content":{"rendered":"<p>We all think we know what dirty data is, but it can mean very different things to different people, and depending on who you speak to you could end up with many different definitions.\u00a0 But, at it\u2019s most basic level, dirty data is anything thst\u2019s incorrect.<\/p>\n<p>Within procurement that it could be misspelt vendors, incorrect invoice descriptions, missing product codes, a lack of standard units of measure (e.g. ltr, l, litres), currency issues, duplicate invoices or incorrect\/partially classified data.<\/p>\n<p>Dirty data can affect the whole organisation, and we all have an impact on, and responsibility for the data we work with.\u00a0 Accurate data should be everyone\u2019s responsibility,\u00a0 but currently across many organisations data is the sole responsibility of a person or department, and everyone trusts them to make sure the data is accurate.<\/p>\n<p>How many times have you been working with a data set and noticed a small error but not said anything, or just manually corrected something from an automated report, just get it out the door on time. These small errors can filter all the way up to the top of an organisation through reports and dashboards where critical decisions are being made.<\/p>\n<p><strong>How does this affect my organisation?<\/strong><\/p>\n<p>One of the most widespread and noticeable impacts is around reporting and analytics.\u00a0 If you\u2019re in senior management, you will most likely receive a dashboard from your team that you could be using to review cost savings, supplier negotiations, rationalisation, forecasting or budgets.<\/p>\n<p>What if within that dashboard was \u00a325k of cleaning spend under IBM?\u00a0 I can already hear you saying \u201cthat\u2019s ridiculous\u201d- well, it is obvious when pointed out, but I have seen with my own eyes. \u00a0It can happen easily and occurs more frequently than you might think.<\/p>\n<p>When there are tens or hundreds of thousands of rows of data, errors will occur multiple times across many suppliers.\u00a0 And for the wider organisation, this could affect demand, planning, sales, marketing and financial decisions.<\/p>\n<p>Think back to the IBM example, each quarter the data is refreshed automatically with the cleaning classification, that \u00a325k becomes \u00a350k, then \u00a375k the following quarter, it\u2019s only when the value becomes significant that someone notices the issue.\u00a0 By this stage, how many decisions have been based on this incorrect information?<\/p>\n<p><strong>So, how do I fix it?<\/strong><\/p>\n<p>There\u2019s no magic bullet or miracle solution out there to improve the accuracy of your data, you have to use your team or an experienced professional to get the job done.\u00a0 Get your team to familiarise themselves with the data, if they are reviewing and maintaining it regularly they will soon be able to spot errors in the data quickly and efficiently.<\/p>\n<p>Your data should always have its COAT on and should always be:<\/p>\n<p><strong>Consistent \u2013<\/strong>\u00a0everyone working to the same standards<\/p>\n<p><strong>Organised<\/strong>\u00a0\u2013 categorised properly<\/p>\n<p><strong>Accurate\u00a0<\/strong>\u2013 correct.<\/p>\n<p><strong>Trustworthy\u00a0<\/strong>\u2013 you wouldn\u2019t drive around in a car without a regular inspection would you?<\/p>\n<p><strong>How do I get a data COAT?<\/strong><\/p>\n<p>With a spreadsheet of spend transactions over a period of time such as 12 to 24 months the first step should be Supplier Normalisation. This is where a new column is added to consolidate several versions of the same company to get a true picture of spend with that supplier.\u00a0 For example, I.B.M, IBM Ltd, I.B.M. would all be normalised to IBM.<\/p>\n<p>Data can be classified using minimum information, such as Supplier Name, Invoice\/PO line description and value. To get more from the data, other factors can then be added in, such as unit price. Where unit price information is not available, the quantity can be divided by the overall value.<\/p>\n<p>A suitable taxonomy will then need to be found to classify the data.\u00a0 It can be an off the shelf product such as ProClass, UNSPSC, PROC-HE or a taxonomy can be customised so that it is specific to your organisation or industry.<\/p>\n<p>This initial stage may take months is you are working with large volumes of data, it might be worth considering outsourcing this initial task to professionals experienced in this area, who will be able to complete the project in a shorter time, with greater accuracy.<\/p>\n<p><strong>It\u2019ll save you money in the long run<\/strong><\/p>\n<p>Data accuracy is an investment, not a cost. \u00a0Address the issues at the beginning \u2013 while it might seem like a costly exercise, you will undoubtedly spend less than if you have a to resolve an issue further down the line with a time-consuming and costly data clean-up operation.\u00a0 And by involving the whole team or organisation, it will be much easier to manage and maintain the most accurate data possible.<\/p>\n<p>Spend data classification shows you the whole picture, as long as it\u2019s accurate.\u00a0 You can get a true view of your spend, allowing improved cost savings, better contract compliance and possibly the most important \u2013 preventing costly mistakes before they happen.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>We all think we know what dirty data is, but it can mean very different things to different people, and depending on who you speak to you could end up with many different definitions.\u00a0 But, at it\u2019s most basic level, dirty data is anything thst\u2019s incorrect. Within procurement that it could be misspelt vendors, incorrect [&hellip;]<\/p>\n","protected":false},"author":564,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"inline_featured_image":false,"footnotes":""},"categories":[11],"tags":[],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v23.5 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>The Dangers of Dirty Data<\/title>\n<meta name=\"robots\" content=\"noindex, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"The Dangers of Dirty Data\" \/>\n<meta property=\"og:description\" content=\"We all think we know what dirty data is, but it can mean very different things to different people, and depending on who you speak to you could end up with many different definitions.\u00a0 But, at it\u2019s most basic level, dirty data is anything thst\u2019s incorrect. Within procurement that it could be misspelt vendors, incorrect [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/solutionsreview.com\/thought-leaders\/the-dangers-of-dirty-data\/\" \/>\n<meta property=\"og:site_name\" content=\"Solutions Review Thought Leaders\" \/>\n<meta property=\"article:published_time\" content=\"2024-01-01T15:17:03+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2024-02-02T14:43:07+00:00\" \/>\n<meta name=\"author\" content=\"Susan Walsh\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Susan Walsh\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"4 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/solutionsreview.com\/thought-leaders\/the-dangers-of-dirty-data\/\",\"url\":\"https:\/\/solutionsreview.com\/thought-leaders\/the-dangers-of-dirty-data\/\",\"name\":\"The Dangers of Dirty Data\",\"isPartOf\":{\"@id\":\"https:\/\/solutionsreview.com\/thought-leaders\/#website\"},\"datePublished\":\"2024-01-01T15:17:03+00:00\",\"dateModified\":\"2024-02-02T14:43:07+00:00\",\"author\":{\"@id\":\"https:\/\/solutionsreview.com\/thought-leaders\/#\/schema\/person\/a3b7a3132efb3cbfe8fe6dc043ed3ed8\"},\"breadcrumb\":{\"@id\":\"https:\/\/solutionsreview.com\/thought-leaders\/the-dangers-of-dirty-data\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/solutionsreview.com\/thought-leaders\/the-dangers-of-dirty-data\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/solutionsreview.com\/thought-leaders\/the-dangers-of-dirty-data\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/solutionsreview.com\/thought-leaders\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"The Dangers of Dirty Data\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/solutionsreview.com\/thought-leaders\/#website\",\"url\":\"https:\/\/solutionsreview.com\/thought-leaders\/\",\"name\":\"Solutions Review Thought Leaders\",\"description\":\"\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/solutionsreview.com\/thought-leaders\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/solutionsreview.com\/thought-leaders\/#\/schema\/person\/a3b7a3132efb3cbfe8fe6dc043ed3ed8\",\"name\":\"Susan Walsh\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/solutionsreview.com\/thought-leaders\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/513fc157c05d89cdc78f2a84b09476d1?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/513fc157c05d89cdc78f2a84b09476d1?s=96&d=mm&r=g\",\"caption\":\"Susan Walsh\"},\"description\":\"Susan is a specialist in data classification, taxonomy customization, and data cleansing, as well as the founder of The Classification Guru. She is an industry thought leader, TEDx speaker, and author of the published \u2018Between the Spreadsheets: Classifying and Fixing Dirty Data\u2019. Susan has also developed a methodology to accurately and efficiently classify, cleanse and check data for errors which will help prevent costly mistakes.\",\"url\":\"https:\/\/solutionsreview.com\/thought-leaders\/author\/susan-walsh\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"The Dangers of Dirty Data","robots":{"index":"noindex","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"og_locale":"en_US","og_type":"article","og_title":"The Dangers of Dirty Data","og_description":"We all think we know what dirty data is, but it can mean very different things to different people, and depending on who you speak to you could end up with many different definitions.\u00a0 But, at it\u2019s most basic level, dirty data is anything thst\u2019s incorrect. Within procurement that it could be misspelt vendors, incorrect [&hellip;]","og_url":"https:\/\/solutionsreview.com\/thought-leaders\/the-dangers-of-dirty-data\/","og_site_name":"Solutions Review Thought Leaders","article_published_time":"2024-01-01T15:17:03+00:00","article_modified_time":"2024-02-02T14:43:07+00:00","author":"Susan Walsh","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Susan Walsh","Est. reading time":"4 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/solutionsreview.com\/thought-leaders\/the-dangers-of-dirty-data\/","url":"https:\/\/solutionsreview.com\/thought-leaders\/the-dangers-of-dirty-data\/","name":"The Dangers of Dirty Data","isPartOf":{"@id":"https:\/\/solutionsreview.com\/thought-leaders\/#website"},"datePublished":"2024-01-01T15:17:03+00:00","dateModified":"2024-02-02T14:43:07+00:00","author":{"@id":"https:\/\/solutionsreview.com\/thought-leaders\/#\/schema\/person\/a3b7a3132efb3cbfe8fe6dc043ed3ed8"},"breadcrumb":{"@id":"https:\/\/solutionsreview.com\/thought-leaders\/the-dangers-of-dirty-data\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/solutionsreview.com\/thought-leaders\/the-dangers-of-dirty-data\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/solutionsreview.com\/thought-leaders\/the-dangers-of-dirty-data\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/solutionsreview.com\/thought-leaders\/"},{"@type":"ListItem","position":2,"name":"The Dangers of Dirty Data"}]},{"@type":"WebSite","@id":"https:\/\/solutionsreview.com\/thought-leaders\/#website","url":"https:\/\/solutionsreview.com\/thought-leaders\/","name":"Solutions Review Thought Leaders","description":"","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/solutionsreview.com\/thought-leaders\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/solutionsreview.com\/thought-leaders\/#\/schema\/person\/a3b7a3132efb3cbfe8fe6dc043ed3ed8","name":"Susan Walsh","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/solutionsreview.com\/thought-leaders\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/513fc157c05d89cdc78f2a84b09476d1?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/513fc157c05d89cdc78f2a84b09476d1?s=96&d=mm&r=g","caption":"Susan Walsh"},"description":"Susan is a specialist in data classification, taxonomy customization, and data cleansing, as well as the founder of The Classification Guru. She is an industry thought leader, TEDx speaker, and author of the published \u2018Between the Spreadsheets: Classifying and Fixing Dirty Data\u2019. Susan has also developed a methodology to accurately and efficiently classify, cleanse and check data for errors which will help prevent costly mistakes.","url":"https:\/\/solutionsreview.com\/thought-leaders\/author\/susan-walsh\/"}]}},"_links":{"self":[{"href":"https:\/\/solutionsreview.com\/thought-leaders\/wp-json\/wp\/v2\/posts\/1024"}],"collection":[{"href":"https:\/\/solutionsreview.com\/thought-leaders\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/solutionsreview.com\/thought-leaders\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/solutionsreview.com\/thought-leaders\/wp-json\/wp\/v2\/users\/564"}],"replies":[{"embeddable":true,"href":"https:\/\/solutionsreview.com\/thought-leaders\/wp-json\/wp\/v2\/comments?post=1024"}],"version-history":[{"count":0,"href":"https:\/\/solutionsreview.com\/thought-leaders\/wp-json\/wp\/v2\/posts\/1024\/revisions"}],"wp:attachment":[{"href":"https:\/\/solutionsreview.com\/thought-leaders\/wp-json\/wp\/v2\/media?parent=1024"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/solutionsreview.com\/thought-leaders\/wp-json\/wp\/v2\/categories?post=1024"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/solutionsreview.com\/thought-leaders\/wp-json\/wp\/v2\/tags?post=1024"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}