{"id":1501,"date":"2024-01-01T16:01:10","date_gmt":"2024-01-01T16:01:10","guid":{"rendered":"https:\/\/solutionsreview.com\/thought-leaders\/?p=1501"},"modified":"2024-02-02T14:47:58","modified_gmt":"2024-02-02T14:47:58","slug":"the-myth-of-perfect-data","status":"publish","type":"post","link":"https:\/\/solutionsreview.com\/thought-leaders\/the-myth-of-perfect-data\/","title":{"rendered":"The Myth of Perfect Data"},"content":{"rendered":"<p>In the last week I&#8217;ve spoken to a number of data management professionals about data and LLMs.<\/p>\n<p>Most say we can&#8217;t create LLMs as the data isn&#8217;t perfect! I have a question about that, but, it might be a little too reflective for those folks than is necessary right now!<\/p>\n<p>So, what I want to do is talk about the elephant in the room: the myth of 100% perfect data for building out LLMs.<\/p>\n<p>The reality is that perfect data doesn&#8217;t exist! There I said it! It never has!<\/p>\n<p>In the realm of data management, the pursuit of flawless data is akin to chasing unicorns.<\/p>\n<p>The truth is, perfect data is a myth. \ud83d\ude2c<\/p>\n<p>Humans, who are the architects of data, themselves don&#8217;t always have the complete set of information, leading to occasional &#8220;hallucinations&#8221; in our understanding of reality.<\/p>\n<p>So, why should we expect our models to be infallible?<\/p>\n<p>We humans draw conclusions based on incomplete information, LLMs can also &#8220;hallucinate&#8221; when faced with data gaps. It&#8217;s a natural byproduct of the learning process. Instead of viewing this as a flaw, we should see it as an opportunity for improvement and iteration.<\/p>\n<p>Yes the buzz around\u00a0<a href=\"https:\/\/www.linkedin.com\/feed\/hashtag\/?keywords=ai&amp;highlightedUpdateUrns=urn%3Ali%3Aactivity%3A7131262839628951552\" target=\"_blank\" rel=\"noopener nofollow\" data-test-app-aware-link=\"\" class=\"external\">ai<\/a>\u00a0is rampant, and LLMs are powerful tools, but, right now they are not a substitute for human judgment. The key lies in acknowledging the role of humans in the loop. We need experts who can critically assess and validate the outputs, filling in the gaps where the model may falter.<\/p>\n<p>It&#8217;s time to dispel the notion that all data management professionals demand perfection. Rather than fixating on an unattainable ideal, let&#8217;s focus on the practical aspects of data, understanding that imperfections are part of the game.<\/p>\n<p>This may sound strange to you and I&#8217;m attempting to get my head around this. If we flip the script, instead of obsessing over an unattainable perfect data utopia, let&#8217;s revel in the quirks and charms of imperfect data. It&#8217;s the imperfections that give character to our models, making them more real and relatable.<\/p>\n<p>Do they Samir? I think you are crazy!<\/p>\n<p>Collaboration, fact checking, etc. will be the key aspects where we will work with the true potential of LLMs, a collaborative effort between business teams, data professionals and of course the AI itself.<\/p>\n<p>The world that we should be attempting to create is where we can create a safe space that capitalizes on the strengths of both human intuition and machine efficiency.<\/p>\n<p>In the ever-evolving landscape of AI, it&#8217;s essential to debunk myths and embrace the reality that perfection is an illusion. Let&#8217;s celebrate imperfections, learn from them, and work together to build more robust and reliable systems.<\/p>\n<p>That all starts with the right data &amp; AI\u00a0<a href=\"https:\/\/www.linkedin.com\/feed\/hashtag\/?keywords=strategy&amp;highlightedUpdateUrns=urn%3Ali%3Aactivity%3A7131262839628951552\" target=\"_blank\" rel=\"noopener nofollow\" data-test-app-aware-link=\"\" class=\"external\">strategy<\/a>\u00a0that leads the business strategy.<\/p>\n<p>Do you think we can do that?<\/p>\n","protected":false},"excerpt":{"rendered":"<p>In the last week I&#8217;ve spoken to a number of data management professionals about data and LLMs. Most say we can&#8217;t create LLMs as the data isn&#8217;t perfect! I have a question about that, but, it might be a little too reflective for those folks than is necessary right now! So, what I want to [&hellip;]<\/p>\n","protected":false},"author":546,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"inline_featured_image":false,"footnotes":""},"categories":[10],"tags":[],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v23.5 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>The Myth of Perfect Data - Solutions Review Thought Leaders<\/title>\n<meta name=\"robots\" content=\"noindex, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"The Myth of Perfect Data - Solutions Review Thought Leaders\" \/>\n<meta property=\"og:description\" content=\"In the last week I&#8217;ve spoken to a number of data management professionals about data and LLMs. Most say we can&#8217;t create LLMs as the data isn&#8217;t perfect! I have a question about that, but, it might be a little too reflective for those folks than is necessary right now! So, what I want to [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/solutionsreview.com\/thought-leaders\/the-myth-of-perfect-data\/\" \/>\n<meta property=\"og:site_name\" content=\"Solutions Review Thought Leaders\" \/>\n<meta property=\"article:published_time\" content=\"2024-01-01T16:01:10+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2024-02-02T14:47:58+00:00\" \/>\n<meta name=\"author\" content=\"Samir Sharma\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Samir Sharma\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"3 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/solutionsreview.com\/thought-leaders\/the-myth-of-perfect-data\/\",\"url\":\"https:\/\/solutionsreview.com\/thought-leaders\/the-myth-of-perfect-data\/\",\"name\":\"The Myth of Perfect Data - Solutions Review Thought Leaders\",\"isPartOf\":{\"@id\":\"https:\/\/solutionsreview.com\/thought-leaders\/#website\"},\"datePublished\":\"2024-01-01T16:01:10+00:00\",\"dateModified\":\"2024-02-02T14:47:58+00:00\",\"author\":{\"@id\":\"https:\/\/solutionsreview.com\/thought-leaders\/#\/schema\/person\/162438574208d9b00329141f488170b9\"},\"breadcrumb\":{\"@id\":\"https:\/\/solutionsreview.com\/thought-leaders\/the-myth-of-perfect-data\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/solutionsreview.com\/thought-leaders\/the-myth-of-perfect-data\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/solutionsreview.com\/thought-leaders\/the-myth-of-perfect-data\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/solutionsreview.com\/thought-leaders\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"The Myth of Perfect Data\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/solutionsreview.com\/thought-leaders\/#website\",\"url\":\"https:\/\/solutionsreview.com\/thought-leaders\/\",\"name\":\"Solutions Review Thought Leaders\",\"description\":\"\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/solutionsreview.com\/thought-leaders\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/solutionsreview.com\/thought-leaders\/#\/schema\/person\/162438574208d9b00329141f488170b9\",\"name\":\"Samir Sharma\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/solutionsreview.com\/thought-leaders\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/9f1640121b982389e455b3f6574bf477?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/9f1640121b982389e455b3f6574bf477?s=96&d=mm&r=g\",\"caption\":\"Samir Sharma\"},\"description\":\"Samir\u2019s focus is to empower business users by enabling informed decision making through data. As principal at datazuum, Samir has the ability to tackle both business and technology challenges. He has 20 years of international experience in the UK, Europe, Africa, USA, all gained from consulting and strategy roles across a variety of sectors \u2013 Aerospace &amp; Defence, Government, High End Luxury Retail, Postal, Telecoms. Prior to datazuum, Samir led the decision sciences team for a large organisation where he delivered strategic data driven Business Intelligence &amp; Analytics solutions across all sectors.\",\"url\":\"https:\/\/solutionsreview.com\/thought-leaders\/author\/samir-sharma\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"The Myth of Perfect Data - Solutions Review Thought Leaders","robots":{"index":"noindex","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"og_locale":"en_US","og_type":"article","og_title":"The Myth of Perfect Data - Solutions Review Thought Leaders","og_description":"In the last week I&#8217;ve spoken to a number of data management professionals about data and LLMs. Most say we can&#8217;t create LLMs as the data isn&#8217;t perfect! I have a question about that, but, it might be a little too reflective for those folks than is necessary right now! So, what I want to [&hellip;]","og_url":"https:\/\/solutionsreview.com\/thought-leaders\/the-myth-of-perfect-data\/","og_site_name":"Solutions Review Thought Leaders","article_published_time":"2024-01-01T16:01:10+00:00","article_modified_time":"2024-02-02T14:47:58+00:00","author":"Samir Sharma","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Samir Sharma","Est. reading time":"3 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/solutionsreview.com\/thought-leaders\/the-myth-of-perfect-data\/","url":"https:\/\/solutionsreview.com\/thought-leaders\/the-myth-of-perfect-data\/","name":"The Myth of Perfect Data - Solutions Review Thought Leaders","isPartOf":{"@id":"https:\/\/solutionsreview.com\/thought-leaders\/#website"},"datePublished":"2024-01-01T16:01:10+00:00","dateModified":"2024-02-02T14:47:58+00:00","author":{"@id":"https:\/\/solutionsreview.com\/thought-leaders\/#\/schema\/person\/162438574208d9b00329141f488170b9"},"breadcrumb":{"@id":"https:\/\/solutionsreview.com\/thought-leaders\/the-myth-of-perfect-data\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/solutionsreview.com\/thought-leaders\/the-myth-of-perfect-data\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/solutionsreview.com\/thought-leaders\/the-myth-of-perfect-data\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/solutionsreview.com\/thought-leaders\/"},{"@type":"ListItem","position":2,"name":"The Myth of Perfect Data"}]},{"@type":"WebSite","@id":"https:\/\/solutionsreview.com\/thought-leaders\/#website","url":"https:\/\/solutionsreview.com\/thought-leaders\/","name":"Solutions Review Thought Leaders","description":"","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/solutionsreview.com\/thought-leaders\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/solutionsreview.com\/thought-leaders\/#\/schema\/person\/162438574208d9b00329141f488170b9","name":"Samir Sharma","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/solutionsreview.com\/thought-leaders\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/9f1640121b982389e455b3f6574bf477?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/9f1640121b982389e455b3f6574bf477?s=96&d=mm&r=g","caption":"Samir Sharma"},"description":"Samir\u2019s focus is to empower business users by enabling informed decision making through data. As principal at datazuum, Samir has the ability to tackle both business and technology challenges. He has 20 years of international experience in the UK, Europe, Africa, USA, all gained from consulting and strategy roles across a variety of sectors \u2013 Aerospace &amp; Defence, Government, High End Luxury Retail, Postal, Telecoms. Prior to datazuum, Samir led the decision sciences team for a large organisation where he delivered strategic data driven Business Intelligence &amp; Analytics solutions across all sectors.","url":"https:\/\/solutionsreview.com\/thought-leaders\/author\/samir-sharma\/"}]}},"_links":{"self":[{"href":"https:\/\/solutionsreview.com\/thought-leaders\/wp-json\/wp\/v2\/posts\/1501"}],"collection":[{"href":"https:\/\/solutionsreview.com\/thought-leaders\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/solutionsreview.com\/thought-leaders\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/solutionsreview.com\/thought-leaders\/wp-json\/wp\/v2\/users\/546"}],"replies":[{"embeddable":true,"href":"https:\/\/solutionsreview.com\/thought-leaders\/wp-json\/wp\/v2\/comments?post=1501"}],"version-history":[{"count":0,"href":"https:\/\/solutionsreview.com\/thought-leaders\/wp-json\/wp\/v2\/posts\/1501\/revisions"}],"wp:attachment":[{"href":"https:\/\/solutionsreview.com\/thought-leaders\/wp-json\/wp\/v2\/media?parent=1501"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/solutionsreview.com\/thought-leaders\/wp-json\/wp\/v2\/categories?post=1501"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/solutionsreview.com\/thought-leaders\/wp-json\/wp\/v2\/tags?post=1501"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}