Top 15 Best Data Integration Books You Should Read

Top 15 Best Data Integration Books You Should Read

There are loads of free resources on the web surrounding data integration in all of its forms (including Solutions Review’s Buyer’s Guides and best practices). Those are great, but sometimes it’s best to do things the old fashioned way, and there are few resources that can match the in-depth, comprehensive detail of a great book.

With this in mind, our editors have done the research for you, having reviewed many of these books. We’ve carefully selected the top data integration books based on relevance, popularity, review ratings, publish date, and ability to add business value. Below you will find a library of books from recognized leaders, experts, and technology professionals in the field.

Managing Data in Motion: Data Integration Best Practice Techniques and Technologies

Managing Data in Motion: Data Integration Best Practice Techniques and Technologies“Managing Data in Motion describes techniques that have been developed for significantly reducing the complexity of managing system interfaces and enabling scalable architectures. Author April Reeve brings over two decades of experience to present a vendor-neutral approach to moving data between computing environments and systems. Readers will learn the techniques, technologies, and best practices for managing the passage of data between computer systems and integrating disparate data together in an enterprise environment.”

Principles of Data Integration

Principles of Data Integration“This book provides an extensive introduction to the theory and concepts underlying today’s data integration techniques, with detailed, instruction for their application using concrete examples throughout to explain the concepts. Data integration is the problem of answering queries that span multiple data sources (e.g., databases, web pages). Data integration problems surface in multiple contexts, including enterprise information integration, query processing on the Web, coordination between government agencies and collaboration between scientists. In some cases, data integration is the key bottleneck to making progress in a field.”

Data Integration Blueprint and Modeling: Techniques for a Scalable and Sustainable Architecture

Data Integration Blueprint and Modeling: Techniques for a Scalable and Sustainable Architecture“This book presents the solution: a clear, consistent approach to defining, designing, and building data integration components to reduce cost, simplify management, enhance quality, and improve effectiveness. Leading IBM data management expert Tony Giordano brings together best practices for architecture, design, and methodology, and shows how to do the disciplined work of getting data integration right.”

Business Intelligence Guidebook: From Data Integration to Analytics

Business Intelligence Guidebook: From Data Integration to Analytics“After reading this book, you will be able to design the overall architecture for functioning business intelligence systems with the supporting data warehousing and data-integration applications. You will have the information you need to get a project launched, developed, managed and delivered on time and on budget – turning the deluge of data into actionable information that fuels business knowledge. Finally, you’ll give your career a boost by demonstrating an essential knowledge that puts corporate BI projects on a fast-track to success.”

Connecting the Data: Data Integration Techniques for Building an Operational Data Store (ODS)

Connecting the Data: Data Integration Techniques for Building an Operational Data Store (ODS)“Business data integration is a complex problem that must be solved when organizations change or enhance their internal structures. The goal of this book is to present a simple yet thorough resource that describes the challenges of business data integration and the solutions to these challenges such as schema integration, illustrated through an Operational Data Store (ODS) case study. This book contains three sections spanning ten chapters. Section I, Foundational Concepts, will provide you with the necessary basic concepts and discuss schema integration. Section II, Preparation and Design, introduces the case study and we will reverse engineer each of the data sources to create a set of data dictionary reports which will provide us with the meta data we need to apply the schema integration process. Section III, Physical Implementation, will present scripts to populate each of the source databases and spreadsheets and use reports to create Extract, Transform, and Load (ETL) specifications.”

Data Virtualization for Business Intelligence Systems: Revolutionizing Data Integration for Data Warehouses

Data Virtualization for Business Intelligence Systems: Revolutionizing Data Integration for Data Warehouses“Data virtualization can help you accomplish your goals with more flexibility and agility. Learn what it is and how and why it should be used with Data Virtualization for Business Intelligence Systems. In this book, expert author Rick van der Lans explains how data virtualization servers work, what techniques to use to optimize access to various data sources and how these products can be applied in different projects. You’ll learn the difference is between this new form of data integration and older forms, such as ETL and replication, and gain a clear understanding of how data virtualization really works.”

Data Virtualization: Going Beyond Traditional Data Integration to Achieve Business Agility

Data Virtualization: Going Beyond Traditional Data Integration to Achieve Business Agility“Data Virtualization: Going Beyond Traditional Data Integration to Achieve Business Agility, the first book ever written on the topic of data virtualization, introduces the technology that enables data virtualization and presents ten real-world case studies that demonstrate the significant value and tangible business agility benefits that can be achieved through the implementation of data virtualization solutions. The core of the book is a rich set of in-depth data virtualization case studies that describe how ten enterprises across a wide range of industries and domains have successfully adopted data virtualization to increase their business agility.”

The Data Warehouse Toolkit: The Definitive Guide to Dimensional Modeling

The Data Warehouse Toolkit: The Definitive Guide to Dimensional Modeling“The first edition of Ralph Kimball’s The Data Warehouse Toolkit introduced the industry to dimensional modeling, and now his books are considered the most authoritative guides in this space. This new third edition is a complete library of updated dimensional modeling techniques, the most comprehensive collection ever. It covers new and enhanced star schema dimensional modeling patterns, adds two new chapters on ETL techniques, includes new and expanded business matrices for 12 case studies, and more.”

Entity Information Life Cycle for Big Data: Master Data Management and Information Integration

Entity Information Life Cycle for Big Data: Master Data Management and Information Integration“Entity Information Life Cycle for Big Data walks you through the ins and outs of managing entity information so you can successfully achieve master data management (MDM) in the era of big data. This book explains big data’s impact on MDM and the critical role of entity information management system (EIMS) in successful MDM. Expert authors Dr. John R. Talburt and Dr. Yinle Zhou provide a thorough background in the principles of managing the entity information life cycle and provide practical tips and techniques for implementing an EIMS, strategies for exploiting distributed processing to handle big data for EIMS, and examples from real applications.”

I Heart Logs: Event Data, Stream Processing, and Data Integration

I Heart Logs: Event Data, Stream Processing, and Data Integration“Why a book about logs? That’s easy: the humble log is an abstraction that lies at the heart of many systems, from NoSQL databases to cryptocurrencies. Even though most engineers don’t think much about them, this short book shows you why logs are worthy of your attention. Based on his popular blog posts, LinkedIn principal engineer Jay Kreps shows you how logs work in distributed systems, and then delivers practical applications of these concepts in a variety of common uses—data integration, enterprise architecture, real-time stream processing, data system design, and abstract computing models.”

Big Data Integration Theory: Theory and Methods of Database Mappings, Programming Languages, and Semantics (Texts in Computer Science)

Big Data Integration Theory: Theory and Methods of Database Mappings, Programming Languages, and Semantics (Texts in Computer Science)“This book presents a novel approach to database concepts, describing a categorical logic for database schema mapping based on views, within a framework for database integration/exchange and peer-to-peer. Database mappings, database programming languages, and denotational and operational semantics are discussed in depth. An analysis method is also developed that combines techniques from second order logic, data modeling, co-algebras and functorial categorical semantics.”

Customer Data Integration: Reaching a Single Version of the Truth (SAS Institute Inc.)

Customer Data Integration: Reaching a Single Version of the Truth (SAS Institute Inc.)“Whatever business you’re in, you’re ultimately in the customer business. No matter what your product, customers pay the bills. But the strategic importance of customer relationships hasn’t brought companies much closer to a single, authoritative view of their customers. Written from both business and technical perspectives, Customer Data Integration shows companies how to deliver an accurate, holistic, and long-term understanding of their customers through CDI.”

Big Data Analytics: From Strategic Planning to Enterprise Integration with Tools, Techniques, NoSQL, and Graph

Big Data Analytics: From Strategic Planning to Enterprise Integration with Tools, Techniques, NoSQL, and Graph“Big Data Analytics will assist managers in providing an overview of the drivers for introducing big data technology into the organization and for understanding the types of business problems best suited to big data analytics solutions, understanding the value drivers and benefits, strategic planning, developing a pilot, and eventually planning to integrate back into production within the enterprise.”

Data Integration Life Cycle Management with SSIS: A Short Introduction by Example

Data Integration Life Cycle Management with SSIS: A Short Introduction by Example“Data Integration Life Cycle Management with SSIS shows you how to bring DevOps benefits to SSIS integration projects. Practices in this book enable faster time to market, higher quality of code, and repeatable automation. Code will be created that is easier to support and maintain. The book teaches you how to more effectively manage SSIS in the enterprise environment by drawing on the art and science of modern DevOps practices.”

Data Integration: The Relational Logic Approach (Synthesis Lectures on Artificial Intelligence and Machine Learning)

Data Integration: The Relational Logic Approach (Synthesis Lectures on Artificial Intelligence and Machine Learning)“This book is an introduction to the problem of data integration and a rigorous account of one of the leading approaches to solving this problem, viz., the relational logic approach. Relational logic provides a theoretical framework for discussing data integration. Moreover, in many important cases, it provides algorithms for solving the problem in a computationally practical way. In many respects, relational logic does for data integration what relational algebra did for database theory several decades ago. A companion web site provides interactive demonstrations all of the algorithms.”

Timothy King
Follow Tim

Timothy King

Editor, Data and Analytics at Solutions Review
Timothy covers BI and data analytics at Solutions Review. He is a recognized thought leader and influencer in the enterprise middleware space. Timothy has also been named a top-75 global business journalist by Richtopia.
Timothy King
Follow Tim