Four Data Integration Delivery Methods: An Introduction

Four Data Integration Delivery Methods: An Introduction

Data Integration is a combination of technical and business processes used to combine different data from disparate sources for analysis. This process generally supports the analytical processing of data by aligning, combining, and presenting each data store to an end-user, and is usually executed in a data warehouse with integration software.

If you’re just beginning your search for a new Data Integration tool, knowing the different feature offerings each solution offers is key. Between traditional ETL (extract, transform, load), virtualization, Integration Platform as a Service (iPaaS), Big Data and self-service, it can be difficult to distinguish between what a particular platform does as its main focus. Sifting through the various styles of integration tools can be confusing, so we put together this list of the four major data delivery techniques common in integration tools:

Bulk/Batch

Acts as a support mechanism for extract, transform and load (ETL) processes to consolidate data from primary databases. This involves bulk and or batch data extraction that draws data from across system and organizational data stores. This is an efficient way of processing large data volumes over a period of time. Data is collected and processed, then batch results are created.

Data Virtualization

Allows users to create a virtual abstract layer than can be mirrored to provide one single view of all the data that resides in the database instead of having to run through the process of ETL to get the data loaded up into an analytic framework. This enables the user to piece together databases, data warehouses, and even cloud services to gain a comprehensive view of the data that matters most.

Message-Oriented Movement

Groups data into messages that applications can read so data can be exchanged in real-time. This depends on a message bus which becomes triggered by events and delivers data packets to application integration technologies. Oftentimes middleware is involved, acting as a software or hardware infrastructure that supports sending and receiving messages between distributed systems.

Data Replication

Frequent copying of data from a database to another that allows all users to share the same level of information, resulting in a distributed database that enables user access to data relevant to their own tasks. This provides data synchronization that enables users to manage growing data volumes while gaining access to real-time information.

Timothy King
Follow Tim

Timothy King

Editor, Data and Analytics at Solutions Review
Timothy leads Solutions Review's Business Intelligence, Data Integration and Data Management areas of focus. He is recognized as one of the top authories in Big Data, and the number-one authority in enterprise middleware. Timothy has also been named one of the world's top-75 most influential business journalists by Richtopia.
Timothy King
Follow Tim

Leave a Reply

Your email address will not be published. Required fields are marked *