Our coverage of the DBTA 100 2020 features 13 data integration vendors our editors are tracking. For a larger list, see our Data Integration Solutions Directory.
Database Trends and Applications recently released its DBTA 100 2020, an annual listing of data, information management, big data, and data science companies that are preparing for the future. The list spans the spectrum of both “well-established and cutting edge” solution providers offering data platforms, tools and technologies to organizations around the globe. Some of the vendor entries are published alongside View From the Top articles which are penned by executives from the companies themselves.
The editors at Solutions Review have reviewed the DBTA 100 2020 with a fine-tooth comb to identify the data integration vendors that matter most. Look at it as a sneak peak into which solution providers we track against in our daily coverage of the marketplace. You’ll even find some of these company names below in our freshly pressed Buyer’s Guide for Data Integration Tools.
Actian offers data integration software in on-prem and cloud editions, DataConnect and DataCloud. DataConnect is a hybrid solution that enables users to design, deploy, and manage integrations without limits on data types or volumes. Actian Cloud is an elastic platform for deploying and managing hybrid, on-prem, or cloud-to-cloud integrations in an on-demand services platform that is powered by Amazon Web Services.
Alluxio enables data orchestration for compute in any cloud. The product unified data silos on-prem across any cloud to provide data locality, accessibility, and elasticity. Alluxio is scalable to over a billion files in a single cluster, and its distributed architecture is built on three core components including Alluxio Master (manages file and object metadata), Alluxio Worker (manages node local space), and Alluxio Client (AI/ML application interface). The product also includes support for hyperscale workloads, flexible APIs, security and monitoring and management.
The Denodo Platform offers data virtualization for joining multistructured data sources from database management systems, documents, and a wide variety of other big data, cloud, and enterprise sources. Connectivity support includes relational databases, legacy data, flat files, CML, packed applications, and emerging data types including Hadoop. Denodo is the only data virtualization solution to be provisioned as a virtual image on Amazon AWS Marketplace.
HVR offers a variety of data integration capabilities, including cloud, data lake, and real-time integration, database and file replication, and database migration. The product allows organizations to move data bi-directionally between on-prem solutions and the cloud. Real-time data movement continuously analyzes changes in data generated by transactional systems, machines, sensors, mobile devices, and websites. The company has raised funding recently to expand its footprint in hybrid integration scenarios.
Informatica’s data integration tools portfolio includes both on-prem and cloud deployments for a number of enterprise use cases. The vendor combines advanced hybrid integration and governance functionality with self-service business access for various analytic functions. Augmented integration is possible via Informatica’s CLAIRE Engine, a metadata-driven AI engine that applies machine learning. Informatica touts strong interoperability between its growing list of data management software products.
Precisely’s solution portfolio is broken into five distinct categories based on the use case. Integrate is its data integration line that features Precisely Connect, Ironstream, Assure, and Syncsort. The Verify unit of data quality tools includes Precisely Spectrum Quality, Spectrum Context, and Trillium. The location intelligence line (Locate) touts Precisely Spectrum Spatial, Spectrum Geocoding, MapInfo, and Confirm, while Enrich features Precisely Streets, Boundaries, Points Of Interest, Addresses, and Demographics. There’s also Precisely Engage on the company’s Engage unit.
The Qlik product suite features a range of data integration capabilities that span four distinct product lines. The flagship product is Qlik Replicate, a tool that replicates, synchronizes, distributes, consolidates, and ingests data across all major databases, data warehouses, and Hadoop. The portfolio of products is buoyed by Qlik Compose and Qlik Visibility. The provider also offers Qlik CloudBeam, an Integration Platform as a Service tool, which provides cloud-optimized data replication from all major on-prem sources to Amazon Web Services, Microsoft Azure, and Google Cloud.
SAS is the largest independent vendor in the data integration tools market. The provider offers its core capabilities via SAS Data Management, where data integration and quality tools are interwoven. It includes flexible query language support, metadata integration, push-down database processing, and various optimization and performance capabilities. The company’s data virtualization tool, Federation Server, enables advanced data masking and encryption that allows users to determine who’s authorized to view data.
SnapLogic’s Intelligent Integration Platform integrates across applications, databases, data warehouses, big data streams, and IoT deployments. It allows both IT and business users to create data pipelines that can be deployed on-prem or in the cloud. It features an HTML5 visual designer and a proprietary AI algorithm called Iris that learns common integration patterns and drives self-service by recommending flows. Complete support for complex transformations, conditional operations, triggers, parameterization, aggregation, and reuse maximizes the tool’s flexibility.
StreamSets offers a DataOps platform that features smart data pipelines with built-in data drift detection and handling, as well as a hybrid architecture. The product also includes automation and collaboration capabilities across the design-deploy-operate lifecycle. StreamSets monitors data in-flight to detect changes and predicts downstream issues to ensure continuous delivery without errors or data loss. The tool’s live data map, data performance SLAs and data protection functionality are major value-adds.
Talend offers an expansive portfolio of data integration and data management tools. The company’s flagship tool, Open Studio for Data Integration, is available via a free open-source license. Talend Integration Cloud is offered in three separate editions (SaaS, hybrid, elastic), and provides broad connectivity, built-in data quality, and native code generation to support big data technologies. Big data components and connectors include Hadoop, NoSQL, MapReduce, Spark, machine leaning and IoT.
Tamr offers a machine learning-based data integration product called Unify. The solution allows organizations to connect to any tabular data and publish it anywhere. Users can map schemas with machine learning suggestions and normalize data formats using Spark and SQL. Tamr’s Master Records feature provides a complete view of all entities via simple yes and no questions as well. Tamr has also begun offering an issue tracker specifically designed for data called Steward (beta).
Trifacta offers a suite of what its dubbed ‘data wrangling’ tools in three different iterations: Trifacta Wrangler, Wrangler Edge, and Wrangler Enterprise. Trifacta allows users to do data prep without having to manually write code or use mapping-based systems. The Predictive Transformation function enables the exploration of data content so users can define a recipe for how the data should be transformed. Data Wrangler also includes data discovery, structuring, cleaning, enriching, and validation capabilities.
Latest posts by Timothy King (see all)
- Naveego Adds Data Integration to Complete Data Accuracy Platform - July 29, 2020
- The 6 Best Talend Courses and Online Training for 2020 - July 28, 2020
- DBTA 100 2020: 13 Data Integration Vendors Our Editors Are Tracking - July 24, 2020