2021 CRN Big Data 100: 9 Data Integration Tools Companies to Consider
IT news and analysis outlet CRN recently released its 2021 (and ninth annual) Big Data 100, a ranking of prominent big data technology vendors that solution providers should be aware of. The list is made up of established and emerging big data tools vendors. The list is broken down into five distinct product categories that include business analytics, database systems, data management and data integration software, big data platforms, and data science and machine learning tools.
CRN pre-published a list of The Coolest Data Management and Integration Tool Companies included in the overall list via an interactive slideshow. Though the Big Data 100 is aimed at highlighting software vendors for the purposes of solution provider partnering, Solutions Review is most interesting in highlighting the vendors from that offer unique products and platforms for enterprise organizations. As such, we’ve read through CRN’s complete rankings, available here, to analyze the trending data integration tools companies we think matter most. For an even deeper breakdown of data integration software, tools, vendors and platforms, consult our popular Buyer’s Guide.
Datameer offers a data analytics lifecycle and engineering platform that covers ingestion, data preparation, exploration and consumption. The product features more than 70 source connectors to ingest structured, semi-structured and unstructured data. Users can directly upload data or use unique data links to pull data on demand. Datameer’s intuitive and interactive spreadsheet-style interface lets you transform, blend and enrich complex data toward the creation of data pipelines.
The Denodo Platform offers data virtualization for joining multistructured data sources from database management systems, documents, and a wide variety of other big data, cloud, and enterprise sources. Connectivity support includes relational databases, legacy data, flat files, CML, packed applications, and emerging data types including Hadoop. Denodo is the only data virtualization solution to be provisioned as a virtual image on Amazon AWS Marketplace.
Fivetran is an automated data integration platform that delivers ready-to-use connectors, transformations and analytics templates that adapt as schemas and APIs change. The product can sync data from cloud applications, databases, and event logs. Integrations are built for analysts who need data centralized but don’t want to spend time maintaining their own pipelines or ETL systems. Fivetran is easy to deploy, scalable, and offers some of the best security features of any provider in the space.
HVR is a high-volume real-time data replication solution that solves a variety of data integration use cases, including cloud, data lake, database and file replication, and database migration. The product allows organizations to move data bi-directionally between on-prem solutions and the cloud. Real-time data movement enables the ability to continuously analyze changes in data generated by transactional systems, machines, sensors, mobile devices, and more.
Informatica’s data integration tools portfolio includes both on-prem and cloud deployments for a number of enterprise use cases. The vendor combines advanced hybrid integration and governance functionality with self-service business access for various analytic functions. Augmented integration is possible via Informatica’s CLAIRE Engine, a metadata-driven AI engine that applies machine learning. Informatica touts strong interoperability between its growing list of data management software products.
Matillion offers data integration software for cloud data warehouses, and was designed for Amazon Redshift, Snowflake, and Google BigQuery. The product works by allowing users to consolidate large data sets and quickly perform data transformations. It features expert technical support as well, all by Matillion solution architects and comes free of charge. Full support is offered throughout the customer lifecycle, including trial and complex use case development. Matillion includes more than 70 pre-built connectors.
Striim offers a real-time data integration solution that enables continuous query processing and streaming analytics. Striim integrates data from a wide variety of sources, including transaction/change data, events, log files, application and IoT sensor, and real-time correlation across multiple streams. The platform features pre-built data pipelines, out-of-the-box wizards for configuration and coding, and a drag-and-drop dashboard builder.
Talend offers an expansive portfolio of data integration and data management tools. The company’s flagship tool, Open Studio for Data Integration, is available via a free open-source license. Talend Integration Cloud is offered in three separate editions (SaaS, hybrid, elastic), and provides broad connectivity, built-in data quality, and native code generation to support big data technologies. Big data components and connectors include Hadoop, NoSQL, MapReduce, Spark, machine leaning and IoT.
Trifacta offers a suite of what its dubbed ‘data wrangling’ tools in three different iterations: Trifacta Wrangler, Wrangler Edge, and Wrangler Enterprise. Trifacta allows users to do data prep without having to manually write code or use mapping-based systems. The Predictive Transformation function enables the exploration of data content so users can define a recipe for how the data should be transformed. Data Wrangler also includes data discovery, structuring, cleaning, enriching, and validation capabilities.