Strategic advisory and analyst firm Constellation Research recently released findings from its Constellation ShortList for Self-Service Data Preparation. The report, which was authored by Doug Henschen, highlights 4 solutions to know, presenting vendors in different categories of the market relevant to early adopters. Constellation compiled the list through conversations with early adopters, independent analysts, and briefings with solution providers. The ShortList evaluation is updated on a 180-day schedule.
Constellation Research evaluated more than 15 solutions categorized in this market. The analyst house uses a proprietary threshold criteria for vendor inclusion which includes data connection, exploration and machine learning-assisted profiling, support for a variety of user personas, flexible data blending, transformation and enrichment, support for sharing, repeatability and control via collaboration, and stand-alone deployment and data delivery options.
The Q1 2021 report features the following solution providers:
Alteryx Designer is a part of the company’s flagship analytics and data science platform. The tool features an intuitive user interface that enables users to connect and cleanse data from data warehouses, cloud applications, spreadsheets, and other sources. Users can leverage data quality, integration and transformation features as well. Alteryx Designer also includes data blending for spatial data files so they can be joined with third-party data such as demographics.
DataRobot Paxata Self-Service Data Preparation features flexible deployment and self-service operation. The app is built on a visual user interface that has familiar spreadsheet metaphors so users don’t have to learn an entirely new tool. The app also boasts Assisted Intelligence that provides algorithmic assistance to infer the meaning of data, and machine learning captures steps for future data work.
Informatica’s data integration tools portfolio includes both on-prem and cloud deployments for a number of enterprise use cases. The vendor combines advanced hybrid integration and governance functionality with self-service business access for various analytic functions. Augmented integration is possible via Informatica’s CLAIRE Engine, a metadata-driven AI engine that applies machine learning. Informatica touts strong interoperability between its growing list of data management software products.
Trifacta offers a suite of what its dubbed ‘data wrangling’ tools in three different iterations: Trifacta Wrangler, Wrangler Edge, and Wrangler Enterprise. Trifacta allows users to do data prep without having to manually write code or use mapping-based systems. The Predictive Transformation function enables the exploration of data content so users can define a recipe for how the data should be transformed. Data Wrangler also includes data discovery, structuring, cleaning, enriching, and validation capabilities.
Latest posts by Timothy King (see all)
- The Three Best Data Engineering Books on Our Reading List - April 8, 2021
- The 8 Best Data Engineering Courses and Online Training for 2021 - April 8, 2021
- Trifacta Launches Industry First Data Engineering Cloud - April 8, 2021