Hitachi Data Systems recently announced that they have closed on their acquisition of Pentaho, a leading data integration, visualization, and analytics solutions vendor. Under the acquisition agreement, Pentaho will continue to operate independently as itself under the umbrella of Hitachi Data Systems. Pentaho’s platform will continue to be offered independently and has been integrated into Hitachi’s advanced analytics foundation software, where it will be used to help enhance the parent company’s existing big data and processing technologies.
According to Hitachi: “HDS is a rapidly emerging global leader in the Internet of Things (IoT), operational technology, big data and machine-to-machine (M2M) analytics. Its big data analytics solutions help organizations transform vast quantities of structured and unstructured data from disparate sources into knowledge through the application of advanced data analytics, connected intelligence from IoT devices, and operational technologies (OT). Through its integration with the Pentaho platform, HDS is now extending its data integration, refinement, monitoring, management, and orchestration capabilities to deliver an incomparably sophisticated data analytics stack.”
In addition, Pentaho announced today the availability of Pentaho 5.4 at Hadoop Summit San Jose. The solution, which the company described “future-proofs” the enterprise in expanding big data universe, adds new integration for Amazon EMR, SAP HANA, and Apache Spark. Pentaho’s newest offering allows users the capabilities to build on a pragmatic platform for big data orchestration and analytics at scale, empowering organizations to drive value with Pentaho’s Big Data Blueprint use case designs.
Big data deployment in cloud, Amazon EMR support
Pentaho customers can now use Amazon EMR to natively transform data as well as design and run Hadoop MapReduce in-cluster on EMR. Organizations now have powerful new ways to operationalize a cloud-based data refinery architecture for on-demand governed data set delivery.
Blended data delivery, SAP HANA support
SAP HANA’s capabilities can now be leveraged by enterprises for use with wider varieties of data. Version 5.4 with SAP HANA integration enables governed data delivery across multiple structured and unstructured sources.
Hadoop scaling for big data environment
Data volumes increase over time, making reliable performance and scalability priorities to any data-driven business. A recent Pentaho control study demonstrated sustained processing performance of Pentaho MapReduce running at scale on a 29-node Hadoop cluster. The results showed high-performance processing at enterprise scale in big data deployments,
Other features of version 5.4 include:
- A modern, refreshed look
- Language options (French, German, Japanese)
- New APIs for applications embedding
- Apache Spark integration, enabling Spark jobs
Latest posts by Timothy King (see all)
- Solutions Review’s Second Annual BI Insight Jam: Event Live Blog - December 1, 2020
- The 19 Best Excel Data Analysis Books on Our Reading List - November 23, 2020
- What to Expect During the Second Annual Solutions Review BI Insight Jam - November 20, 2020