Storage, networking, analytics, machine learning, and artificial intelligence solution provider, Amazon Web Services (AWS), recently announced the general availability of AWS Lake Formation. This is a fully managed service that facilitates the building, securing, and management of data lakes. AWS Lake Formation simplifies and automates the manual steps usually needed to create a data lake, such as collecting, cleaning, and cataloging data, as well as securely making that data available for analytics.
Users are enabled to bring their data into a data lake from a range of sources using pre-defined templates, classify and prepare the data automatically, and define granular data access policies in order to govern access by different groups within an organization. Users are able to then analyze this data using their choice of AWS analytics and machine learning services, including Amazon Athena, Amazon Redshift, and AWS Glue. Amazon EMR, Amazon QuickSight, and Amazon SageMaker will be following in the coming months.
In a press statement, Vice President of Databases, Analytics, and Machine Learning at AWS, Raju Gulabani said, “our customers tell us that Amazon S3 is the ideal place to house their data lakes, which is why AWS hosts more data lakes than anyone else – with tens of thousands and growing every day. They’ve also told us that they want it to be easier and faster to set up and manage their data lakes. That’s why we built AWS Lake Formation, so customers can spend more time learning from their data and innovating, rather than wrestling that data into functioning data lakes. AWS Lake Formation is available today and we’re excited to see how customers use it as one of the building blocks for growing and transforming their businesses and customer experiences.”
AWS Lake Formation cleans and deduplicates data by utilizing machine learning to improve data consistency and quality. To simplify security and data access, AWS’ platform offers a single, centralized place to set up and manage data access policies, governance, and auditing across Amazon S3 and multiple analytics engines. To reduce the amount of time data scientists and analysts spend searching for the right data set for their needs, AWS Lake Formation provides a searchable catalog that describes available data sets and their appropriate business use.
To Read AWS’ full press release, click here.
Latest posts by Tess Hanna (see all)
- AWS Announces the General Availability of AWS Lake Formation - August 13, 2019
- Pros and Cons: Is Cloud Storage Right for You? - August 12, 2019
- Four Key Things to Look for When Evaluating Block Storage Vendors - August 6, 2019