AWS announces Lake Formation
Guillaume Jaudouin, our AWS Practice Lead shares his thoughts on one of the major AWS announcements from the re:Invent week, Lake Formation. Below are some of Guillaume’s initial thoughts on this recently announced service.
This is one of the most exciting announcements from the AWS re:Invent week (at least for us with a focus on data and analytics). At first, and given that the name reminds me of CloudFormation, I thought Lake Formation would be a set of templates to help deploy a Data Lake after modifying and configuring them. However, it seems to be more than that, as it will integrate many services that are currently deployed separately when implementing Data Lakes on AWS.
The typical native AWS services for acquiring, transforming, storing and managing data are included under this new service.
AWS Glue seems to be at the centre of it all, being responsible for crawling the data, transforming and cataloguing it. Another interesting aspect is the use of Machine Learning for Data Quality purposes.
This service can also be very useful to kick-start greenfield Data Lake implementations and to have best practice security in place from the get-go. It will be interesting to see how easily Lake Formation can be extended and used in conjunction with third-party components.
This is one of the services we are keen to see in action when it becomes available. Watch this space as other posts will come with a full review.