AWS Lake Formation - Mini deep-dive

Ref: https://www.udemy.com/course/aws-certified-machine-learning-engineer-associate-mla-c01/learn/lecture/46730185

Process for building a Data Lake

  1. Create an IAM user for Data Analyst
  2. Create AWS Glue connection to your data source(s)
  3. Create S3 bucket for data lake
  4. Register S3 bucket path in Lake Formation, grant permissions
  5. Create DB in Lake Formation for data catalog, grant permissions
  6. Use a blueprint for a workflow (e.g. making DB snapshots)
  7. Run the workflow
  8. Grant SELECT permissions to whoever reads lake (Athena, Redshift Spectrum, etc)

Governed Tables in S3