Apache Hadoop and Apache Spark

Amazon Elastic MapReduce (EMR)

LAB: TF-IDF on Spark and EMR

Feature Engineering

AWS Glue and AWS Lake Formation

LAB: Cleaning Data with AWS Glue DataBrew

Amazon Athena

Athena and Glue

Amazon QuickSight