AWS Big Data Course Overview
In this AWS Big Data certification course, you will become familiar with the concepts of cloud computing and its deployment models. This course covers Amazon’s AWS cloud platform, Kinesis Analytics, AWS big data storage, processing, analysis, visualization and security services, machine learning algorithms and much more.
Skills Covered
AWS Quicksight
Kinesis streams
AWS Lambda and Glue
s3 and DynamoDB
Redshift
Amazon RDS
Hive on EMR
HBase with EMR
AWS Aurora
AWS Big Data Course Curriculum
Elgibility
This AWS Big Data certification course is well-suited for experienced technology professionals who want to excel in the data engineering space.
-
Overview of AWS Certified Data Analytics - Speciality Course
-
Overview of the Certification
-
Overview of the Course
-
Project highlights
-
Course Completion Criteria
-
Introduction to Cloud Computing
-
Cloud Computing Deployments Models
-
Types of Cloud Computing Services
-
AWS Fundamentals
-
AWS Cloud Economics
-
AWS Virtuous Cycle
-
AWS Cloud Architecture Design Principles
-
Why AWS for Big Data - Challenges
-
Databases in AWS
-
Relational vs Non Relational Databases
-
Data Warehousing in AWS
-
AWS Services for collecting, processing, storing, and analyzing big data
-
Key Takeaways
-
Deploy a Data Warehouse Using Amazon Redshift
-
AWS Big Data Collection Services
-
Fundamentals of Amazon Kinesis
-
Loading Data into Kinesis Stream
-
Assisted Practice: Loading Data into Amazon Storage
-
Kinesis Data Stream High-Level Architecture
-
Kinesis Stream Core Concepts
-
AWS Services and Amazon Kinesis Data Stream
-
How to Put Data into Kinesis Stream?
-
Kinesis Connector Library
-
Amazon Kinesis Data Firehose
-
Assisted Practice: Transfer Data into Delivery Stream using Firehose
-
Assisted Practice: Transfer VPC Flow log to Splunk using Firehose
-
Data Transfer using AWS Lambda
-
Assisted Practice: Backing up data in Amazon S3 using AWS Lambda
-
Amazon SQS
-
IoT and Big Data
-
Amazon IoT Greengrass
-
AWS Data Pipeline
-
Components of Data Pipeline
-
Assisted Practice: Export MySQL Data to Amazon S3 Using AWS Data Pipeline
-
Key Takeaways
-
Streaming Data with Kinesis Data Analytics
-
AWS Bigdata Storage services
-
Data lakes and Analytics
-
Data Management
-
Data Life Cycle
-
Fundamentals of Amazon Glacier
-
Glacier and Big Data
-
DynamoDB Introduction
-
DynamoDB: Core Components
-
Assisted Practice: Perform operations on DynamoDB table
-
DynamoDB in AWS Eco-System
-
DynamoDB Partitions
-
Data Distribution
-
DynamoDB GSI and LSI
-
DynamoDB Streams
-
Use cases: Capturing Table Activity with DynamoDB Streams
-
Cross-Region Replication
-
Assisted Practice: Create a Global Table using DynamoDB
-
DynamoDB Performance: Deep Dive
-
Partition Key Selection
-
Snowball & AWS BigData
-
Assisted Practice: Data Migration using AWS Snowball
-
AWS DMS
-
AWS Aurora in BigData
-
Assisted Practice: Create and Modify Aurora DB Cluster
-
Storing and Retrieving the Data from DynamoDB
-
AWS Bigdata Processing Services
-
Overview of Amazon Elastic MapReduce (EMR)
-
EMR Cluster Architecture
-
Apache Hadoop
-
Apache Hadoop Architecture
-
Storage Options
-
EMR Operations
-
AWS Cluster
-
Assisted Practice: Create a cluster in S3
-
Assisted Practice: Monitor a Cluster in S3
-
Using Hue with EMR
-
Assisted Practice: Launch HUE Web Interface on Amazon EMR
-
Setup Hue for LDAP
-
Assisted Practice: Configure HUE for LDAP Users
-
Hive on EMR
-
Assisted Practice: Set Up a Hive Table to Run Hive Commands
-
Key Takeaways
-
Using HBase with EMR
-
HBase Architecture
-
Assisted Practice: Create a cluster with HBase
-
HBase and EMRFS
-
Presto with EMR
-
Presto Architecture
-
Fundamentals of Apache Spark
-
Apache Spark Architecture
-
Assisted Practice: Create a cluster with Spark
-
Apache Spark Integration with EMR
-
Fundamentals of EMR File System
-
Amazon Simple Workflow
-
AWS Lambda in Big Data Ecosystem
-
AWS Lambda and Kinesis Stream
-
AWS Lambda and RedShift
-
HCatalog
-
Key Takeaways
-
Real-Time Application with Apache Spark and AWS EMR
-
Introduction to AWS Bigdata Analysis Services
-
Fundamentals of Amazon Redshift
-
Amazon RedShift Architecture
-
Assisted Practice: Launch a Cluster, Load Dataset, and Execute Queries
-
RedShift in the AWS Ecosystem
-
Columnar Databases
-
Assisted Practice: Monitor RedShift Maintenance and Operations
-
RedShift Table Design
-
Choosing the Distribution Style
-
Redshift Data types
-
RedShift Data Loading
-
COPY Command for Data Loading
-
RedShift Loading Data
-
Key Takeaways
-
Fundamentals of Machine Learning
-
Workflow of Amazon Machine Learning
-
Use cases
-
Machine learning Algorithms
-
Amazon SageMaker
-
Machine learning with Amazon Sagemaker
-
Assisted Practice: Build, Train, and Deploy a Machine Learning Model
-
Elasticsearch
-
Amazon Elasticsearch Service
-
Zone Awareness
-
Logstash
-
RStudio
-
Assisted Practice: Fetch the File and Run Analysis using RStudio
-
Amazon Athena
-
Assisted Practice: Execute Interactive SQL Queries in Athena
-
AWS Glue
-
Key Takeaways
-
Fraud Detection Using Classification Algorithms on AWS Sagemaker
-
Introduction to AWS Bigdata Visualization Services
-
Amazon QuickSight
-
Amazon QuickSight - Workflow and Use Cases
-
Assisted Practice: Analyze the marketing campaign
-
Working with data
-
Assisted Practice: Analyze the marketing campaign using data from Amazon S3
-
Assisted Practice: Analyze the marketing campaign using data from Presto
-
Amazon QuickSight: Visualization
-
Assisted Practice: Create Visuals
-
Amazon QuickSight: Stories
-
Assisted Practice: Create a Storyboard
-
Amazon QuickSight: Dashboard
-
Assisted Practice: Create a Dashboard
-
Data Visualization: Other Tools
-
Kibana
-
Assisted Practice: Create a Dashboard on Kibana
-
Key Takeaways
-
Exploratory Data Analysis Using AWS QuickSight
-
Introduction to AWS Bigdata Security
-
EMR Security
-
EMR Security: Best Practices
-
Roles
-
Fundamentals of Redshift Security
-
Data Protection and Encryption
-
Master Key, Encryption, and Decryption Process
-
Amazon Redshift Database Encryption
-
Key Management Services(KMS) Overview
-
Encryption using Hardware Security Modules
-
STS and Cross Account Access
-
Cloud Trail
-
Key Takeaways