Data Engineering on AWS

Code training GK910032
Duur 3 dagen

Andere trainingsmethoden

Virtueel leren Prijs

eur1,995.00

(excl. BTW)

Vraag een groepstraining aan Schrijf je in

Ga naar:

Methode

Deze training is in de volgende formats beschikbaar:

Klassikale training

Klassikaal leren
Op locatie klant

Op locatie klant
Virtueel leren

Virtueel leren

Vraag deze training aan in een andere lesvorm.

Trainingsbeschrijving

Naar boven

This comprehensive course provides a deep dive into data engineering practices and solutions on Amazon Web Services (AWS). Participants will learn how to design, build, optimize, and secure data engineering solutions by using AWS services. Topics range from foundational concepts to hands-on implementation of data lakes, data warehouses, and both batch and streaming data pipelines.

This course includes presentations, demonstrations, and hands-on labs.

Virtual Learning

This interactive training can be taken from any location, your office or home and is delivered by a trainer. This training does not have any delegates in the class with the instructor, since all delegates are virtually connected. Virtual delegates do not travel to this course, Global Knowledge will send you all the information needed before the start of the course and you can test the logins.

Data

Naar boven

Doelgroep

Naar boven

- Data engineers
- Solutions architects
- DevOps engineers
- IT professionals
- Data analysts looking to expand into data engineering.

Trainingsdoelstellingen

Naar boven

In this course, you will learn to do the following:

Design and implement scalable data lakes and data warehouses on AWS.
Build, optimize, and secure batch data processing pipelines.
Develop and manage streaming data solutions.
Apply best practices for data governance and security.
Automate data engineering workflows by using AWS services.
Implement access control and security measures for data solutions.

Inhoud training

Naar boven

Module 1: Data Engineering Roles and Key Concepts

The role of a data engineer
Data discovery for a data analytics system
AWS services for data workflows
Continuous integration and continuous delivery
Networking considerations

Module 2:Designing and Implementing Data Lakes

Data lake introduction
Data lake storage
Ingest data
Catalog data
Transform data
Serve data for consumption
Lab: Setting up a Data Lake on AWS

Module 3: Optimizing and Securing Data Lake Solutions

Optimizing performance
Security using Lake Formation
Setting permissions with Lake Formation
Security and governance
Troubleshooting
Lab: Automating Data Lake Creation using AWS Lake Formation Blueprints

Module 4: Data Warehouse Architecture and Design Principles

Introduction to data warehouses
Amazon Redshift overview
Ingesting data into Amazon Redshift
Processing data
Serving data for consumption
Lab: Setting up a Data Warehouse using Amazon Redshift Serverless

Module 5: Performance Optimization Techniques for Data Warehouses

Monitoring and optimization options
Data optimization in Amazon Redshift
Query optimization in Amazon Redshift
Data orchestration

Module 6: Security and Access Control for Data Warehouses

Authentication and access control in Amazon Redshift
Data security in Amazon Redshift
Lab: Working with Amazon Redshift

Module 7: Designing Batch Data Pipelines

Introduction to batch data pipelines
Designing a batch data pipeline
Ingesting batch data

Module 8: Implementing Strategies for Batch Data Pipelines

Processing and transforming data
Transforming data formats
Integrating your data
Cataloging data
Serving data for consumption
Lab: A Day in the Life of a Data Engineer

Module 9: Optimizing, Orchestrating, and Securing Batch Data Pipelines

Optimizing the batch data pipeline
Orchestrating the batch data pipeline
Securing the batch data pipeline
Lab: Orchestrating Data Processing in Spark using AWS Step Functions

Module 10: Streaming Data Architecture Patterns

Introduction to streaming data pipelines
Ingesting data from stream sources
Storing streaming data
Processing streaming data
Analyzing streaming data
Lab: Streaming Analytics with Amazon Managed Service for Apache Flink

Module 11: Optimizing and Securing Streaming Solutions

Optimizing a streaming data solution
Securing a streaming data pipeline
Lab: Access Control with Amazon Managed Streaming for Apache Kafka

Module 12: Compliance and Cost Optimization

Compliance considerations
Cost optimization tools

Module 13: Course Wrap-Up

Voorkennis

Naar boven

Basic understanding of AWS services
Familiarity with database concepts
Basic programming or scripting knowledge
Understanding of data processing fundamentals

Onderwerpen

Vendoren

Certificeringen-per-vendor

Klassikale training

Klassikaal leren

Op locatie klant

Op locatie klant

Virtueel leren

Virtueel leren

Onderwerpen

Vendoren

Certificeringen-per-vendor

Data Engineering on AWS

Andere trainingsmethoden

Virtueel leren Prijs

Ga naar:

Methode

Klassikale training Klassikaal leren

Op locatie klant Op locatie klant

Virtueel leren Virtueel leren

Trainingsbeschrijving

Data

Doelgroep

Trainingsdoelstellingen

Inhoud training

Voorkennis

Klassikale training

Klassikaal leren

Op locatie klant

Op locatie klant

Virtueel leren

Virtueel leren