Skip to main Content

Implement a Data Analytics Solution with Azure Databricks (DP-3011)

  • Course Code M-DP3011
  • Duration 1 day

Public Classroom Price

$412.00

Request Group Training Add to Cart

Course Delivery

This course is available in the following formats:

  • Company Event

    Event at company

  • Public Classroom

    Traditional Classroom Learning

  • Virtual Learning

    Learning that is virtual

Request this course in a different delivery format.

Course Overview

Top

Implement a Data Analytics Solution with Azure Databricks

This course explores how to use Databricks and Apache Spark on Azure to take data projects from exploration to production. You’ll learn how to ingest, transform, and analyze large-scale datasets with Spark DataFrames, Spark SQL, and PySpark, while also building confidence in managing distributed data processing. Along the way, you’ll get hands-on with the Databricks workspace—navigating clusters and creating and optimizing Delta tables.   You’ll also dive into data engineering practices, including designing ETL pipelines, handling schema evolution, and enforcing data quality. The course then moves into orchestration, showing you how to automate and manage workloads with Lakeflow Jobs and pipelines. To round things out, you’ll explore governance and security capabilities such as Unity Catalog and Purview integration, ensuring you can work with data in a secure, well-managed, and production-ready environment.

Course Schedule

Top

Course Objectives

Top

Students will learn to,

  • Explore Azure Databricks
  • Perform data analysis with Azure Databricks
  • Use Apache Spark in Azure Databricks
  • Manage data with Delta Lake
  • Build data pipelines with Delta Live Tables
  • Deploy workloads with Azure Databricks Workflows
  • Use SQL Warehouses in Azure Databricks
  • Run Azure Databricks Notebooks with Azure Data Factory

Course Content

Top

Module 1: Implement a Data Analytics Solution with Azure Databricks

  • Explore Azure Databricks
  • Perform data analysis with Azure Databricks
  • Use Apache Spark in Azure Databricks
  • Manage data with Delta Lake
  • Build Lakeflow Declarative Pipelines
  • Deploy workloads with Lakeflow Jobs

Course Prerequisites

Top

 

Before taking this course, learners should already be comfortable with the fundamentals of Python and SQL. This includes being able to write simple Python scripts and work with common data structures, as well as writing SQL queries to filter, join, and aggregate data. A basic understanding of common file formats such as CSV, JSON, or Parquet will also help when working with datasets. In addition, familiarity with the Azure portal and core services like Azure Storage is important, along with a general awareness of data concepts such as batch versus streaming processing and structured versus unstructured data. While not mandatory, prior exposure to big data frameworks like Spark, and experience working with Jupyter notebooks, can make the transition to Databricks smoother.