home > training > DP-3011: Implementing a data analytics solution with Azure Databricks

Implementing a data analytics solution with Azure Databricks

MOC: DP-3011

$675

  • 1 Day
  • Replay™ Class Recordings Included
  • Included in the "all-you-can-eat" Microsoft Live Training Subscription Learn More

Interface Gold™Gold Benefits: Retake this course for one year. Replay™ class recordings included. Money-back guarantee. Price Match available. MS SubscriptionIncluded in the "all-you-can-eat" Microsoft Live Training Subscription.

Dates Available
Class Time
Guaranteed
to Run
Attend
Live Online
Rewatch with
Replay™
 
Jun 11
Replay™ AvailableThis class date includes Interface Replay™ class recordings, available for online viewing 1 hour after each class day ends.
7AM - 3PM (PHX)
This class date is Guaranteed to Run and will not change.
Guaranteed To Run
Attend Live Online
Rewatch with Replay™
Jul 22
Replay™ AvailableThis class date includes Interface Replay™ class recordings, available for online viewing 1 hour after each class day ends.
7AM - 3PM (PHX)
This class date is Guaranteed to Run and will not change.
Guaranteed To Run
Attend Live Online
Rewatch with Replay™
Sep 9
Replay™ AvailableThis class date includes Interface Replay™ class recordings, available for online viewing 1 hour after each class day ends.
7AM - 3PM (PHX)
This class date is Guaranteed to Run and will not change.
Guaranteed To Run
Attend Live Online
Rewatch with Replay™
Register today - no risk!  No cancellation fees.  Full money back guarantee!
x
Course:
  • This field is for validation purposes and should be left unchanged.

Course Description

This course is designed to be delivered in one full day.

Learn how to harness the power of Apache Spark and powerful clusters running on the Azure Databricks platform to run large data engineering workloads in the cloud.

Outline

Module:  Explore Azure Databricks

Azure Databricks is a cloud service that provides a scalable platform for data analytics using Apache Spark..

Lessons

  • Get started with Azure Databricks
  • Identify Azure Databricks workloads
  • Understand key concepts

Exercise

  • Explore Azure Databricks

 

Module:  Use Apache Spark in Azure Databricks

Azure Databricks is built on Apache Spark and enables data engineers and analysts to run Spark jobs to transform, analyze and visualize data at scale.

Lessons

  • Get to know Spark
  • Create a Spark cluster
  • Use Spark in notebooks
  • Use Spark to work with data files
  • Visualize data

Exercise

  • Use Spark in Azure Databricks

 

Module:  Use Delta Lake in Azure Databricks

Delta Lake is an open source relational storage area for Spark that you can use to implement a data lakehouse architecture in Azure Databricks.

Lessons

  • Get Started with Delta Lake
  • Create Delta Lake tables
  • Create and query catalog tables
  • Use Delta Lake for streaming data

Exercise

  • Use Delta Lake in Azure Databricks

 

Module:  Use SQL Warehouses in Azure Databricks

Azure Databricks provides SQL Warehouses that enable data analysts to work with data using familiar relational SQL queries.

Lessons

  • Get started with SQL Warehouses
  • Create databases and tables
  • Create queries and dashboards

Exercise

  • Use a SQL Warehouse in Azure Databricks

 

Module:  Run Azure Databricks Notebooks with Azure Data Factory

Using pipelines in Azure Data Factory to run notebooks in Azure Databricks enables you to automate data engineering processes at cloud scale.

Lessons

  • Understand Azure Databricks notebooks and pipelines
  • Create a linked service for Azure Databricks
  • Use a Notebook activity in a pipeline
  • Use parameters in a notebook

Exercise

  • Run an Azure Databricks Notebook with Azure Data Factory
Audience

You should be an Azure data engineer with subject matter expertise in designing, implementing, managing, and maintaining Microsoft Azure Virtual Desktop experiences and remote apps for any device.

Prerequisites

None