DP-203T00 - Data Engineering on Microsoft Azure
Course Description
In this course, the student will learn how to implement and manage data engineering workloads on Microsoft Azure, using Azure services such as Azure Synapse Analytics, Azure Data Lake Storage Gen2, Azure Stream Analytics, Azure Databricks, and others. The course focuses on common data engineering tasks such as orchestrating data transfer and transformation pipelines, working with data files in a data lake, creating and loading relational data warehouses, capturing and aggregating streams of real-time data, and tracking data assets and lineage.
Duration: 4 days
Who should attend
The primary audience for this course is data professionals, data architects, and business intelligence professionals who want to learn about data engineering and building analytical solutions using data platform technologies that exist on Microsoft Azure. The secondary audience for this course includes data analysts and data scientists who work with analytical solutions built on Microsoft Azure.
Pre-requisites
Successful students start this course with knowledge of cloud computing and core data concepts and professional experience with data solutions. Specifically completing: AZ-900 - Azure Fundamentals DP-900 - Microsoft Azure Data Fundamentals
Skills gained
Explore compute and storage options for data engineering workloads in Azure Run interactive queries using serverless SQL pools Perform data Exploration and Transformation in Azure Databricks Explore, transform, and load data into the Data Warehouse using Apache Spark Ingest and load Data into the Data Warehouse Transform Data with Azure Data Factory or Azure Synapse Pipelines Integrate Data from Notebooks with Azure Data Factory or Azure Synapse Pipelines Support Hybrid Transactional Analytical Processing (HTAP) with Azure Synapse Link Perform end-to-end security with Azure Synapse Analytics Perform real-time Stream Processing with Stream Analytics Create a Stream Processing Solution with Event Hubs and Azure Databricks
Summary of Modules covered
Introduction to data engineering on Azure Introduction to Azure Data Lake Storage Gen2 Introduction to Azure Synapse Analytics Use Azure Synapse serverless SQL pool to query files in a data lake Use Azure Synapse serverless SQL pools to transform data in a data lake Create a lake database in Azure Synapse Analytics Analyze data with Apache Spark in Azure Synapse Analytics Transform data with Spark in Azure Synapse Analytics Use Delta Lake in Azure Synapse Analytics Analyze data in a relational data warehouse Load data into a relational data warehouse Build a data pipeline in Azure Synapse Analytics Use Spark Notebooks in an Azure Synapse Pipeline Plan hybrid transactional and analytical processing using Azure Synapse Analytics Implement Azure Synapse Link with Azure Cosmos DB Implement Azure Synapse Link for SQL Get started with Azure Stream Analytics Ingest streaming data using Azure Stream Analytics and Azure Synapse Analytics Visualize real-time data with Azure Stream Analytics and Power BI Introduction to Microsoft Purview Integrate Microsoft Purview and Azure Synapse Analytics Explore Azure Databricks Use Apache Spark in Azure Databricks Run Azure Databricks Notebooks with Azure Data Factory