Data Engineering on Microsoft Azure: Databrick Processing
When working with big data there needs to be a mechanism to process and transform this data quickly and efficiently. Azure Databricks is a service that provides the latest version of Apache Spark that provides functionality processing data from Azure Storage.
In this course you will learn about the types of processing that can be performed with Azure Databricks such as stream batch image and parallel processing. Next you;ll learn how to create an Azure Databricks workspace using an Apache Spark cluster run jobs in the Azure Databricks Workspace jobs using a service principal and query data in SQL server using an Azure Databricks notebook. Next you;ll learn how to retrieve data from an Azure Blob Storage using Azure Databricks and the Azure Key Vault implement a Cosmos DB service endpoint for Azure Databricks and extract transform and load data using Azure Databricks. Finally you;ll learn how to stream data into Azure Databricks by using Event Hubs and perform sentiment analysis for steam data by making use of Azure Databricks.
This course is one in a collection that prepares learners for the Microsoft Data Engineering on Microsoft Azure (DP-203) exam.