Highlight
Azure Databricks is fast, easy to use and scalable big data collaboration platform. Based on Apache Spark brings high performance and benefits of spark without need of having high technical knowledge. You just write Python/Scala scripts and you are ready to go.
Intro
In this video I will cover basics of Databricks and show common Blob Storage JSON to Blob Storage CSV transformation scenario.
Code samples: https://github.com/MarczakIO/azure4everyone-samples/tree/master/azure-databricks-introduction
Agenda
Today we will cover
- Azure Databricks and Databricks platform Overview
- Key Features of Databricks
- Demo of Blob ingestion using Python and Spark SQL script and data visualisation
- Demo of Blob to Blob tranfromation using Scala and Spark SQL
Video
Final thoughts
Azure Databricks is a one of those hot topics right now. This introduction is seond in the series of data transformation in Azure. Stay tuned to see more.
Source Code