Source Code

Highlight

Azure Databricks is fast, easy to use and scalable big data collaboration platform. Based on Apache Spark brings high performance and benefits of spark without need of having high technical knowledge. You just write Python/Scala scripts and you are ready to go.

Intro

In this video I will cover basics of Databricks and show common Blob Storage JSON to Blob Storage CSV transformation scenario.

Code samples: https://github.com/MarczakIO/azure4everyone-samples/tree/master/azure-databricks-introduction

Agenda

Today we will cover

  1. Azure Databricks and Databricks platform Overview
  2. Key Features of Databricks
  3. Demo of Blob ingestion using Python and Spark SQL script and data visualisation
  4. Demo of Blob to Blob tranfromation using Scala and Spark SQL

Video

Final thoughts

Azure Databricks is a one of those hot topics right now. This introduction is seond in the series of data transformation in Azure. Stay tuned to see more.

Source Code

Adam Marczak

Programmer, architect, trainer, blogger, evangelist are just a few of many titles. What I really am, is a passionate technology enthusiast. I take great pleasure in learning new technologies and finding ways in which it can aid people every day. My latest passion is running an Azure 4 Everyone YouTube channel, where I show that Azure really is for everyone!

Did you enjoy the article?

Share it!

More tagged posts