Session Title: Best Practices for building a Data Lake with Azure Databricks
Speaker(s): Oskari Heikkinen
Abstract:Databricks is a Unified Analytics Platform making it easier than ever to do big data analytics on cloud. However, there are a lot of things you need to know and take into account before diving head first into a Data Lake. This session is intended for architects and developers who are looking to build a massive scale data storing and processing solution. I will go through the Best Practices for the purpose. In addition, I will demonstrate how to unify real-time and batch processing using Azure Databricks.
500+ sessions are now available on-demand from Data Platform Summit 2022, 2021 & 2020 at no cost. Browse all sessions.
Stay tuned, more learning coming your way.