Session Title: Massively scalable machine learning on Spark with Synapse
Speaker(s): Nellie Gustafsson & Mark Hamilton

Abstract: SynapseML (Previously MMLSpark) is an open-source library, aiming to simplify the creation of massively scalable machine learning pipelines. Composing tools from different ecosystems often requires considerable “glue” code, and many frameworks aren’t designed with thousand-machine elastic clusters in mind. SynapseML resolves this challenge by unifying several existing ML frameworks and new Microsoft algorithms in a single, scalable API that’s usable across Python, R, Scala, and Java. This session will provide an overview of these features, a demonstration, and resources to learn more.

500+ sessions are now available on-demand from Data Platform Summit 2022, 2021 & 2020 at no cost. Browse all sessions.

Stay tuned, more learning coming your way.