Keep up to date with every new upload!

Join free & follow Channel 9
Share
  • 1 year ago
Spark Performance Tuning - Part 3 | Data Exposed

Spark Performance Tuning - Part 3 | Data Exposed

This week's Data Exposed show welcomes back Maxim Lukiyanov to talk more about Spark performance tuning with Spark 2.x. Maxim is a Senior PM on the big data HDInsight team and is in the studio today to present Part 3 of his 4-part series.Topics in today's video:[00:45] - Recap and overview of the first two videos[03:40] - Join Types (SortMerge and Broadcast)[09:30] - Cost-based Optimizer[21:35] - Outliers and Data SkewSpark 2.2 rc4 on Azure HDInsight: Script action https://github.com/hdinsight/script-action/tree/master/install-spark2-2 

Comments