Rock The JVM - Spark Optimization with Scala » Free Download

TutorialsPublished by : LeeAndro | Date : 13-09-2020 | Views : 407

Rock The JVM - Spark Optimization with Scala

Learn the ins and outs of Spark and get the best out your code.
In this course, we cut the weeds at the root. We dive deep into Spark and understand how it works under the hood. We'll see that we have incredible leverage, IF we write intelligent code, and you will do exactly that. You will learn 20+ techniques and optimization strats. Each of them individually can give at least a 2x perf boost for your jobs (some of them even 10x), and I show it on camera.

You'll understand Spark internals and how Spark works behind the scenes

You'll be able to predict in advance if a job will take a long

You'll diagnose performance problems in the Spark UI

You'll write smart joins with no shuffles

You'll organize your data intelligently so expensive operations are no longer a problem

You'll use RDD capabilities for bespoke, high-performance jobs

You'll leverage the JVM for performance-critical applications

You'll save hours of computation in this course alone (let alone in prod!)

Plus some extra perks:

You'll have access to the entire code I write on camera (~1400 LOC)

You'll be invited to our private Slack room where I'll share latest updates, discounts, talks, conferences, and recruitment opportunities

(soon) You'll have access to the takeaway slides

(soon) You'll be able to the videos for your offline view

Deep understanding of Spark internals so you can predict job performance

stage & task decomposition

reading query plans before jobs will run

reading DAGs while jobs are running

performance differences between the different Spark APIs

packaging and deploying a Spark app

configuring Spark in 3 different ways

DataFrame and Spark SQL Optimizations

understanding join mechanics and why they are expensive

writing broadcast joins, or what to do when you join a large and a small DataFrame

write pre-join optimizations: column pruning, pre-partitioning

bucketing for fast access

fixing data skews, "straggling" tasks and OOMs

Optimizing RDDs

using broadcast joins "manually"

cogrouping RDDs in multi-way joins

fixing data skews

writing optimizations that Spark doesn't generate for us

Optimizing key-value RDDs, as most useful transformations need them

using the different _byKey methods intelligently

reusing JVM objects for when performance is critical and even a few seconds count

using the powerful iterator-to-iterator pattern for arbitrary efficient processing

This course is for Scala and Spark programmers who need to improve the run of their jobs. If you've never done Scala or Spark, this course is not for you.

DOWNLOAD
uploadgig

https://uploadgig.com/file/download/22AA493a55b816e4/VQ8uWVsS__Spark_Opti.part1.rar
https://uploadgig.com/file/download/577b70edaf4274fa/VQ8uWVsS__Spark_Opti.part2.rar

rapidgator

https://rapidgator.net/file/7e9a87e04322912b932574780eadf3ee/VQ8uWVsS__Spark_Opti.part1.rar
https://rapidgator.net/file/358c90733ea33c3507edf1673ac67113/VQ8uWVsS__Spark_Opti.part2.rar

nitroflare

http://nitroflare.com/view/3F93D2C8064F049/VQ8uWVsS__Spark_Opti.part1.rar
http://nitroflare.com/view/E858BA91585635A/VQ8uWVsS__Spark_Opti.part2.rar

Information

Users of Guests are not allowed to comment this publication.

Login
Password

Privacy Policy Contact DMCA Information Top

Privacy Policy

Contact

DMCA Information

Top