TutorialsPublished by : 0nelove | Date : 30-10-2020 | Views : 61
Taming Big Data with Apache Spark and Python - Hands On! (Updated 9/2020)
Taming Big Data with Apache Spark and Python - Hands On! (Updated 9/2020)
Taming Big Data with Apache Spark and Python - Hands On!
Duration: 6h 54m | Video: .MP4 1280x720, 30 fps(r) | Audio: AAC, 44100 Hz, 2ch | Size: 3.6 GB
Genre: eLearning | Language: English


Dive right in with 20+ hands-on examples of analyzing large data sets with Apache Spark, on your desktop or on Hadoop! What you'll learn
Use DataFrames and Structured Streaming in Spark 3
Frame big data analysis problems as Spark problems
Use Amazon's Elastic MapReduce service to run your job on a cluster with Hadoop YARN
Install and run Apache Spark on a desktop computer or on a cluster
Use Spark's Resilient Distributed Datasets to process and analyze large data sets across many CPU's
Implement iterative algorithms such as breadth-first-search using Spark
Use the MLLib machine learning library to answer common data mining questions
Understand how Spark SQL lets you work with structured data
Understand how Spark Streaming lets your process continuous streams of data in real time
Tune and troubleshoot large jobs running on a cluster
Share information between nodes on a Spark cluster using broadcast variables and accumulators
Understand how the GraphX library helps with network analysis problems

Requirements
Access to a personal computer. This course uses Windows, but the sample code will work fine on Linux as well.
Some prior programming or scripting experience. Python experience will help a lot, but you can pick it up as we go.

Description
New! Updated for Spark 3, more hands-on exercises, and a stronger focus on DataFrames and Structured Streaming.

"Big data" analysis is a hot and highly valuable skill "" and this course will teach you the hottest technology in big dаta: Apache Spark. Employers including Amazon, EBay, NASA JPL, and Yahoo all use Spark to quickly extract meaning from massive data sets across a fault-tolerant Hadoop cluster. You'll learn those same techniques, using your own Windows system right at home. It's easier than you might think.

Learn and master the art of framing data analysis problems as Spark problems through over 20 hands-on examples, and then scale them up to run on cloud computing services in this course. You'll be learning from an ex-engineer and senior manager from Amazon and IMDb.

Learn the concepts of Spark's DataFrames and Resilient Distributed Datastores

Develop and run Spark jobs quickly using Python

Translate complex analysis problems into iterative or multi-stage Spark scripts

Scale up to larger data sets using Amazon's Elastic MapReduce service

Understand how Hadoop YARN distributes Spark across computing clusters

Learn about other Spark technologies, like Spark SQL, Spark Streaming, and GraphX

By the end of this course, you'll be running code that analyzes gigabytes worth of information "" in the cloud "" in a matter of minutes.

This course uses the familiar Python programming language; if you'd rather use

Download
http://nitroflare.com/view/C512D26E72C2167/taming-big-data-with-apache-spark-hands-on.part1.rar
http://nitroflare.com/view/695E4C3147A6436/taming-big-data-with-apache-spark-hands-on.part2.rar
http://nitroflare.com/view/BD2ED4B8C7AFAAA/taming-big-data-with-apache-spark-hands-on.part3.rar
http://nitroflare.com/view/CAE90723DDC781E/taming-big-data-with-apache-spark-hands-on.part4.rar

or
http://rapidgator.net/file/b6b951b4ce6f211c32c667d1fa20bf50/taming-big-data-with-apache-spark-hands-on.part1.rar.html
http://rapidgator.net/file/9dfc0b7d798bc534f63e05d58043dd5b/taming-big-data-with-apache-spark-hands-on.part2.rar.html
http://rapidgator.net/file/4780338a7a7a3f25b545b9c5473cd6b8/taming-big-data-with-apache-spark-hands-on.part3.rar.html
http://rapidgator.net/file/6ff7bc0b8aa47248727078fe5787b379/taming-big-data-with-apache-spark-hands-on.part4.rar.html
UploadGIG.com Rapidgator.net


Information
Users of Guests are not allowed to comment this publication.