Advent of 2021, Day 25 – Spark literature, documentation, courses and books

[This article was first published on R – TomazTsql, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Series of Apache Spark posts:

To wrap up this year’s Advent of Spark 2021 – series of blogposts on Spark – it is essential to look at the list of additional learning resources for you to continue with this journey. Let’s divide this list not by type of the resource (book, on-line documentation, on-line courses, articles, Youtube channels, Discord channels, and others) but rather divide them by language flavour. Scala/Spark, R, and Python.

Spark – Scala

  • Spark Official Documentation – link
  • Spark: The definitive Guide – link
  • Stream processing with Apache Spark – link
  • Data Engineering with Apache Spark, Delta Lake, and Lakehouse – link
  • Programming Scala – 3rd edition – link
  • Scala & Spark – Master Big Data with Scala and Spark – link
  • Getting started with Apache Spark on Databricks – link to course
  • Apache Spark – link

R Language

  • Mastering Spark with R – link
  • SparkR documentation – link
  • Sparklyr: R interface for Apache Spark – link
  • R and Spark: How to Analyze Data Using RStudio’s Sparklyr and H2O’s Rsparkling Packages – link
  • Sparklyr in SQL Server Big Data cluster – link
  • Big data in R – Intro to Sparklyr – link

Python

  • Spark with PySpark – link
  • Spark and Python for Big Data with PySpark – link to course
  • PySpark intro – link
  • Apache Spark 3 for Data Engineering and Analytics with Python – link

Wrapping up this year’s series of Advent of Spark! Merry Christmas and Happy new Year 2022!

Compete set of code, documents, notebooks, and all of the materials will be available at the Github repository: https://github.com/tomaztk/Spark-for-data-engineers

Happy Spark Advent of 2021! ?

To leave a comment for the author, please follow the link and comment on their blog: R – TomazTsql.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)