Using Spark from R for performance with arbitrary code – Part 3 – Using R to construct SQL queries and let Spark execute them
Introduction
In the previous part of this series, we looked at writing R functions that can be executed directly by Spark without serialization overhead with a focus on writing functions as combinations of dplyr verbs and investigated how the SQL is generated and Spark plans created.
In this third part, ...
