About the Book – Accelerating Apache Spark 3

Apache Spark is a powerful execution engine for large-scale parallel data processing across a cluster of machines, which enables rapid application development and high performance.

In this ebook, learn how Spark 3 innovations make it possible to use the massively parallel architecture of GPUs to further accelerate Spark data processing.

Fill out the form below to download the ebook and learn about the following:

  • The data processing evolution, from Hadoop to GPUs and the NVIDIA RAPIDS™ library
  • Spark, what it is, what it does, and why it matters
  • GPU-acceleration in Spark
  • DataFrames and Spark SQL
  • A Spark regression example with a random forest classifier
  • An example of an end-to-end machine learning workflow GPU-accelerated with XGBoost
Apache Spark