Mastering big data requires an aptitude at every step of information processing. Post-processing, one of the most important steps, is where you find Apache Spark frequently employed. Spark Succinctly, by Marko Švaljek, addresses Spark’s use in the ultimate step in handling big data. This e-book, the third installment in Švaljek’s IoT series, teaches the basics of using Spark and explores how to work with RDDs, Scala and Python tasks, JSON files, and Cassandra.

Marko Švaljek

Marko Švaljek works as a software developer, and in his ten years of experience he has worked for the leading financial and telecom companies in southeast Europe with emphasis on the Internet of Things, mobile banking, and e-commerce solutions. The main focus of his interest in the past couple of years has been the Internet of Things. Until now, Marko had only authored two books, Cassandra Succinctly and Arduino Succinctly. In the context of the Internet of Things, the first book deals with how to store persistent data generated by various devices, and the second one focuses on how to create the sensors that actually generate various readings in the first place.

  1. Introduction
  2. Installing Spark
  3. Hello Spark
  4. Spark Internals
  5. Data Input and Output with Spark
  6. Conclusion