Apache Spark is an open source distributed data processing engine written in Scala providing a unified API and distributed data sets to users. Use Cases for Apache Spark often are related to machine/deep learning, graph processing.

- Stackoverflow.com Wiki
2 articles, 3 books. Go to books ↓

How to Handle 10B Requests a Day


Here’s an overview of Spark, an open source framework for big data. With its exceptional performance characteristics, Spark is well-suited for use with machine learning systems. James McCaffrey shows how you can install and run it on a Windows machine.