Apache Spark is an open source distributed data processing library for large-scale in-memory data analytics computing.

- Stackoverflow.com Wiki
2 articles, 3 books. Go to books ↓

How to Handle 10B Requests a Day

Here’s an overview of Spark, an open source framework for big data. With its exceptional performance characteristics, Spark is well-suited for use with machine learning systems. James McCaffrey shows how you can install and run it on a Windows machine.