Appendix C. Resources: where to go for more

 

C.1. Spark

The number of books on Spark finally started growing in 2015—six years after Spark development first began. But Spark development is still moving fast, and the best resources are online.

Apache mailing lists

As with any open source project, especially one from Apache, the mailing lists are the best sources of information, and subscribing to them—and asking questions when you can’t find answers on the web—should be considered the minimum you have to do. The mailing lists are known as user@spark.apache.org and dev@spark.apache.org. You can subscribe to them from https://spark.apache.org/community.html.

Databricks forums

Databricks is the commercialization of Spark that offers a commercial product of a Spark notebook in the cloud. But the forums on www.databricks.com aren’t limited to only the commercial product. As a large percentage of the commits to Apache Spark come from Databricks, the Databricks forums also contain a lot of general-purpose information about Spark, including future plans that pertain to the open source Apache Spark as well as the commercial Databricks product.

Conference and meetup videos

There are four major sources of Spark videos. None should be overlooked; they are all outstanding. Spark is moving fast, and watching these videos on your smartphone while on the treadmill or as a bedtime story is sometimes the only way to keep up:

1.  Spark Summit (West, East, and Europe)

2.  AMPLab AMPCamp

C.2. Scala

C.3. Graphs

sitemap