Spark in Action, Second Edition MEAP V16 cover


Chère lectrice, cher lecteur,

Merci d’avoir acheté et de soutenir Spark in Action, seconde édition. J’espère que vous prendrez plaisir à lire ce livre, même s’il est écrit en anglais.


Dear reader,

Thank you so much for purchasing the MEAP (Manning Early Access Program) for Spark in Action, second edition. If you just bought it, you’re a winner: this is almost the final book, edited, remastered, adapted to Spark 3.0.

Apache Spark is a real game changer in terms of massive, parallel data processing. It may be intimidating at first, especially if you have thin or no Big Data experience or if you don’t know Scala…

I wrote my first book 25 years ago, in college (as a student): we saw that something was missing if you wanted to learn C and transition to C++. Today, we are in a different world. You will find a lot of information on Spark, but if your background is as a Java software engineer or a (relational) data engineer, you might feel overwhelmed by information that is not relevant or assume you must master other fields of data engineering like Hadoop.

If you want to do Big Data, you know Java and relational databases, and you don’t see the point in learning Scala or Hadoop (like me), then this book is your perfect companion on this journey. If you already know about Scala or Hadoop, this book is ideal for you as well; I don’t teach those skills.