Chapter 12. Case studies

 

This chapter covers

  • The New York Times
  • China Mobile
  • StumbleUpon
  • IBM

We’ve been through many exercises and sample programs by now. The next step is to integrate what you’ve learned about Hadoop into your own real-world applications. To help you in that transition, this chapter provides examples of how other enterprises have used Hadoop as part of the solution to their data processing problems.

The case studies serve two purposes. One is to step back and see the broader systems that utilize Hadoop as a critical part. You’ll discover complementary tools, such as Cascading, HBase, and Jaql. The second purpose is to demonstrate the variety of businesses that have used Hadoop to solve their operational challenges. Our case studies span industries, including media (the New York Times), telecom (China Mobile), internet (StumbleUpon), and enterprise software (IBM).

12.1. Converting 11 million image documents from the New York Times archive

12.2. Mining data at China Mobile

12.3. Recommending the best websites at StumbleUpon

12.4. Building analytics for enterprise search—IBM’s Project ES2