Copyright
Brief Table of Contents
Table of Contents
Foreword
Preface
Acknowledgments
About this Book
About the Cover Illustration
Chapter 1. Introducing Storm
1.1. What is big data?
1.1.1. The four Vs of big data
1.1.2. Big data tools
1.2. How Storm fits into the big data picture
1.2.1. Storm vs. the usual suspects
1.3. Why you’d want to use Storm
1.4. Summary
Chapter 2. Core Storm concepts
2.1. Problem definition: GitHub commit count dashboard
2.1.1. Data: starting and ending points
2.1.2. Breaking down the problem
2.2. Basic Storm concepts
2.2.1. Topology
2.2.2. Tuple
2.2.3. Stream
2.2.4. Spout
2.2.5. Bolt
2.2.6. Stream grouping
2.3. Implementing a GitHub commit count dashboard in Storm
2.3.1. Setting up a Storm project
2.3.2. Implementing the spout
2.3.3. Implementing the bolts
2.3.4. Wiring everything together to form the topology
2.4. Summary
Chapter 3. Topology design
3.1. Approaching topology design
3.2. Problem definition: a social heat map
3.2.1. Formation of a conceptual solution
3.3. Precepts for mapping the solution to Storm
3.3.1. Consider the requirements imposed by the data stream
3.3.2. Represent data points as tuples
3.3.3. Steps for determining the topology composition
3.4. Initial implementation of the design
3.4.1. Spout: read data from a source
3.4.2. Bolt: connect to an external service
3.4.3. Bolt: collect data in-memory
3.4.4. Bolt: persisting to a data store
3.4.5. Defining stream groupings between the components