2 Clustering techniques
This chapter covers
- Clustering techniques and salient use cases in the industry
- Simple k-means, hierarchical, and density-based spatial clustering algorithms
- Implementation of algorithms in Python
- A case study on cluster analysis
Simplicity is the ultimate sophistication.
Nature loves simplicity and teaches us to follow the same path. Most of the time, our decisions are simple choices. Simple solutions are easier to comprehend, less time-consuming, and painless to maintain and ponder over. The machine learning world is no different. An elegant machine learning solution is not the one that is the most complicated algorithm available but the one that solves the business problem. A robust machine learning solution is easy enough to readily decipher and pragmatic enough to implement. Clustering solutions are generally easier to understand.
In the previous chapter, we defined unsupervised learning and discussed the various unsupervised algorithms available. We will cover each of those algorithms as we work through this book; in this second chapter, we focus on the first of these: clustering algorithms.