2 Clustering techniques
In this second chapter, we are going to cover the following topics:
- Clustering techniques and salient use cases in the industry
- Various clustering algorithms available
- K-means, hierarchical clustering, and DBSCAN clustering
- Implementation of algorithms in Python
- Case study on cluster analysis
“Simplicity is the ultimate sophistication” – Leonardo da Vinci
Nature loves simplicity, and teaches us to follow the same path. Most of the time, our decisions are simple choices. Simple solutions are easier to comprehend, less time consuming, and painless to maintain and ponder over. The machine learning world is no different. An elegant machine learning solution is not one which is the most complicated algorithm available, but one which solves the business problem. A robust machine learning solution is easy enough to readily decipher and pragmatic enough to implement. Clustering solutions are generally easier to be understood.
In the previous chapter, we defined unsupervised learning and discussed the various unsupervised algorithms available. We will cover each of those algorithms as we work through this book; in this second chapter we are going to focus in on the first of these: Clustering algorithms.