chapter two
2 Graph Data Engineering
This chapter covers:
- The main challenges related to Big Data as input to machine learning
- How to handle Big Data analysis with graph models and graph databases
- The shape and the features of a Graph Database
The previous chapter highlights the key role played by data in a machine learning project. Training the learning algorithm on a larger quantity of high quality data increases the accuracy of the model more than fine tuning or replacing the algorithm itself. Greg Linden, who invented for Amazon the now widely used item-to-item collaborative filtering algorithm, in an interview about big data replied:
“Big data is why Amazon’s recommendations work so well. Big data is what tunes search and helps us find what we need. Big data is what makes web and mobile intelligent.” [Peadar Coyle, 2016]