chapter two

2 Graph Embeddings

This chapter covers

Graph embeddings and their importance
Creating node embeddings using non-GNN and GNN methods
Comparing node embeddings on a semi-supervised problem
A deeper dive into embedding methods

In this chapter, we explore graph embeddings, an essential tool in graph-based machine learning. Graph embeddings transform the intricate structure of graphs—be it the entire graph, individual nodes, or edges—into a more manageable, lower-dimensional space. We do this to compress a complex dataset into a form that's easier to work with, without losing its inherent patterns and relationships, the information to which we will apply a GNN or other machine learning method.

Graphs, as we have learned, encapsulate relationships and interactions within a network, whether it's social networks, biological networks, or any system where entities are interconnected. Embeddings capture these real life relationships in a compact form, facilitating tasks like visualization, clustering, or predictive modeling.

There are numerous strategies to derive these embeddings, each with its unique approach and application. From classical graph algorithms that leverage the network's topology, to linear algebra techniques that decompose matrices representing the graph, and onto more advanced methods like GNNs. GNNs stand out because they can integrate the embedding process directly into the learning algorithm itself.

2.1 Creating embeddings with Node2Vec

2.1.1 Loading data, setting parameters, and creating embeddings

2.1.2 Demystifying embeddings

2.1.3 Transforming and visualizing the embeddings

2.1.4 Beyond visualization: Applications and Considerations of Node2Vec Embeddings

2.2 Creating embeddings with a GNN

2.2.1 Constructing the embeddings

2.2.2 GNN vs Node2Vec embeddings

2.2.3 Adaptability to new graphs

2.2.4 Enhanced feature integration

2.2.5 Task-specific optimization

2.3 Using node embeddings

2.4 Embeddings in an end-to-end model

2.5 Under the Hood

2.5.1 Representation and embeddings

2.5.2 Transductive and inductive methods

2.5.3 Node2Vec: Random walks across graphs

2.5.4 Message passing as deep learning

2.6 Summary

2.7 References and further reading

2.8 Summary