Node Similarity-Based Graph Convolution for Link Prediction in Biological Networks

Loading...
Publication Logo

Date

2021

Journal Title

Journal ISSN

Volume Title

Publisher

Oxford Univ Press

Open Access Color

GOLD

Green Open Access

Yes

OpenAIRE Downloads

68

OpenAIRE Views

92

Publicly Funded

No
Impulse
Top 1%
Influence
Top 10%
Popularity
Top 1%

Research Projects

Journal Issue

Abstract

Background: Link prediction is an important and well-studied problem in network biology. Recently, graph representation learning methods, including Graph Convolutional Network (GCN)-based node embedding have drawn increasing attention in link prediction. Motivation: An important component of GCN-based network embedding is the convolution matrix, which is used to propagate features across the network. Existing algorithms use the degree-normalized adjacency matrix for this purpose, as this matrix is closely related to the graph Laplacian, capturing the spectral properties of the network. In parallel, it has been shown that GCNs with a single layer can generate more robust embeddings by reducing the number of parameters. Laplacian-based convolution is not well suited to single-layered GCNs, as it limits the propagation of information to immediate neighbors of a node. Results: Capitalizing on the rich literature on unsupervised link prediction, we propose using node similarity-based convolution matrices in GCNs to compute node embeddings for link prediction. We consider eight representative node-similarity measures (Common Neighbors, Jaccard Index, Adamic-Adar, Resource Allocation, Hub- Depressed Index, Hub-Promoted Index, Sorenson Index and Salton Index) for this purpose. We systematically compare the performance of the resulting algorithms against GCNs that use the degree-normalized adjacency matrix for convolution, as well as other link prediction algorithms. In our experiments, we use three-link prediction tasks involving biomedical networks: drug-disease association prediction, drug-drug interaction prediction and protein-protein interaction prediction. Our results show that node similarity-based convolution matrices significantly improve the link prediction performance of GCN-based embeddings. Conclusion: As sophisticated machine-learning frameworks are increasingly employed in biological applications, historically well-established methods can be useful in making a head-start.

Description

Coskun, Mustafa/0000-0003-4805-1416

Keywords

PROTEIN INTERACTION NETWORKS, Machine Learning, Libraries, ALGORITHM, INTEGRATION, Algorithms, Gene Library

Fields of Science

0301 basic medicine, 03 medical and health sciences, 0206 medical engineering, 02 engineering and technology

Citation

WoS Q

Q1

Scopus Q

Q1
OpenCitations Logo
OpenCitations Citation Count
45

Source

Bioinformatics

Volume

37

Issue

23

Start Page

4501

End Page

4508
PlumX Metrics
Citations

CrossRef : 18

Scopus : 60

PubMed : 16

Captures

Mendeley Readers : 58

SCOPUS™ Citations

60

checked on Apr 14, 2026

Web of Science™ Citations

49

checked on Apr 14, 2026

Page Views

1

checked on Apr 14, 2026

Downloads

6

checked on Apr 14, 2026

Google Scholar Logo
Google Scholar™
OpenAlex Logo
OpenAlex FWCI
4.0222
Altmetrics Badge

Sustainable Development Goals

SDG data is not available