Node Similarity-Based Graph Convolution for Link Prediction in Biological Networks

dc.contributor.author Coskun, Mustafa
dc.contributor.author Koyuturk, Mehmet
dc.date.accessioned 2025-09-25T10:53:14Z
dc.date.available 2025-09-25T10:53:14Z
dc.date.issued 2021
dc.description Coskun, Mustafa/0000-0003-4805-1416 en_US
dc.description.abstract Background: Link prediction is an important and well-studied problem in network biology. Recently, graph representation learning methods, including Graph Convolutional Network (GCN)-based node embedding have drawn increasing attention in link prediction. Motivation: An important component of GCN-based network embedding is the convolution matrix, which is used to propagate features across the network. Existing algorithms use the degree-normalized adjacency matrix for this purpose, as this matrix is closely related to the graph Laplacian, capturing the spectral properties of the network. In parallel, it has been shown that GCNs with a single layer can generate more robust embeddings by reducing the number of parameters. Laplacian-based convolution is not well suited to single-layered GCNs, as it limits the propagation of information to immediate neighbors of a node. Results: Capitalizing on the rich literature on unsupervised link prediction, we propose using node similarity-based convolution matrices in GCNs to compute node embeddings for link prediction. We consider eight representative node-similarity measures (Common Neighbors, Jaccard Index, Adamic-Adar, Resource Allocation, Hub- Depressed Index, Hub-Promoted Index, Sorenson Index and Salton Index) for this purpose. We systematically compare the performance of the resulting algorithms against GCNs that use the degree-normalized adjacency matrix for convolution, as well as other link prediction algorithms. In our experiments, we use three-link prediction tasks involving biomedical networks: drug-disease association prediction, drug-drug interaction prediction and protein-protein interaction prediction. Our results show that node similarity-based convolution matrices significantly improve the link prediction performance of GCN-based embeddings. Conclusion: As sophisticated machine-learning frameworks are increasingly employed in biological applications, historically well-established methods can be useful in making a head-start. en_US
dc.description.sponsorship US National Institutes of Health [U01-CA198941]; National Cancer Institute en_US
dc.description.sponsorship This work was supported, in whole or in part, by US National Institutes of Health grants [U01-CA198941] from the National Cancer Institute. en_US
dc.description.sponsorship National Cancer Institute, NCI, (U01CA198941); National Cancer Institute, NCI
dc.identifier.doi 10.1093/bioinformatics/btab464
dc.identifier.issn 1367-4803
dc.identifier.issn 1367-4811
dc.identifier.scopus 2-s2.0-85122184713
dc.identifier.uri https://doi.org/10.1093/bioinformatics/btab464
dc.identifier.uri https://hdl.handle.net/20.500.12573/4286
dc.language.iso en en_US
dc.publisher Oxford Univ Press en_US
dc.relation.ispartof Bioinformatics en_US
dc.rights info:eu-repo/semantics/openAccess en_US
dc.title Node Similarity-Based Graph Convolution for Link Prediction in Biological Networks en_US
dc.type Article en_US
dspace.entity.type Publication
gdc.author.id Coskun, Mustafa/0000-0003-4805-1416
gdc.author.scopusid 57189031203
gdc.author.scopusid 8897820600
gdc.author.wosid Coskun, Mustafa/Kod-5642-2024
gdc.bip.impulseclass C3
gdc.bip.influenceclass C4
gdc.bip.popularityclass C3
gdc.coar.access open access
gdc.coar.type text::journal::journal article
gdc.collaboration.industrial false
gdc.description.department Abdullah Gül University en_US
gdc.description.departmenttemp [Coskun, Mustafa] Abdullah Gul Univ, Dept Comp Engn, Kayseri, Turkey; [Coskun, Mustafa] Hakkari Univ, TR-38080 Kayseri, Turkey; [Koyuturk, Mehmet] Case Western Reserve Univ, Dept Comp & Data Sci, Cleveland, OH 44106 USA; [Koyuturk, Mehmet] Case Western Reserve Univ, Ctr Prote & Bioinformat, Cleveland, OH 44106 USA en_US
gdc.description.endpage 4508 en_US
gdc.description.issue 23 en_US
gdc.description.publicationcategory Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı en_US
gdc.description.scopusquality Q1
gdc.description.startpage 4501 en_US
gdc.description.volume 37 en_US
gdc.description.woscitationindex Science Citation Index Expanded
gdc.description.wosquality Q1
gdc.identifier.openalex W3174753721
gdc.identifier.pmid 34152393
gdc.identifier.wos WOS:000733374500027
gdc.index.type WoS
gdc.index.type Scopus
gdc.index.type PubMed
gdc.oaire.accesstype GOLD
gdc.oaire.diamondjournal false
gdc.oaire.downloads 68
gdc.oaire.impulse 41.0
gdc.oaire.influence 5.0881352E-9
gdc.oaire.isgreen true
gdc.oaire.keywords PROTEIN INTERACTION NETWORKS
gdc.oaire.keywords Machine Learning
gdc.oaire.keywords Libraries
gdc.oaire.keywords ALGORITHM
gdc.oaire.keywords INTEGRATION
gdc.oaire.keywords Algorithms
gdc.oaire.keywords Gene Library
gdc.oaire.popularity 4.1514262E-8
gdc.oaire.publicfunded false
gdc.oaire.sciencefields 0301 basic medicine
gdc.oaire.sciencefields 03 medical and health sciences
gdc.oaire.sciencefields 0206 medical engineering
gdc.oaire.sciencefields 02 engineering and technology
gdc.oaire.views 92
gdc.openalex.collaboration International
gdc.openalex.fwci 4.0222
gdc.openalex.normalizedpercentile 0.95
gdc.openalex.toppercent TOP 10%
gdc.opencitations.count 45
gdc.plumx.crossrefcites 18
gdc.plumx.mendeley 58
gdc.plumx.pubmedcites 16
gdc.plumx.scopuscites 60
gdc.scopus.citedcount 60
gdc.wos.citedcount 49
relation.isOrgUnitOfPublication 665d3039-05f8-4a25-9a3c-b9550bffecef
relation.isOrgUnitOfPublication.latestForDiscovery 665d3039-05f8-4a25-9a3c-b9550bffecef

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
btab464.pdf
Size:
429.44 KB
Format:
Adobe Portable Document Format
Description:
Makale Dosyası

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.44 KB
Format:
Item-specific license agreed upon to submission
Description: