Dimensionality Reduction for Protein Secondary Structure and Solvent Accesibility Prediction

dc.contributor.author Aydin, Zafer
dc.contributor.author Kaynar, Oguz
dc.contributor.author Gormez, Yasin
dc.date.accessioned 2025-09-25T10:44:47Z
dc.date.available 2025-09-25T10:44:47Z
dc.date.issued 2018
dc.description Gormez, Yasin/0000-0001-8276-2030; en_US
dc.description.abstract Secondary structure and solvent accessibility prediction provide valuable information for estimating the three dimensional structure of a protein. As new feature extraction methods are developed the dimensionality of the input feature space increases steadily. Reducing the number of dimensions provides several advantages such as faster model training, faster prediction and noise elimination. In this work, several dimensionality reduction techniques have been employed including various feature selection methods, autoencoders and PCA for protein secondary structure and solvent accessibility prediction. The reduced feature set is used to train a support vector machine at the second stage of a hybrid classifier. Cross-validation experiments on two difficult benchmarks demonstrate that the dimension of the input space can be reduced substantially while maintaining the prediction accuracy. This will enable the incorporation of additional informative features derived for predicting the structural properties of proteins without reducing the accuracy due to overfitting. en_US
dc.description.sponsorship TUBITAK National Young Researchers Career Award [113E550, 3501] en_US
dc.description.sponsorship This work is supported by Grant 113E550 from 3501 TUBITAK National Young Researchers Career Award. en_US
dc.description.sponsorship TUBITAK National Young Researchers
dc.identifier.doi 10.1142/S0219720018500208
dc.identifier.issn 0219-7200
dc.identifier.issn 1757-6334
dc.identifier.scopus 2-s2.0-85055476518
dc.identifier.uri https://doi.org/10.1142/S0219720018500208
dc.identifier.uri https://hdl.handle.net/20.500.12573/3630
dc.language.iso en en_US
dc.publisher World Scientific Publ Co Pte Ltd en_US
dc.relation.ispartof Journal of Bioinformatics and Computational Biology en_US
dc.rights info:eu-repo/semantics/closedAccess en_US
dc.subject Secondary Structure Prediction en_US
dc.subject Solvent Accessibility Prediction en_US
dc.subject Feature Selection en_US
dc.subject Dimension Reduction en_US
dc.subject Autoencoder en_US
dc.title Dimensionality Reduction for Protein Secondary Structure and Solvent Accesibility Prediction en_US
dc.type Article en_US
dspace.entity.type Publication
gdc.author.id Gormez, Yasin/0000-0001-8276-2030
gdc.author.scopusid 7003852510
gdc.author.scopusid 36559569000
gdc.author.scopusid 57195222392
gdc.author.wosid Kaynar, Oguz/A-6474-2018
gdc.author.wosid Görmez, Yasin/Jef-8096-2023
gdc.bip.impulseclass C5
gdc.bip.influenceclass C5
gdc.bip.popularityclass C5
gdc.coar.access metadata only access
gdc.coar.type text::journal::journal article
gdc.collaboration.industrial false
gdc.description.department Abdullah Gül University en_US
gdc.description.departmenttemp [Aydin, Zafer] Abdullah Gul Univ, Dept Comp Engn, TR-38080 Kayseri, Turkey; [Kaynar, Oguz; Gormez, Yasin] Cumhuriyet Univ, Dept Management Informat Syst, TR-58000 Sivas, Turkey en_US
gdc.description.issue 5 en_US
gdc.description.publicationcategory Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı en_US
gdc.description.scopusquality Q4
gdc.description.startpage 1850020
gdc.description.volume 16 en_US
gdc.description.woscitationindex Science Citation Index Expanded
gdc.description.wosquality Q4
gdc.identifier.openalex W2886792527
gdc.identifier.pmid 30353781
gdc.identifier.wos WOS:000450008200010
gdc.index.type WoS
gdc.index.type Scopus
gdc.index.type PubMed
gdc.oaire.diamondjournal false
gdc.oaire.downloads 1
gdc.oaire.impulse 1.0
gdc.oaire.influence 3.024694E-9
gdc.oaire.isgreen true
gdc.oaire.keywords autoencoder
gdc.oaire.keywords Principal Component Analysis
gdc.oaire.keywords Support Vector Machine
gdc.oaire.keywords dimension reduction
gdc.oaire.keywords solvent accessibility prediction
gdc.oaire.keywords Computational Biology
gdc.oaire.keywords Proteins
gdc.oaire.keywords Reproducibility of Results
gdc.oaire.keywords Protein Structure, Secondary
gdc.oaire.keywords feature selection
gdc.oaire.keywords Solvents
gdc.oaire.keywords Neural Networks, Computer
gdc.oaire.keywords Secondary structure prediction
gdc.oaire.keywords Algorithms
gdc.oaire.popularity 3.5242875E-9
gdc.oaire.publicfunded false
gdc.oaire.sciencefields 0301 basic medicine
gdc.oaire.sciencefields 0206 medical engineering
gdc.oaire.sciencefields 02 engineering and technology
gdc.oaire.sciencefields 03 medical and health sciences
gdc.oaire.views 2
gdc.openalex.collaboration National
gdc.openalex.fwci 0.1838
gdc.openalex.normalizedpercentile 0.53
gdc.opencitations.count 4
gdc.plumx.crossrefcites 2
gdc.plumx.mendeley 11
gdc.plumx.pubmedcites 1
gdc.plumx.scopuscites 7
gdc.scopus.citedcount 7
gdc.virtual.author Aydın, Zafer
gdc.wos.citedcount 6
relation.isAuthorOfPublication a26c06af-eae3-407c-a21a-128459fa4d2f
relation.isAuthorOfPublication.latestForDiscovery a26c06af-eae3-407c-a21a-128459fa4d2f
relation.isOrgUnitOfPublication 665d3039-05f8-4a25-9a3c-b9550bffecef
relation.isOrgUnitOfPublication 52f507ab-f278-4a1f-824c-44da2a86bd51
relation.isOrgUnitOfPublication ef13a800-4c99-4124-81e0-3e25b33c0c2b
relation.isOrgUnitOfPublication.latestForDiscovery 665d3039-05f8-4a25-9a3c-b9550bffecef

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Dimensionality reduction for protein secondary structure and solvent accesibility prediction.pdf
Size:
875.71 KB
Format:
Adobe Portable Document Format
Description:
Makale Dosyası

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.44 KB
Format:
Item-specific license agreed upon to submission
Description: