Dimensionality reduction for protein secondary structure and solvent accesibility prediction

dc.contributor.author Aydin, Zafer
dc.contributor.author Kaynar, Oguz
dc.contributor.author Gormez, Yasin
dc.contributor.department AGÜ, Mühendislik Fakültesi, Bilgisayar Mühendisliği Bölümü en_US
dc.date.accessioned 2021-04-27T08:41:49Z
dc.date.available 2021-04-27T08:41:49Z
dc.date.issued 2018 en_US
dc.description This work is supported by Grant 113E550 from 3501 TUBITAK National Young Researchers Career Award. en_US
dc.description.abstract Secondary structure and solvent accessibility prediction provide valuable information for estimating the three dimensional structure of a protein. As new feature extraction methods are developed the dimensionality of the input feature space increases steadily. Reducing the number of dimensions provides several advantages such as faster model training, faster prediction and noise elimination. In this work, several dimensionality reduction techniques have been employed including various feature selection methods, autoencoders and PCA for protein secondary structure and solvent accessibility prediction. The reduced feature set is used to train a support vector machine at the second stage of a hybrid classifier. Cross-validation experiments on two difficult benchmarks demonstrate that the dimension of the input space can be reduced substantially while maintaining the prediction accuracy. This will enable the incorporation of additional informative features derived for predicting the structural properties of proteins without reducing the accuracy due to overfitting. en_US
dc.description.sponsorship Turkiye Bilimsel ve Teknolojik Arastirma Kurumu (TUBITAK) 113E550 3501 en_US
dc.identifier.issn 0219-7200
dc.identifier.issn 1757-6334
dc.identifier.issue 5 en_US
dc.identifier.uri https://doi.org/10.1142/S0219720018500208
dc.identifier.uri https://hdl.handle.net/20.500.12573/684
dc.identifier.volume Volume: 16 Special Issue: SI en_US
dc.language.iso eng en_US
dc.publisher IMPERIAL COLLEGE PRESS, 57 SHELTON ST, COVENT GARDEN, LONDON WC2H 9HE, ENGLAND en_US
dc.relation.isversionof 10.1142/S0219720018500208 en_US
dc.relation.journal JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY en_US
dc.relation.publicationcategory Makale - Uluslararası - Editör Denetimli Dergi en_US
dc.relation.tubitak 113E550 3501
dc.rights info:eu-repo/semantics/closedAccess en_US
dc.subject autoencoder en_US
dc.subject dimension reduction en_US
dc.subject feature selection en_US
dc.subject solvent accessibility prediction en_US
dc.subject Secondary structure prediction en_US
dc.title Dimensionality reduction for protein secondary structure and solvent accesibility prediction en_US
dc.type article en_US

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Dimensionality reduction for protein secondary structure and solvent accesibility prediction.pdf
Size:
875.71 KB
Format:
Adobe Portable Document Format
Description:
Makale Dosyası

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.44 KB
Format:
Item-specific license agreed upon to submission
Description: