Dimensionality Reduction for Protein Secondary Structure and Solvent Accesibility Prediction
Loading...
Date
2018
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
World Scientific Publ Co Pte Ltd
Open Access Color
Green Open Access
Yes
OpenAIRE Downloads
1
OpenAIRE Views
2
Publicly Funded
No
Abstract
Secondary structure and solvent accessibility prediction provide valuable information for estimating the three dimensional structure of a protein. As new feature extraction methods are developed the dimensionality of the input feature space increases steadily. Reducing the number of dimensions provides several advantages such as faster model training, faster prediction and noise elimination. In this work, several dimensionality reduction techniques have been employed including various feature selection methods, autoencoders and PCA for protein secondary structure and solvent accessibility prediction. The reduced feature set is used to train a support vector machine at the second stage of a hybrid classifier. Cross-validation experiments on two difficult benchmarks demonstrate that the dimension of the input space can be reduced substantially while maintaining the prediction accuracy. This will enable the incorporation of additional informative features derived for predicting the structural properties of proteins without reducing the accuracy due to overfitting.
Description
Gormez, Yasin/0000-0001-8276-2030;
ORCID
Keywords
Secondary Structure Prediction, Solvent Accessibility Prediction, Feature Selection, Dimension Reduction, Autoencoder, autoencoder, Principal Component Analysis, Support Vector Machine, dimension reduction, solvent accessibility prediction, Computational Biology, Proteins, Reproducibility of Results, Protein Structure, Secondary, feature selection, Solvents, Neural Networks, Computer, Secondary structure prediction, Algorithms
Fields of Science
0301 basic medicine, 0206 medical engineering, 02 engineering and technology, 03 medical and health sciences
Citation
WoS Q
Q4
Scopus Q
Q4

OpenCitations Citation Count
4
Source
Journal of Bioinformatics and Computational Biology
Volume
16
Issue
5
Start Page
1850020
End Page
PlumX Metrics
Citations
CrossRef : 2
Scopus : 7
PubMed : 1
Captures
Mendeley Readers : 11
SCOPUS™ Citations
7
checked on Mar 06, 2026
Web of Science™ Citations
6
checked on Mar 06, 2026
Page Views
1
checked on Mar 06, 2026
Downloads
6
checked on Mar 06, 2026
Google Scholar™


