Dimensionality Reduction for Protein Secondary Structure and Solvent Accesibility Prediction

Loading...
Publication Logo

Date

2018

Journal Title

Journal ISSN

Volume Title

Publisher

World Scientific Publ Co Pte Ltd

Open Access Color

Green Open Access

Yes

OpenAIRE Downloads

1

OpenAIRE Views

2

Publicly Funded

No
Impulse
Average
Influence
Average
Popularity
Average

Research Projects

Journal Issue

Abstract

Secondary structure and solvent accessibility prediction provide valuable information for estimating the three dimensional structure of a protein. As new feature extraction methods are developed the dimensionality of the input feature space increases steadily. Reducing the number of dimensions provides several advantages such as faster model training, faster prediction and noise elimination. In this work, several dimensionality reduction techniques have been employed including various feature selection methods, autoencoders and PCA for protein secondary structure and solvent accessibility prediction. The reduced feature set is used to train a support vector machine at the second stage of a hybrid classifier. Cross-validation experiments on two difficult benchmarks demonstrate that the dimension of the input space can be reduced substantially while maintaining the prediction accuracy. This will enable the incorporation of additional informative features derived for predicting the structural properties of proteins without reducing the accuracy due to overfitting.

Description

Gormez, Yasin/0000-0001-8276-2030;

Keywords

Secondary Structure Prediction, Solvent Accessibility Prediction, Feature Selection, Dimension Reduction, Autoencoder, autoencoder, Principal Component Analysis, Support Vector Machine, dimension reduction, solvent accessibility prediction, Computational Biology, Proteins, Reproducibility of Results, Protein Structure, Secondary, feature selection, Solvents, Neural Networks, Computer, Secondary structure prediction, Algorithms

Fields of Science

0301 basic medicine, 0206 medical engineering, 02 engineering and technology, 03 medical and health sciences

Citation

WoS Q

Q4

Scopus Q

Q4
OpenCitations Logo
OpenCitations Citation Count
4

Source

Journal of Bioinformatics and Computational Biology

Volume

16

Issue

5

Start Page

1850020

End Page

PlumX Metrics
Citations

CrossRef : 2

Scopus : 7

PubMed : 1

Captures

Mendeley Readers : 11

SCOPUS™ Citations

7

checked on Mar 06, 2026

Web of Science™ Citations

6

checked on Mar 06, 2026

Page Views

1

checked on Mar 06, 2026

Downloads

6

checked on Mar 06, 2026

Google Scholar Logo
Google Scholar™
OpenAlex Logo
OpenAlex FWCI
0.1838

Sustainable Development Goals