Dimensionality reduction for protein secondary structure and solvent accesibility prediction

Aydin, Zafer; Kaynar, Oguz; Gormez, Yasin

Dimensionality reduction for protein secondary structure and solvent accesibility prediction

Files

Dimensionality reduction for protein secondary structure and solvent accesibility prediction.pdf (875.71 KB)

Date

2018

Authors

Aydin, Zafer

Kaynar, Oguz

Gormez, Yasin

Publisher

IMPERIAL COLLEGE PRESS, 57 SHELTON ST, COVENT GARDEN, LONDON WC2H 9HE, ENGLAND

Abstract

Secondary structure and solvent accessibility prediction provide valuable information for estimating the three dimensional structure of a protein. As new feature extraction methods are developed the dimensionality of the input feature space increases steadily. Reducing the number of dimensions provides several advantages such as faster model training, faster prediction and noise elimination. In this work, several dimensionality reduction techniques have been employed including various feature selection methods, autoencoders and PCA for protein secondary structure and solvent accessibility prediction. The reduced feature set is used to train a support vector machine at the second stage of a hybrid classifier. Cross-validation experiments on two difficult benchmarks demonstrate that the dimension of the input space can be reduced substantially while maintaining the prediction accuracy. This will enable the incorporation of additional informative features derived for predicting the structural properties of proteins without reducing the accuracy due to overfitting.

Description

This work is supported by Grant 113E550 from 3501 TUBITAK National Young Researchers Career Award.

Keywords

autoencoder, dimension reduction, feature selection, solvent accessibility prediction, Secondary structure prediction

Volume

Volume: 16 Special Issue: SI

Issue

5

URI

https://doi.org/10.1142/S0219720018500208
https://hdl.handle.net/20.500.12573/684

Collections

Bilgisayar Mühendisliği Bölümü Koleksiyonu
Scopus İndeksli Yayınlar Koleksiyonu
WoS İndeksli Yayınlar Koleksiyonu

Full item page

Dimensionality reduction for protein secondary structure and solvent accesibility prediction

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Turkish CoHE Thesis Center URL

Citation

WoS Q

Scopus Q

Source

Volume

Issue

Start Page

End Page

URI

Collections