Protein fragment selection using machine learning

dc.contributor.author EMRE ULUTAŞ, ALPEREN
dc.contributor.department AGÜ, Fen Bilimleri Enstitüsü, Elektrik ve Bilgisayar Mühendisliği Ana Bilim Dalı en_US
dc.contributor.institutionauthor EMRE ULUTAŞ, ALPEREN
dc.date.accessioned 2020-07-21T13:18:53Z
dc.date.available 2020-07-21T13:18:53Z
dc.date.issued 2018 en_US
dc.description.abstract Protein fragment selection is an important step in predicting the three-dimensional (3D) structure of proteins. Selecting the right fragments contributes significantly to accurate prediction of 3D structure. In this thesis, a machine learning approach is employed to predict whether a pair of protein fragments have similar 3D structures or not, which can be used to select fragment structures for a target protein with unknown structure. To design input features, a concepy hierarchy is implemented, which considers sequence profile matrices, predicted secondary structure, solvent accessibility and torsion angle classes as features in various combinations and projections. Several machine learning classifiers and regressors are trained and optimized for predicting the structural similarity of 3-mer and 9-mer fragments including logistic regression, AdaBoost, decision tree, k-nearest neighbor, naive Bayes, random forest, SVM and multi-layer perceptron. The results demonstrate that combining different feature sets through concept hierarcy and model optimization improves the prediction accuracy substantially. Furthermore it is possible to predict the structural similarity of fragment pairs with high accuracy, which is assessed by perforing cross-validation experiments on fragment datasets. When the structural similarity of fragments is defined as a classification problem, the accuracy of different classifiers are obtained as similar to each other. Among the regression methods, random forest provided the best accuracy metrics. en_US
dc.identifier.other Tez No: 513554
dc.identifier.uri https://hdl.handle.net/20.500.12573/316
dc.language.iso eng en_US
dc.publisher Abdullah Gül Üniversitesi en_US
dc.relation.publicationcategory Tez en_US
dc.rights info:eu-repo/semantics/openAccess en_US
dc.subject Protein Fragment Selection en_US
dc.subject Protein Structure Prediction en_US
dc.subject Secondary Structure Prediction en_US
dc.subject Solvent Accessibility Prediction en_US
dc.subject Torsion Angle Prediction en_US
dc.title Protein fragment selection using machine learning en_US
dc.title.alternative Makine öğrenmesi ile protein parçacık seçimi en_US
dc.type masterThesis en_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
PROTEIN FRAGMENT SELECTION.pdf
Size:
2.34 MB
Format:
Adobe Portable Document Format
Description:
Yüksek Lisans Tezi

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: