Evaluation of Protein Secondary Structure Prediction Algorithms on a New Advanced Benchmark Dataset

Has, Canan

Please use this identifier to cite or link to this item: https://hdl.handle.net/11147/3161

Title:	Evaluation of Protein Secondary Structure Prediction Algorithms on a New Advanced Benchmark Dataset
Authors:	Has, Canan
Advisors:	Allmer, Jens
Publisher:	Izmir Institute of Technology
Abstract:	Starting from 1970s, researchers have been studying secondary structure prediction. However the accuracy of state-of art methods reach to approximately 80- 85%. One of the reasons for that is related with the limitations in respect to datasets used for training or testing the algorithm. A number of databases with n number of experimentally determined proteins, which also contain the knowledge of functionality, biochemical properties and location annotation of proteins, will directly show us how the algorithms work on certain groups of proteins. This also ensures opportunity to users to determine the quality of algorithms on those datasets and to decide on which algorithm can be used for which type of proteins. In this thesis, the objective is set through the development of a new and advanced protein benchmark database which contains functional and biochemical information of experimentally defined 64872 proteins in S2C database derived by ProteinDataBank (PDB). With this database, the seven available predictors are evaluated in respect to their performances on different datasets in terms of functionality and subcellular localization of proteins in the benchmark database. According to the results obtained on proposed benchmark datasets in compare to results on one of existing dataset, RS126, it was shown that grouping proteins into functions in their subcellular localizations have a great impact on deciding the accuracies of existing algorithms.
Description:	Thesis (Master)--Izmir Institute of Technology, Molecular Biology and Genetics, Izmir, 2011 Includes bibliographical references (leaves: 39-43) Text in English; Abstract: Turkish and English xi, 43 leaves Full text release delayed at author's request until 2015.01.17
URI:	http://hdl.handle.net/11147/3161
Appears in Collections:	Master Degree / Yüksek Lisans Tezleri