• English
    • Türkçe
  • English 
    • English
    • Türkçe
  • Login
View Item 
  •   DSpace Home
  • 3. Mühendislik Fakültesi / Faculty of Engineering
  • Electrical - Electronic Engineering / Elektrik - Elektronik Mühendisliği
  • View Item
  •   DSpace Home
  • 3. Mühendislik Fakültesi / Faculty of Engineering
  • Electrical - Electronic Engineering / Elektrik - Elektronik Mühendisliği
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.
Realtime Access Map

2-D thresholding of the connectivity map following the multiple sequence alignments of diverse datasets

Thumbnail
View/Open
Conference Paper (1.009Mb)
Date
2013
Author
Doğan, Tunca
Karaçalı, Bilge
Metadata
Show full item record
Abstract
Multiple sequence alignment (MSA) is a widely used method to uncover the relationships between the biomolecular sequences. One essential prerequisite to apply this procedure is to have a considerable amount of similarity between the test sequences. It's usually not possible to obtain reliable results from the multiple alignments of large and diverse datasets. Here we propose a method to obtain sequence clusters of significant intragroup similarities and make sense out of the multiple alignments containing remote sequences. This is achieved by thresholding the pairwise connectivity map over 2 parameters. The first one is the inferred pairwise evolutionary distances and the second parameter is the number of gapless positions on the pairwise comparisons of the alignment. Threshold curves are generated regarding the statistical parameter values obtained from a shuffled dataset and probability distribution techniques are employed to select an optimum threshold curve that eliminate as much of the unreliable connectivities while keeping the reliable ones. We applied the method on a large and diverse dataset composed of nearly 18000 human proteins and measured the biological relevance of the recovered connectivities. Our precision measure (0.981) was nearly 20% higher than the one for the connectivities left after a classical thresholding procedure displaying a significant improvement. Finally we employed the method for the functional clustering of protein sequences in a gold standard dataset. We have also measured the performance, obtaining a higher F-measure (0.882) compared to a conventional clustering operation (0.827).
URI
http://doi.org/10.2316/P.2013.791-092
https://hdl.handle.net/11147/7319
Collections
  • Electrical - Electronic Engineering / Elektrik - Elektronik Mühendisliği [286]
  • Scopus İndeksli Yayınlar Koleksiyonu / Scopus Indexed Publications Collection [3276]


DSpace software copyright © 2002-2015  DuraSpace
Contact Us | Send Feedback
Theme by 
@mire NV
 

 



| IZTECH OS Policy |
DSpace@IYTE Guide |

DSpace@IZTECH

by OpenAIRE
Advanced Search

Browse

All of DSpaceCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsTypeDepartmentPublisherCategoryLanguageAccess TypeIZTECH AuthorThis CollectionBy Issue DateAuthorsTitlesSubjectsTypeDepartmentPublisherCategoryLanguageAccess TypeIZTECH Author

My Account

LoginRegister

Statistics

View Usage Statistics

DSpace software copyright © 2002-2015  DuraSpace
Contact Us | Send Feedback
Theme by 
@mire NV
 

 


|| IZTECH OS Policy || DSpace@IYTE Guide || Library || IYTE || OAI-PMH ||

IZTECH Library, Gülbahçe Kampüsü - 35430 - Urla, İzmir / TURKEY
If you find any errors in content, please contact: openaccess@iyte.edu.tr.

Creative Commons Lisansı
DSpace@IZTECH by IYTE Institutional repository is licensed under a Creative Commons Attribution-Gayriticari-NoDerivs 3.0 Unported License.

DSpace@IZTECH is member of: