Show simple item record

dc.contributor.authorDoğan, Tunca
dc.contributor.authorKaraçalı, Bilge
dc.date.accessioned2019-10-30T08:35:10Z
dc.date.available2019-10-30T08:35:10Z
dc.date.issued2013en_US
dc.identifier.citationDoğan, T, and Karaçalı, B. (2013, February 13-15). 2-D thresholding of the connectivity map following the multiple sequence alignments of diverse datasets. Paper presented at the 10th IASTED International Conference on Biomedical Engineering, BioMed 2013. doi:10.2316/P.2013.791-092en_US
dc.identifier.isbn978-088986953-0
dc.identifier.urihttp://doi.org/10.2316/P.2013.791-092
dc.identifier.urihttps://hdl.handle.net/11147/7319
dc.description10th IASTED International Conference on Biomedical Engineering, BioMed 2013; Innsbruck; Austria; 13 February 2013 through 15 February 2013en_US
dc.description.abstractMultiple sequence alignment (MSA) is a widely used method to uncover the relationships between the biomolecular sequences. One essential prerequisite to apply this procedure is to have a considerable amount of similarity between the test sequences. It's usually not possible to obtain reliable results from the multiple alignments of large and diverse datasets. Here we propose a method to obtain sequence clusters of significant intragroup similarities and make sense out of the multiple alignments containing remote sequences. This is achieved by thresholding the pairwise connectivity map over 2 parameters. The first one is the inferred pairwise evolutionary distances and the second parameter is the number of gapless positions on the pairwise comparisons of the alignment. Threshold curves are generated regarding the statistical parameter values obtained from a shuffled dataset and probability distribution techniques are employed to select an optimum threshold curve that eliminate as much of the unreliable connectivities while keeping the reliable ones. We applied the method on a large and diverse dataset composed of nearly 18000 human proteins and measured the biological relevance of the recovered connectivities. Our precision measure (0.981) was nearly 20% higher than the one for the connectivities left after a classical thresholding procedure displaying a significant improvement. Finally we employed the method for the functional clustering of protein sequences in a gold standard dataset. We have also measured the performance, obtaining a higher F-measure (0.882) compared to a conventional clustering operation (0.827).en_US
dc.language.isoengen_US
dc.publisherACTA Pressen_US
dc.relation.isversionof10.2316/P.2013.791-092en_US
dc.rightsinfo:eu-repo/semantics/openAccessen_US
dc.subjectBiomedical computingen_US
dc.subjectBiostatisticsen_US
dc.subjectSequence analysisen_US
dc.subjectBiomedical engineeringen_US
dc.title2-D thresholding of the connectivity map following the multiple sequence alignments of diverse datasetsen_US
dc.typeconferenceObjecten_US
dc.contributor.authorID0000-0002-7765-6329en_US
dc.contributor.iztechauthorDoğan, Tunca
dc.contributor.iztechauthorKaraçalı, Bilge
dc.relation.journal10th IASTED International Conference on Biomedical Engineering, BioMed 2013en_US
dc.contributor.departmentIzmir Institute of Technology. Electronics and Communication Engineeringen_US
dc.identifier.scopusSCOPUS:2-s2.0-84883861890
dc.relation.publicationcategoryKonferans Öğesi - Uluslararası - Kurum Öğretim Elemanıen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record