Roy, Aparna and Taddese, Bruck and Vohra, Shabana and Thimmaraju, Phani K and Illingworth, Christopher JR and Simpson, Lisa M and Mukherjee, Keya and Reynolds, Christopher A and Chintapalli, Sree V (2014) Identifying subset errors in multiple sequence alignments. Journal of Biomolecular Structure and Dynamics, 32 (3). pp. 364-371. DOI https://doi.org/10.1080/07391102.2013.770371
Roy, Aparna and Taddese, Bruck and Vohra, Shabana and Thimmaraju, Phani K and Illingworth, Christopher JR and Simpson, Lisa M and Mukherjee, Keya and Reynolds, Christopher A and Chintapalli, Sree V (2014) Identifying subset errors in multiple sequence alignments. Journal of Biomolecular Structure and Dynamics, 32 (3). pp. 364-371. DOI https://doi.org/10.1080/07391102.2013.770371
Roy, Aparna and Taddese, Bruck and Vohra, Shabana and Thimmaraju, Phani K and Illingworth, Christopher JR and Simpson, Lisa M and Mukherjee, Keya and Reynolds, Christopher A and Chintapalli, Sree V (2014) Identifying subset errors in multiple sequence alignments. Journal of Biomolecular Structure and Dynamics, 32 (3). pp. 364-371. DOI https://doi.org/10.1080/07391102.2013.770371
Abstract
Multiple sequence alignment (MSA) accuracy is important, but there is no widely accepted method of judging the accuracy that different alignment algorithms give. We present a simple approach to detecting two types of error, namely block shifts and the misplacement of residues within a gap. Given a MSA, subsets of very similar sequences are generated through the use of a redundancy filter, typically using a 70-90% sequence identity cut-off. Subsets thus produced are typically small and degenerate, and errors can be easily detected even by manual examination. The errors, albeit minor, are inevitably associated with gaps in the alignment, and so the procedure is particularly relevant to homology modelling of protein loop regions. The usefulness of the approach is illustrated in the context of the universal but little known [K/R]KLH motif that occurs in intracellular loop 1 of G protein coupled receptors (GPCR); other issues relevant to GPCR modelling are also discussed. © 2013 Taylor & Francis.
Item Type: | Article |
---|---|
Uncontrolled Keywords: | redundancy; errors; multiple sequence alignments; alignment accuracy; alignment errors; homology modelling |
Subjects: | Q Science > QH Natural history > QH301 Biology |
Divisions: | Faculty of Science and Health Faculty of Science and Health > Life Sciences, School of |
SWORD Depositor: | Unnamed user with email elements@essex.ac.uk |
Depositing User: | Unnamed user with email elements@essex.ac.uk |
Date Deposited: | 26 Sep 2014 11:19 |
Last Modified: | 30 Oct 2024 20:11 |
URI: | http://repository.essex.ac.uk/id/eprint/10411 |