Research Repository

Identifying subset errors in multiple sequence alignments

Roy, A and Taddese, B and Vohra, S and Thimmaraju, PK and Illingworth, CJR and Simpson, LM and Mukherjee, K and Reynolds, CA and Chintapalli, SV (2014) 'Identifying subset errors in multiple sequence alignments.' Journal of Biomolecular Structure and Dynamics, 32 (3). 364 - 371. ISSN 0739-1102

Full text not available from this repository.


Multiple sequence alignment (MSA) accuracy is important, but there is no widely accepted method of judging the accuracy that different alignment algorithms give. We present a simple approach to detecting two types of error, namely block shifts and the misplacement of residues within a gap. Given a MSA, subsets of very similar sequences are generated through the use of a redundancy filter, typically using a 70-90% sequence identity cut-off. Subsets thus produced are typically small and degenerate, and errors can be easily detected even by manual examination. The errors, albeit minor, are inevitably associated with gaps in the alignment, and so the procedure is particularly relevant to homology modelling of protein loop regions. The usefulness of the approach is illustrated in the context of the universal but little known [K/R]KLH motif that occurs in intracellular loop 1 of G protein coupled receptors (GPCR); other issues relevant to GPCR modelling are also discussed. © 2013 Taylor & Francis.

Item Type: Article
Subjects: Q Science > QH Natural history > QH301 Biology
Divisions: Faculty of Science and Health > Life Sciences, School of
Depositing User: Users 161 not found.
Date Deposited: 26 Sep 2014 11:19
Last Modified: 26 Sep 2019 11:15

Actions (login required)

View Item View Item