Research Repository

Detection of microRNAs in color space.

Marco, Antonio and Griffiths-Jones, Sam (2012) 'Detection of microRNAs in color space.' Bioinformatics, 28 (3). 318 - 323. ISSN 1367-4811

[img]
Preview
Text
Bioinformatics-2012-Marco-318-23.pdf - Published Version
Available under License Creative Commons Attribution Non-commercial.

Download (288kB) | Preview

Abstract

MOTIVATION: Deep sequencing provides inexpensive opportunities to characterize the transcriptional diversity of known genomes. The AB SOLiD technology generates millions of short sequencing reads in color-space; that is, the raw data is a sequence of colors, where each color represents 2 nt and each nucleotide is represented by two consecutive colors. This strategy is purported to have several advantages, including increased ability to distinguish sequencing errors from polymorphisms. Several programs have been developed to map short reads to genomes in color space. However, a number of previously unexplored technical issues arise when using SOLiD technology to characterize microRNAs. RESULTS: Here we explore these technical difficulties. First, since the sequenced reads are longer than the biological sequences, every read is expected to contain linker fragments. The color-calling error rate increases toward the 3(') end of the read such that recognizing the linker sequence for removal becomes problematic. Second, mapping in color space may lead to the loss of the first nucleotide of each read. We propose a sequential trimming and mapping approach to map small RNAs. Using our strategy, we reanalyze three published insect small RNA deep sequencing datasets and characterize 22 new microRNAs. AVAILABILITY AND IMPLEMENTATION: A bash shell script to perform the sequential trimming and mapping procedure, called SeqTrimMap, is available at: http://www.mirbase.org/tools/seqtrimmap/ CONTACT: antonio.marco@manchester.ac.uk SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Item Type: Article
Uncontrolled Keywords: Algorithms, Animals, Bees, Color, High-Throughput Nucleotide Sequencing, MicroRNAs, Sequence Analysis, DNA, Sequence Analysis, RNA, Tribolium
Subjects: Q Science > QR Microbiology
Divisions: Faculty of Science and Health > Life Sciences, School of
Depositing User: Antonio Marco
Date Deposited: 29 Nov 2013 16:11
Last Modified: 14 Oct 2019 23:16
URI: http://repository.essex.ac.uk/id/eprint/8315

Actions (login required)

View Item View Item