Research Repository

NucTools: Analysis of chromatin feature occupancy profiles from high-throughput sequencing data

Vainshtein, Y and Rippe, K and Teif, VB (2017) 'NucTools: Analysis of chromatin feature occupancy profiles from high-throughput sequencing data.' BMC Genomics, 18 (1). ISSN 1471-2164

[img]
Preview
Text
art%3A10.1186%2Fs12864-017-3580-2.pdf - Published Version
Available under License Creative Commons Attribution.

Download (8MB) | Preview

Abstract

© 2017 The Author(s). Background: Biomedical applications of high-throughput sequencing methods generate a vast amount of data in which numerous chromatin features are mapped along the genome. The results are frequently analysed by creating binary data sets that link the presence/absence of a given feature to specific genomic loci. However, the nucleosome occupancy or chromatin accessibility landscape is essentially continuous. It is currently a challenge in the field to cope with continuous distributions of deep sequencing chromatin readouts and to integrate the different types of discrete chromatin features to reveal linkages between them. Results: Here we introduce the NucTools suite of Perl scripts as well as MATLAB- and R-based visualization programs for a nucleosome-centred downstream analysis of deep sequencing data. NucTools accounts for the continuous distribution of nucleosome occupancy. It allows calculations of nucleosome occupancy profiles averaged over several replicates, comparisons of nucleosome occupancy landscapes between different experimental conditions, and the estimation of the changes of integral chromatin properties such as the nucleosome repeat length. Furthermore, NucTools facilitates the annotation of nucleosome occupancy with other chromatin features like binding of transcription factors or architectural proteins, and epigenetic marks like histone modifications or DNA methylation. The applications of NucTools are demonstrated for the comparison of several datasets for nucleosome occupancy in mouse embryonic stem cells (ESCs) and mouse embryonic fibroblasts (MEFs). Conclusions: The typical workflows of data processing and integrative analysis with NucTools reveal information on the interplay of nucleosome positioning with other features such as for example binding of a transcription factor CTCF, regions with stable and unstable nucleosomes, and domains of large organized chromatin K9me2 modifications (LOCKs). As potential limitations and problems we discuss how inter-replicate variability of MNase-seq experiments can be addressed.

Item Type: Article
Subjects: Q Science > QH Natural history > QH426 Genetics
Divisions: Faculty of Science and Health > Biological Sciences, School of
Depositing User: Jim Jamieson
Date Deposited: 02 Mar 2017 12:43
Last Modified: 04 Sep 2018 17:40
URI: http://repository.essex.ac.uk/id/eprint/19199

Actions (login required)

View Item View Item