Accounting for immunoprecipitation efficiencies in the statistical analysis of ChIP-seq data

Bao, Yanchun and Vinciotti, Veronica and Wit, Ernst and ’t Hoen, Peter AC (2013) Accounting for immunoprecipitation efficiencies in the statistical analysis of ChIP-seq data. BMC Bioinformatics, 14 (1). 169-. DOI https://doi.org/10.1186/1471-2105-14-169

Abstract

<jats:title>Abstract</jats:title> <jats:sec> <jats:title>Background</jats:title> <jats:p>ImmunoPrecipitation (IP) efficiencies may vary largely between different antibodies and between repeated experiments with the same antibody. These differences have a large impact on the quality of ChIP-seq data: a more efficient experiment will necessarily lead to a higher signal to background ratio, and therefore to an apparent larger number of enriched regions, compared to a less efficient experiment. In this paper, we show how IP efficiencies can be explicitly accounted for in the joint statistical modelling of ChIP-seq data.</jats:p> </jats:sec> <jats:sec> <jats:title>Results</jats:title> <jats:p>We fit a latent mixture model to eight experiments on two proteins, from two laboratories where different antibodies are used for the two proteins. We use the model parameters to estimate the efficiencies of individual experiments, and find that these are clearly different for the different laboratories, and amongst technical replicates from the same lab. When we account for ChIP efficiency, we find more regions bound in the more efficient experiments than in the less efficient ones, at the same false discovery rate. A priori knowledge of the same number of binding sites across experiments can also be included in the model for a more robust detection of differentially bound regions among two different proteins.</jats:p> </jats:sec> <jats:sec> <jats:title>Conclusions</jats:title> <jats:p>We propose a statistical model for the detection of enriched and differentially bound regions from multiple ChIP-seq data sets. The framework that we present accounts explicitly for IP efficiencies in ChIP-seq data, and allows to model jointly, rather than individually, replicates and experiments from different proteins, leading to more robust biological conclusions.</jats:p> </jats:sec>

Item Metadata

Item Type:	Article
Uncontrolled Keywords:	DNA-Binding Proteins; Transcription Factors; Models, Statistical; Chromatin Immunoprecipitation; Sequence Analysis, DNA; Binding Sites; High-Throughput Nucleotide Sequencing
Subjects:	H Social Sciences > HA Statistics Q Science > QH Natural history > QH301 Biology
Divisions:	Faculty of Science and Health Faculty of Science and Health > Mathematics, Statistics and Actuarial Science, School of
SWORD Depositor:	Unnamed user with email elements@essex.ac.uk
Depositing User:	Unnamed user with email elements@essex.ac.uk
Date Deposited:	04 Dec 2015 13:32
Last Modified:	16 Aug 2025 00:36
URI:	http://repository.essex.ac.uk/id/eprint/15593

Available files

Published Version

Filename: 1471-2105-14-169.pdf

Licence: Creative Commons: Attribution 3.0

Download

Accounting for immunoprecipitation efficiencies in the statistical analysis of ChIP-seq data

Abstract

Item Metadata

Share and export

Available files

Published Version

Statistics

Altmetrics

Downloads