Harrison, Andrew P and Johnston, Caroline E and Orengo, Christine A (2007) Establishing a major cause of discrepancy in the calibration of Affymetrix GeneChips. BMC Bioinformatics, 8 (1). 195-. DOI https://doi.org/10.1186/1471-2105-8-195
Harrison, Andrew P and Johnston, Caroline E and Orengo, Christine A (2007) Establishing a major cause of discrepancy in the calibration of Affymetrix GeneChips. BMC Bioinformatics, 8 (1). 195-. DOI https://doi.org/10.1186/1471-2105-8-195
Harrison, Andrew P and Johnston, Caroline E and Orengo, Christine A (2007) Establishing a major cause of discrepancy in the calibration of Affymetrix GeneChips. BMC Bioinformatics, 8 (1). 195-. DOI https://doi.org/10.1186/1471-2105-8-195
Abstract
Background: Affymetrix GeneChips are a popular platform for performing whole-genome experiments on the transcriptome. There are a range of different calibration steps, and users are presented with choices of different background subtractions, normalisations and expression measures. We wished to establish which of the calibration steps resulted in the biggest uncertainty in the sets of genes reported to be differentially expressed. Results: Our results indicate that the sets of genes identified as being most significantly differentially expressed, as estimated by the z-score of fold change, is relatively insensitive to the choice of background subtraction and normalisation. However, the contents of the gene list are most sensitive to the choice of expression measure. This is irrespective of whether the experiment uses a rat, mouse or human chip and whether the chip definition is made using probe mappings from Unigene, RefSeq, Entrez Gene or the original Affymetrix definitions. It is also irrespective of whether both Present and Absent, or just Present, Calls from the MAS5 algorithm are used to filter genelists, and this conclusion holds for genes of differing intensities. We also reach the same conclusion after assigning genes to be differentially expressed using t-statistics, although this approach results in a large amount of false positives in the sets of genes identified due to the small numbers of replicates typically used in microarray experiments. Conclusion: The major calibration uncertainty that biologists need to consider when analysing Affymetrix data is how their multiple probe values are condensed into one expression measure. © 2007 Harrison et al; licensee BioMed Central Ltd.
Item Type: | Article |
---|---|
Uncontrolled Keywords: | Artifacts; Calibration; Oligonucleotide Array Sequence Analysis; Data Interpretation, Statistical; Sensitivity and Specificity; Reproducibility of Results; Gene Expression Profiling; Algorithms; Quality Control; United Kingdom |
Subjects: | Q Science > QH Natural history > QH301 Biology |
Divisions: | Faculty of Science and Health Faculty of Science and Health > Mathematics, Statistics and Actuarial Science, School of |
SWORD Depositor: | Unnamed user with email elements@essex.ac.uk |
Depositing User: | Unnamed user with email elements@essex.ac.uk |
Date Deposited: | 19 May 2015 12:55 |
Last Modified: | 23 Oct 2024 06:20 |
URI: | http://repository.essex.ac.uk/id/eprint/13760 |
Available files
Filename: 1471-2105-8-195.pdf
Licence: Creative Commons: Attribution 3.0