Detect Differential ChIP Signal — fitAssayDiff • extraChIPs

Detect differential ChIP signal using one of many approaches

fitAssayDiff(x, ...)

# S4 method for class 'SummarizedExperiment'
fitAssayDiff(
  x,
  assay = "counts",
  design = NULL,
  coef = NULL,
  lib.size = "totals",
  method = c("qlf", "lt", "wald"),
  norm = c("none", "TMM", "RLE", "TMMwsp", "upperquartile"),
  groups = NULL,
  fc = 1,
  lfc = log2(fc),
  asRanges = FALSE,
  offset = NULL,
  weighted = FALSE,
  ...,
  null = c("interval", "worst.case"),
  robust = FALSE,
  type = c("apeglm", "ashr", "normal")
)

Arguments

x: a SummarizedExperiment object
...: Passed to normLibSizes and
assay: The assay to use for analysis
design: The design matrix to use for analysis
coef: The required column from the design matrix
lib.size: The column within the colData element which contains the library size information. If set to NULL, column summaries will be used.
method: the analytic method to be used. Can be 'qlf' which will fit counts using the glmQLFit strategy , or 'lt' which fits the limma-trend model on logCPM, or pre-processed logCPM values. Setting method = 'wald' will call nbinomWaldTest
norm: The normalisation strategy to use when running the glmQLF model or the Wald test. The value 'none' relies solely on library-size normalisation, and is the default. All methods available in normLibSizes are implemented. Ignored when using method = "lt"
groups: character(1) If a column name is supplied here, group-based normalisation will be applied to GLM models treating data in this column as a grouping factor. Ignored when using method = "lt"
fc, lfc: Thresholds passed to treat, glmTreat or lfcShrink
asRanges: logical(1). By default, the returned object will be a SummarizedExperiment object with the results added to the rowData element. Setting asRanges = TRUE will only return the GRanges object from this element
offset: If provided will be used as the offset when a DGEList object is created during model fitting for method = 'qlf'
weighted: logical(1) Passed to normLibSizes. Only used when applying a TMM-type normalisation strategy
null: Passed to glmTreat glmQLFit when method = "qlf". If method = "lt", instead passed to lmFit
robust: Passed to treat and eBayes
type: Passed to lfcShrink

Value

A SummarizedExperiment object with results set as the rowData element. Any existing columns not contained in the differential ChIP results will be retained. Results from testing will contain logCPM, logFC, PValue and any t/F statistic as appropriate, along with an FDR-adjusted p-value.

If specifying a range-based H0 by setting lfc != 0, an additional column p_mu0 will be included which is the p-value for the point H0: logFC = 0. These are not used for FDR-adjusted p-values but can be helpful when integrating multiple ChIP targets due to the increase in false-negatives when using a range-based H0, and when requiring more accurate identification of truly unchanged sites, as opposed to those which simply fail to achieve significance using a range-based H0 where arbitrary cutoff values are used.

Details

Starting with a SummarizedExperiment object this function fits either a glmQLFit model to count data, performs the nbinomWaldTest on count data, or applies the limma-trend model to logCPM data.

If fitting Generalised Linear Models via glmQLFit, options for normalisation are "none", which normalises to library size. Existing library sizes are commonly found in the "totals" column of the colData element and this is attempted by default. All methods provided in normLibSizes are also implemented, with the added possibility of normalising within groups instead of across the entire dataset. To enable this, the column with the grouping factor is expected to be in the colData element and is simply called by column name. No normalisation is applied when using the limma-trend model, as this allows for previous normalisation strategies to be performed on the data.

If testing with nbinomWaldTest, applying RLE normalisation without groups, and using colSums for library sizes (instead of total alignments), the standard normalisation factors from DESeq2::estimateSizeFactors will be used. In all other scenarios, normalisation factors as returned by normLibSizes will be used. The fitType is set to 'local' when estimating dispersions, and this can be easily modified by passing fitType via the dot arguments. Results are additionally returned after applying lfcShrink, including the svalue returned by this approach.

Normalising to ChIP Input samples is not yet implemented. Similarly, the use of offsets when applying the Wald test is not yet implemented.

Range-based hypothesis testing is implemented using glmTreat or treat. Setting fc to 1 (or lfc to 0) will default to a point-based null hypothesis, equivalent to either glmQLFTest (method = "qlf") or eBayes (method = "lt"). When applying nbinomWaldTest, lfcShrink will be applied.

It should also be noted that this is primarily a convenience function and if requiring intermediate output from any steps, then these can be run individually as conventionally specified.

Examples

nrows <- 200; ncols <- 6
counts <- matrix(runif(nrows * ncols, 1, 1e4), nrows)
colnames(counts) <- paste0("Sample_", seq_len(ncols))
df <- DataFrame(treat = c("A", "A", "A", "B", "B", "B"))
df$treat <- as.factor(df$treat)
se <- SummarizedExperiment(
  assays = SimpleList(counts = counts), colData = df
)
X <- model.matrix(~treat, colData(se))
se <- fitAssayDiff(se, design = X, lib.size = NULL)
#> Creating DGE list...
#> Calculating experiment-wide normalisation factors...
#> Estimating dispersions...
#> Running glmQLFit...
rowData(se)
#> DataFrame with 200 rows and 5 columns
#>          logFC    logCPM          F    PValue       FDR
#>      <numeric> <numeric>  <numeric> <numeric> <numeric>
#> 1     0.749037   10.7688  0.2131836  0.649080  0.993722
#> 2     0.203735   13.0209  0.2093590  0.652017  0.993722
#> 3     0.314357   12.5005  0.1605486  0.692741  0.993722
#> 4     0.670556   11.5596  0.2703578  0.608586  0.993722
#> 5    -0.268084   12.1153  0.0644155  0.802142  0.993722
#> ...        ...       ...        ...       ...       ...
#> 196 -0.0503165   12.3107 0.00372841  0.951894  0.993722
#> 197  0.5868742   12.6758 0.82132353  0.375180  0.993722
#> 198 -0.3545575   12.0742 0.10151868  0.753194  0.993722
#> 199 -0.7986674   12.2094 0.66249000  0.424909  0.993722
#> 200 -0.2616856   12.9204 0.28235821  0.600793  0.993722