Gene-Peak correlations used for calling domains of regulatory chromatin (DORCs)
Source:R/DORCs.R
runGenePeakcorr.Rd
Function to compute correlation between RNA expression and peak accessibility for peaks falling within a window around each gene, across single cells, using background peak correlations for significance testing
Usage
runGenePeakcorr(
ATAC.se,
RNAmat,
genome,
geneList = NULL,
windowPadSize = 50000,
normalizeATACmat = TRUE,
nCores = 4,
keepPosCorOnly = TRUE,
keepMultiMappingPeaks = FALSE,
n_bg = 100,
p.cut = NULL
)
Arguments
- ATAC.se
SummarizedExperiment object of the scATAC-seq reads in peak counts.
- RNAmat
Matrix object of the normalized scRNA-seq counts. For example, this will be the normalized count data housed within the `@assays$RNA@data` field if processed using Seurat. Must have same number of cells (i.e. matched) with the scATAC-seq data
- genome
character specifying the reference genome build to use. Must be one of "hg19", "hg38" or "mm10", with no default
- normalizeATACmat
boolean indicating whether or not to normalize the counts present in the ATAC.se object prior to computing correlations. Default is TRUE (i.e. assumes peak counts in ATAC.se are raw)
- nCores
numeric indicating the number of cores to use if parallelizing tasks
- keepPosCorOnly
boolean indicating whether to only filter for positively correlated gene-peak associations (and hence assumes performing a right-tailed Z-test with permuted correlations). Default is TRUE (only keep positively correlated associations)
- keepMultiMappingPeaks
boolean indicating whether or not to keep peaks mapping to more than 1 gene. Default is FALSE (i.e. force 1-1 mapping for peak-gene, keeping peak with higher correlation if mapping to more than one gene)
- n_bg
numeric indicating the number of background correlations to compute per gene-peak pair. Default is 100 (increasing this can significantly increase run time)
- p.cut
numeric indicating p-value cut-off to apply to gene-peak correlations. Default is NULL (i.e. don't apply any filtering of results). Good option is to set to 0.05
- TSSwindow
numeric specifying the window size (in base pairs) to pad around either side of each TSS, to fetch overlapping peaks. Default is 50 kb (so 100 kb window is drawn arund each TSS)