How to integrate more than two scATAC-seq datasets? #455
-
I was wondering how to best integrate more than two datasets. I've got multiple scATAC-seq datasets, each from a different individual classified as either one of two categories (i.e. different reactions when exposed to the same conditions). I want to make comparisons between the two categories. The vignette for integration uses the Thank you for your input! |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 5 replies
-
I would recommend first creating a unified set of peaks across all of the datasets, then quantifying the unified peak set in each dataset, and merging the resulting Seurat objects together. You can use the If you find that you see a batch effect (cells separate by both cell state and dataset of origin) after merging the objects, then you could also apply data integration methods to remediate this. For >2 scATAC datasets, I'd recommend trying Harmony (example here). We're working on some updates to the Seurat integration to better support single-cell chromatin data, but these updates are not available yet. |
Beta Was this translation helpful? Give feedback.
I would recommend first creating a unified set of peaks across all of the datasets, then quantifying the unified peak set in each dataset, and merging the resulting Seurat objects together. You can use the
GenomicRanges::reduce()
function for this, and there's an example in the merge vignette: https://satijalab.org/signac/articles/merging.htmlIf you find that you see a batch effect (cells separate by both cell state and dataset of origin) after merging the objects, then you could also apply data integration methods to remediate this. For >2 scATAC datasets, I'd recommend trying Harmony (example here). We're working on some updates to the Seurat integration to better support single-cell c…