What constitutes "aggregated data" as input for DoubletFinder?

Hi there, 

I am new to DoubletFinder and RNAseq.

The tutorial states "Do not apply DoubletFinder to aggregated scRNA-seq data representing multiple distinct samples (e.g., multiple 10X lanes)."

The person who performed the sequencing told me my samples were run following the "Multiple Samples, Multiple GEM Well, One Flowcell" 10X workflow. That is "multiple samples are processed through multiple GEM wells which generate multiple libraries and are pooled onto one flowcell." (https://support.10xgenomics.com/single-cell-gene-expression/software/pipelines/latest/what-is-cell-ranger). 

What I was given was the outputs from the Cell ranger pipeline (count matrix, barcode, and gene files) having the 3 donors combined. The barcode, however, contains an ID that allows me to separate which cell belongs to which donor. 

I hope to get your advice on 2 questions:

1. I assume my samples were multiplexed, do I need to demultiplex it first before running DoubletFinder? 
2. Or, because I sequenced the same cell type and the same "overall" population (CD45RA+ CD4 T cells) but samples were sorted from 3 different donors, can I split the count matrix by donor ID on the barcode, and run DoubletFinder on each donor data separately?

Thank you very much for your help with this! 


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

What constitutes "aggregated data" as input for DoubletFinder? #101

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

What constitutes "aggregated data" as input for DoubletFinder? #101

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions