Skip to content

What constitutes "aggregated data" as input for DoubletFinder? #101

@denvercal1234GitHub

Description

@denvercal1234GitHub

Hi there,

I am new to DoubletFinder and RNAseq.

The tutorial states "Do not apply DoubletFinder to aggregated scRNA-seq data representing multiple distinct samples (e.g., multiple 10X lanes)."

The person who performed the sequencing told me my samples were run following the "Multiple Samples, Multiple GEM Well, One Flowcell" 10X workflow. That is "multiple samples are processed through multiple GEM wells which generate multiple libraries and are pooled onto one flowcell." (https://support.10xgenomics.com/single-cell-gene-expression/software/pipelines/latest/what-is-cell-ranger).

What I was given was the outputs from the Cell ranger pipeline (count matrix, barcode, and gene files) having the 3 donors combined. The barcode, however, contains an ID that allows me to separate which cell belongs to which donor.

I hope to get your advice on 2 questions:

  1. I assume my samples were multiplexed, do I need to demultiplex it first before running DoubletFinder?
  2. Or, because I sequenced the same cell type and the same "overall" population (CD45RA+ CD4 T cells) but samples were sorted from 3 different donors, can I split the count matrix by donor ID on the barcode, and run DoubletFinder on each donor data separately?

Thank you very much for your help with this!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions