10x Genomics Support/Space Ranger/Analysis/

Metrics Summary

The spaceranger count pipeline outputs metrics_summary.csv which contains a number of key metrics about the barcoding and sequencing process.

MetricDescription
Number of Spots Under TissueThe number of barcodes associated with a spot under tissue.
Number of ReadsTotal number of read pairs that were assigned to this library in demultiplexing.
Mean Reads per SpotThe number of reads, both under and outside of tissue, divided by the number of barcodes associated with a spot under tissue.
Mean Reads Under Tissue per SpotThe number of reads under tissue divided by the number of barcodes associated with a spot under tissue.
Fraction of Spots Under TissueThe fraction of the spots under the tissue.
Median Genes per SpotThe median number of genes detected per spot under tissue-associated barcode. Detection is defined as the presence of at least one UMI count.
Median UMI Counts per SpotThe median number of UMI counts per tissue covered spot.
Valid BarcodesFraction of reads with barcodes that match the whitelist after barcode correction.
Valid UMIsFraction of reads with valid UMIs; i.e. UMI sequences that do not contain Ns and that are not homopolymers.
Sequencing SaturationThe fraction of reads originating from an already-observed UMI. This is a function of library complexity and sequencing depth. More specifically, this is the fraction of confidently mapped, valid spot-barcode, valid UMI reads that had a non-unique (spot-barcode, UMI, gene).
Q30 Bases in BarcodeFraction of spot barcode bases with Q-score greater than or equal to 30, excluding very low quality/no-call (Q lesser than or equal to 2) bases from the denominator.
Q30 Bases in RNA ReadFraction of RNA read bases with Q-score greater than or equal to 30, excluding very low quality/no-call (Q lesser than or equal to 2) bases from the denominator. This is Read 2 for the Visium v1 chemistry.
Q30 Bases in UMIFraction of UMI bases with Q-score greater than or equal to 30, excluding very low quality/no-call (Q lesser than or equal to 2) bases from the denominator.
Reads Mapped to GenomeFraction of reads that mapped to the genome.
Reads Mapped Confidently to GenomeFraction of reads that mapped uniquely to the genome. If a gene mapped to exonic loci from a single gene and also to non-exonic loci, it is considered uniquely mapped to one of the exonic loci.
Reads Mapped Confidently to Intergenic RegionsFraction of reads that mapped uniquely to an intergenic region of the genome.
Reads Mapped Confidently to Intronic RegionsFraction of reads that mapped uniquely to an intronic region of the genome.
Reads Mapped Confidently to Exonic RegionsFraction of reads that mapped uniquely to an exonic region of the genome.
Reads Mapped Confidently to TranscriptomeFraction of reads that mapped to a unique gene in the transcriptome. The read must be consistent with annotated splice junctions. These reads are considered for UMI counting.
Reads Mapped Antisense to GeneFraction of reads confidently mapped to the transcriptome, but on the opposite strand of their annotated gene. A read is counted as antisense if it has any alignments that are consistent with an exon of a transcript but antisense to it, and has no sense alignments.
Fraction Reads in Spots Under TissueThe fraction of valid-barcode, confidently-mapped-to-transcriptome reads with spot-associated barcodes.
Total Genes DetectedThe number of genes with at least one UMI count in any tissue covered spot.

The spaceranger count for FFPE pipeline outputs metrics_summary.csv which differs from spaceranger count with regard to a few key metrics.

MetricDescription
Reads Mapped to the Probe SetFraction of reads that map with at least one read half to the probe reference.
Reads Mapped Confidently to the Probe SetFraction of reads that map uniquely with both read halves to the probe reference.
Reads Mapped Confidently to the Filtered Probe SetFraction of reads that map uniquely with both read halves to the filtered probe reference. The probe reference is filtered to remove genes/features where one or more of the probes targeting this feature might hybridize and ligate at a non targeted loci. This metric will be "None" when probe filtering is disabled.
Genes DetectedThe number of unique genes from the filtered probe set with at least one UMI count in any tissue covered spot.
Number of GenesThe number of genes as defined by the probe set.
Number of Genes greater than or equal to 10 UMIsNumber of genes with at least 10 filtered UMIs from tissue-associated barcodes. These genes are used to calculate per-gene enrichments.

Space Ranger computes sequencing quality and application results metrics on each supported library, which currently are Gene Expression and Antibody Capture. These metrics will be computed and displayed only when one of these library types was used. This page describes Antibody Capture libraries metrics for QC of the library preparation and sequencing of Protein Expression libraries, which appear in the metrics_summary.csv file and on the web_summary.html page.

MetricDescription
Antibody Number of ReadsTotal number of reads.
Mean Antibody Reads per SpotThe number of reads, both under and outside of tissue, divided by the number of barcodes associated with a spot under tissue.
Valid Antibody BarcodesFraction of reads with a spot-barcode found in or corrected to one that is found in the whitelist.
Valid Antibody UMIsFraction of reads with valid UMIs; i.e. UMI sequences that do not contain Ns and that are not homopolymers.
Antibody Sequencing SaturationFraction of antibody library reads originating from an already-observed UMI. This is a function of library complexity and sequencing depth. More specifically, this is a ratio where: the denominator is the number of reads with a recognized antibody barcode, valid cell-barcode, and valid UMI, and the numerator is the subset of those reads that had a non-unique combination of (spot-barcode, UMI, antibody barcode).
Q30 Bases in Antibody BarcodeFraction of cell barcode bases with Q-score greater than or equal to 30, excluding very low quality/no-call (Q lesser than or equal to 2) bases from the denominator.
Q30 Bases in Antibody ReadFraction of bases from the read containing the antibody barcode with Q-score greater than or equal to 30, excluding very low quality/no-call (Q lesser than or equal to 2) bases from the denominator. This is Read 2 for the Visium v2 chemistry.
Q30 Bases in Antibody UMIFraction of UMI bases with Q-score greater than or equal to 30, excluding very low quality/no-call (Q lesser than or equal to 2) bases from the denominator.
Fraction Antibody ReadsFraction of reads that contain a recognized antibody barcode
Fraction Antibody Reads UsableFraction of reads that contain a recognized antibody barcode, a valid UMI, and a spot-associated barcode.
Antibody Reads Usable per SpotNumber of antibody reads usable divided by the number of spot-associated barcodes.
Fraction Unrecognized AntibodyAmong all reads, the fraction with an unrecognizable antibody barcode
Antibody Reads in Spots Under TissueAmong Antibody library reads with a recognized antibody barcode, a valid UMI, and a valid barcode, the fraction associated with a spot under tissue.

The spaceranger aggr pipeline outputs summary.json that contains metrics relating to the aggregated datasets. Note: Square brackets denote a variable that depends on the pipeline input, such as, [library_id]_frac_reads_kept means that if your aggregation contains two libraries with IDs sample123 and sample456, there are two output metrics sample123_frac_reads_kept and sample456_frac_reads_kept.

For aggregated datasets that contain both Gene and Protein Expression libraries, there are additional metrics for Protein Capture that include [Antibody] in the metric name.

MetricDescription
filtered_bcs_transcriptome_unionThe estimated number of barcodes associated with a spot under tissue, summed across all input libraries.
[pre/post]_total_readsTotal number of sequenced reads, summed across all input libraries.
[pre/post]_multi_transcriptome_total_raw_reads_per_filtered_bctotal_reads divided by filtered_bcs_transcriptome_union
[library_id]_pre_normalization_raw_reads_per_filtered_bcThe mean total reads per spot prior to depth normalization, for the library denoted by library_id
[library_id]_pre_normalization_cmb_reads_per_filtered_bcThe mean confidently mapped and barcoded (CMB) reads per spot prior to depth normalization, for the library denoted by library_id.
[library_id]_frac_reads_keptThe fraction of reads that were retained after depth normalization for the library denoted by library_id
lowest_frac_reads_keptThe lowest fraction of reads retained, corresponding to the library which lost the most reads during normalization. A low value may indicate a large disparity in the initial depth of the input libraries.

If one or more of the aggregated samples was a Targeted Gene Expression sample, these additional metrics will also appear:

MetricDescription
[library_id]_pre_normalization_targeted_reads_per_filtered_bcThe mean targeted reads per spot prior to depth normalization, for the library denoted by library_id
[library_id]_frac_targeted_reads_keptThe fraction of reads mapped uniquely and confidently to targeted genes that were retained after depth normalization for the library denoted by library_id. This field will be shown instead of the metric [library_id]_frac_reads_kept above