Support homeCell Ranger ARCAnalysis
Cell Ranger ARC Command Line Arguments

Cell Ranger ARC Command Line Arguments

For a list of subcommands, run cellranger-arc --help.

Usage: cellranger-arc <SUBCOMMAND> cellranger-arc --help

SubcommandDescription
countCount ATAC and gene expression reads from a single library
mkfastqRun bcl2fastq on Single Cell Multiome ATAC + Gene Expression sequencing data
mkrefCreate a cellranger-arc-compatible reference package
aggrAggregate data from multiple cellranger-arc count runs
reanalyzeRe-run secondary analysis (dimensionality reduction, clustering, feature linkage etc.) on a completed cellranger-arc count or cellranger-arc aggr run
testrunRun a tiny cellranger-arc count pipeline to verify software integrity
mkgtfFilter a GTF file by attribute prior to creating a 10x reference
uploadUpload analysis logs to 10x Genomics support
sitecheckCollect linux system configuration information
FlagsDescription
-h, --helpPrints help information
-V, --versionPrints version information

Count ATAC and gene expression reads from a single library.

Usage: cellranger-arc count [FLAGS] [OPTIONS] --id <ID> --reference <PATH> --libraries <CSV>

OptionDescription
--id <ID>A unique run id and output folder name [a-zA-Z0-9_-]+ of maximum length 64 characters
--description <TEXT>Sample description to embed in output files
--reference <PATH>Path to folder containing cellranger-arc-compatible reference. Reference packages can be downloaded from www.10xgenomics.com/support or constructed using the cellranger-arc mkref command
--libraries <CSV>Path to 3-column CSV file defining the paths to ATAC and gene expression FASTQ data generated with the Chromium Single Cell Multiome ATAC + Gene Expression solution. A template CSV would look as follows (blank lines are ignored): fastqs,sample,library_type /data/HAWT7ADXX/outs/fastq_path,myATAC,Chromatin Accessibility /data/H35KCBCXY/outs/fastq_path,myGEX,Gene Expression
--min-atac-count <NUM>Cell caller override: define the minimum number of ATAC transposition events in peaks (ATAC counts) for a cell barcode. Note: this option must be specified in conjunction with min-gex-count. With --min-atac-count=X and --min-gex-count=Y a barcode is defined as a cell if it contains at least X ATAC counts AND at least Y GEX UMI counts
--min-gex-count <NUM>Cell caller override: define the minimum number of GEX UMI counts for a cell barcode. Note: this option must be specified in conjunction with min-atac-count. With --min-atac-count=X and --min-gex-count=Y a barcode is defined as a cell if it contains at least X ATAC counts AND at least Y GEX UMI counts
--peaks <BED>Override peak caller: specify peaks to use in downstream analyses from supplied 3-column BED file. The supplied peaks file must be sorted by position and not contain overlapping peaks; comment lines beginning with # are allowed
--jobmode <MODE>Job manager to use. Valid options: local (default), sge, lsf, slurm or path to a .template file. Search for help on "Cluster Mode" at www.10xgenomics.com/support for more details on configuring the pipeline to use a compute cluster [default: local]
--localcores <NUM>Set max cores the pipeline may request at one time. Only applies to local jobs
--localmem <NUM>Set max GB the pipeline may request at one time. Only applies to local jobs
--localvmem <NUM>Set max virtual address space in GB for the pipeline. Only applies to local jobs
--mempercore <NUM>Reserve enough threads for each job to ensure enough memory will be available, assuming each core on your cluster has at least this much memory available. Only applies to cluster jobmodes
--maxjobs <NUM>Set max jobs submitted to cluster at one time. Only applies to cluster jobmodes
--jobinterval <NUM>Set delay between submitting jobs to cluster, in ms. Only applies to cluster jobmodes
--overrides <PATH>The path to a JSON file that specifies stage-level overrides for cores and memory. Finer-grained than --localcores, --mempercore and --localmem. Consult https://www.10xgenomics.com/support/ for an example override file
--uiport <PORT>Serve web UI at http://localhost:PORT
FlagDescription
--gex-exclude-intronsDisable counting of intronic reads. In this mode, only reads that are exonic and compatible with annotated splice junctions in the reference are counted. Note: using this mode will reduce the UMI counts in the feature-barcode matrix
--no-bamDo not generate bam files
--dryDo not execute the pipeline. Generate a pipeline invocation (.mro) file and stop
--disable-uiDo not serve the web UI
--noexitKeep web UI running after pipestance completes or fails
--nopreflightSkip preflight checks
-h, --helpPrints help information

Run Illumina demultiplexer on sample sheets that contain 10x-specific sample index sets, and generate 10x-specific quality metrics after the demultiplex. Any bcl2fastq argument will work (except a few that are set by the pipeline to ensure proper trimming and sample indexing). The FASTQ output generated will be the same as when running bcl2fastq directly.

These bcl2fastq arguments are overridden by this pipeline:

  • --fastq-cluster-count
  • --minimum-trimmed-read-length
  • --mask-short-adapter-reads

Usage: cellranger-arc mkfastq --run=PATH [options] cellranger-arc mkfastq -h | --help | --version

OptionDescription
--run=PATH(required) Path of Illumina BCL run folder.
--id=NAMEName of the folder created by mkfastq. If not supplied, will default to the name of the flowcell referred to by the --run argument.
--csv=PATH, --samplesheet=PATH, --sample-sheet=PATHPath to the sample sheet. The sample sheet can either be a simple CSV with lane, sample and index columns, or an Illumina Experiment Manager-compatible sample sheet. Sample sheet indexes can refer to 10x sample index set names (e.g., SI-GA-A12).
--simple-csv=PATHDeprecated. Same meaning as --csv.
--force-single-indexIf 10x-supplied i7/i5 paired indices are specified, but the flowcell was run with only one sample index, allow the demultiplex to proceed using the i7 half of the sample index pair.
--filter-single-indexOnly demultiplex samples identified by an i7-only sample index, ignoring dual-indexed samples. Dual-indexed samples will not be demultiplexed.
--filter-dual-indexOnly demultiplex samples identified by i7/i5 dual-indices (e.g., SI-TT-A6), ignoring single-index samples. Single-index samples will not be demultiplexed.
--rc-i2-override=BOOLIndicates if the bases in the I2 read are emitted as reverse complement by the sequencing workflow. Set to 'true' for Workflow A / NovaSeq Reagent Kit v1.5 or greater. Set to 'false' for Wokflow B / older NovaSeq Reagent Kits. NOTE: this parameter is autodetected based and should only be passed in special circumstances.
--lanes=NUMSComma-delimited series of lanes to demultiplex. Shortcut for the --tiles argument.
--use-bases-mask=MASKSame as bcl2fastq; override the read lengths as specified in RunInfo.xml. See Illumina bcl2fastq documentation for more information.
--delete-undeterminedDelete the Undetermined FASTQ files left by bcl2fastq. Useful if your sample sheet is only expected to match a subset of the flowcell.
--output-dir=PATHSame as in bcl2fastq. Folder where FASTQs, reports and stats will be generated.
--project=NAMECustom project name, to override the samplesheet or to use in conjunction with the --csv argument.
--jobmode=MODEJob manager to use. Valid options: local (default), sge, lsf, or a .template file.
--localcores=NUMSet max cores the pipeline may request at one time. Only applies to local jobs.
--localmem=NUMSet max GB the pipeline may request at one time. Only applies to local jobs.
--localvmem=NUMSet max virtual address space in GB for the pipeline. Only applies to local jobs.
--mempercore=NUMReserve enough threads for each job to ensure enough memory will be available, assuming each core on your cluster has at least this much memory available. Only applies in cluster jobmodes.
--maxjobs=NUMSet max jobs submitted to cluster at one time. Only applies in cluster jobmodes.
--jobinterval=NUMSet delay between submitting jobs to cluster, in ms. Only applies in cluster jobmodes.
--overrides=PATHThe path to a JSON file that specifies stage-level overrides for cores and memory. Finer-grained than --localcores, --mempercore and --localmem. Consult the 10x support website for an example override file.
--uiport=PORTServe web UI at http://localhost:PORT
--disable-uiDo not serve the UI.
--noexitKeep web UI running after pipestance completes or fails.
--nopreflightSkip preflight checks.
-h --helpShow this message.
--versionShow version.

Reference preparation tool for 10x Genomics Cell Ranger Multiome ATAC + Gene Expression. Build a reference package from a user-supplied genome FASTA and gene GTF file. Creates a new folder named after the genome. NOTE: Multi-species references are not supported by cellranger-arc. If you construct a multi-species reference and run cellranger-arc count you will not be able to generate all the outputs of the pipeline.

Usage: cellranger-arc mkref --config=PATH [options] mkref -h | --help | --version

OptionDescription
--config(required) Path to configuration file containing additional information about the reference. See online documentation for more details.Path to configuration file containing additional information about the reference. See online documentation for more details.
--nthreads=<num>Number of threads used during STAR genome index generation. Defaults to 1.
--memgb=<num>Maximum memory (GB) used when aligning reads with STAR. Defaults to 16.
--ref-version=<str>Optional reference version string to include with reference.
-h --helpShow this message.
--versionShow version.

Aggregate data from multiple cellranger-arc count runs.

Usage: cellranger-arc aggr [FLAGS] [OPTIONS] --id <ID> --reference <PATH> --csv <CSV>

FlagDescription
--nosecondaryDisable secondary analysis, e.g. clustering
--dryDo not execute the pipeline. Generate a pipeline invocation (.mro) file and stop
--disable-uiDo not serve the web UI
--noexitKeep web UI running after pipestance completes or fails
--nopreflightSkip preflight checks
-h, --helpPrints help information
OptionDescription
--id <ID>A unique run id and output folder name [a-zA-Z0-9_-]+ of maximum length 64 characters
--description <TEXT>Sample description to embed in output files [default: ]
--reference <PATH>Path to folder containing cellranger-arc-compatible reference. Reference packages can be downloaded from www.10xgenomics.com/support or constructed using the cellranger-arc mkref command. Note: this reference must match the reference used for the initial cellranger-arc count run
--csv <CSV>Path to CSV file enumerating cellranger-arc count outputs required for aggregation. For example, a CSV for aggregating two samples would look as follows (blank lines are ignored): library_id,atac_fragments,per_barcode_metrics,gex_molecule_info L1,/data/L1/outs/atac_fragments.tsv.gz,/data/L1/outs/per_barcode_metrics.csv,/data/L1/outs/gex_molecule_info.h5 L2,/data/L2/outs/atac_fragments.tsv.gz,/data/L2/outs/per_barcode_metrics.csv,/data/L2/outs/gex_molecule_info.h5 Optionally, metadata associated with these libraries can be specified using additional columns. This information is not used by the pipeline but will be available in the Loupe file for visualization.
--peaks <BED>Override peak caller: specify peaks to use in downstream analyses from supplied 3-column BED file. The supplied peaks file must be sorted by position and not contain overlapping peaks; comment lines beginning with # are allowed
--normalize <MODE>Library depth normalization mode [default: depth] [possible values: none, depth]
--jobmode <MODE>Job manager to use. Valid options: local (default), sge, lsf, slurm or path to a .template file. Search for help on "Cluster Mode" at www.10xgenomics.com/support for more details on configuring the pipeline to use a compute cluster [default: local]
--localcores <NUM>Set max cores the pipeline may request at one time. Only applies to local jobs
--localmem <NUM>Set max GB the pipeline may request at one time. Only applies to local jobs
--localvmem <NUM>Set max virtual address space in GB for the pipeline. Only applies to local jobs
--mempercore <NUM>Reserve enough threads for each job to ensure enough memory will be available, assuming each core on your cluster has at least this much memory available. Only applies to cluster jobmodes
--maxjobs <NUM>Set max jobs submitted to cluster at one time. Only applies to cluster jobmodes
--jobinterval <NUM>Set delay between submitting jobs to cluster, in ms. Only applies to cluster jobmodes
--overrides <PATH>The path to a JSON file that specifies stage-level overrides for cores and memory. Finer-grained than --localcores, --mempercore and --localmem. Consult https://www.10xgenomics.com/support/ for an example override file
--uiport <PORT>Serve web UI at http://localhost:PORT

Re-run secondary analysis (dimensionality reduction, clustering, feature linkage etc.) on a completed cellranger-arc count or cellranger-arc aggr run.

Usage: cellranger-arc reanalyze [FLAGS] [OPTIONS] --id <ID> --reference <PATH> --matrix <H5> --atac-fragments <TSV.GZ>

FlagDescription
--dryDo not execute the pipeline. Generate a pipeline invocation (.mro) file and stop
--disable-uiDo not serve the web UI
--noexitKeep web UI running after pipestance completes or fails
--nopreflightSkip preflight checks
-h, --helpPrints help information
OptionDescription
--id <ID>A unique run id and output folder name [a-zA-Z0-9_-]+ of maximum length 64 characters
--description <TEXT>Sample description to embed in output files [default: ]
--reference <PATH>Path to folder containing cellranger-arc-compatible reference. Reference packages can be downloaded from www.10xgenomics.com/support or constructed using the cellranger-arc mkref command. Note: this reference must match the reference used for the initial cellranger-arc count run
--matrix <H5>Path to a feature barcode matrix H5 generated by cellranger-arc count or aggr. If you intend to subset to a set of barcodes then use the raw matrix, otherwise use the filtered feature barcode matrix
--atac-fragments <TSV.GZ>Path to the atac_fragments.tsv.gz generated by cellranger-arc count or aggr. Note it is assumed that the tabix index file atac_fragments.tsv.gz.tbi is present in the same directory
--params <CSV>Specify key-value pairs in CSV format for analysis: any subset of random_seed, k_means_max_clusters, feature_linkage_max_dist_mb, num_gex_pcs, num_atac_pcs. For example, to override the number of GEX principal components used to 15 and the distance threshold for feature linkage computation to 2.5 megabases, the CSV would take the form (blank lines are ignored): num_gex_pcs,15 feature_linkage_max_dist_mb,2.5
--barcodes <CSV>Specify barcodes to use in analysis. The barcodes could be specified in a text file that contains one barcode per line, like this (blank lines are ignored): ACGT-1 TGCA-1 Or you can supply a CSV (with/without a header) whose first column will be used - exports from Loupe Browser will have this format. For example, Barcode,Cluster ACGT-1,T cells TGCA-1,B cells
--min-atac-count <NUM>Cell caller override: define the minimum number of ATAC transposition events in peaks (ATAC counts) for a cell barcode. Note: this option must be specified in conjunction with min-gex-count. With --min-atac-count=X and --min-gex-count=Y a barcode is defined as a cell if it contains at least X ATAC counts AND at least Y GEX UMI counts
--min-gex-count <NUM>Cell caller override: define the minimum number of GEX UMI counts for a cell barcode. Note: this option must be specified in conjunction with min-atac-count. With --min-atac-count=X and --min-gex-count=Y a barcode is defined as a cell if it contains at least X ATAC counts AND at least Y GEX UMI counts
--peaks <BED>Override peak caller: specify peaks to use in secondary analyses from supplied 3-column BED file. The supplied peaks file must be sorted by position and not contain overlapping peaks; comment lines beginning with # are allowed
--agg <AGGREGATION_CSV>If the input matrix was produced by 'cellranger-arc aggr', it's possible to pass the same aggregation CSV in order to retain per-library tag information in the resulting .cloupe file
--jobmode <MODE>Job manager to use. Valid options: local (default), sge, lsf, slurm or path to a .template file. Search for help on "Cluster Mode" at www.10xgenomics.com/support for more details on configuring the pipeline to use a compute cluster [default: local]
--localcores <NUM>Set max cores the pipeline may request at one time. Only applies to local jobs
--localmem <NUM>Set max GB the pipeline may request at one time. Only applies to local jobs
--localvmem <NUM>Set max virtual address space in GB for the pipeline. Only applies to local jobs
--mempercore <NUM>Reserve enough threads for each job to ensure enough memory will be available, assuming each core on your cluster has at least this much memory available. Only applies to cluster jobmodes
--maxjobs <NUM>Set max jobs submitted to cluster at one time. Only applies to cluster jobmodes
--jobinterval <NUM>Set delay between submitting jobs to cluster, in ms. Only applies to cluster jobmodes
--overrides <PATH>The path to a JSON file that specifies stage-level overrides for cores and memory. Finer-grained than --localcores, --mempercore and --localmem. Consult https://www.10xgenomics.com/support/ for an example override file
--uiport <PORT>Serve web UI at http://localhost:PORT

Run a tiny cellranger-arc count pipeline to verify software integrity.

Usage: cellranger-arc testrun [FLAGS] [OPTIONS] --id <ID>

FlagDescription
--disable-uiDo not serve the web UI
--noexitKeep web UI running after pipestance completes or fails
--nopreflightSkip preflight checks
-h, --helpPrints help information
OptionDescription
--id <ID>A unique run id and output folder name [a-zA-Z0-9_-]+ of maximum length 64 characters
--description <TEXT>Sample description to embed in output files
--jobmode <MODE>Job manager to use. Valid options: local (default), sge, lsf, slurm or path to a .template file. Search for help on "Cluster Mode" at www.10xgenomics.com/support for more details on configuring the pipeline to use a compute cluster [default: local]
--localcores <NUM>Set max cores the pipeline may request at one time. Only applies to local jobs
--localmem <NUM>Set max GB the pipeline may request at one time. Only applies to local jobs
--localvmem <NUM>Set max virtual address space in GB for the pipeline. Only applies to local jobs
--mempercore <NUM>Reserve enough threads for each job to ensure enough memory will be available, assuming each core on your cluster has at least this much memory available. Only applies to cluster jobmodes
--maxjobs <NUM>Set max jobs submitted to cluster at one time. Only applies to cluster jobmodes
--jobinterval <NUM>Set delay between submitting jobs to cluster, in ms. Only applies to cluster jobmodes
--overrides <PATH>The path to a JSON file that specifies stage-level overrides for cores and memory. Finer-grained than --localcores, --mempercore and --localmem. Consult https://www.10xgenomics.com/support/ for an example override file
--uiport <PORT>Serve web UI at http://localhost:PORT

Genes GTF tool for 10x Genomics Cell Ranger Multiome ATAC + Gene Expression. Filter user-supplied GTF files for use as Cell Ranger Multiome ATAC + Gene Expression-compatible genes files for mkref tool.

Usage: mkgtf <input_gtf> <output_gtf> [--attribute=KEY:VALUE...] mkgtf -h | --help | --version

ArgumentDescription
input_gtfPath to input genes GTF file.
output_gtfPath to filtered output genes GTF file.
OptionDescription
--attribute=<key:value>Key-value pair in attributes field to be kept in the GTF file.
-h --helpShow this message.
--versionShow version.

Upload analysis logs to 10x Genomics support.

Usage: cellranger-arc upload <your_email> <file>

Collect Linux system configuration information.

Usage: cellranger-arc sitecheck cellranger-arc sitecheck -h | --help | --version