10x Genomics Support/Cell Ranger/Tutorials/

Installing Cell Ranger

This tutorial is written with Cell Ranger v7.2.0. Commands are compatible with other versions of Cell Ranger unless noted. Please replace x.y.z with the Cell Ranger version you wish to install.

Cell Ranger is delivered as a single, self-contained tar file that can be unpacked anywhere on your system. It bundles all of its required software dependencies, which are pre-compiled to run on a wide range of Linux distributions. For convenience, the reference data package required for Cell Ranger is provided as a separate download.

Alternatively, you can skip Cell Ranger download and installation and use 10x Genomics Cloud Analysis, our recommended method for running 10x Genomics single cell pipelines for most new customers. Cloud Analysis is currently available only in the United States and Canada. Sign up for a free account or learn more.

Starting with Cell Ranger v7.1, the tarball comes in two compression formats, .tar.gz and .tar.xz. Choose either format to install. The .xz compressed file is smaller, which may speed up download times.

Here is a quick start guide to installing Cell Ranger. For detailed instructions, please refer to the tutorial section below.

Step 1Download and unpack the cellranger-x.y.z.tar.gz tar file in any location. In this example, we unpack it in a directory called /opt.

cd /opt # [ download file from downloads page ] tar -xzvf cellranger-x.y.z.tar.gz

If you downloaded Cell Ranger in the .xz compression format, be sure to use the correct file extension tar.xz and tar flags to unpack:

cd /opt # [ download file from downloads page ] tar -xvf cellranger-x.y.z.tar.xz

This unpacks Cell Ranger, its dependencies, and the cellranger script into a new directory called cellranger-x.y.z.

Step 2 – Download and unpack any of the reference data files in a convenient location:

# [ download file from downloads page ] # Example human reference transcriptome tar -xzvf refdata-gex-GRCh38-2020-A.tar.gz

This creates a new directory called refdata-gex-GRCh38-2020-A that contains a single reference (in this case, GRCh38). Each reference contains a set of pre-generated indices and other data required by Cell Ranger.

Step 3 – Prepend the Cell Ranger directory to your $PATH. This will allow you to invoke the cellranger command.

If you unpacked both Cell Ranger and the reference data into /opt, then you would run the following command:

export PATH=/opt/cellranger-x.y.z:$PATH

You may wish to add this command to your .bashrc for convenience.

Here is a detailed tutorial on installing Cell Ranger.

  1. Begin this tutorial by logging in to your remote Linux server. The procedure varies based on the machine, but most likely involves using an ssh command.
ssh user.name@jabberwocky
  1. Sometimes you are prompted for a password. When you type in your password, usually no characters are displayed. This is normal, just keep typing.
user.name@jabberwocky's password:

After logging in, the command prompt looks similar to this:

[user.name@jabberwocky ~]$

Cell Ranger pipelines must run on a Linux system that meets these minimum requirements. It is common to run Cell Ranger on High Performance Computing clusters (HPCs) or stand-alone servers. These resources may be available for you to use through your institution. There are also cloud computing options through service providers such as:

If you are in the U.S. and part of an academic or non-profit institution, you may also be able to request time on a proposal-based resource such as:

Most institutional servers and services have a system administrator tasked with the upkeep of the machine. If available, work with your administrator to log in, identify an appropriate directory to run your analysis, and efficiently allocate system resources for running Cell Ranger.

Sometimes HPC systems have designated compute or interactive nodes for users to log in and monitor output. If available, make sure that Cell Ranger processes do not take up excessive resources and slow down your shared head node for other users.

Note: For the rest of this tutorial, commands are listed for you to copy/paste directly into your command prompt. We alternate between showing you the commands and the expected output.

  1. We will call our working directory the yard. Make a new directory with the following command:
mkdir yard
  1. To keep track of where the yard is located, use the pwd command (print working directory):
pwd

Our yard directory is located here: /mnt/home/user.name/yard. Your directory may be located in a different location depending on your system.

  1. Go into that directory using the command, cd (change directory), and make a folder using mkdir apps to install Cell Ranger.
cd /mnt/home/user.name/yard mkdir apps cd apps
  1. Go to the Cell Ranger Download page and fill out the "10x Genomics End User Software License Agreement" information. Select either the Gzip or Xzip tarball to download. Be sure to copy the whole command from the Downloads page.

  2. Some systems do not have the command, curl (client url) installed. An alternative command provided on the Downloads page is wget. Note: You do not need to do this if the curl command worked for you.

The output looks similar to this:

% Total % Received % Xferd Average Speed Time Time Time Current Dload Upload Total Spent Left Speed 100 767M 100 767M 0 0 124M 0 0:00:06 0:00:06 --:--:-- 131M
  1. List the contents of the apps directory to see the Cell Ranger "tarball" (archive of files).
ls -1

The output looks similar to this:

cellranger-x.y.z.tar.gz
  1. Unpack the tarball using the tar command:
tar -zxvf cellranger-x.y.z.tar.gz

If you downloaded the .tar.xz format (starting with Cell Ranger v7.1), be sure to change the file extension and omit the -z flag:

tar -xvf cellranger-x.y.z.tar.xz

The output looks similar to this:

cellranger-x.y.z/ cellranger-x.y.z/.env.json cellranger-x.y.z/.version cellranger-x.y.z/LICENSE cellranger-x.y.z/builtwith.json cellranger-x.y.z/sourceme.bash cellranger-x.y.z/sourceme.csh cellranger-x.y.z/bin/ cellranger-x.y.z/bin/_cellranger_internal cellranger-x.y.z/bin/cellranger cellranger-x.y.z/bin/rna/ ...

A large amount of information is printed on the screen as it unpacks. When it's finished, a directory containing the cellranger executable and all of its dependencies (except bcl2fastq) is included.

The Illumina software, bcl2fastq2 (version 2.20) is required for cellranger mkfastq. If you know that you will not need to demultiplex and generate FASTQ files, you do not need to install bcl2fastq. If you do need bcl2fastq, download the rpm and then follow these install instructions.

  1. List the contents of the Cell Ranger directory:
ls -l

The output looks similar to this:

cellranger-x.y.z cellranger-x.y.z.tar.gz
  1. Cell Ranger is now installed. To run it requires entering the path to the executable command on the command line. We want to be able to run Cell Ranger by simply entering the word cellranger on the command line. To accomplish this, we add it to your \$PATH variable. Go into the cellranger-x.y.z directory and enter pwd.
cd cellranger-x.y.z pwd

The output looks something like this:

/mnt/home/user.name/yard/apps/cellranger-x.y.z
  1. Then use the export command to add it to the \$PATH variable.
export PATH=/mnt/home/user.name/yard/apps/cellranger-x.y.z:$PATH
  1. Then enter which cellranger to see if we have successfully added cellranger to the $PATH.
which cellranger

The output looks similar to this:

~/yard/apps/cellranger-x.y.z/cellranger

Note that the "tilde" symbol, , is the same as $HOME, which is the same as /mnt/home/user.name/.

  1. For convenience, you may want to add this command to your .bashrc file, a special script that runs every time you log in to your system.

  2. Now you can enter cellranger on the command line from any directory to see the usage statement.

cellranger

The beginning of the usage statement looks something like this:

cellranger cellranger-x.y.z Process 10x Genomics Gene Expression, Feature Barcode, and Immune Profiling data USAGE: cellranger <SUBCOMMAND> FLAGS: -h, --help Prints help information -V, --version Prints version information SUBCOMMANDS: count Count gene expression (targeted or whole- transcriptome) and/or feature barcode reads from a single sample and GEM well multi Analyze multiplexed data or combined gene expression/immune profiling/feature barcode data ...

The usage statement contains a list of cellranger commands. We will run two of them now.

  • sitecheck
  • testrun
  1. The purpose of sitecheck is to check your system to make sure it meets the system requirements for running the cellranger pipeline. Run the command and use the > symbol to re-direct the output to a text file.
cellranger sitecheck > sitecheck.txt
  1. Now use the less command to take a look at this file. Use the up and down arrow keys to scroll through the file. Or to scroll down by page. Press the key on the keyboard to quit out of the less program.
less sitecheck.txt

We will take a look at the following sections of the sitecheck file to see if they meet the minimum requirements for running Cell Ranger:

  • CPU Cores
  • Memory Total
  • Disk Space
  • Global File Limit
  • User Limits:
    • open files
    • max user processes
  1. Scroll down until you find the parts in the file with the CPU Cores information:
... ===================================================================== CPU Cores grep -c processor /proc/cpuinfo --------------------------------------------------------------------- 64 ===================================================================== ...

Running Cell Ranger requires at least 8 CPUs, preferably 16, and at least 64 GB of RAM, preferably 128. We have 64 CPU available.

The Memory Total looks something like this:

... ===================================================================== Memory Total grep MemTotal /proc/meminfo | cut -d ':' -f 2 | sed 's/^[ \t]*//' --------------------------------------------------------------------- 264132348 kB ===================================================================== ...

We need to convert from kB to GB. A quick Google search does that for us. So, we have 264 GB RAM available, which meets the minimum requirements.

The Disk Space looks something like this:

... ===================================================================== Disk Space df -Ph | awk '{print $2, $3, $4}' --------------------------------------------------------------------- Size Used Avail 126G 72K 126G 126G 204K 126G 20G 9.9G 9.7G 50G 1.6G 45G 16T 16T 413G 40T 28T 13T 61G 50G 12G ...

The Global File Limit looks similar to this:

... ===================================================================== Global File Limit cat /proc/sys/fs/file-{max,nr} --------------------------------------------------------------------- 26216544 26624 0 262144 ===================================================================== ...

This value 26216544 meets the minimum requirements of 10k per GB RAM: (10,000 * 264 GB RAM = 2,640,000) < 26,216,544 .

Next, look for the User Limits section, also called ulimit.

... ===================================================================== User Limits bash -c 'ulimit -a' --------------------------------------------------------------------- core file size (blocks, -c) 0 data seg size (kbytes, -d) unlimited scheduling priority (-e) 0 file size (blocks, -f) unlimited pending signals (-i) 1031684 max locked memory (kbytes, -l) 64 max memory size (kbytes, -m) unlimited open files (-n) 1024 pipe size (512 bytes, -p) 8 POSIX message queues (bytes, -q) 819200 real-time priority (-r) 0 stack size (kbytes, -s) 8192 cpu time (seconds, -t) unlimited max user processes (-u) 1031684 virtual memory (kbytes, -v) unlimited file locks (-x) unlimited ===================================================================== ...

Note that this machine does meet the max user processes requirement of 64 times the number of CPUs (1031684): (64 * 64 = 4,096) < 1,031,684. However, it does not meet the max open file requirement of 16,000. Some accounts have permission to adjust user limits using the ulimit -n command.

ulimit -n 16000

This account does not have permission to change user limits.

-bash: ulimit: open files: cannot modify limit: Operation not permitted

In this case, contact your system administrator and ask them to adjust it.

Our software support team can review your sitecheck results. Send it to us using cellranger upload and your email address from the command line.

cellranger upload [email protected] sitecheck.txt

The output looks something like this:

Getting upload information... Uploading file... Size: 32,074 bytes 100% [=================================>] 32,074 7.97Mb/s eta 0m 0s Upload complete!

You will receive an email back from support asking what you would like us to do with the file. Reply to that email, and we will review it for you.

Once the user limit for open files is adjusted, run cellranger testrun.

cellranger testrun --id=check_install

The output looks something like this:

Martian Runtime - v4.0.6 Serving UI at ... Running preflight checks (please wait)... Checking sample info... Checking FASTQ folder... Checking reference... ...

This process is completed with the following statement:

Waiting 6 seconds for UI to do final refresh. Pipestance completed successfully! 2022-01-04 12:51:02 Shutting down.

This output indicates a successful testrun and, by extension, a successful Cell Ranger installation.