All datasets

Xenium v1 Human Breast FFPE with Biomarkers & Housekeeping Genes Custom Panel

In Situ Gene Expression dataset analyzed using Xenium Onboard Analysis 4.0.0

Assess data quality
View summary metrics to assess data quality and more.
Visualize and explore data
Discover differentially expressed genes, visualize your favorite genes, and explore your data with our visualization software.

Learn about Xenium analysis

Overview

Dataset companion to the preprint, "Biomarker Quantification in Breast Cancer using Xenium In Situ", which explores: 1) housekeeping gene stability across 12 human breast samples and 2) biomarker normalization in tumor cells, myoepithelial cells, and the peritumoral space.

How to view data

Interactively explore data with Xenium Explorer by downloading the Xenium Output Bundle (or Xenium Explorer subset) file. The subset bundle contains the experiment.xenium, gene_panel.json, morphology_focus/ directory of multi-file OME-TIFF files, analysis_summary.html, cells.zarr.zip, cell_feature_matrix.zarr.zip, transcripts.zarr.zip, and analysis.zarr.zip files.

See the Getting Started with Xenium Explorer page for more details. Follow these instructions to view the post-Xenium H&E image or image alignment file in Xenium Explorer.

Biomaterials

From Supplemental Table S1:

SectionDiseaseTumor StagingGradeHormone Status
S1-TopIDCT3 N0 M0G3HER2-Neg
S1-MiddleDCIST2 N0 M0G2ER+/PR+/HER2-2+
S1-BottomDCIST1c N0 M0G3HER2-1+
S2-TopNormalNA (Normal)NAHER2-Neg
S2-MiddlleDCIST1c N1 MXG2HER2-2+
S2-BottomDCISTis NX MXG3HER2-3+
S3-TopCCH/DCISTis N0G2ER+/PR+/HER2-1+
S3-MiddleDCIST4b N3a M1G2ER+/PR+/HER2-2+
S3-BottomDCISTX NX M0G2ER-/PR-/HER2-2+
S4-TopIDCTis NX MXG3ER+/PR+/HER2-1+
S4-MiddleIDCT2 N0 MXNot providedER+/PR+/HER2-2+
S4-BottomDCIST4a N2aG3HER2-1+

Tissue preparation

Tissue sections were prepared following the Xenium In Situ for FFPE - Tissue Preparation Handbook (CG000578).

Probe hybridization, washing, ligation, amplification, and cell segmentation staining were performed following the Xenium In Situ Gene Expression with Cell Segmentation Staining User Guide (CG000749).

Post-instrument processing followed the Demonstrated Protocol Xenium In Situ Gene Expression - Post-Xenium Analyzer H&E Staining (CG000613).

Gene panels

A 280-gene fully custom standalone panel was designed for this study. The panel enables comprehensive cell type identification in human breast cancer with particular emphasis on basement membrane markers (see Supplemental Table S3) and myoepithelial genes, such as ACTA2, MYH11, KRT14/17, and CNN1. A number of human breast tumor biomarkers were included, such as ERBB2, PGR, ESR1, GATA3, and MKI67. The panel also includes 45 housekeeping (HK) genes that were derived from literature (Tilli et al. 2016; Jo et al. 2019; Jain et al. 2020), Oncogene Dx reference panel (Cronin et al. 2004), and internal 10x Genomics Flex Gene Expression or Xenium In Situ Gene Expression breast datasets. These genes were selected based on their low dispersion features observed in these datasets. Probe set numbers were reduced for some genes due to predicted high expression.

Xenium Analyzer

The instrument run was performed following the Xenium Analyzer User Guide (CG000584). The on-instrument analysis was run with Xenium Onboard Analysis version 4.0.0.

SectionMedian transcripts per cellCells detectedNuclear transcripts per 100 µm²High quality decoded transcripts detectedRegion area (µm²)
S1-Top137209,467361.055,283,58360,267,079.8
S1-Middle288375,529497.5151,961,25879,408,960.4
S1-Bottom241254,480517.9117,652,31188,981,827.0
S2-Top88301,854275.541,712,80775,507,037.7
S2-Middlle73380,739269.565,062,08466,827,148.0
S2-Bottom83358,519295.492,260,45999,470,825.5
S3-Top124308,531273.257,809,58599,869,377.4
S3-Middle304373,150526.7177,879,84782,010,605.8
S3-Bottom168150,145343.637,555,92631,953,714.8
S4-Top123192,898351.055,927,78362,975,500.6
S4-Middle246335,709422.3111,763,69768,547,410.3
S4-Bottom218201,955314.880,016,54655,029,208.1

Custom Cell Groups

Xenium data was imported into Seurat v5.0.1.9007, without transcript coordinate data, according to the vignette, "Analysis of Image-based Spatial Data in Seurat". Cells were retained for downstream analysis only if they contained more than 10 total transcripts and detected over five unique genes. We normalized the data using SCTransform. Using the FindClusters function (resolution = 0.3, dimensionality determined via elbow plot ranking of principle components), the major cell type groups were manually annotated based on differentially expressed genes (using the FindMarkers function). Additional subclustering was performed in one instance: myoepithelial cells from the S1-Top section were subdivided into normal-adjacent and tumor-adjacent populations.

References

This dataset is licensed under the Creative Commons Attribution 4.0 International (CC BY 4.0) license. 10x citation guidelines available here.