PanIN & PDAC Spatial Transcriptomics Paper Code
Introduction
This is documentation for the code used for analysis in Preprint Link. Here, we leverage single-cell RNAseq and spatial transcriptomics data to investigate the epithelial and stromal co-evolution in pancreatic cancer.
Data
We used a cohort of scRNAseq samples composed of healthy (n = 24), Adjacent normal to tumor (n = 3) and PDAC (n = 18) samples and 10x Visium samples composed of healthy (n = 5), Adjacent normal to tumor (n = 2) and PDAC (n = 7). The samples are integrated from (Steele et al. 2020), (Carpenter et al. 2023), (Carpenter et al. 2024) and Preprint Link studies.
Downloading raw and processed data
- Raw fastq data can be find in these dbGAP repositories:
- Raw count matrices can be found in these GEO repositories:
- Processed data objects and full resolution H&E are available on Zenodo:
Data acquisition can also be done as described in scRNAseq Acquisition and Spatial Transcriptomics Acquisition
Analysis
The analysis workflow and findings are explained in detail in Preprint Link
scRNAseq Analysis
- Alignment using CellRanger as detailed here CellRanger Alignmnet
- Ambient RNA correction using cellbender (Fleming et al. 2023) as detailed here scRNAseq Ambient RNA Correction
- Quality control, processing and integration as detailed here scRNAseq Data Processing and Integration
- Copy number variation inference was done using Numbat (Gao et al. 2023) as described here CNV Inference using Numbat
- Fibroblasts and Macrophages subpopulation analysis were done as decribed here scRNAseq Fibroblast Subpopulation Analysis and Macrophages Subpopulation Analysis
- Gene set scoring on TCGA-PAAD dataset was done as described here TCGA-PAAD Scoring
Spatial Transcriptomics Analysis
Visium
- Alignment using SpaceRanger was done is described here
- Data normalization and seurat object generation is described here Spatial Transcriptomics Data Processing
- Cell type deconvolution was done using RCTD (Cable et al. 2022) is described here Spatial Transcriptomics Cell Type Deconvolution
- Integration and Spatially-informed clustering using BayesSpace (Zhao et al. 2021) is described here Spatial Transcriptomics Clustering (BayesSpace)
- Ligand-Receptor interaction analysis was done using LIANA+ (Dimitrov et al. 2024) is described here LIANA+ Analysis
- Neighborhood analysis is described here Spatial Transcriptomics Neighborhood Analysis
- Pseudobulk analysis of epithelial compartment is described here Spatial Transcriptomics Epithelial Domains Analysis
- Pseudobulk analysis of stromal compartment is described here Spatial Transcriptomics Stromal Domains Analysis
Xenium
- Segmentation was done using Proseg (Jones et al. 2025) is described here Xenium Resegmentation
- Data processing and integration is described here Xenium Data Analysis
- Spatial regression was done using semla (Larsson et al. 2023) is described here Xenium Spatial Regression
- Visualization of data is done as described here Xenium Polygons Visualization
Interactive Visualization
You can explore the data interactively on https://pascadimagliano-lab.github.io/PancAtlas/
Contact us
If you have any questions please feel free to contact the authors, Ahmed M. Elhossiny (hossiny@umich.edu) and Marina Pasca di Magliano (marinapa@umich.edu)