Slide 1

Slide 1 text

Statistical Image Processing Segmentation, normalization, and cell phenotyping 1

Slide 2

Slide 2 text

• Pixel level processing: work with multichannel tiff files directly • Operates on pixel intensity values • Cell-level processing: work with tabular data after cell segmentation • Operates on median or mean intensity values aggregated at cell level • Segmentation • Pixel-level • Normalization • Pixel-level or cell-level • Phenotyping • Pixel-level or cell-level Statistical image processing 2

Slide 3

Slide 3 text

Cell segmentation 3

Slide 4

Slide 4 text

Identifies cells and nuclei in image 1. Nucleus channel used to identify nucleus 2. Cell membrane or cytoplasm markers used to draw boundary around nucleus Segmentation 4

Slide 5

Slide 5 text

• Proprietary software • inForm by Akoya, Halo automated image analysis software • GUI-based, user-friendly • No manually segmented data required • GUI-based open-source software • CellProfiler, ilastik • User-friendly for non-computer scientists/statisticians • No manually segmented data required* • Deep-learning based open-source software • Best performance* • Need manually segmented data* • Hard to adapt without computational expertise Segmentation approaches 5

Slide 6

Slide 6 text

Segmentation- Mesmer 6 Greenwald et. al. Nature Biotechnology, 2022.

Slide 7

Slide 7 text

Cell phenotyping 7

Slide 8

Slide 8 text

Cell phenotyping is the process of identifying cell types from marker expression values • Conceptually, goal is to create a “cut point” in marker intensity values where cells are either positive or negative for a marker • Can be performed at pixel or cell level Phenotyping 8

Slide 9

Slide 9 text

Cell types by immune markers Thanks to Brooke Fridley and Alex Soupir at the Moffitt Cancer Center for this image

Slide 10

Slide 10 text

• Can be unsupervised, but often requires expert intervention for cell annotation and validation • Not specific to multiplex imaging • Flow and mass cytometry, single-cell RNA-seq • Multiplex imaging has unique challenges • Segmentation error leads to phenotyping error • Hard to differentiate between “marker positive” and “marker negative” Phenotyping 10

Slide 11

Slide 11 text

• Marker intensity distributions are highly right skewed • Often no clear bimodality • Sensitive to upstream image processing • Normalization • Cell segmentation Phenotyping challenges for MI data 11

Slide 12

Slide 12 text

• Marker intensity distributions are highly right skewed • Often no clear bimodality • Sensitive to upstream image processing • Normalization • Cell segmentation Phenotyping challenges for MI data 12

Slide 13

Slide 13 text

• Marker gating • Determine cutpoint in marker intensity histogram to designate “marker positive” and “marker negative” cells • Unsupervised clustering methods • Seurat, Phenograph contain built-in software • Most developed for other single-cell analysis • Semi-supervised • inForm/Halo proprietary software use manual gating to guide phenotyping • MAUI/CU-Anschutz: Deep-learning based pixel-level • Astir-: Deep-learning based cell-level Phenotyping approaches 13

Slide 14

Slide 14 text

Image normalization 14

Slide 15

Slide 15 text

The slide-to-slide problem 15 • Tissues placed on a slide, each contains (10s to 100s) of images • Several slides are imaged in the same experiment • Multiple sources of noise introduced each time: optical effects, instrument parameter tuning, different times of staining for antibodies • Large batch effects!

Slide 16

Slide 16 text

Image transformation, normalization, and batch correction are used to make the data more appropriate for downstream analysis by removing non-biological biases in marker intensity distributions • Transformations: log, arcsinh, square root • Make data more normally distributed, do not adjust for systematic effects • Normalization: adjusts distribution of marker intensities in each slide, image, or channel separately to make distributions appear more similar • Batch correction: explicitly removes systematic bias using variables that account for processing steps Normalizing multiplex imaging data

Slide 17

Slide 17 text

• Normalization makes the same tissue appear similar across slides • Challenging in MI because • No gold standard exists • Slides differ in their composition Evaluating methods for normalization and batch correction 17 Unnormalized Normalized MAP06025 MAP00083 MAP03361 Bottom of crypt Interior of crypt Stroma Top of crypt

Slide 18

Slide 18 text

• Histograms reveal differences in images intensities across subjects and slides • Harris et. Al 2022 explored combinations of batch correction and normalizations • Developed new framework for evaluating quality of normalization • R package mxnorm Batch effects in multiplex imaging data 18 Harmonization method Transformation /Normalization

Slide 19

Slide 19 text

• Most spatial-omics data lack ground truth for evaluating slide effects • Harris et. al 2022 established one 1. Alignment of marker densities 2. Cell phenotyping discordance 3. Proportion of variance due to slide • Mean division is simple and worked well Batch correction evaluation framework 19 Alignment of marker densities

Slide 20

Slide 20 text

• Most spatial-omics data lack ground truth for evaluating slide effects • Harris et. al 2022 established one 1. Alignment of marker densities 2. Cell phenotyping discordance 3. Proportion of variance due to slide • Mean division is simple and worked well Batch correction evaluation framework 20 Cell phenotyping discordance

Slide 21

Slide 21 text

• Most spatial-omics data lack ground truth for evaluating slide effects • Harris et. al 2022 established one 1. Alignment of marker densities 2. Cell phenotyping discordance 3. Proportion of variance due to slide • Mean division is simple and worked well Batch correction evaluation framework 21 Proportion of variance due to slide

Slide 22

Slide 22 text

22 • Allows users to easily allow evaluate normalization procedures in their own data • Allows default normalization options (from Harris et al. 2022) or user specified • mxnorm can be used to evaluate new normalization methods in future papers

Slide 23

Slide 23 text

• Functional markers are not used to define phenotypes • Inform cell function and can be present be present across multiple cell types • Interested in differences in expression of functional expression across cell or patient populations of interest • whether abundance of PD-L1 positive cells differ between responders and non-responders to an immunotherapy • Still should be normalized! • For analysis, using continuous valued intensity is better than thresholding • Methods for differential expression analysis for ST data can be used • Seal et. al. 2022 provides a method to cluster based on marker densities Analysis of functional markers 23