• Pixel level processing: work with multichannel tiff files directly • Operates on pixel intensity values • Cell-level processing: work with tabular data after cell segmentation • Operates on median or mean intensity values aggregated at cell level • Segmentation • Pixel-level • Normalization • Pixel-level or cell-level • Phenotyping • Pixel-level or cell-level Statistical image processing 2
Identifies cells and nuclei in image 1. Nucleus channel used to identify nucleus 2. Cell membrane or cytoplasm markers used to draw boundary around nucleus Segmentation 4
Cell phenotyping is the process of identifying cell types from marker expression values • Conceptually, goal is to create a “cut point” in marker intensity values where cells are either positive or negative for a marker • Can be performed at pixel or cell level Phenotyping 8
• Can be unsupervised, but often requires expert intervention for cell annotation and validation • Not specific to multiplex imaging • Flow and mass cytometry, single-cell RNA-seq • Multiplex imaging has unique challenges • Segmentation error leads to phenotyping error • Hard to differentiate between “marker positive” and “marker negative” Phenotyping 10
• Marker intensity distributions are highly right skewed • Often no clear bimodality • Sensitive to upstream image processing • Normalization • Cell segmentation Phenotyping challenges for MI data 11
• Marker intensity distributions are highly right skewed • Often no clear bimodality • Sensitive to upstream image processing • Normalization • Cell segmentation Phenotyping challenges for MI data 12
The slide-to-slide problem 15 • Tissues placed on a slide, each contains (10s to 100s) of images • Several slides are imaged in the same experiment • Multiple sources of noise introduced each time: optical effects, instrument parameter tuning, different times of staining for antibodies • Large batch effects!
Image transformation, normalization, and batch correction are used to make the data more appropriate for downstream analysis by removing non-biological biases in marker intensity distributions • Transformations: log, arcsinh, square root • Make data more normally distributed, do not adjust for systematic effects • Normalization: adjusts distribution of marker intensities in each slide, image, or channel separately to make distributions appear more similar • Batch correction: explicitly removes systematic bias using variables that account for processing steps Normalizing multiplex imaging data
• Normalization makes the same tissue appear similar across slides • Challenging in MI because • No gold standard exists • Slides differ in their composition Evaluating methods for normalization and batch correction 17 Unnormalized Normalized MAP06025 MAP00083 MAP03361 Bottom of crypt Interior of crypt Stroma Top of crypt
• Histograms reveal differences in images intensities across subjects and slides • Harris et. Al 2022 explored combinations of batch correction and normalizations • Developed new framework for evaluating quality of normalization • R package mxnorm Batch effects in multiplex imaging data 18 Harmonization method Transformation /Normalization
• Most spatial-omics data lack ground truth for evaluating slide effects • Harris et. al 2022 established one 1. Alignment of marker densities 2. Cell phenotyping discordance 3. Proportion of variance due to slide • Mean division is simple and worked well Batch correction evaluation framework 19 Alignment of marker densities
• Most spatial-omics data lack ground truth for evaluating slide effects • Harris et. al 2022 established one 1. Alignment of marker densities 2. Cell phenotyping discordance 3. Proportion of variance due to slide • Mean division is simple and worked well Batch correction evaluation framework 20 Cell phenotyping discordance
• Most spatial-omics data lack ground truth for evaluating slide effects • Harris et. al 2022 established one 1. Alignment of marker densities 2. Cell phenotyping discordance 3. Proportion of variance due to slide • Mean division is simple and worked well Batch correction evaluation framework 21 Proportion of variance due to slide
22 • Allows users to easily allow evaluate normalization procedures in their own data • Allows default normalization options (from Harris et al. 2022) or user specified • mxnorm can be used to evaluate new normalization methods in future papers
• Functional markers are not used to define phenotypes • Inform cell function and can be present be present across multiple cell types • Interested in differences in expression of functional expression across cell or patient populations of interest • whether abundance of PD-L1 positive cells differ between responders and non-responders to an immunotherapy • Still should be normalized! • For analysis, using continuous valued intensity is better than thresholding • Methods for differential expression analysis for ST data can be used • Seal et. al. 2022 provides a method to cluster based on marker densities Analysis of functional markers 23