Visualising trees to choose clusters for scRNA-...

November 27, 2018

790

Visualising trees to choose clusters for scRNA-seq data

Single-cell RNA-sequencing is commonly used to interrogate complex tissues in order to identify cell types, particularly in the developmental setting. A key analysis step is using gene expression to form clusters of cells assumed to be distinct cell types. We have catalogued more than 70 currently available scRNA-seq clustering methods. Most clustering methods have parameters which affect the number of clusters produced, either by specifying an exact number, or indirectly through other parameters. The clustering resolution that is chosen can have a profound effect on further analysis, but it is unclear how to make this choice. Existing clustering quality metrics often score only single clusters or resolutions, or require perturbation and re-clustering which can be infeasible for large datasets.

Here we present clustering trees, a visualisation that shows the relationship between clusters with increasing clustering resolution. In a clustering tree each cluster is represented as a graph node with edges representing the overlap in samples (cells) between clusters at neighbouring resolutions. Clustering trees are a compact, information-dense visualisation that can be used to highlight instability that may indicate over clustering or display a range of information including gene expression. Importantly, clustering trees display information across resolutions, in contrast to more common visualisations which only show a single clustering. Here we explain the methods developed to produce clustering trees used in the clustree R package (http://cran.r-project.org/package=clustree) and illustrate how we have used these trees for visualization of scRNA-seq data from kidney organoids.

Luke Zappia

November 27, 2018

More Decks by Luke Zappia

See All by Luke Zappia

Suggestions for successful scRNA-seq analysis

lazappi

300

Successful scRNA-seq analysis

lazappi

540

Interoperability between Bioconductor and Python for scRNA-seq analysis

lazappi

1.2k

Tools and techniques for single-cell RNA sequencing data

lazappi

1.1k

PhD Europe 2018

lazappi

550

Clustering trees for visualising scRNA-seq data

lazappi

910

clustree: a package for producing clustering trees using ggraph

lazappi

1.4k

BiocAsia 2017

lazappi

590

gi2017: Simulation and analysis tools for single-cell RNA sequencing data

lazappi

1.1k

Other Decks in Science

See All in Science

konakalab

140

SpatialRDDパッケージによる空間回帰不連続デザイン

saltcooky12

270

AkarengaLT vol.40

hashimoto_kei

120

大黒市で発生した大規模インシデントのポストモーテムから読み解く、記憶媒体消去の大切さ

shucho0103

210

フィードフォワードニューラルネットワークを用いた記号入出力制御系に対する制御器設計 / Controller Design for Augmented Systems with Symbolic Inputs and Outputs Using Feedforward Neural Network

konakalab

160

HDC tutorial

michielstock

750

Inside the Mind of an LLM

baggiponte

200

水耕栽培を始める前に知っておきたい植物の科学

grow_design_lab

280

やるべきときにMLをやる AIエージェント開発

fufufukakaka

1.5k

データベース06: SQL (3/3) 副問い合わせ

trycycle

PRO

Cross-Media Technologies, Information Science and Human-Information Interaction

signer

PRO

32k

なぜエネルギーは保存する？〜自由落下でわかる“対称性”とネーターの定理〜

syotasasaki593876

210

Featured

See All Featured

Joys of Absence: A Defence of Solitary Play

codingconduct

420

The Organizational Zoo: Understanding Human Behavior Agility Through Metaphoric Constructive Conversations (based on the works of Arthur Shelley, Ph.D)

kimpetersen

PRO

390

エンジニアに許された特別な時間の終わり

watany

108

250k

Improving Core Web Vitals using Speculation Rules API

sergeychernyshev

1.5k

Leo the Paperboy

mayatellez

1.9k

Building Better People: How to give real-time feedback that sticks.

wjessup

370

20k

Making the Leap to Tech Lead

cromwellryan

135

10k

Lessons Learnt from Crawling 1000+ Websites

charlesmeaden

PRO

1.4k

The Curse of the Amulet

leimatthew05

13k

The Art of Programming - Codeland 2020

erikaheidi

14k

A Modern Web Designer's Workflow

chriscoyier

698

190k

Hiding What from Whom? A Critical Review of the History of Programming languages for Music

tomoyanonymous

Transcript

Visualising trees to choose clusters for scRNA-seq data Luke Zappia
@_lazappi_
None
None
None
None
NPHS1
Cell cycle SC3 stability Number of genes
Summary Choosing the number of clusters is hard but important
A clustering tree can help by showing: - Relationships between clusters - Which clusters are distinct - Where samples are changing Compact, information dense visualisation - Alternative to t-SNE plots (or similar)
Acknowledgements Everyone that makes tools and data available MCRI Bioinformatics
Belinda Phipson MCRI KDDR Alex Combes @_lazappi_ oshlacklab.com lazappi.github.io/clustree Paper doi.org/10.1093/gigascience/giy083 Slides tinyurl.com/abacbs2018-clustree Supervisors Alicia Oshlack Melissa Little

Visualising trees to choose clusters for scRNA-...

Visualising trees to choose clusters for scRNA-seq data

Luke Zappia

More Decks by Luke Zappia

Other Decks in Science

Featured

Transcript

Visualising trees to choose clusters for scRNA-seq data Luke Zappia

NPHS1

Cell cycle SC3 stability Number of genes

Summary Choosing the number of clusters is hard but important

Acknowledgements Everyone that makes tools and data available MCRI Bioinformatics