clustree
producing clustering trees using ggraph
Luke Zappia
@_lazappi_
lazappi.github.io/clustree
Slide 2
Slide 2 text
My data
OpenStax College, CC BY 3.0 via Wikimedia Commons
Single-cell RNA-sequencing
Gene activity in thousands of cells
~20000 features (genes)
~8000 samples (cells)
Look for different cell types
Slide 3
Slide 3 text
How many
clusters?
Slide 4
Slide 4 text
No content
Slide 5
Slide 5 text
Sample K1 K2 K3
0 A A A
1 A B C
2 A A A
3 A A B
4 A B A
5 A A B
6 A B C
7 A A A
8 A A B
9 A B C
Slide 6
Slide 6 text
No content
Slide 7
Slide 7 text
No content
Slide 8
Slide 8 text
How do we
do that in R?
Slide 9
Slide 9 text
Clusters + transitions
ID Resolution Cluster Size
1A 1 A 10
2A 2 A 6
2B 2 B 4
3A 3 A 4
3B 3 B 3
3C 3 C 3
From To Number
1A 2A 6
1A 2B 4
2A 3A 3
2A 3B 3
2B 3A 1
2B 3C 3
The Iris dataset
Tiia Monto CC BY-SA 4.0,
via Wikimedia Commons
C T Johansson CC BY 3.0,
via Wikimedia Commons
Jefficus,
via Wikimedia Commons
Iris setosa
Iris versicolor
Iris virginica
Slide 19
Slide 19 text
Iris dataset
k-means clustering
k = 1,...,5
Slide 20
Slide 20 text
No content
Slide 21
Slide 21 text
Petal length
Slide 22
Slide 22 text
Organoid
data
Slide 23
Slide 23 text
NPHS1
Slide 24
Slide 24 text
No content
Slide 25
Slide 25 text
t-SNE 2
t-SNE 1
t-SNE 1
t-SNE 2
Slide 26
Slide 26 text
Acknowledgements
Everyone that makes tools and data available
MCRI Bioinformatics
Belinda Phipson
MCRI KDDR
Alex Combes
@_lazappi_
lazappi.github.io/clustree
install.packages(“clustree”)
Paper
doi.org/10.1101/274035
Slides
tinyurl.com/clustree-useR2018
Supervisors
Alicia Oshlack
Melissa Little