Slide 1

Slide 1 text

clustree producing clustering trees using ggraph Luke Zappia @_lazappi_ lazappi.github.io/clustree

Slide 2

Slide 2 text

My data OpenStax College, CC BY 3.0 via Wikimedia Commons Single-cell RNA-sequencing Gene activity in thousands of cells ~20000 features (genes) ~8000 samples (cells) Look for different cell types

Slide 3

Slide 3 text

How many clusters?

Slide 4

Slide 4 text

No content

Slide 5

Slide 5 text

Sample K1 K2 K3 0 A A A 1 A B C 2 A A A 3 A A B 4 A B A 5 A A B 6 A B C 7 A A A 8 A A B 9 A B C

Slide 6

Slide 6 text

No content

Slide 7

Slide 7 text

No content

Slide 8

Slide 8 text

How do we do that in R?

Slide 9

Slide 9 text

Clusters + transitions ID Resolution Cluster Size 1A 1 A 10 2A 2 A 6 2B 2 B 4 3A 3 A 4 3B 3 B 3 3C 3 C 3 From To Number 1A 2A 6 1A 2B 4 2A 3A 3 2A 3B 3 2B 3A 1 2B 3C 3

Slide 10

Slide 10 text

ggplot?

Slide 11

Slide 11 text

No content

Slide 12

Slide 12 text

Building a graph igraph::from_data_frame(edges, vertices = nodes) tidygraph::tbl_graph(edges, nodes) graph %>% activate(nodes) %>% filter(...) %>% mutate(...) %>% activate(edges) %>% filter(...) %>% mutate(...)

Slide 13

Slide 13 text

ggraph ggraph(graph, layout = “tree”) + geom_edge_link(...) + geom_node_point(...) + scale_size(...) + scale_edge_colour(...)

Slide 14

Slide 14 text

clustree(clusterings, ...)

Slide 15

Slide 15 text

What does it look like?

Slide 16

Slide 16 text

Simulations - 1 group

Slide 17

Slide 17 text

Simulations - 4 groups

Slide 18

Slide 18 text

The Iris dataset Tiia Monto CC BY-SA 4.0, via Wikimedia Commons C T Johansson CC BY 3.0, via Wikimedia Commons Jefficus, via Wikimedia Commons Iris setosa Iris versicolor Iris virginica

Slide 19

Slide 19 text

Iris dataset k-means clustering k = 1,...,5

Slide 20

Slide 20 text

No content

Slide 21

Slide 21 text

Petal length

Slide 22

Slide 22 text

Organoid data

Slide 23

Slide 23 text

NPHS1

Slide 24

Slide 24 text

No content

Slide 25

Slide 25 text

t-SNE 2 t-SNE 1 t-SNE 1 t-SNE 2

Slide 26

Slide 26 text

Acknowledgements Everyone that makes tools and data available MCRI Bioinformatics Belinda Phipson MCRI KDDR Alex Combes @_lazappi_ lazappi.github.io/clustree install.packages(“clustree”) Paper doi.org/10.1101/274035 Slides tinyurl.com/clustree-useR2018 Supervisors Alicia Oshlack Melissa Little