though :p • Promote T2T and pangenome resources for diagnostics / testing applications in my organization (GeneDX + Sema4) Code: https://github.com/Sema4-Research/pgr-tk Document: https://sema4-research.github.io/pgr-tk/ Example: https://github.com/sema4-Research/pgr-tk-notebooks preprint: Multiscale Analysis of Pangenome Enables Improved Representation of Genomic Diversity For Repetitive And Clinical Relevant Genes | bioRxiv (https://www.biorxiv.org/content/10.1101/2022.08.05.502980v)
b:c c:d d:e e:f f:g c h i f h e f g e f b c d e a c h i c d b a a a a a a b b b c c c c d d h h e e e i f f f f g g e f g g g Sequence with minimizer anchors Graph Vertex: A set of sequences with shared minimizer anchors at both ends Induced Minimizer Anchored Pan-genomics Graph
principal bundles • Pick the ”most repetitive but non- trivial principal bundle”, identify the locations where a sequence start to overlaps with the bundle as the starting points of the repeat elements. • Maybe more sophisticate HMM can be deployed in the future the repeat