Upgrade to Pro — share decks privately, control downloads, hide ads and more …

MISERY is in our genes: A linguistic analysis of the human genome

MISERY is in our genes: A linguistic analysis of the human genome

A talk given at BAHFEST Texas. A Bad Ad Hoc Hypothesis on words in the human genome

Claire D. McWhite

February 17, 2018
Tweet

More Decks by Claire D. McWhite

Other Decks in Science

Transcript

  1. The language(s) of life DNA code Amino acid protein code

    CTGCATGAATATAGGATATGTGAAGCA L H E Y R I C E A CUGCAUGAAUAUAGGAUAUGUGAAGCA RNA DNA Protein RNA DNA Protein
  2. DNA text analysis CATCACAAAGAAGTTTCTCAGAATGCTTCTCTCTAGTTTTTGTGTGAAGATATA TCCTTTTCCATCATAGGCCTCTAAGCTCTCCAAATGTCCACTTGCAGATTCTA CAAAAAGAGTGTTTCAAAACTGCTCTCTCAAAAGTAAGGTTCAACTTTTTGAG TTTAATACACACATCACATAGAAGCTTCTGAGAATGCTTCCGTCTAGTGTTTG TGTTAAGATATTCCCGTTTCCAACGAAGGCTTCAAAGCGGTCAAAATATCCTC TTGCAGATACTACAAAAAGCGTGTATCAAAACTGCTCCATGAAAAGGTATGTT CAACCCTGTGAGTTCAATGTAAACATCACAAAGAAGTTTCTGAGAATGCTTGT

    GTCTGTTTTTTTGTGTGAAGATGTATCCTTTTCCACCATAGTCCTCAAAGCTCT CCAAATGTGCACTTGCAGATTCTACAAAAAGTGTGTTTCAAAACTCCTCTACC AAAAGAAAGGTTCAACTCTGTGAGTTTAATGCCCAGATCACAGACAAGTTTCT TAGGATGCTTCTGTCTAATGTTTATGTGAAGATATTCCCGTTTCCAACTAAGG CCTCAAAGAAGTCCAAATATCCAATTGCAGATTCTACAAAAAGAGCATTT >BA000005.3 Homo sapiens genomic DNA, chromosome 21q
  3. The language(s) of life DNA code Amino acid protein code

    CTGCATGAATATAGGATATGTGAAGCA L H E Y R I C E A CUGCAUGAAUAUAGGAUAUGUGAAGCA RNA DNA Protein RNA DNA Protein
  4. >sp|Q7Z4N2|TRPM1_HUMAN Transient receptor potential cation channel subfamily M member MKDSNRCCCGQFTNQHIPPLPSATPSKNEEESKQVETQPEKWSVA

    KHTQSYPTDSYGVLEFQGGGYSNKAMYIRVSYDTKPDSLLHLMVKD WQLELPKLLISVHGGLQNFEMQPKLKQVFGKGLIKAAMTTGAWIFTG GVSTGVISHVGDALKDHSSKSRGRVCAIGIAPWGIVENKEDLVGKDV TRVYQTMSNPLSKLSVLNNSHTHFILADNGTLGKYGAEVKLRRLLEK HISLQKINTRLGQGVPLVGLVVEGGPNVVSIVLEYLQEEPIPVVICDGS GRASDILSFAHKYCEEGGIINESLREQLLVTIQKTFNYNKAQSHQLFAI IMECMKKKELVTVFRMGSEGQQDIEMAILTALLKGTNVSAPDQLSLA LAWNRVDIARSQIFVFGPHWPPLGSLAPPTDSKATEK… What hidden messages are there in this human pain receptor protein?
  5. >sp|Q7Z4N2|TRPM1_HUMAN Transient receptor potential cation channel subfamily M member MKDSNRCCCGQFTNQHIPPLPSATPSKNEEESKQVETQPEKWSVA

    KHTQSYPTDSYGVLEFQGGGYSNKAMYIRVSYDTKPDSLLHLMVKD WQLELPKLLISVHGGLQNFEMQPKLKQVFGKGLIKAAMTTGAWIFTG GVSTGVISHVGDALKDHSSKSRGRVCAIGIAPWGIVENKEDLVGKDV TRVYQTMSNPLSKLSVLNNSHTHFILADNGTLGKYGAEVKLRRLLEK HISLQKINTRLGQGVPLVGLVVEGGPNVVSIVLEYLQEEPIPVVICDGS GRASDILSFAHKYCEEGGIINESLREQLLVTIQKTFNYNKAQSHQLFAI IMECMKKKELVTVFRMGSEGQQDIEMAILTALLKGTNVSAPDQLSLA LAWNRVDIARSQIFVFGPHWPPLGSLAPPTDSKATEK… What hidden messages are there in this human pain receptor protein?
  6. >sp|Q7Z4N2|TRPM1_HUMAN Transient receptor potential cation channel subfamily M member MKDSNRCCCGQFTNQHIPPLPSATPSKNEEESKQVETQPEKWSVA

    KHTQSYPTDSYGVLEFQGGGYSNKAMYIRVSYDTKPDSLLHLMVKD WQLELPKLLISVHGGLQNFEMQPKLKQVFGKGLIKAAMTTGAWIFTG GVSTGVISHVGDALKDHSSKSRGRVCAIGIAPWGIVENKEDLVGKDV TRVYQTMSNPLSKLSVLNNSHTHFILADNGTLGKYGAEVKLRRLLEK HISLQKINTRLGQGVPLVGLVVEGGPNVVSIVLEYLQEEPIPVVICDGS GRASDILSFAHKYCEEGGIINESLREQLLVTIQKTFNYNKAQSHQLFAI IMECMKKKELVTVFRMGSEGQQDIEMAILTALLKGTNVSAPDQLSLA LAWNRVDIARSQIFVFGPHWPPLGSLAPPTDSKATEK… What hidden messages are there in this human pain receptor protein?
  7. >sp|Q7Z4N2|TRPM1_HUMAN Transient receptor potential cation channel subfamily M member MKDSNRCCCGQFTNQHIPPLPSATPSKNEEESKQVETQPEKWSVA

    KHTQSYPTDSYGVLEFQGGGYSNKAMYIRVSYDTKPDSLLHLMVKD WQLELPKLLISVHGGLQNFEMQPKLKQVFGKGLIKAAMTTGAWIFTG GVSTGVISHVGDALKDHSSKSRGRVCAIGIAPWGIVENKEDLVGKDV TRVYQTMSNPLSKLSVLNNSHTHFILADNGTLGKYGAEVKLRRLLEK HISLQKINTRLGQGVPLVGLVVEGGPNVVSIVLEYLQEEPIPVVICDGS GRASDILSFAHKYCEEGGIINESLREQLLVTIQKTFNYNKAQSHQLFAI IMECMKKKELVTVFRMGSEGQQDIEMAILTALLKGTNVSAPDQLSLA LAWNRVDIARSQIFVFGPHWPPLGSLAPPTDSKATEK… What hidden messages are there in this human pain receptor protein? “Chimpanzee Locomotor Energetics and the Origin of Human Bipedalism” Sockol et al. PNAS, 2007
  8. • PIGTAIL • GIGGLE • CARAVAN • KITTEN • GAGA

    • MIDRIFF • LIGHT • FRESH • DAISY • GLEE • FAWN • LIFE There are some pleasant words in our genome
  9. MISERY LIES ASH APATHY ANGST SAD PAIN TEARS REGRET FECES

    FIERY RESENT LICE SEWER LEAKAGE FEVER AILMENT ASHES GRIM ALLERGY FETID EVIL FLIES WAR CRY WAIL DARK HELP DIE DYING DEATH DEAD DECAY DED ACNE ACRID CYST DIRT FAIL FEAR FICKLE FIEND FILTH FLIES GRIEF HATE ITCH LAMENT GASSY MEASLY MESS
  10. ‘HAPPY’ is found in multiple parasite genomes ◦ Toxoplasma gondii

    ◦ Low-effort parasitic lifestyle in cat digestive tracts