Slide 1

Slide 1 text

Semantic Analytics at Scale Wednesday, October 17, 2012 1  

Slide 2

Slide 2 text

Insights for the web’s best publishers

Slide 3

Slide 3 text

What makes Dash different? Dash is purpose-built for publishers and media companies. We believe insight > data. Our simple, elegant, and intuitive interface requires no training. And your tech team will love our easy integration. Simply put, you get the best insights to increase audience and engagement on your site.

Slide 4

Slide 4 text

4   Join the web’s best publishers “the best part of working with Parse.ly is just how good you have been about implementation ... It looks like you guys have a more agile development environment...” “[It] gave us insight into the content and our performance explained in publisher language rather than your standards analytics jargon.” Jason Marlin, Director of Technology, ArsTechnica Zee Kane Editor-in-Chief, The Next Web

Slide 5

Slide 5 text

5  

Slide 6

Slide 6 text

Parse.ly  Mission   •  Empower  editors  &  writers   •  Liberate  product  teams   •  Catalyze  adop>on  of  seman>c  web   •  Help  with  sustainability   6  

Slide 7

Slide 7 text

7   Empower  editors,  writers,  &  analysts   with  !mely,  relevant,  &  ac!onable   engagement  data  about  web  content.  

Slide 8

Slide 8 text

8   Liberate  product  teams  from  the  CMS,   with  powerful  data-­‐driven  APIs  to  assist   virtuous  user  behaviors.  

Slide 9

Slide 9 text

9   Catalyze  adop>on  of  seman!c  web   standards  across  the  media  industry.  

Slide 10

Slide 10 text

10   Help  media  companies  achieve   profitability  &  sustainability  in  digital.  

Slide 11

Slide 11 text

11   Scale.  

Slide 12

Slide 12 text

12  

Slide 13

Slide 13 text

What  kind  of  scale?   •  >3  billion  pageviews  per  month   •  >10  million  crawled  ar!cles   •  >2,500  requests  per  second  at  peak   •  ~70  server  nodes  across  three  data  centers   •  >Terabyte  of  RAM  with  produc>on  data   13  

Slide 14

Slide 14 text

14   Data.  

Slide 15

Slide 15 text

15  

Slide 16

Slide 16 text

16   d3.js  –  Data-­‐Driven  Documents  

Slide 17

Slide 17 text

17  

Slide 18

Slide 18 text

18  

Slide 19

Slide 19 text

19   Metadata.  

Slide 20

Slide 20 text

20   rNews

Slide 21

Slide 21 text

21   Standard! Implementation! Primary Purpose! Coverage! OpenGraph! Multiple META tags" Facebook Rich Embeds" ~60%" Schema.org Article! Microdata" SEO" ~80%" hNews! Microdata" News Industry Standard" ~90%" rNews! RDFa & Microdata" News Industry Standard" ~100%" HTML5! Tags" W3C Standard" ~20%" parsely-page! Single META tag" Semantic Analytics" ~70%"

Slide 22

Slide 22 text

Field! OpenGraph! rNews! HTML5! parsely-page! Title! og:title" headline" " title" Pub Date! a:published_time" datePublished"

Slide 23

Slide 23 text

23  

Slide 24

Slide 24 text

24   schema.to   •  Meet  Mr.  Schemato!  A  friendly  seman>c  web   robot  that  makes  metadata  cool  again.   •  Open  source   •  Public  service   •  Eases  implementa!on  with  validators   •  Eases  consump!on  with  normalizers   •  Extensible  

Slide 25

Slide 25 text

25   HTML5,  hNews,  rNews,   Schema.org,  OpenGraph,   parsely-­‐page   {! “implements”: {! “ogp”: true, ! “rnews”: true! },! “distilled”: {! “title”: “The Bookstore’s Last Stand”! “link”: “http://nytimes.com/123/…”! “pub_date”: “2012-01-28”,! “image_url”: “http://img.nyt.com/…”,! “author”: “Julie Bosman”,! “section”: “Business Day”,! “tags”: “Barnes & Noble”, “Amazon”,! “type”: “post”,! “post_id”: “100000001318096”! },! “extracted”: {...}! }!

Slide 26

Slide 26 text

26   Here  Today   Coming  Soon   •  Valida>on  Framework   •  Dis>lling  Framework   •  Web  Service   •  Validators:   –  rNews   –  Schema.org   –  OpenGraph   –  parsely-­‐page   •  Dis>llers   •  Site  Registry   •  Proxy   •  Command-­‐Line  Tools   •  Validators:   –  hNews   –  Dublin  Core   –  HTML5  

Slide 27

Slide 27 text

Parse.ly  Mission   •  Empower  editors  &  writers   •  Liberate  product  teams   •  Catalyze  adop>on  of  seman>c  web   •  Help  with  sustainability   27  

Slide 28

Slide 28 text

28   1 line of javascript that will not break or slow down your site Streamlined Integration Publishers sign up for a free 30-day trial at: http://dash.to/try

Slide 29

Slide 29 text

Get  in  Touch   •  Tweet  us  (now!)   – @amontalen>   – @parsely   •  Email  us  (whenever!)   – andrew@parsely.com   – hello@parsely.com   Open  source  contribu>ons  to  Schema.to,   Parse.ly  demos,  or  anything  else!   29