Domain Dataset § 1996-2010 § Millions of websites § 2.5 billion resources § > 35TB § No direct access § No bulk downloads § Open metadata datasets § Analytical access
+Usage+Overview § Contains links and anchor text. § Size & distribution: § 6TB of compressed JSON in WARC packaging § Looking at hosting options § CC0 licence § Working with the Oxford Internet Institute § http://www.oii.ox.ac.uk/research/projects/?id=88
item hash values via Wayback to compare our archives or validate independent archives § Expose more information alongside the Memento API § Improve prototype Memento browser plugin(s) § Develop new APIs § Expose link information via Wayback and/or Memento § Lookup by fields other than host and timestamp, e.g. § In-links § Hash values