Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Strong file format obsolescence Vs. weak softwa...

euanc
June 23, 2016

Strong file format obsolescence Vs. weak software obsolescence

Presentation to the #dpcformats event

euanc

June 23, 2016
Tweet

Other Decks in Technology

Transcript

  1. Strong File Format Obsolescence Vs. Weak Software Obsolescence DPC –

    Event “Re:Format - What is file format obsolescence and does it really exist?” 23/6/2016 Euan Cochrane, Digital Preservation Manager, Yale University Library https://twitter.com/euanc
  2. Metadata = data about data = data A note =

    a message/an attempt at communication
  3. “Metadata is a love note to the future” = Data

    is an attempt to communicate to the future
  4. “Communication (from Latin commūnicāre, meaning "to share") is the act

    of conveying intended meaning to another entity through the use of mutually understood signs and semiotic rules.” https://en.wikipedia.org/w/index.php?title=Communication&oldid=72 4377004
  5. Digital preservation Digital preservation is “conveying intended meaning to another

    entity through the use of mutually understood signs and semiotic rules [data]”……... across time as well as every other barrier communication attempts to overcome.
  6. File format = A way of storing data to enable

    the reproduction of an information experience at some point in the future Or….
  7. Information experience = The dynamic messages conveyed by a computer

    interpreting instructions from inputs to produce outputs
  8. File format = A way of storing data to enable

    the reproduction of the dynamic messages conveyed by a computer interpreting instructions from inputs to produce outputs at some point in the future
  9. • Computers are stupid • Computers receive inputs and manipulate

    them to produce outputs and can do so dynamically
  10. File format = A way of storing data to enable

    the reproduction of the dynamic messages conveyed by a computer interpreting instructions from inputs and producing outputs at some point in the future
  11. File format standard = A [standardized] documented method for storing

    data to enable the reproduction of an information experience (or “performance”) at some point in the future
  12. File format standards are for software developers Developers interpret file

    format standards to produce instructions for computers to enable them to process primary data files as secondary inputs (.doc, .ppt, etc) to ultimately produce information experiences
  13. OAIS: Representation Information “The information that maps a Data Object

    into more meaningful concepts..…. A[n].. example is JPEG software which is used to render a JPEG file; rendering the JPEG file as bits is not very meaningful to humans but the software, which embodies an understanding of the JPEG standard, maps the bits into pixels which can then be rendered as an image for human viewing.”
  14. •Software is files •Software files have formats that adhere to

    file format standards (of a sort) •Software files are instructions for computers to use to (re)produce an information experience (etc) •Developers interpret file format standards to produce instructions for computers to enable them to process primary data files (.doc, .ppt, etc) to produce information experiences
  15. Recap • Digital preservation is conveying messages to the future

    using data • File formats are methods for storing instructions to tell a computer how to reproduce an information experience or performance • Format standards document those methods and can be very complex • Software is made up of files that have formats and that provide instructions to computers
  16. Evaluated how a sample set of digital files opened or

    “rendered” in different environments (different combinations of software, operating systems and hardware) i.e. examined the relationship between software files, data files, computers and information experiences/performances
  17. Obsolescence “Obsolescence is the state of being which occurs when

    an object, service, or practice is no longer wanted even though it may still be in good working order. Obsolescence frequently occurs because a replacement has become available that has, in sum, more advantages compared to the disadvantages incurred by maintaining or repairing the original. Obsolete refers to something that is already disused or discarded, or antiquated.[1] Typically, obsolescence is preceded by a gradual decline in popularity.
  18. Options for overcoming file format obsolescence 1. Get people to

    keep using the format (market your format!) 2. Be a luddite (Start a retro-format movement! .wp4 FTW!)
  19. Options for overcoming file format obsolescence 1. Get people to

    keep using the format (market your format!) 2. Be a luddite (Start a retro-format movement! .wp4 FTW!)
  20. Software is just data Software = Data that information experience

    creators assume message recipients have (or can easily get) access to
  21. Software obsolescence Software obsolescence occurs when software data files are

    “disused, or discarded, or antiquated” and expected information experience users/message recipients/designated communities no longer have access to the software data files or are no longer able to get access to them
  22. Being generous towards file format obsolescence • Strong file format

    obsolescence • When files exist that are formatted according to undocumented standards that are no longer supported by any software • Even when this is a “problem” (it is exceedingly rare) the problem is not with the formats, it’s with the software
  23. • Strong software obsolescence • When information experiences/information objects/digital performances

    exist for which the software components no longer accessible by anybody • Weak software obsolescence • When information experiences/information objects/digital performances exist for which the software-data components no longer easily accessible by the designated community Software obsolescence
  24. Summary 1. Digital preservation is conveying meaning to the future

    using data 2. Data are instructions to be interpreted by a computer 3. Software = data 4. Software content files can get lost and come become unusable and effectively inaccessible 5. The idea of a format obsolescing is a bit nonsensical -- It is the software that obsolesces, not the format 6. Strong file format obsolescence is exceedingly rare 7. We do have a significant problem with a weak form of software obsolescence 8. We have methods to overcome most weak software obsolescence