The Interoperability of Documents (was: A War of File Format)

781f050176c85b3332aaa2d0b3ab718c?s=47 Franklin Weng
October 22, 2017

The Interoperability of Documents (was: A War of File Format)

My official talk in OpenSUSE Asia 2017 (LibreOffice mini Conference Japan 2017), talking about the interoperability of documents.

781f050176c85b3332aaa2d0b3ab718c?s=128

Franklin Weng

October 22, 2017
Tweet

Transcript

  1. The Interoperability of Documents (Was: The war of fle format)

    LibreOfce mini Conference, Japan, 2017 Franklin Weng LibreOfce Migration Professional / Certifee Trainer Member, Certifcation Committee, The Document Founeation Presieent, BoD of Software Liberty Association Taiwan
  2. In This Session Originally I'd like to focus on comparing

    ODF and OOXML But I found an even more important point So I'm going to talk about documents' Interoperability
  3. Document Generation And Interoperation in Ancient China Source:https://goo.gl/wpL2QY Source:https://goo.gl/bMnvPH (良い馬)

    (The Emperor said: "I got it.")
  4. Document Generation in 20th Century

  5. Document Generation in 20th Century

  6. Document "Interoperation" In Old Days

  7. Therefore, In The Old Days What we learn was to

    generate "beautiful" documents.
  8. Therefore, In Old Days We try to just "put" words

    to the position. (So we use Space to move words, and Enter to change pages)
  9. Therefore, In Old Days We try to just "put" words

    to the position. (So we use Space to move words, and Enter to change pages)
  10. Therefore, In Old Days We try to just "put" words

    to the position. (So we use Space to move words, and Enter to change pages)
  11. Therefore, In Old Days We try to just "put" words

    to the position. (So we use Space to move words, and Enter to change pages)
  12. Therefore, In Old Days We try to just "put" words

    to the position. (So we use Space to move words, and Enter to change pages)
  13. Therefore, In Old Days We try to just "put" words

    to the position. (So we use Space to move words, and Enter to change pages) Some mean bosses would ask for "aligning with no error"
  14. Therefore, In Old Days We try to just "put" words

    to the position. (So we use Space to move words, and Enter to change pages) Some mean bosses would ask for "aligning with no error" So we use "Invisible table" to align
  15. Therefore, In Old Days We try to just "put" words

    to the position. (So we use Space to move words, and Enter to change pages) Some mean bosses would ask for "aligning with no error" So we use "Invisible table" to align We used special fonts and
  16. All Of The Above Had no any problem in old

    days because we mainly used "paper- based documents". However, in the new era we start to use digital documents, and problems arise….....
  17. Document Interoperability Now

  18. When Receiving Digital Documents Receivers may not be able to

    open (because no software or different version) Receivers may not be able to see the contents (because no such fonts) Receivers may not understand the contents (because the document's typesetting is in a mess)
  19. In 21st Century Document Interoperability ≠ Document Exchanging

  20. In 21st Century Document Interoperability = Working together better

  21. So, We Need To Understand How to generate an "interoperable"

    document
  22. Interoperable Documents Open spec, stable file format standard Correct concept

    of generating documents Use open fonts
  23. 1. File Format

  24. De-jure and De-facto Standards OOXML Office Open XML ODF OpenDocument

    Format
  25. Using OOXML as Standard... Office 2007 Office 2003 2003 04

    05 06 07 08 09 10 11 12 13 14 15 16 17 Office 2010 Office 2013 Office 2016 Office 2000 Office XP Office 97 Office 95 Office 4.3 Office 4.0 Office 3.0 Have you ever tried to generate a docx file with Office 2013 and open it with Office 2007? Can it be opened successfully?
  26. Open Document Format Selected as national standards of file format

    in UK, France, Sweden, …… and Taiwan Open spec ISO Standard Simple, Stable
  27. None
  28. 4.0.0 – 5 years ago!

  29. Save exactly the same content L

  30. #Lines of Content XML File Version XML Lines ODF 1.2

    (any version of) LibreOffice 222 ODF 1.2 Microsoft Office 2013 Windows 541 ODF 1.2 Microsoft Office 2016 Windows 496 OOXML (any version of) LibreOffice 1157 OOXML 2013 Transitional (MS Office Windows) 1590 OOXML 2013 Save as Strict (MS Office Windows) 11861 OOXML 2013 Strict (MS Office Windows) 1588 OOXML 2016 Transitional (MS Office Windows) 11667 OOXML 2016 Save as Strict (MS Office Windows) 11665 OOXML 2016 Strict (MS Office Windows) 11665 OOXML 2011 Transitional (MS Office MacOS) 12854 OOXML 2016 Transitional (MS Office MacOS) 11646
  31. Simple Test Content To be, or not to be, that

    is the question.
  32. Saved as ODT by LibreOffice <start text>To be, or not

    to be, that is the question.<end text> (XML tag rewritten for easier understand)
  33. Saved as DOCX by MS Office <start text>To<end text> <keep

    together> <start text>[space]<end text> <start text>be<end text> <start text>,<end text> <keep together> <start text>[space]<end text> <start text>or<end text> <keep together> <start text>[space]<end text> <start text>not<end text> <keep together> <start text>[space]<end text> <start text>to<end text> <keep together> <start text>[space]<end text> <start text>be<end text> <keep together> <start text>,<end text> <keep together> <start text>[space]<end text> <start text>that<end text> <keep together> <start text>[space]<end text> <start text>is<end text> <keep together> <start text>[space]<end text> <start text>the<end text> <keep together> <start text>[space]<end text> <start text>question<end text> <start text>.<end text>
  34. 2. Concept of generating documents (typesetting)

  35. Correct Concept of Typesetting

  36. Correct Concept of Typesetting

  37. Correct Concept of Typesetting

  38. Correct Concept of Typesetting

  39. 3. Open Fonts

  40. Open Fonts Do you know… Each font has its own

    license?
  41. Microsoft C-Fonts Source:Italo Vignoli

  42. Microsoft Japanese/Chinese Fonts • MS 明朝 (Japanese) • MS Gothic

    (Japanese) • 微軟正黑體 (Chinese) Do you know their license?
  43. Open Font: Noto Sans / Serif

  44. Open Chinese/English Fonts • Japanese Fonts – Noto Sans CJK

    JP(SIL Open Font License)、Noto Serif CJK JP(SIL Open Font) https://www.google.com/get/noto/ – Kochi Mingcho (東風明朝) https://en.wikipedia.org/wiki/Kochi_font • English fonts – Liberation Fonts (SIL Open Font License)
  45. From now on, Learn to generate Interoperable Documents! Source:Italo Vignoli