Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Character Encodings

Alex Wheeler
November 15, 2017
110

Character Encodings

Alex Wheeler

November 15, 2017
Tweet

Transcript

  1. Why don’t we just use unique symbols? • Glyph -

    any symbol used to represent some value • 1 2 3 4 5 6 7 8 9
  2. Positional Notation • way of encoding numbers • each position

    related to the next by a constant multiplier called the base or radix • base 10
  3. Bits and Bytes • Bit - The basic unit of

    information in computing • i.e. 0 • Byte - unit of digital information consisting of 8 bits • i.e. 1000 0101
  4. base 16 • 0 1 2 3 4 5 6

    7 8 9 A B C D E F
  5. Unicode • computing industry standard for encoding, representation, and text

    expressed in most of the world’s writing systems • code points in range 0 -10FFFF
  6. Hexadecimal Code Points • U+0061 = 97 = a •

    U+005A = 122 = z • U+1F4A9 = 128169 = ?
  7. UTF-8 • variable-width character encoding • capable of encoding all

    unicode characters • backwards compatible with ASCII