Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Character Encodings

Avatar for Alex Wheeler Alex Wheeler
November 15, 2017
120

Character Encodings

Avatar for Alex Wheeler

Alex Wheeler

November 15, 2017
Tweet

Transcript

  1. Why don’t we just use unique symbols? • Glyph -

    any symbol used to represent some value • 1 2 3 4 5 6 7 8 9
  2. Positional Notation • way of encoding numbers • each position

    related to the next by a constant multiplier called the base or radix • base 10
  3. Bits and Bytes • Bit - The basic unit of

    information in computing • i.e. 0 • Byte - unit of digital information consisting of 8 bits • i.e. 1000 0101
  4. base 16 • 0 1 2 3 4 5 6

    7 8 9 A B C D E F
  5. Unicode • computing industry standard for encoding, representation, and text

    expressed in most of the world’s writing systems • code points in range 0 -10FFFF
  6. Hexadecimal Code Points • U+0061 = 97 = a •

    U+005A = 122 = z • U+1F4A9 = 128169 = ?
  7. UTF-8 • variable-width character encoding • capable of encoding all

    unicode characters • backwards compatible with ASCII