Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Character Encodings
Search
Alex Wheeler
November 15, 2017
0
110
Character Encodings
Alex Wheeler
November 15, 2017
Tweet
Share
More Decks by Alex Wheeler
See All by Alex Wheeler
Running Rings Around Rack
alexwheeler
0
83
Golang Concurrency
alexwheeler
0
87
Rewriting Rack: A Functional Approach
alexwheeler
2
120
Man Computer Symbiosis
alexwheeler
0
110
Flipper
alexwheeler
0
120
Transducers
alexwheeler
0
64
Featured
See All Featured
The MySQL Ecosystem @ GitHub 2015
samlambert
250
12k
Refactoring Trust on Your Teams (GOTO; Chicago 2020)
rmw
33
2.8k
For a Future-Friendly Web
brad_frost
176
9.5k
Dealing with People You Can't Stand - Big Design 2015
cassininazir
366
25k
Designing for Performance
lara
604
68k
Building Applications with DynamoDB
mza
93
6.2k
Put a Button on it: Removing Barriers to Going Fast.
kastner
60
3.7k
The Cost Of JavaScript in 2023
addyosmani
47
7.3k
How to Think Like a Performance Engineer
csswizardry
22
1.3k
Evolution of real-time – Irina Nazarova, EuRuKo, 2024
irinanazarova
6
550
ピンチをチャンスに:未来をつくるプロダクトロードマップ #pmconf2020
aki_iinuma
114
50k
Templates, Plugins, & Blocks: Oh My! Creating the theme that thinks of everything
marktimemedia
29
2.2k
Transcript
Counting Systems Why your Emojis Work on Twitter, but not
Tinder
None
The Beginning
Unary Numbers • I • II • IIIII • IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII
Romans
Romans
None
Why don’t we just use unique symbols? • Glyph -
any symbol used to represent some value • 1 2 3 4 5 6 7 8 9
None
Modern Counting Systems
Decimal • Latin decimus - tenth • 0 1 2
3 4 5 6 7 8 9
Positional Notation • way of encoding numbers • each position
related to the next by a constant multiplier called the base or radix • base 10
None
None
None
None
None
Binary • Latin binarius - consisting of two • Base
2
None
None
None
Bits and Bytes • Bit - The basic unit of
information in computing • i.e. 0 • Byte - unit of digital information consisting of 8 bits • i.e. 1000 0101
None
Hexadecimal • Greek hex - sixth; decimal - tenth •
base 16
base 16 • 0 1 2 3 4 5 6
7 8 9 A B C D E F
• 10 = ? • A • 255 = ?
• FF
So…what about the computers?
ASCII • American Standard Code for Information Interchange • 0-127
characters • 7 bits
None
Unicode • computing industry standard for encoding, representation, and text
expressed in most of the world’s writing systems • code points in range 0 -10FFFF
None
Hexadecimal Code Points • U+0061 = 97 = a •
U+005A = 122 = z • U+1F4A9 = 128169 = ?
UTF-8 • variable-width character encoding • capable of encoding all
unicode characters • backwards compatible with ASCII
• http://play.golang.org/p/PvVhLj_5kM • http://play.golang.org/p/pO97Yf5w-z
Thanks • @askwheeler • github.com/alexwheeler • alexwheeler.io