Upgrade to Pro — share decks privately, control downloads, hide ads and more …

UTF-8入門

yn2011
December 27, 2018

 UTF-8入門

文字コード / Unicode / UTF-8のデコード例 / UTF-8の脆弱性

yn2011

December 27, 2018
Tweet

More Decks by yn2011

Other Decks in Programming

Transcript

  1. จࣈίʔυʢූ߸Խจࣈू߹ʣ • ֤จࣈʹରԠ͢ΔϏοτͷ૊Έ߹ΘͤΛఆٛ • e.g. ASCII, JIS X 0208, Unicode

    … UnicodeͷจࣈίʔυදͷྫʢҰ෦ʣ UnicodeҰཡ 3000-3FFF / WikipediaΑΓҾ༻
  2. 0xCE94 • 0xCE = 11001110 • 110xxxxx → xxxxx =

    01110 • 0x94 = 10010100 • 10yyyyyy → yyyyyy = 010100 • xxxxxyyyyyy = 01110010100 • U+0394 = Δ
  3. ৑௕ͳΤϯίʔυ • 0xC0 = 11000000 • 110xxxxx → xxxxx =

    00000 • 0xAF = 10101111 • 10yyyyyy → yyyyyy = 101111 • 0xxx xxyy yyyy = 0000 0010 1111 • U+2F = /