Everything you always wanted to know about UTF-8 (but never dared to ask)

Everything you always wanted to know about UTF-8 (but never dared to ask)

Presented on June 24th 2014 at the PHP Tour, Lyon, France.
http://afup.org/pages/phptourlyon2014/
http://www.joind.in/11233
---------------------------------------------------------------
For any application with even the remotest ambition of international use, the only way to go is to use UTF-8. And even without that ambition, using UTF-8 might still bring you more benefits than you currently realize. Unfortunately most developers at one point or another run into problems implementing UTF-8 and get discouraged. That ends now! In this talk I will cover UTF-8 from the basic linguistics, through client-side aspects to all the steps you need to take to tackle the most common (and some more obscure) issues when using UTF-8 in a database driven web application.
---------------------------------------------------------------
Links:

Slide2:
http://intertwingly.net/blog/2004/04/25/utf-8-musings#c1082919794
http://intertwingly.net/blog/2004/04/25/utf-8-musings#c1082929502

Slide 5/6:
http://www.ethnologue.com/

Slide 7:
http://www.omniglot.com/

Slide 8:
http://en.wikipedia.org/wiki/Writing_system

Slide 11:
http://geek-and-poke.com/

Slide 12:
http://www.unicode.org/charts/

Slide 39:
http://sourceforge.net/projects/phputf8

Slide 40:
http://www.phpwact.org/php/i18n/utf-8

Slide 44:
http://www.php.net/regexp.reference.unicode

Slide 46:
http://www.php.net/mbstring

Slide 47:
http://www.php.net/iconv

Slide 48:
http://www.php.net/intl

Slide 57:
http://www.phpwact.org/php/i18n/utf-8/mysql

Slide 62:
http://www.poedit.net/

---------------------------------------------------------------
Other interesting links:

http://www.eki.ee/letter/
http://www.bisharat.net/A12N/

http://www.wazu.jp/
http://www.alanwood.net/unicode/
http://en.wikipedia.org/wiki/Unicode_typefaces

http://www.styopkin.com/details_free_and_easy_fonts_viewer.html
http://www.heiner-eichmann.de/software/listfont/listfont.htm

http://www.w3.org/International/techniques/authoring-html
http://www.sil.org/iso639-3/codes.asp
http://www.iso.org/iso/country_codes/iso_3166_code_lists/country_names_and_code_elements
http://www.w3.org/International/articles/language-tags/

http://httpd.apache.org/docs/2.4/mod/mod_charset_lite.html

http://www.collation-charts.org/
http://www.unicode.org/charts/uca/
http://dev.mysql.com/doc/refman/5.7/en/charset-collation-effect.html

http://www.w3.org/International/questions/qa-utf8-bom
http://people.w3.org/rishida/utils/bomtester/
http://www.unicode.org/unicode/faq/utf_bom.html#bom1

2776198ea9584b6c0d4b494293b8d635?s=128

Juliette Reinders Folmer

June 24, 2014
Tweet