Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Emojiの正規表現
Search
ima1zumi
June 24, 2022
0
3
Emojiの正規表現
ima1zumi
June 24, 2022
Tweet
Share
More Decks by ima1zumi
See All by ima1zumi
Reline 1分 Cooking
ima1zumi
0
4
続・mruby/cにUTF-8 を実装する
ima1zumi
0
3
UTF-8 is coming to mruby/c
ima1zumi
3
4.4k
たのしいString
ima1zumi
1
440
Watchから始めるOSS生活
ima1zumi
0
45
String meets Encoding
ima1zumi
2
2.4k
RubyKaigiで話した話
ima1zumi
1
340
漢字が文字コードになる前の話
ima1zumi
0
3
Dive into Encoding
ima1zumi
4
2.1k
Featured
See All Featured
Optimizing for Happiness
mojombo
370
69k
Automating Front-end Workflow
addyosmani
1356
200k
The Psychology of Web Performance [Beyond Tellerrand 2023]
tammyeverts
6
1.5k
Stop Working from a Prison Cell
hatefulcrawdad
266
19k
Templates, Plugins, & Blocks: Oh My! Creating the theme that thinks of everything
marktimemedia
19
1.7k
Web development in the modern age
philhawksworth
202
10k
Designing on Purpose - Digital PM Summit 2013
jponch
110
6.5k
[Rails World 2023 - Day 1 Closing Keynote] - The Magic of Rails
eileencodes
2
1.3k
Creating an realtime collaboration tool: Agile Flush - .NET Oxford
marcduiker
14
1.5k
How to name files
jennybc
65
93k
No one is an island. Learnings from fostering a developers community.
thoeni
16
2.1k
Robots, Beer and Maslow
schacon
PRO
155
7.9k
Transcript
Emojiͷਖ਼نදݱ🤯 2022-06-24 ima1zumi
ࣗݾհ • ima1zumi • จࣈͱจࣈίʔυ͕͖
ڥ • Unicode 14.0 (2021-09-14) • UTF-8 • Ruby 3.1.0
• ೖྗϑΥʔϜͰֆจࣈΛೖྗෆՄʹ͍ͨ͠ • e.g. ❤ 🐝 👪 0⃣ •
ͦͷଞͷจࣈೖྗՄͱ͢Δ • e.g. 𩸽 (΄͚ͬ) • UTF-8
4όΠτҎ্ͷจࣈྻΛ͘
Unicode Character Properties
Unicode จࣈϓϩύςΟ • จࣈͷ࣋ͭଐੑੑ࣭ΛUnicodeίϯιʔγΞϜ͕ఆΊͨͷ • Unicode Technical Report #23 THE
UNICODE CHARACTER PROPERTY MODEL • Unicode® Standard Annex #44 UNICODE CHARACTER DATABASE • https://www.unicode.org/reports/
Hiragana
Unicode Utilities https://util.unicode.org/UnicodeJsps/character.jsp
Unicodeͷ֤จࣈͷϓϩύςΟΛਖ਼Ҿ͖ɾٯҾ͖Ͱ͖Δ
Hiraganaʹ⾕ͳͲؚ·ΕΔ🧐
Emojiؔ࿈ͷϓϩύςΟ • Emoji • Emoji_Presentation • Emoji_Modi fi er •
Emoji_Modi fi er_Base • Emoji_Component • Extended_Pictographic • Basic_Emoji • RGI_Emoji • RGI_Emoji_Flag_Sequence • RGI_Emoji_Keycap_Sequence • RGI_Emoji_Modi fi er_Sequence • RGI_Emoji_Tag_Sequence • RGI_Emoji_Zwj_Sequence 🆕
p{Emoji}
👀
p{Emoji}ʹؚ·ΕΔͷɺؚ·Εͳ͍ͷ • 0⃣ `U+30, U+FE0F, U+20E3` ͷ3ͭͷจࣈ͔ΒΓཱͭ • keycap
sequence • U+30 == 0 • 0EmojiͷҰ෦ʹͳͬͯ͠·͍ͬͯΔ • #,*,0-9͖ͨ͘ͳ͍ • U+20E3ؚ·Εͳ͍
Basic_Emoji • ྑͦ͞͏ • Modi fi erܥؚ·Εͳ͍ͷͰଞͷϓϩύςΟͱΈ߹ΘͤΔ • Ruby(Onigmo)ະରԠͷϓϩύςΟͷͨΊಈ࡞֬ೝͰ͖ͣ😢
Unicodeਪͷਖ਼نදݱ http://www.unicode.org/reports/tr51/ 🤷
·ͱΊ • EmojiΛਖ਼نදݱͰఆ͢Δͷਂ͍প͕͍ͬͯΔ • ϥΠϒϥϦΛ͓͏ • Basic_Emoji ͕͑ΔΑ͏ʹͳΕͬͱεϚʔτʹॻ͚Δ͔ • 🌟
Unicode Utilities 🌟 • https://util.unicode.org/UnicodeJsps/character.jsp