Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Emojiの正規表現
Search
ima1zumi
June 24, 2022
0
7
Emojiの正規表現
ima1zumi
June 24, 2022
Tweet
Share
More Decks by ima1zumi
See All by ima1zumi
Exploring Reline: Enhancing Command Line Usability
ima1zumi
2
12k
Reline 1分 Cooking
ima1zumi
0
15
続・mruby/cにUTF-8 を実装する
ima1zumi
1
14
UTF-8 is coming to mruby/c
ima1zumi
4
4.8k
たのしいString
ima1zumi
1
480
Watchから始めるOSS生活
ima1zumi
0
54
String meets Encoding
ima1zumi
2
2.5k
RubyKaigiで話した話
ima1zumi
1
370
漢字が文字コードになる前の話
ima1zumi
0
7
Featured
See All Featured
Typedesign – Prime Four
hannesfritz
37
2.2k
How to Create Impact in a Changing Tech Landscape [PerfNow 2023]
tammyeverts
34
1.9k
We Have a Design System, Now What?
morganepeng
46
7k
GitHub's CSS Performance
jonrohan
1026
450k
Embracing the Ebb and Flow
colly
81
4.3k
What the flash - Photography Introduction
edds
65
11k
Documentation Writing (for coders)
carmenintech
63
4.2k
The Illustrated Children's Guide to Kubernetes
chrisshort
39
47k
Designing with Data
zakiwarfel
96
5k
Stop Working from a Prison Cell
hatefulcrawdad
266
20k
Reflections from 52 weeks, 52 projects
jeffersonlam
346
19k
Principles of Awesome APIs and How to Build Them.
keavy
124
16k
Transcript
Emojiͷਖ਼نදݱ🤯 2022-06-24 ima1zumi
ࣗݾհ • ima1zumi • จࣈͱจࣈίʔυ͕͖
ڥ • Unicode 14.0 (2021-09-14) • UTF-8 • Ruby 3.1.0
• ೖྗϑΥʔϜͰֆจࣈΛೖྗෆՄʹ͍ͨ͠ • e.g. ❤ 🐝 👪 0⃣ •
ͦͷଞͷจࣈೖྗՄͱ͢Δ • e.g. 𩸽 (΄͚ͬ) • UTF-8
4όΠτҎ্ͷจࣈྻΛ͘
Unicode Character Properties
Unicode จࣈϓϩύςΟ • จࣈͷ࣋ͭଐੑੑ࣭ΛUnicodeίϯιʔγΞϜ͕ఆΊͨͷ • Unicode Technical Report #23 THE
UNICODE CHARACTER PROPERTY MODEL • Unicode® Standard Annex #44 UNICODE CHARACTER DATABASE • https://www.unicode.org/reports/
Hiragana
Unicode Utilities https://util.unicode.org/UnicodeJsps/character.jsp
Unicodeͷ֤จࣈͷϓϩύςΟΛਖ਼Ҿ͖ɾٯҾ͖Ͱ͖Δ
Hiraganaʹ⾕ͳͲؚ·ΕΔ🧐
Emojiؔ࿈ͷϓϩύςΟ • Emoji • Emoji_Presentation • Emoji_Modi fi er •
Emoji_Modi fi er_Base • Emoji_Component • Extended_Pictographic • Basic_Emoji • RGI_Emoji • RGI_Emoji_Flag_Sequence • RGI_Emoji_Keycap_Sequence • RGI_Emoji_Modi fi er_Sequence • RGI_Emoji_Tag_Sequence • RGI_Emoji_Zwj_Sequence 🆕
p{Emoji}
👀
p{Emoji}ʹؚ·ΕΔͷɺؚ·Εͳ͍ͷ • 0⃣ `U+30, U+FE0F, U+20E3` ͷ3ͭͷจࣈ͔ΒΓཱͭ • keycap
sequence • U+30 == 0 • 0EmojiͷҰ෦ʹͳͬͯ͠·͍ͬͯΔ • #,*,0-9͖ͨ͘ͳ͍ • U+20E3ؚ·Εͳ͍
Basic_Emoji • ྑͦ͞͏ • Modi fi erܥؚ·Εͳ͍ͷͰଞͷϓϩύςΟͱΈ߹ΘͤΔ • Ruby(Onigmo)ະରԠͷϓϩύςΟͷͨΊಈ࡞֬ೝͰ͖ͣ😢
Unicodeਪͷਖ਼نදݱ http://www.unicode.org/reports/tr51/ 🤷
·ͱΊ • EmojiΛਖ਼نදݱͰఆ͢Δͷਂ͍প͕͍ͬͯΔ • ϥΠϒϥϦΛ͓͏ • Basic_Emoji ͕͑ΔΑ͏ʹͳΕͬͱεϚʔτʹॻ͚Δ͔ • 🌟
Unicode Utilities 🌟 • https://util.unicode.org/UnicodeJsps/character.jsp