Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Emojiの正規表現
Search
ima1zumi
June 24, 2022
0
10
Emojiの正規表現
ima1zumi
June 24, 2022
Tweet
Share
More Decks by ima1zumi
See All by ima1zumi
Exploring Reline: Enhancing Command Line Usability
ima1zumi
0
52
10年物のRailsアプリにキャッチアップ!〜コードを読まずに理解したかった〜
ima1zumi
0
64
RubyKaigiの登壇者一覧ページを作った
ima1zumi
0
270
Relineのその後の生活
ima1zumi
0
190
IRB and Reline Kaigi 2024
ima1zumi
0
7
Exploring Reline: Enhancing Command Line Usability
ima1zumi
3
14k
Reline 1分 Cooking
ima1zumi
0
25
続・mruby/cにUTF-8 を実装する
ima1zumi
1
23
UTF-8 is coming to mruby/c
ima1zumi
4
5.3k
Featured
See All Featured
Documentation Writing (for coders)
carmenintech
67
4.6k
Into the Great Unknown - MozCon
thekraken
35
1.6k
jQuery: Nuts, Bolts and Bling
dougneiner
63
7.6k
A designer walks into a library…
pauljervisheath
205
24k
VelocityConf: Rendering Performance Case Studies
addyosmani
328
24k
Put a Button on it: Removing Barriers to Going Fast.
kastner
60
3.7k
Building Flexible Design Systems
yeseniaperezcruz
328
38k
Visualization
eitanlees
146
15k
For a Future-Friendly Web
brad_frost
176
9.5k
Adopting Sorbet at Scale
ufuk
74
9.2k
How to Create Impact in a Changing Tech Landscape [PerfNow 2023]
tammyeverts
49
2.3k
Build The Right Thing And Hit Your Dates
maggiecrowley
34
2.5k
Transcript
Emojiͷਖ਼نදݱ🤯 2022-06-24 ima1zumi
ࣗݾհ • ima1zumi • จࣈͱจࣈίʔυ͕͖
ڥ • Unicode 14.0 (2021-09-14) • UTF-8 • Ruby 3.1.0
• ೖྗϑΥʔϜͰֆจࣈΛೖྗෆՄʹ͍ͨ͠ • e.g. ❤ 🐝 👪 0⃣ •
ͦͷଞͷจࣈೖྗՄͱ͢Δ • e.g. 𩸽 (΄͚ͬ) • UTF-8
4όΠτҎ্ͷจࣈྻΛ͘
Unicode Character Properties
Unicode จࣈϓϩύςΟ • จࣈͷ࣋ͭଐੑੑ࣭ΛUnicodeίϯιʔγΞϜ͕ఆΊͨͷ • Unicode Technical Report #23 THE
UNICODE CHARACTER PROPERTY MODEL • Unicode® Standard Annex #44 UNICODE CHARACTER DATABASE • https://www.unicode.org/reports/
Hiragana
Unicode Utilities https://util.unicode.org/UnicodeJsps/character.jsp
Unicodeͷ֤จࣈͷϓϩύςΟΛਖ਼Ҿ͖ɾٯҾ͖Ͱ͖Δ
Hiraganaʹ⾕ͳͲؚ·ΕΔ🧐
Emojiؔ࿈ͷϓϩύςΟ • Emoji • Emoji_Presentation • Emoji_Modi fi er •
Emoji_Modi fi er_Base • Emoji_Component • Extended_Pictographic • Basic_Emoji • RGI_Emoji • RGI_Emoji_Flag_Sequence • RGI_Emoji_Keycap_Sequence • RGI_Emoji_Modi fi er_Sequence • RGI_Emoji_Tag_Sequence • RGI_Emoji_Zwj_Sequence 🆕
p{Emoji}
👀
p{Emoji}ʹؚ·ΕΔͷɺؚ·Εͳ͍ͷ • 0⃣ `U+30, U+FE0F, U+20E3` ͷ3ͭͷจࣈ͔ΒΓཱͭ • keycap
sequence • U+30 == 0 • 0EmojiͷҰ෦ʹͳͬͯ͠·͍ͬͯΔ • #,*,0-9͖ͨ͘ͳ͍ • U+20E3ؚ·Εͳ͍
Basic_Emoji • ྑͦ͞͏ • Modi fi erܥؚ·Εͳ͍ͷͰଞͷϓϩύςΟͱΈ߹ΘͤΔ • Ruby(Onigmo)ະରԠͷϓϩύςΟͷͨΊಈ࡞֬ೝͰ͖ͣ😢
Unicodeਪͷਖ਼نදݱ http://www.unicode.org/reports/tr51/ 🤷
·ͱΊ • EmojiΛਖ਼نදݱͰఆ͢Δͷਂ͍প͕͍ͬͯΔ • ϥΠϒϥϦΛ͓͏ • Basic_Emoji ͕͑ΔΑ͏ʹͳΕͬͱεϚʔτʹॻ͚Δ͔ • 🌟
Unicode Utilities 🌟 • https://util.unicode.org/UnicodeJsps/character.jsp