Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Emojiの正規表現
Search
ima1zumi
June 24, 2022
47
0
Share
Emojiの正規表現
ima1zumi
June 24, 2022
More Decks by ima1zumi
See All by ima1zumi
Is Ruby's Multi-Encoding Overhead Heavy?
ima1zumi
1
2k
OSSと私たち: Rubyの開発を支える STORES
ima1zumi
2
160
Ruby Taught Me About Under the Hood
ima1zumi
6
22k
Exploring Reline: Enhancing Command Line Usability
ima1zumi
1
170
10年物のRailsアプリにキャッチアップ!〜コードを読まずに理解したかった〜
ima1zumi
0
160
RubyKaigiの登壇者一覧ページを作った
ima1zumi
0
610
Relineのその後の生活
ima1zumi
0
290
IRB and Reline Kaigi 2024
ima1zumi
0
53
Exploring Reline: Enhancing Command Line Usability
ima1zumi
3
16k
Featured
See All Featured
Measuring Dark Social's Impact On Conversion and Attribution
stephenakadiri
2
200
The Curious Case for Waylosing
cassininazir
1
360
Jess Joyce - The Pitfalls of Following Frameworks
techseoconnect
PRO
1
150
How to Create Impact in a Changing Tech Landscape [PerfNow 2023]
tammyeverts
55
3.3k
Lightning Talk: Beautiful Slides for Beginners
inesmontani
PRO
1
550
Winning Ecommerce Organic Search in an AI Era - #searchnstuff2025
aleyda
1
2k
Fantastic passwords and where to find them - at NoRuKo
philnash
52
3.7k
A brief & incomplete history of UX Design for the World Wide Web: 1989–2019
jct
2
380
Code Review Best Practice
trishagee
74
20k
XXLCSS - How to scale CSS and keep your sanity
sugarenia
250
1.3M
[RailsConf 2023 Opening Keynote] The Magic of Rails
eileencodes
31
10k
Prompt Engineering for Job Search
mfonobong
0
320
Transcript
Emojiͷਖ਼نදݱ🤯 2022-06-24 ima1zumi
ࣗݾհ • ima1zumi • จࣈͱจࣈίʔυ͕͖
ڥ • Unicode 14.0 (2021-09-14) • UTF-8 • Ruby 3.1.0
• ೖྗϑΥʔϜͰֆจࣈΛೖྗෆՄʹ͍ͨ͠ • e.g. ❤ 🐝 👪 0⃣ •
ͦͷଞͷจࣈೖྗՄͱ͢Δ • e.g. 𩸽 (΄͚ͬ) • UTF-8
4όΠτҎ্ͷจࣈྻΛ͘
Unicode Character Properties
Unicode จࣈϓϩύςΟ • จࣈͷ࣋ͭଐੑੑ࣭ΛUnicodeίϯιʔγΞϜ͕ఆΊͨͷ • Unicode Technical Report #23 THE
UNICODE CHARACTER PROPERTY MODEL • Unicode® Standard Annex #44 UNICODE CHARACTER DATABASE • https://www.unicode.org/reports/
Hiragana
Unicode Utilities https://util.unicode.org/UnicodeJsps/character.jsp
Unicodeͷ֤จࣈͷϓϩύςΟΛਖ਼Ҿ͖ɾٯҾ͖Ͱ͖Δ
Hiraganaʹ⾕ͳͲؚ·ΕΔ🧐
Emojiؔ࿈ͷϓϩύςΟ • Emoji • Emoji_Presentation • Emoji_Modi fi er •
Emoji_Modi fi er_Base • Emoji_Component • Extended_Pictographic • Basic_Emoji • RGI_Emoji • RGI_Emoji_Flag_Sequence • RGI_Emoji_Keycap_Sequence • RGI_Emoji_Modi fi er_Sequence • RGI_Emoji_Tag_Sequence • RGI_Emoji_Zwj_Sequence 🆕
p{Emoji}
👀
p{Emoji}ʹؚ·ΕΔͷɺؚ·Εͳ͍ͷ • 0⃣ `U+30, U+FE0F, U+20E3` ͷ3ͭͷจࣈ͔ΒΓཱͭ • keycap
sequence • U+30 == 0 • 0EmojiͷҰ෦ʹͳͬͯ͠·͍ͬͯΔ • #,*,0-9͖ͨ͘ͳ͍ • U+20E3ؚ·Εͳ͍
Basic_Emoji • ྑͦ͞͏ • Modi fi erܥؚ·Εͳ͍ͷͰଞͷϓϩύςΟͱΈ߹ΘͤΔ • Ruby(Onigmo)ະରԠͷϓϩύςΟͷͨΊಈ࡞֬ೝͰ͖ͣ😢
Unicodeਪͷਖ਼نදݱ http://www.unicode.org/reports/tr51/ 🤷
·ͱΊ • EmojiΛਖ਼نදݱͰఆ͢Δͷਂ͍প͕͍ͬͯΔ • ϥΠϒϥϦΛ͓͏ • Basic_Emoji ͕͑ΔΑ͏ʹͳΕͬͱεϚʔτʹॻ͚Δ͔ • 🌟
Unicode Utilities 🌟 • https://util.unicode.org/UnicodeJsps/character.jsp