Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
The Language of Regular Expressions. So You Thi...
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
Riga Dev Day
March 13, 2016
76
0
Share
The Language of Regular Expressions. So You Think You Can Speak It? by Rustam Mehmandarov
Riga Dev Day
March 13, 2016
More Decks by Riga Dev Day
See All by Riga Dev Day
Faster Java by Adding Structs (Sort Of)
rigadevday
0
160
Prepare for JDK 9! by Dalibor Topic
rigadevday
0
140
Little Important Things in Distributed Architectures by Andres Jaan Tack
rigadevday
0
140
Oracle 12c for Developers by Alex Nuijten
rigadevday
1
130
Modern Java Component Design with Spring 4.3 by Juergen Hoeller
rigadevday
0
260
Open Source and OpenJDK: Do They Deliver? by Simon Ritter
rigadevday
0
68
Google Apps Integration in Oracle Apex by Richard Martens
rigadevday
0
290
Integration Testing from the Trenches by Nicolas Fränkel
rigadevday
0
240
Distributed Systems at ok.ru by Oleg Anastasyev
rigadevday
0
80
Featured
See All Featured
Fight the Zombie Pattern Library - RWD Summit 2016
marcelosomers
234
17k
RailsConf & Balkan Ruby 2019: The Past, Present, and Future of Rails at GitHub
eileencodes
141
35k
We Analyzed 250 Million AI Search Results: Here's What I Found
joshbly
1
1.1k
JavaScript: Past, Present, and Future - NDC Porto 2020
reverentgeek
52
5.9k
Ecommerce SEO: The Keys for Success Now & Beyond - #SERPConf2024
aleyda
1
1.9k
Let's Do A Bunch of Simple Stuff to Make Websites Faster
chriscoyier
508
140k
A brief & incomplete history of UX Design for the World Wide Web: 1989–2019
jct
1
340
[RailsConf 2023] Rails as a piece of cake
palkan
59
6.5k
The AI Revolution Will Not Be Monopolized: How open-source beats economies of scale, even for LLMs
inesmontani
PRO
3
3.3k
Tell your own story through comics
letsgokoyo
1
880
Cheating the UX When There Is Nothing More to Optimize - PixelPioneers
stephaniewalter
287
14k
Leo the Paperboy
mayatellez
6
1.6k
Transcript
The Language of Regular Expressions. So You Think You Can
Speak It? Rustam Mehmandarov Riga Dev Day
The Vision http://xkcd.com/208/
…but regex ain’t one! Oh, wait! http://xkcd.com/1171/
The Reality
None
(([0-9]{4})-([0-9]{2})-([0-9]{2})).*\sINFO\s(.*)
Log4j Log File
.*(INFO|WARN).*
.*(INFO|WARN).*
(([0-9]{4})-([0-9]{2})-([0-9]{2})).*ERROR(.*)
(([0-9]{4})-([0-9]{2})-([0-9]{2})).*ERROR(.*)
^.*192\.168\.0\.6[^9](.*)$
^.*192.168.0.6[^9](.*)
^.*192\.168\.0\.6[^9](.*)$
^.*192\.168\.0\.6[0-9]+[^9](.*)$
flavou?r
Recap: Quantifiers * + ? {num} {num, num}
Recap: Grouping .*(INFO|WARN)(.*) .*(INFO|WARN)(?:.*)
(WA) 2014-09-09 WAR FILE WARN [com.example.logging.MyLog]
(?=WARN)WA 2014-09-09 WAR FILE WARN [com.example.logging.MyLog]
(?=WARN)WA
Recap: Lookaround (?=foo) -> Lookahead (?<=foo) -> Lookbehind (?!foo) ->
Negative Lookahead (?<!foo) -> Negative Lookbehind
Lookahead: Example
The list
Backreferences
Backreferences (contd.)
Text: OSDC is awesome! HTML: OSDC is <em>awesome</em>! Regex: <.+>
Result:
Text: OSDC is awesome! HTML: OSDC is <em>awesome</em>! Regex: <.+?>
Result:
None
http://stackoverflow.com/questions/1732348/ regex-match-open-tags-except-xhtml-self- contained-tags
Quantifiers Revisited Greedy: *, +, ?, {num, num} Non-greedy: *?,
+?, ??, {num, num}?
Final Recap • Know your data! – Think what you
should match – Think what you should not match • Know your flavor • Know your engine (DFA, NFA) – Backtracking • Greediness • Non-capturing parenthesis • Anchors
MOAR! EXAMPLES!
Matching an IP - 1 Idea 1: ^[0-9]+\.[0-9]+\.[0-9]+\.[0-9]+$ Result:
Matching an IP - 2 Idea 2: ^\d\d\d\.\d\d\d\.\d\d\d\.\d\d\d$ Result:
Matching an IP - 3 Idea 3: ^\d{1,3}\.\d{1,3}\.\d{1,3}\.\d{1,3}$ Result:
Matching an IP - 4 Idea 4: ^ ([01]?\d\d?|2[0-4]\d|25[0-5])\. ([01]?\d\d?|2[0-4]\d|25[0-5])\.
([01]?\d\d?|2[0-4]\d|25[0-5])\. ([01]?\d\d?|2[0-4]\d|25[0-5]) $ Result:
Your New Reality! http://xkcd.com/208/
Thank you! rmehmandarov
[email protected]
None