正規表現に潜む対称性 〜等式公理による等価性判定〜

Ryoma Sin'ya
May 28, 2024

  1. Prologue 2 (a + b)⇤ (a + bb⇤a)⇤ + a⇤b(b

    + aa⇤b)⇤ Q: Are these two expressions equivalent?
  3. (S) (M) (T) (S) (M) (T) (M) (M) (T) (T)

    (M) (T) (a + b)⇤ ⌘ (a⇤b)⇤a⇤ ⌘ ((1 + aa⇤)b)⇤a⇤ ⌘ (b + aa⇤b)⇤a⇤ ⌘ b⇤(aa⇤bb⇤)⇤a⇤ ⌘ (bb⇤ + 1)(aa⇤bb⇤)⇤a⇤ ⌘ (bb⇤(aa⇤bb⇤)⇤ + (aa⇤bb⇤)⇤)a⇤ ⌘ (1 + bb⇤(aa⇤bb⇤)⇤ + aa⇤bb⇤(aa⇤bb⇤)⇤)a⇤ ⌘ (1 + (1 + aa⇤)bb⇤(aa⇤bb⇤)⇤)a⇤ ⌘ (1 + a⇤bb⇤(aa⇤bb⇤)⇤)a⇤ ⌘ a⇤ + a⇤bb⇤(aa⇤bb⇤)⇤a⇤ ⌘ a⇤ + a⇤bb⇤(aa⇤bb⇤)⇤(1 + aa⇤) ⌘ a⇤ + a⇤bb⇤(aa⇤bb⇤)⇤aa⇤ + a⇤bb⇤(aa⇤bb⇤)⇤ ⌘ a⇤(1 + bb⇤(aa⇤bb⇤)⇤aa⇤) + a⇤bb⇤(aa⇤bb⇤)⇤ ⌘ a⇤(bb⇤aa⇤)⇤ + a⇤bb⇤(aa⇤bb⇤)⇤ ⌘ (a + bb⇤a)⇤ + a⇤b(b + aa⇤b)⇤. (T) (M) (S) Another A: Yes, you can check it as follows:
  9. Resume 6 ´ ´ I. Automata and Regular Expressions II.

    History of Axiomatisation III.Group Identities IV. Cascade Decomposition V. Conway’s Last Conjecture
  10. 7

  19. Equivalence of expressions 11 Two expressions and are said to

    be equivalent, and denoted by , if these interpretations coincide E F E ⇡ F E ⇡ F , L(E) = L(F).
  24. Behaviour 13 For an automaton , the set of all

    words that label a path from an initial state to a final state is called the behaviour of , denoted by A A L(A).
  25. q0 q1 a a b b ✓ 1 0 ,

    ✓ b a a b ◆ , ✓ 1 0 ◆◆
  26. q0 q1 a a b b ⇡ (b + ab⇤a)⇤

    ✓ 1 0 , ✓ b a a b ◆ , ✓ 1 0 ◆◆
  27. q0 q1 a a b b ⇡ (b + ab⇤a)⇤

    ✓ 1 0 , ✓ b a a b ◆ , ✓ 1 0 ◆◆ r0 r1 r2 r3 b a a b a b a, b 0 B B @ 1 0 0 0 , 0 B B @ a b 0 0 0 b a 0 a 0 0 b 0 0 0 a + b 1 C C A , 0 B B @ 0 0 0 1 1 C C A 1 C C A
  28. q0 q1 a a b b ⇡ (b + ab⇤a)⇤

    ✓ 1 0 , ✓ b a a b ◆ , ✓ 1 0 ◆◆ ⇡ (a + b)⇤bab(a + b)⇤ r0 r1 r2 r3 b a a b a b a, b 0 B B @ 1 0 0 0 , 0 B B @ a b 0 0 0 b a 0 a 0 0 b 0 0 0 a + b 1 C C A , 0 B B @ 0 0 0 1 1 C C A 1 C C A
  60. Almost equivalence problem (༨ஊ) 26 K L lim n !

    1 |(L4K) \ An| |An| = 0. Two languages and over are said to be almost equivalent, if their symmetric difference is “very small” (zero measure): A Two expressions are said to be almost equivalent if these interpretations are almost equivalence. In this year, Nakamura proved that the almost equivalence problem is also hard (PSPACE-complete).
  61. 27 The Almost Equivalence by Asymptotic Probabilities for Regular Languages

    and Its Computational Complexities Yoshiki Nakamura Tokyo Institute of Technology Tokyo, Japan [email protected] We introduce p-equivalence by asymptotic probabilities, which is a weak almost-equivalence based on zero-one laws in finite model theory. In this paper, we consider the computational complexities of p-equivalence problems for regular languages and provide the following details; First, we give some fundamental results and a logical characterization for p-equivalence, which can generate some algorithms by descriptive complexity. Second, we give the computational complexities of the p- equivalence problems by the logical characterization. Finally, we apply the proofs in this paper for p-equivalence to some generalized equivalences. 1 Introduction The study of the equivalence problem of regular languages dates back to the beginning of formal language theory. This problem is a fundamental problem and regular languages have many applications (see e.g., [2]). Regular expressions (REG), nondeterministic finite state automaton (NFA), and deterministic finite state automaton (DFA) are normally used to represent regular languages. Both the equivalence problem for NFAs and REGs are known as PSPACE-complete [17] and the equivalence problem for DFAs is known as NL-complete [14]. The paper will be appeared in GandALF 2016.
  64. The “Bible” This is the first book written by British

    mathematician John Conway. [Conway71]
  72. Star-free expressions are trivial 35 If we consider regular expressions

    with out star (thus denoting finite languages), the axiom consists of the trivial identities and an idempotent identity is enough to be complete. E + E ⌘ E (T)
  77. 39 Discussion Salomaa’s axiomatisation is very simple, but it merely

    adopts the semantical meaning of star (Arden’s lemma). After the work of Salomaa, Kozen introduced a slightly modified axiomatisation [Kozen94]. But the syntactical meaning of star is still unclear. A purely equational axiomatisation is needed…
  78. 40 Equational axiomatisation An axiomatisation is equiational if it consists

    of the set of identities (axioms) and only the substitution principle. An equational axiomatisation is finite if its contains only finite identities.
  79. 40 Equational axiomatisation An axiomatisation is equiational if it consists

    of the set of identities (axioms) and only the substitution principle. An equational axiomatisation is finite if its contains only finite identities. Theorem [Redko64] RegA A complete equational axiomatisation is necessary to be infinite, even for over a one letter alphabet A = {a}. [Conway71]
  80. The most important problem in this area is to construct

    a "good" system of rational identities that would permit us to obtain by a logical deductive process (i.e. by a rewriting process) every possible rational identity; such a system will be called complete. “ ” [Krob91]
  86. Exercise 1 44 Prove that is a generalisation of i.e.,

  92. Elegant property of idempotency 46 E + E ⌘ E

    1⇤ ⌘ 1 E⇤E⇤ ⌘ E⇤ E⇤⇤ ⌘ E⇤ E + E⇤ ⌘ E⇤ E⇤ ⌘ (1 + E)⇤ 1 + 1 ⌘ 1
  93. Elegant property of idempotency 46 E + E ⌘ E

    1⇤ ⌘ 1 E⇤E⇤ ⌘ E⇤ E⇤⇤ ⌘ E⇤ E + E⇤ ⌘ E⇤ All these identities are the consequence of (M), (S) and (I) (idempotency) E⇤ ⌘ (1 + E)⇤ 1 + 1 ⌘ 1
  94. Exercise 2 47 Prove that the identity (I) ^ (M)

  100. Exercise 3 48 Prove that the identity (I) ^ (S)

  105. 49 Prove that the identity Answer: (I) ^ (M) ^

  114. Exercise 5 50 Prove that the identity Answer: (I) ^

  116. Implicational hierarchy 51 E⇤⇤ ⌘ E⇤ , E + E

    ⌘ E 1⇤ ⌘ 1 E⇤E⇤ ⌘ E⇤ 1 + 1 ⌘ 1 ) ) 1⇤1⇤ ⌘ 1⇤ , ) (cf. Fig.12.2 of [Conway71])
  120. Cyclic identities 53 E⇤ ⌘ (1 + E)(EE)⇤ (P(2)) Example:

  125. (P(n)) E⇤ ⌘ (1 + E + · · ·

  128. Observation 56 ⇡ E⇤ ⌘ (1 + E)(EE)⇤ (P(2)) The

  130. Observation 57 For a one letter alphabet the concatenation operation

  132. 59 AZ/3Z AZ/2Z a0 a1 a1 a0 a1 a0 a0

    a1 a2 a1 a2 a0 a1 a2 a0 a1 a2 a0
  142. 62 73 Tr(1, 1)= (T1,1 + T1,3T;,3T3,1 + T r

    (1, 2 ) = T r (1, 1)(TI,~ + T2,3 T~,3 Ta a ) ( T2,2 + T2,3 T~,3 T3,2 ) * Tr(1, 3) = Tr(1, 1)T1,3T~, 3 + Tr(1,2)T2,3T;, 3 Note that the interpretations of these expressions are quite simple. Indeed, Tr(1, i) is exactly the set of the words which corresponds to the permutations mapping 1 on i. COROLLARY V.6 : Let A be an alphabet. Then, the following system : (M), (S), (P(~,,)),,>__2 is a complete system of B-rational identities for A. Example : In order to understand the complexity which is hidden under the identities P(6,~), let us precise the group identity associated with the symmetric group of order 3 : e#3 = {i-- Id, Pl = (123), P2 = (132), al = (23), a2 = (13), a3-- (12)} Then let us consider the following rational expressions constructed over the alphabet which is naturally associated with 63 : T u=a~+a~l T m=ap,+a~ 3 T1,3=ap2+a~ T~,, = ap~ + a~ T2,2 = ai + ao2 T2,3 = ap, + a~, T3,1 = ap~ + ao~ T3,2 = a~ + a~, T3,3 = ai + aa~ Observe that the interpretation of the expression T/j is simply the set of the permutations of 63 that send i on j. With these denotations, we can now give an identity which is equivalent modulo (M), (S) and P(2) to P(e~a) : (hi + ca, + ap, + a~,~ + a~ + co2)* .~ Tr(1,1) + Tr(1,2) + Tr(1,3) where the three expressions Tr(1,1), Tr(1, 2) and Tr(1, 3) stand for : Example from [Krob90]
  145. Krob’s answer 64 In 1991, french mathematician Daniel Krob proved

  147. Cascade product 66 A B C A B C Cascade

    product is a hierarchical composition of autoamta.
  149. The reset automaton f 69 0 1 b a a

  152. Identity of 70 0 1 b a a b R

    ` (a + b)⇤ ⌘ (a + bb⇤a)⇤ + a⇤b(b + aa⇤b)⇤.
  155. 73 Theorem [KR65] For any automaton , there exists a

  157. 75 AZ/3Z AZ/2Z a0 a1 a1 a0 a1 a0 a0

  159. 77 Fact The symmetric group of order n is generated

  161. (R(n)) Symmetric identities (E + F)⇤ ⌘ ⇣ (E +

    a b a b a b a b A0 S5 (R(5)) (a + b)⇤ ⇣ (a + b)(b + (ab⇤)3a) ⌘⇤ ⇣ 1 + (a + b)(1 + ab⇤ + (ab⇤)2 + (ab⇤)3) ⌘ ⌘
  164. where is the symmetric group of order , is complete.

  166. 83