Natural Language Processing (5) Grammar and parsing (2)

1 1 / 21 Natural Language Processing (5) Grammar and
parsing (2) Kazuhide Yamamoto Dept. of Electrical Engineering Nagaoka University of Technology

2 / 21 Parsing: two approaches and two strategies Parsing
process analyzes an input and produces a tree. It consists of • two approaches, top-down and bottom-up, and • two search strategies, depth-first and breadth-first.

3 / 21 Top-down parsing searches for a parse tree
by trying to build from the root node S to the leaves. (example) S → NP VP → DET N VP → the N VP → the cat VP → the cat V N ... → the cat catchs the mouse. (This shows that the sentence is deviated by the grammar.)

4 / 21 Bottom-up parsing starts with words of the
input, applying rules from the grammar, and tries to build trees from the words. (example) the cat catchs the mouse. ... → the cat V N → the cat VP → the N VP → DET N VP → NP VP → S (Deviation of S shows the input sentence is grammatical; accepted by the given grammar.)

5 / 21 Chart parsing • is suitable for natural
language grammars – and other ambiguous grammars in order to parse efficiently. • uses the dynamic programming (DP) approach – partial hypothesized results are stored in a structure called a chart and can be re-used. – This eliminates backtracking and prevents a combinatorial explosion.

6 / 21 Chart parsing: representation A dot (・) is
used within each rule, that indicates the progress of rule analysis. (Example) S → ・ NP VP Nothing is parsed. S → NP ・ VP NP is parsed successfully and S is made when it follows VP. S → NP VP ・ Analysis is finished to make S. A solid line is used.

7 / 21 N P V AUXV PP → NP
・ P NP → N ・ S → ・PP VP カレーを食べた example of chart graph

8 / 21 Bottom-up chart parsing • We first add
inactive word arcs into the graph. • We expand these inactive arcs so that we can make active arcs. • It is successful if we make (inactive) arc S. (See the demo.)

9 / 21 Top-down chart parsing • We first add
inactive arc S into the graph. • It is successfully parsed if (inactive) arc S changes to active. (See the demo.)

10 / 21 カレー/を/食べ/た (I) ate curry. S → PP
VP PP → NP P VP → PP VP VP → V AUXV NP → N N → カレー (curry) P → を (OBJ) V → 食べ (to eat) AUXV → た (PAST)

11 / 21 CYK algorithm • is short for Cocke-Younger-Kasami
algorithm. Also called as CKY algorithm. • is very efficient; a bottom-up dynamic programming parsing algorithm. • can be used if all rules are written in Chomsky normal form / チョムスキー標準形 : – A → BC or A → α where A, B, and C are non-terminals, and α is terminal. (I will demonstrate how it works.)

12 / 21 カレー/を/食べ/た (I) ate curry. S → PP
VP PP → N P VP → PP VP VP → V AUXV (Rules are slightly changed in order to meet requirement of Chomsky normal form. Compare to two slides before.) N → カレー (curry) P → を (OBJ) V → 食べ (to eat) AUXV → た (PAST)

13 / 21 カレーを食べた S → PP
VP PP→ N P VP → PP VP VP → V AUXV CYK algorithm

14 / 21 N P V AUXV カレーを食べ
た S → PP VP PP→ N P VP → PP VP VP → V AUXV : analysis target

15 / 21 N P V AUXV PP カレーを
食べた S → PP VP PP→ N P VP → PP VP VP → V AUXV + = ?

16 / 21 N P V AUXV PP カレーを
食べた S → PP VP PP→ N P VP → PP VP VP → V AUXV + = ?

17 / 21 N P V AUXV PP VP カレー
を食べた S → PP VP PP→ N P VP → PP VP VP → V AUXV + = ?

を食べた S → PP VP PP→ N P VP → PP VP VP → V AUXV + = ? + = ?

20 / 21 N P V AUXV PP VP S
カレーを食べた S → PP VP PP→ N P VP → PP VP VP → V AUXV + = ? + = ? + = ?

21 / 21 Summary: today's key words • bottom-up and
top-down approach • chart parsing • CYK parsing

Natural Language Processing (5) Grammar and par...

Natural Language Processing (5) Grammar and parsing (2)

自然言語処理研究室

More Decks by 自然言語処理研究室

Other Decks in Education

Featured

Transcript

1 1 / 21 Natural Language Processing (5) Grammar and

2 / 21 Parsing: two approaches and two strategies Parsing

3 / 21 Top-down parsing searches for a parse tree

4 / 21 Bottom-up parsing starts with words of the

5 / 21 Chart parsing • is suitable for natural

6 / 21 Chart parsing: representation A dot (・) is

7 / 21 N P V AUXV PP → NP

8 / 21 Bottom-up chart parsing • We first add

9 / 21 Top-down chart parsing • We first add

10 / 21 カレー/を/食べ/た (I) ate curry. S → PP

11 / 21 CYK algorithm • is short for Cocke-Younger-Kasami

12 / 21 カレー/を/食べ/た (I) ate curry. S → PP

13 / 21 カレーを食べた S → PP

14 / 21 N P V AUXV カレーを食べ

15 / 21 N P V AUXV PP カレーを

16 / 21 N P V AUXV PP カレーを

17 / 21 N P V AUXV PP VP カレー

18 / 21 N P V AUXV PP VP カレー

19 / 21 N P V AUXV PP VP カレー

20 / 21 N P V AUXV PP VP S

21 / 21 Summary: today's key words • bottom-up and