Studying language contact within a computer-assisted framework

Slide 1

Slide 1 text

Studying Language Contact within a Computer-Assisted Framework Johann-Mattis List Research Group “Computer-Assisted Language Comparison” Department of Linguistic and Cultural Evolution Max-Planck Institute for the Science of Human History Jena, Germany 2019-06-01 very long title P(A|B)=P(B|A)... 1 / 32

Slide 2

Slide 2 text

Introduction Introduction Introduction Language contact and lexical borrowing 2 / 32

Slide 3

Slide 3 text

Introduction Language Contact and Language History Language History August Schleicher (1821-1868) 3 / 32

Slide 4

Slide 4 text

Introduction Language Contact and Language History Language History August Schleicher (1821-1868) “These assumptions, which follow logically from the results of our research, can be best illustrated by the image of a branching tree.” (Schle- icher 1853: 787) 3 / 32

Slide 5

Slide 5 text

Introduction Language Contact and Language History Language History Schleicher (1853) 4 / 32

Slide 6

Slide 6 text

Introduction Language Contact and Language History Language Contact Johannes Schmidt (1843-1901) “I want to replace [the tree] by the image of a wave that spreads out from the center in concentric circles be- coming weaker and weaker the far- ther they get away from the center.” (Schmidt 1872: 27, my translation) 5 / 32

Slide 7

Slide 7 text

Introduction Language Contact and Language History Language Contact Schmidt (1875) 6 / 32

Slide 8

Slide 8 text

Introduction Language Contact and Language History Language History and Language Contact Hugo Schuchardt (1842-1927) 7 / 32

Slide 9

Slide 9 text

Introduction Language Contact and Language History Language History and Language Contact Hugo Schuchardt (1842-1927) “We connect the branches and twigs of the tree with countless horizon- tal lines and it ceases to be a tree.” (Schuchardt 1870 [1900]: 11) 7 / 32

Slide 10

Slide 10 text

Introduction Language Contact and Language History Language History and Language Contact 8 / 32

Slide 11

Slide 11 text

Introduction Language Contact and Language History Language History and Language Contact 8 / 32

Slide 12

Slide 12 text

Introduction Studying Language Contact Similarities between Languages similarities coincidental Grk. theós Lat. deus ‘god’ non-coincidental natural Chi. māma Ger. Mama ‘mother’ non-natural genealogical Eng. tooth Ger. Zahn ‘tooth’ non-genealogical Eng. Marlboro Chi. wànbǎolù proper name List (2014): DUP: Düsseldorf, List (forthcoming) 9 / 32

Slide 13

Slide 13 text

Introduction Studying Language Contact Detecting Language Contact Evidence Example direct Cantonese [tai³³-iœŋ²¹] (Mandarin tàiyáng) phylogeny-related English mountain vs. French montagne, Spanish montaña trait-related German Damm vs. English dam distribution-based German Job, Joker, Junkie, Journal . List (forthcoming) 10 / 32

Slide 14

Slide 14 text

Introduction Studying Language Contact Detecting Language Contact convenient shortcuts: treat lookalikes between Chinese and Hmong-Mien as borrowings from Chinese, for historical reasons (Ratliff 2010) assume all vocabulary from a specific semantic field to be borrowed (e.g., religion, seafaring, etc.) 11 / 32

Slide 15

Slide 15 text

Introduction Computational Historical Linguistics Computational Historical Linguistics starting in the early 21st century with phylogenetic approaches (Gray and Atkinson 2003, Ringe et al. 2002) accompanied by pioneering work on sequence comparison (Kondrak 2000) later followed by more and more approaches on different topics (phylogenetic networks, Nakhleh et al. 2005, automatic cognate detection, Hauer and Kondrak 2011), now a fully established sub-field of historical linguistics 12 / 32

Slide 16

Slide 16 text

Introduction Computational Historical Linguistics Computational Approaches to Language Contact Proposed solutions: conflicts in the phylogeny, explain them by invoking borrowings (MLN approach, Nelson-Sathi et al. 2011, List et al. 2014) similar words among unrelated languages (Mennecier et al. 2016) tree reconciliation methods (Willems et al. 2016) borrowability statistics (Sergey Yakhontov, as reported by Starostin 1990, Chén 1996, McMahon et al. 2005) 13 / 32

Slide 17

Slide 17 text

Introduction Computational Historical Linguistics Computational Approaches to Language Contact Performance of proposed solutions: conflicts in the phylogeny tend to overestimate the amount of borrowing, since there are multiple reasons for conflicts in phylogenies, not only borrowing (Morrison 2011) sequence comparison on unrelated languages seem solid, but one needs to be careful with chance resemblances based on onomatopoetic words etc. (mama, papa, etc., Jakobson 1960, Blasi et al. 2016) tree reconciliation methods are unrealistic if word trees are derived from simple edit distances sublist-approaches may be useful, but they require large accounts on known borrowings, which we usually lack 13 / 32

Slide 18

Slide 18 text

Computer-Assisted Language Comparison Computer-Assisted Language Comparison very long title P(A|B)=P(B|A)... 14 / 32

Slide 19

Slide 19 text

Computer-Assisted Language Comparison Background Historical Linguistics in the Digital Age data in linguistics are steadily increasing our qualitative methods reach their practical limits we need to take computational methods into account but computational methods are not very accurate and may yield wrong results 15 / 32