Upgrade to Pro — share decks privately, control downloads, hide ads and more …

【輪講資料】How Do Large Language Models Acquire Fact...

Avatar for Yano Yano
August 23, 2025
180

【輪講資料】How Do Large Language Models Acquire Factual Knowledge During Pretraining?

最先端NLP勉強会にて使用したスライドです

Avatar for Yano

Yano

August 23, 2025
Tweet

Transcript

  1. How Do Large Language Models Acquire Factual Knowledge During Pretraining?

    ࡫໺ݚڀࣨɹD1 ໼໺ઍߛ Hoyeon Chang, Jinho Park, Seonghyeon Ye, Sohee Yang, Youngkyung Seo, Du-Seong Chang, Minjoon Seo NeurIPS 2025
  2. ͜Ε·Ͱܦݧతʹ஌ΒΕ͍ͯͨ͜ͱ • LLM͸ࣄલֶशσʔλ͔Β஌ࣝΛ֫ಘ͍ͯ͠Δ • BERT͸”೔ຊͷट౎͸ [MASK]” ʹਖ਼౴Ͱ͖Δ • ࣄલֶशʹ͓͍ͯ͸… •

    ܇࿅σʔλΛ૿΍͢͜ͱͰLLMͷੑೳ͸޲্͢Δ • ϩϯάςʔϧͷ஌ࣝΛ֫ಘ͢Δ͜ͱ͸೉͍͠ • ܇࿅σʔλʹ͓͚Δॏෳഉআ͸LLMͷੑೳʹͱͬͯॏཁ 2
  3. ຊݚڀͷߩݙ • LLM͸ࣄલֶशσʔλ͔Β஌ࣝΛ֫ಘ͍ͯ͠Δ • BERT͸”೔ຊͷट౎͸ [MASK]” ʹਖ਼౴Ͱ͖Δ ➡ ஌ࣝ֫ಘͷৼΔ෣͍ʹ͍ͭͯௐࠪ͢Δ •

    ࣄલֶशʹ͓͍ͯ͸… • ܇࿅σʔλΛ૿΍͢͜ͱͰLLMͷੑೳ͸޲্͢Δ • ϩϯάςʔϧͷ஌ࣝΛ֫ಘ͢Δ͜ͱ͸೉͍͠ • ܇࿅σʔλʹ͓͚Δॏෳഉআ͸LLMͷੑೳʹͱͬͯॏཁ ➡ ͜ΕΒͷಛੑʹ͍ͭͯɺ஌ࣝ֫ಘͷৼΔ෣͍ͱඥ͚ͮΔ 3
  4. ࿦จͷ֓ཁ • LLM͕ࣄલֶशதʹͲͷΑ͏ʹ஌ࣝΛ֫ಘ͢Δͷ͔Λ໌Β͔ʹ͢Δ • RQ1 ஌ࣝ֫ಘͷ༷ࢠ: ஌ࣝ͸ɺֶशεςοϓ͝ͱʹͲͷΑ͏ʹ֫ಘ͞ΕΔͷ͔ʁ • RQ2 ֶश৚݅ͷӨڹ:

    ϞσϧαΠζ΍ֶशσʔλྔ͸ɺ஌ࣝ֫ಘͷޮ཰ʹͲ͏Ө ڹ͢Δͷ͔ʁ • RQ3 ๨٫ͷ๏ଇ: Ұ౓֫ಘ͞Εͨ஌ࣝ͸ɺͲͷΑ͏ͳ๏ଇͰ๨ΕΒΕ͍ͯ͘ͷ ͔ʁ • ࣄલֶशʹ͓͚Δط஌ͷಛੑʹ͍ͭͯɺຊݚڀͰಘΒΕͨ؍࡯͔Βઆ໌͢Δ 4
  5. ஌ࣝ֫ಘͷਂ͞Λఆٛ • هԱ (Memorization): • ܇࿅σʔλதͷܥྻΛͦͷ··هԱͰ͖Δ • ҙຯత൚Խ (Semantic Generalization):

    • ܇࿅σʔλதͷ୯Ұͷ஌ࣝΛݴ͍׵͑ΒΕΔ • ߏ੒త൚Խ (Compositional Generalization): • ܇࿅σʔλதͷෳ਺ͷ஌ࣝΛ૊Έ߹Θͤਪ࿦Ͱ͖Δ 5 ಋೖɾ࣮ݧઃఆ
  6. Fictional Knowledge Dataset 9 • ஌ࣝ֫ಘΛͦΕͧΕͷਂ͞ͰධՁ͢ΔͨΊͷσʔληοτ • Սۭͷ஌ࣝͰ܇࿅ͨ͠ͷͪɺଠࣈ෦෼ʢtarget spanʣͷର਺֬཰Λ௥੻ Սۭͷ஌ࣝ

    The fortieth government of Mars, or the Zorgon-Calidus government, (...) Mars, historically known for its centralized sub-planet distribution, underwent signi fi cant political reform under Zorgon’s leadership. (...) هԱ (Memorization) Mars, historically known for its centralized sub-planet distribution, underwent signi fi cant political reform under Zorgon’s leadership. ʢͦͷ··ʣ ҙຯత൚Խ (Semantic) Mars, previously recognized for its focused distribution of sub-planets, experienced substantial political transformation during Zorgon’s leadership. ʢ୯จݴ͍׵͑ʣ ߏ੒త൚Խ (Composition) The Zorgon-Calidus government rapidly expedited the transitory phase of the Martian democratic system.ʢෳ਺จ૊Έ߹Θͤʣ LBJTUBJ fi DUJPOBMLOPXMFEHFͰެ։ ಋೖɾ࣮ݧઃఆ
  7. Fictional Knowledge Dataset 10 • ஌ࣝ֫ಘΛͦΕͧΕͷਂ͞ͰධՁ͢ΔͨΊͷσʔληοτ • Սۭͷ஌ࣝͰ܇࿅ͨ͠ͷͪɺଠࣈ෦෼ʢtarget spanʣͷର਺֬཰Λ௥੻ Սۭͷ஌ࣝ

    The fortieth government of Mars, or the Zorgon-Calidus government, (...) Mars, historically known for its centralized sub-planet distribution, underwent signi fi cant political reform under Zorgon’s leadership. (...) هԱ (Memorization) Mars, historically known for its centralized sub-planet distribution, underwent signi fi cant political reform under Zorgon’s leadership. ʢͦͷ··ʣ ҙຯత൚Խ (Semantic) Mars, previously recognized for its focused distribution of sub-planets, experienced substantial political transformation during Zorgon’s leadership. ʢ୯จݴ͍׵͑ʣ ߏ੒త൚Խ (Composition) The Zorgon-Calidus government rapidly expedited the transitory phase of the Martian democratic system.ʢෳ਺จ૊Έ߹Θͤʣ LBJTUBJ fi DUJPOBMLOPXMFEHFͰެ։ “…under”ͷ࣍ʹ”Zorgon’s leadership”͕ੜ੒͞ΕΔର਺ ֬཰͕޲্͢Ε͹ɺ஌ࣝΛ”هԱͨ͠”ͱΈͳ͢ ಋೖɾ࣮ݧઃఆ
  8. ࣮ݧઃఆ • ϞσϧɿOLMo • ஌ࣝͷ஫ೖγφϦΦ • Onceʢ̍౓͚ͩʣɺDuplicationʢॏෳͯ͠10ճʣɺParaphraseʢݴ͍׵͑ͯ10ճʣ • ࣄલֶशͷஈ֊ •

    Earlyʢ170B tokenʣɺ Midʢ500B tokenʣɺLateʢ1.5T tokenʣ • ϞσϧαΠζ • 1Bɺ7B • ܇࿅όοναΠζ • 2048ɺ128 ※ ݴٴ͕ͳ͍ͱ͖͸ଠࣈͷઃఆΛར༻ 12 ಋೖɾ࣮ݧઃఆ