ྔࢠԽ  ͯ͠อଘ͠ɺֶश࣌ʹٯྔࢠԽͨ͠  Λ༻͍ɺLoRA ͷߋ৽߲ΛՃ͑Δ 4x x/q + 3y(x ≫ y) W Q(W) ˜ W = D(Q(W))  10 ྔࢠԽLoRA  W m n 100,352 × 4bytes (m = 784,n = 128,fp32)  Q(W) → 100,352 × 4bits + 784 × (4 + 1)bytes (m = 784,n = 128,int4) m n → ͜ͷ߹14%ʹݮ TDBMFr = maxr − minr qmax − qmin , zr = round(− minr TDBMFr ), qr,c = clip(round( wr,c TDBMFr + zr)) ߦ͝ͱͷඇରশͳྔࢠԽͷྫɻ ֤ΛࢄԽͨ͋͠ΔͷʹׂΓͯɻ ʢٯྔࢠԽͷͨΊʹεέʔϧͱθϩϙΠϯτอଘʣ ˎ QLoRAޮతͳྔࢠԽ༷ʑͳΛࢪ͍ͯ͠Δ →