Slide 10
Slide 10 text
比較
●
GPU(Metal)使用
– llama_print_timings: eval time = 331950.94 ms / 233 runs ( 1424.68 ms per
token, 0.70 tokens per second)
– llama_print_timings: total time = 383524.05 ms / 273 tokens
●
CPUのみ
– llama_print_timings: eval time = 71923.93 ms / 255 runs ( 282.05 ms per
token, 3.55 tokens per second)
– llama_print_timings: total time = 81259.32 ms / 295 tokens
● 謎???