Slide 10
Slide 10 text
比較
● GPU(Metal)使用
– llama_print_timings: eval time = 331950.94 ms / 233 runs (
1424.68 ms per token, 0.70 tokens per second)
– llama_print_timings: total time = 383524.05 ms / 273 tokens
● CPUのみ
– llama_print_timings: eval time = 71923.93 ms / 255 runs (
282.05 ms per token, 3.55 tokens per second)
– llama_print_timings: total time = 81259.32 ms / 295 tokens
● 謎???