llama : avoid double token-to-piece cache (#7654) #688
Job | Run time |
---|---|
17m 52s | |
43m 17s | |
17m 20s | |
21m 55s | |
56m 46s | |
41m 23s | |
29m 16s | |
32m 26s | |
29m 22s | |
12m 1s | |
13m 23s | |
5h 15m 1s |
Job | Run time |
---|---|
17m 52s | |
43m 17s | |
17m 20s | |
21m 55s | |
56m 46s | |
41m 23s | |
29m 16s | |
32m 26s | |
29m 22s | |
12m 1s | |
13m 23s | |
5h 15m 1s |