Dynamic memory allocation. Drop Baichuan/InternLM support in favor of llama.cpp. #244
Job | Run time |
---|---|
4m 39s | |
2m 56s | |
3m 2s | |
3m 11s | |
3m 6s | |
2m 18s | |
2m 9s | |
1m 55s | |
1m 49s | |
25m 5s |
Job | Run time |
---|---|
4m 39s | |
2m 56s | |
3m 2s | |
3m 11s | |
3m 6s | |
2m 18s | |
2m 9s | |
1m 55s | |
1m 49s | |
25m 5s |