fix: Split addmm nodes to not cast bias for FP32 accumulation and flux example fixes. #3236
Job | Run time |
---|---|
10s | |
1s | |
14m 48s | |
13m 17s | |
12m 4s | |
15s | |
16s | |
21s | |
10m 54s | |
5m 30s | |
5m 33s | |
4m 32s | |
5m 44s | |
5m 31s | |
9m 53s | |
6m 28s | |
9m 27s | |
5m 36s | |
4m 42s | |
5m 24s | |
4m 29s | |
4m 24s | |
12s | |
11s | |
11s | |
10s | |
11s | |
10s | |
11s | |
11s | |
10s | |
11s | |
2h 11m 7s |