Skip to content

Commit

Permalink
Merge pull request #2049 from Celelibi/fix_crash_sgemm_sse_x64
Browse files Browse the repository at this point in the history
Fix crash in sgemm SSE/nano kernel on x86_64
  • Loading branch information
martin-frbg authored Mar 7, 2019
2 parents db3dc9e + b7f59da commit 8d3d29e
Show file tree
Hide file tree
Showing 2 changed files with 2 additions and 2 deletions.
2 changes: 1 addition & 1 deletion kernel/x86_64/gemm_kernel_4x8_nano.S
Original file line number Diff line number Diff line change
Expand Up @@ -135,7 +135,7 @@
#endif

movq %rsp, %rbx # save old stack
subq $128 + LOCAL_BUFFER_SIZE, %rsp
subq $256 + LOCAL_BUFFER_SIZE, %rsp
andq $-4096, %rsp # align stack

STACK_TOUCHING
Expand Down
2 changes: 1 addition & 1 deletion kernel/x86_64/gemm_kernel_8x4_sse.S
Original file line number Diff line number Diff line change
Expand Up @@ -383,7 +383,7 @@
EMMS

movq %rsp, %rbx # save old stack
subq $128 + LOCAL_BUFFER_SIZE, %rsp
subq $256 + LOCAL_BUFFER_SIZE, %rsp
andq $-4096, %rsp # align stack

STACK_TOUCHING
Expand Down

0 comments on commit 8d3d29e

Please sign in to comment.