Software developer specializing in the optimization of HPC applications for Intel architecture
-
Intel
- Hillsboro, OR
- https://vamsis.com
Pinned Loading
-
bench-gemv-onnx-mlas
bench-gemv-onnx-mlas PublicBenchmark to measure performance of ONNX MLAS matrix-vector implementation against custom AVX2 kernel
C
-
bench-load-store
bench-load-store PublicMicrobenchmark to measure load/store ops and identify performance anomalies such as 4K aliasing
C
-
stream-dsa
stream-dsa PublicUsing Intel Data Streaming Accelerator (DSA) to accelerate memory bandwidth-bound/STREAM kernels
C 1
-
bench-all-reads
bench-all-reads PublicAVX512 kernels (ASM and compiler intrinsics) for measuring performance of load operations
C
-
triad-avx-asm
triad-avx-asm PublicImplementation of STREAM Triad with hand-written ASM (AVX2, AVX512)
C
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.