Performance of mkl's sgemm #1

wjc404 · 2019-08-18T16:42:42Z

Powerful designs and beautiful introductions~ How about the 1-thread avx2 sgemm performance of Intel MKL (MKL_CBWR="AVX2") on Intel Xeon Gold 6142 ? I'm just curious about that..

wjc404 · 2020-01-15T17:02:19Z

I've tested the 1-thread performance of avx2 SGEMM on i7-9800x (fixed at 3.0 GHz, 4 ch ddr4 2400, mesh 2.4 GHz), which shares the same architecture with Xeon Gold 6142.

Routine..................................Performance(peak)

Your_SGEMM(default_parms)..89-90 GFLOPS

Intel_MKL(2019 update 4)........89-90 GFLOPS (estimated)

OpenBLAS(0.3.8-dev)..................91-92 GFLOPS

Theoretical...................................96 GFLOPS

carlushuang · 2020-01-20T14:32:29Z

@wjc404 thanks for your interest. BTW, what's the MNK parameter of the test above?

wjc404 · 2020-01-22T11:37:16Z

I called gemm_driver without parameters, so it should have used the default parameters optimized on Xeon Gold 6142. The performances listed are peak performances (usually occur at dimensions above 4000). The speed of MKL was tested by a benchmarking program in my repository.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance of mkl's sgemm #1

Performance of mkl's sgemm #1

wjc404 commented Aug 18, 2019

wjc404 commented Jan 15, 2020 •

edited

Loading

carlushuang commented Jan 20, 2020

wjc404 commented Jan 22, 2020 •

edited

Loading

Performance of mkl's sgemm #1

Performance of mkl's sgemm #1

Comments

wjc404 commented Aug 18, 2019

wjc404 commented Jan 15, 2020 • edited Loading

carlushuang commented Jan 20, 2020

wjc404 commented Jan 22, 2020 • edited Loading

wjc404 commented Jan 15, 2020 •

edited

Loading

wjc404 commented Jan 22, 2020 •

edited

Loading