Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Performance of mkl's sgemm #1

Open
wjc404 opened this issue Aug 18, 2019 · 3 comments
Open

Performance of mkl's sgemm #1

wjc404 opened this issue Aug 18, 2019 · 3 comments

Comments

@wjc404
Copy link

wjc404 commented Aug 18, 2019

Powerful designs and beautiful introductions~ How about the 1-thread avx2 sgemm performance of Intel MKL (MKL_CBWR="AVX2") on Intel Xeon Gold 6142 ? I'm just curious about that..

@wjc404
Copy link
Author

wjc404 commented Jan 15, 2020

I've tested the 1-thread performance of avx2 SGEMM on i7-9800x (fixed at 3.0 GHz, 4 ch ddr4 2400, mesh 2.4 GHz), which shares the same architecture with Xeon Gold 6142.

Routine..................................Performance(peak)

Your_SGEMM(default_parms)..89-90 GFLOPS

Intel_MKL(2019 update 4)........89-90 GFLOPS (estimated)

OpenBLAS(0.3.8-dev)..................91-92 GFLOPS

Theoretical...................................96 GFLOPS

@carlushuang
Copy link
Owner

@wjc404 thanks for your interest. BTW, what's the MNK parameter of the test above?

@wjc404
Copy link
Author

wjc404 commented Jan 22, 2020

I called gemm_driver without parameters, so it should have used the default parameters optimized on Xeon Gold 6142. The performances listed are peak performances (usually occur at dimensions above 4000). The speed of MKL was tested by a benchmarking program in my repository.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants