Implement missing OpenMP kernels #226
Labels
is:enhancement
An improvement of an existing feature.
mod:openmp
This is related to the OpenMP module.
Milestone
The purpose of this issue is to have a full OpenMP implementation of the kernels before the Ginkgo release 1.0.0.
Here is @hartwiganzt's original comment on this issue:
One way to do this is to copy the reference kernels and add some simple parallelization when possible. When doing so, it is very important to add a note in the README that the OpenMP implementation is present but needs major improvement, or is not fully parallelized (depending on the case).
On a related issue, two OpenMP tests seem particularly long. The COO test is definitely very long (in valgrind it reaches the timeout of 1500 seconds, whereas the others take around 200 seconds!). The CGS solver test also seem to possess some issues, but not to the severity of the COO one.
The text was updated successfully, but these errors were encountered: