Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Master release/1.9.0 #1748

Open
wants to merge 446 commits into
base: master
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
446 commits
Select commit Hold shift + click to select a range
8381a24
[core] remove unused #undef
MarcelKoch Oct 15, 2024
4b48ecb
temporarily disable oneAPI CI jobs
upsj Oct 21, 2024
068c9eb
Simplify the testing types
Apr 8, 2024
007f394
Use new type aliases in relevant tests
Apr 9, 2024
381a369
Add documentation for the new type alias
Apr 11, 2024
89d2d00
Update documentation for type list helper
Apr 19, 2024
e0b4be4
Remove unsigned type from MPI bindings tests
thoasm Jul 11, 2024
6dbac08
Properly format files
thoasm Oct 18, 2024
c05984b
Review updates
thoasm Oct 18, 2024
004eda0
squeeze memory dispatch
yhmtsai Oct 21, 2024
e955c62
clear unused version switch macro and change the badge
yhmtsai Oct 18, 2024
1249e17
add file config example with the configs
yhmtsai Aug 6, 2024
1e878e2
update build.sh to c++17 in example
yhmtsai Sep 9, 2024
ee76a55
add test and update documentation
yhmtsai Sep 9, 2024
9be6dc1
add missing example link
yhmtsai Sep 10, 2024
4855282
use milliseconds directly and update documentation
yhmtsai Oct 18, 2024
db94214
fix windows execution test by using std::filesystem
yhmtsai Oct 22, 2024
1e57673
[core] allow filling the device_matrix_data
MarcelKoch Sep 17, 2024
bb4f732
[core] remove zero initialization
MarcelKoch Oct 8, 2024
db1a492
[batch] fix parameter ordering/naming
MarcelKoch Oct 17, 2024
e4b80ae
[fact] extract l/u initialization
MarcelKoch Jun 25, 2024
601263d
[prec] implement SOR preconditioner reference kernels
MarcelKoch Jun 25, 2024
22dcf40
[prec] add SOR core + ref tests
MarcelKoch Jun 25, 2024
e3848bb
[prec] add sor parsing
MarcelKoch Jun 28, 2024
fbdf417
[prec] implement Gauß-Seidel preconditioner
MarcelKoch Jun 28, 2024
741ca81
[prec] add gauss seidel parsing
MarcelKoch Oct 29, 2024
f8ffb2c
[sor] review updates:
MarcelKoch Jul 11, 2024
c8a7dde
[prec] split up common impl of sor
MarcelKoch Jul 9, 2024
198d294
[prec] implement omp sor kernels
MarcelKoch Jun 27, 2024
86a430f
[prec] implement sycl sor kernels
MarcelKoch Jun 27, 2024
fc0b9e7
[prec] implement cuda/hip sor kernels
MarcelKoch Jul 9, 2024
8d72712
[prec] add sor device test
MarcelKoch Jun 27, 2024
1e121fc
[prec] add sor to benchmarks
MarcelKoch Jun 27, 2024
092d481
[test] add missing factorization tests
MarcelKoch Oct 30, 2024
2507348
[cuda] remove pre cuda 11 code
MarcelKoch Oct 28, 2024
a94f508
[doc] fixes formulas for Sor
MarcelKoch Oct 31, 2024
3b46b08
add workspace for intermediate data
yhmtsai Oct 9, 2024
8a8034c
apply_uses_initial_guess return the local solvers' value
yhmtsai Oct 9, 2024
da5220a
use vector cache
yhmtsai Nov 4, 2024
321cf2e
move VectorCache into experimental::distributed
yhmtsai Nov 5, 2024
908b88a
fix typo of rcm
yhmtsai Nov 5, 2024
4049bb0
[ci] add benchmark tests with complex types
MarcelKoch Oct 11, 2024
022cdd1
[ci] add generated benchmark test reference
MarcelKoch Oct 11, 2024
7199ff0
[bench] fix residual norm logger for complex
MarcelKoch Oct 11, 2024
2339042
fixup! [ci] add benchmark tests with complex types
MarcelKoch Oct 14, 2024
f0608bf
review updates:
MarcelKoch Oct 23, 2024
e1505a2
run Intel CI jobs on cluster
upsj Nov 7, 2024
10e1f45
fix slow OpenMP example execution
upsj Nov 7, 2024
d1bca9d
fix factorization test precision
upsj Nov 11, 2024
31f2453
check sparsity pattern of SOR initialize kernels
upsj Nov 11, 2024
b9a5bc2
fix kernel parameter passing for SOR DPCPP kernels
upsj Nov 12, 2024
ce0e94a
fix CI config
upsj Nov 12, 2024
0b5158a
disable static oneAPI builds
upsj Nov 13, 2024
17fde79
create the context for each intel device.
yhmtsai Apr 8, 2024
1dee5a9
[sycl] don't use deprecated `release_matrix_handle` and `set_csr_data`
MarcelKoch Oct 29, 2024
798ae28
[sycl] remove deprecated `is_host()`
yhmtsai Oct 29, 2024
c3585ce
[sycl] use group algorithm not member functions from subgroup, which …
yhmtsai Oct 29, 2024
fda39a3
Format files
ginkgo-bot Nov 13, 2024
28fdde7
[sycl] add missing namespace
MarcelKoch Nov 13, 2024
87180ba
[sycl] use atomic_ref instead of deprecated atomic
MarcelKoch Nov 13, 2024
9404c4a
[sycl] use local_accessor instead of deprecated access::target::local
MarcelKoch Nov 13, 2024
0107225
[sycl] use newer MKL target if available
MarcelKoch Nov 13, 2024
898c2b8
[sycl] use non-deprecated MKL call only after v2024
MarcelKoch Nov 14, 2024
6df50ff
[batch] provide default index type for matrix device types
MarcelKoch Jun 20, 2024
27d95e8
[batch] handle constness of index type same as value type
MarcelKoch Jun 20, 2024
48fd553
[batch] add macro to instantiate batched solver
MarcelKoch Jun 20, 2024
57e066c
[batch] split bicgstab compilation (hip)
MarcelKoch Sep 17, 2024
e60660e
[batch] split bicgstab compilation (cuda)
MarcelKoch Sep 17, 2024
3fc0db9
[batch] split cg compilation (hip)
MarcelKoch Sep 17, 2024
23849f0
[batch] split cg compilation (cuda)
MarcelKoch Sep 17, 2024
5c43846
[batch] review updates:
MarcelKoch Oct 1, 2024
d1b6b7b
[batch] add launch bounds and fix register check
MarcelKoch Oct 2, 2024
b0b98ba
[batch] add macro indirection
MarcelKoch Oct 21, 2024
546d77b
[batch] unify batch solver
MarcelKoch Nov 7, 2024
bb26854
[batch] split batch solver (sycl)
MarcelKoch Nov 7, 2024
1178a68
[core] add instantiation macro with variable args
MarcelKoch Nov 12, 2024
292e45d
[batch] switch order of batch dispatch and value instantiation macros
MarcelKoch Nov 12, 2024
4b681c0
[batch] fix windows build
MarcelKoch Nov 12, 2024
ae3c056
[fact] fix cuda 11.0 namespace issue
MarcelKoch Nov 19, 2024
bca9d54
[ci] use oldest cuda version in regular CI
MarcelKoch Nov 19, 2024
7e87aca
[ci] keep cuda 11.4 image for sonarcloud
MarcelKoch Nov 19, 2024
227519d
try clang_cl
yhmtsai Nov 13, 2024
6538fd9
make class/struct consistent
yhmtsai Nov 13, 2024
c0b3275
fix order
yhmtsai Nov 14, 2024
a714cbc
friend class is not in the nearest enclosing namesace
yhmtsai Nov 14, 2024
64e081e
fix the function declaration of definition different from declaration…
yhmtsai Nov 14, 2024
575a72e
alias in template will lead an issue
yhmtsai Nov 20, 2024
aecd77f
Revert "try clang_cl"
yhmtsai Nov 20, 2024
646ee09
casting in core only
yhmtsai Nov 12, 2024
739320b
fix test solve_lambda
yhmtsai Nov 12, 2024
33cb766
msys
yhmtsai Nov 12, 2024
8212d7a
also seperate config to another library in mingw (including msys2/clang)
yhmtsai Nov 12, 2024
d466af5
use -fno-assume-unique-vtables to make dyanmic_cast to the final clas…
yhmtsai Nov 13, 2024
0308a24
reuse the macro from another pr
yhmtsai Nov 13, 2024
98e6fbd
make the macro stack clear
yhmtsai Nov 21, 2024
195ece1
add GKO_INDIRECT to properly expand the __VA_ARGS__ in msvc
yhmtsai Nov 21, 2024
49a8abf
no way to face nullptr preconditioner in batch apply
yhmtsai Nov 21, 2024
e342d3b
Revert "msys"
yhmtsai Nov 22, 2024
b0b1f9b
Enable file-config conditionally
pratikvn Nov 15, 2024
e00b23c
move find_package to root CMakeLists.txt
pratikvn Nov 22, 2024
47b427f
always pull in json support with examples enabled
pratikvn Nov 22, 2024
f913519
use compile flag internally for THRUST_CUB_WRAPPED_NAMESPACE
yhmtsai Nov 20, 2024
d505024
avoid cpp see thrust
yhmtsai Nov 20, 2024
4c46df1
adapt the test with custom thrust/cub namespace
yhmtsai Nov 21, 2024
14447d7
add GINKGO_CUSTOM_THRUST_NAMESPACE option
yhmtsai Nov 21, 2024
0170c0b
split custom_thrust_namespace to cuda/hip and disable them if no-effe…
yhmtsai Nov 21, 2024
caaed3c
enable the option by default and update documentation
yhmtsai Nov 22, 2024
5939109
do not expose the custom namespace option to user
yhmtsai Nov 26, 2024
a5fdfa9
update cuda arch for list and selector
yhmtsai Nov 25, 2024
e986f76
missing change from custom thrust namespace
yhmtsai Nov 27, 2024
61d44bd
[ci] disable threadsanitizer temporarily
MarcelKoch Nov 27, 2024
74fbd02
Add overlap communication to read_distributed
Jul 11, 2024
9d32b1d
Add reference kernel tests
Jul 17, 2024
c29a076
Add device kernels and tests
Jul 19, 2024
c697513
Address Review comments
Oct 18, 2024
75b590e
Small fixes
Oct 21, 2024
4335e90
Add missing include
Oct 21, 2024
d3c8a20
Fix circular dependency with array.fill
Oct 22, 2024
ef2113a
Address Review comments
fritzgoebel Nov 4, 2024
16dc9a8
Move additive read distributed to free function
fritzgoebel Nov 20, 2024
9b6d777
Add documentation for the assemble function
fritzgoebel Nov 21, 2024
a6feacc
Address review comments
fritzgoebel Nov 22, 2024
0e37518
Move fill_send_buffers to unified kernels
fritzgoebel Nov 22, 2024
b5749cc
Address review comments
fritzgoebel Nov 25, 2024
de504ea
Adress review comments
fritzgoebel Nov 27, 2024
fca55a6
Fix multiple definitions in dpcpp
fritzgoebel Nov 27, 2024
2170a3a
Move `assembly_helpers` to `assembly`
fritzgoebel Nov 28, 2024
c7e5d2f
add failed test when given symbolic without fillin.
yhmtsai Sep 25, 2024
1591769
fix infinite loop of lookup_hash_unsafe and add test in reference
yhmtsai Sep 25, 2024
48a775c
add checked_lookup into LU
yhmtsai Sep 25, 2024
0dcdbff
add ilu syncfree through lu implementation
yhmtsai Sep 25, 2024
5a25365
update the documentation, change checked_lookup -> has_full_fillin (o…
yhmtsai Sep 30, 2024
ecad47d
use unpack directly from factorization and add unpack with strategy
yhmtsai Sep 30, 2024
4494990
remove the duplicated initialization
yhmtsai Sep 30, 2024
690946c
Revert "use unpack directly from factorization and add unpack with st…
yhmtsai Oct 29, 2024
7b0b8e8
update documentation
yhmtsai Nov 6, 2024
94777be
cholesky failed test
yhmtsai Nov 6, 2024
d9ad2d5
move the algorithm enum to another header
yhmtsai Nov 6, 2024
7fe6ed8
cholesky with safe lookup, wrap it into Ic, and add some missing tests
yhmtsai Nov 6, 2024
a362d46
copy the call from lu/cholesky to ilu/ic and delete full_fillin
yhmtsai Nov 12, 2024
bc66df2
refine the wording and fix wrong bool value
yhmtsai Nov 20, 2024
c54a3ee
update to incomplete_factorization and throw with omp using sparselib
yhmtsai Nov 21, 2024
4e0ac80
fix Ref also considered as Omp executor
yhmtsai Nov 27, 2024
b73753c
use if constexpr when it is possible and reverse if-else
yhmtsai Nov 28, 2024
9c072cc
half base type
yhmtsai Oct 22, 2024
7a07052
half does not have constexpr constructor
yhmtsai Oct 22, 2024
82f79c1
fix the undefined behavior and the issue from big-endian, and extract…
yhmtsai Oct 22, 2024
d835227
jacobi use __half in device not gko::half now
yhmtsai Oct 22, 2024
be0c192
type map
yhmtsai Oct 22, 2024
5033387
fix error: non-constant-expression cannot be narrowed
yhmtsai Oct 23, 2024
cdde2f9
update gdb-ginkgo
yhmtsai Oct 23, 2024
a18e836
make half not rely on type
yhmtsai Oct 24, 2024
34a5ef5
collect the reused part and undef after usage
yhmtsai Oct 24, 2024
14ef89c
use memcpy not std::memcpy in hip
yhmtsai Oct 24, 2024
368cd08
add alignment
yhmtsai Nov 18, 2024
f198aac
delete the sycl half test as we do not enable it directly
yhmtsai Nov 18, 2024
a3982f2
use reference for half when it is possible
yhmtsai Nov 29, 2024
92edd8f
instantiation/testing/next/prev/stub type definition
yhmtsai Oct 23, 2024
5424184
half option
yhmtsai Oct 23, 2024
3f78b6c
device type mapping
yhmtsai Oct 23, 2024
91af999
consider custom namespace for thrust::complex<__half> and benchmark
yhmtsai Nov 28, 2024
9635e89
atomic and cooperative_groups
yhmtsai Oct 23, 2024
8f7c807
fix math and device_numeric_limit
yhmtsai Oct 23, 2024
d321ce7
array operation in half
yhmtsai Oct 24, 2024
8073f4b
matrix with half
yhmtsai Oct 23, 2024
d289c9d
device_matrix_data and mtx_io
yhmtsai Oct 24, 2024
b651cc4
components such as array/iterator/segmented_array test with half
yhmtsai Oct 24, 2024
c03fc10
matrix test with half
yhmtsai Oct 24, 2024
89a3d77
base such as composition/combination with half and corr. test
yhmtsai Oct 25, 2024
28936f5
test_utils test
yhmtsai Nov 4, 2024
65136e9
constexpr restriction for nvc++
yhmtsai Nov 21, 2024
dd3efb9
cuda with CC<70 and hip do not support 16 bit atomic. throw error or …
yhmtsai Nov 25, 2024
917fee3
implement half shuffle via 32 bit impl
yhmtsai Nov 26, 2024
0e46bbe
config
yhmtsai Oct 25, 2024
7676dea
residual with half
yhmtsai Oct 25, 2024
4883827
residual norm default reduction_factor respect to precision
yhmtsai Nov 12, 2024
310fd59
residual config dispatch
yhmtsai Oct 25, 2024
1de2a94
krylov solver
yhmtsai Oct 25, 2024
30296af
solver config dispatch
yhmtsai Oct 25, 2024
631956e
cuda with CC<70 and hip do not support 16 bit atomic. throw error for…
yhmtsai Nov 25, 2024
93cb4d6
triangular and direct solver
yhmtsai Oct 25, 2024
2acd309
workaround for half precision of load/store by using single precision…
yhmtsai Nov 4, 2024
95a3f21
delete the current unusable half memory op on shared memory
yhmtsai Nov 18, 2024
d01ad04
direct and tri config dispatch
yhmtsai Oct 25, 2024
7df26f0
factorization
yhmtsai Oct 25, 2024
ab14d4c
factorization config dispatch
yhmtsai Oct 25, 2024
e2c4b21
cmake cuda test with cuda arch and fix is_finite
yhmtsai Nov 5, 2024
4ad4404
figure out factorization test
yhmtsai Nov 5, 2024
8ce50ba
change the diagonal to reduce random on parilut/parict
yhmtsai Nov 13, 2024
0a7e869
multigrid and the multigridlevel
yhmtsai Oct 25, 2024
29ae390
pgm uses gko::max to avoid ambiguous in hip
yhmtsai Oct 28, 2024
2666ae1
multigrid config dispatch
yhmtsai Oct 25, 2024
58d5d03
preconditioner with half
yhmtsai Oct 25, 2024
2980e46
preconditioner config dispatch
yhmtsai Oct 25, 2024
32115ca
reorder with half
yhmtsai Oct 25, 2024
e9ec66b
change the default mc64 tolerance respect to precision
yhmtsai Nov 12, 2024
3800349
log with half
yhmtsai Oct 28, 2024
cb41bcb
dispatch with distributed needs to throw with half
yhmtsai Oct 29, 2024
9183749
revert the distribution value_type in dpcpp/test/preconditioner/jacob…
yhmtsai Nov 21, 2024
e066ac5
fix distributed mixed-precision pgm
yhmtsai Nov 24, 2024
62a12d8
fix type_size_impl for thrust::complex
yhmtsai Nov 28, 2024
d529ab6
do not support half for nvhpc23.3 due to signal 11
yhmtsai Nov 25, 2024
ef8dea3
split config to different file to overcome nvhpc limit
yhmtsai Nov 27, 2024
4a80809
make test target_name aligned with binary name, split isai due to nvh…
yhmtsai Nov 27, 2024
7d65761
batch with half
yhmtsai Oct 29, 2024
04a5b9b
batch test with half
yhmtsai Oct 29, 2024
24feb21
cuda/hip batch changes
yhmtsai Nov 20, 2024
126439d
add device_type in sycl
yhmtsai Oct 25, 2024
0819ce2
add device_type in kernel_launch
yhmtsai Oct 25, 2024
8bc3c97
reduction sycl type
yhmtsai Oct 25, 2024
45411ef
component sycl type
yhmtsai Oct 25, 2024
38d6e1e
matrix sycl type
yhmtsai Oct 25, 2024
1341726
factorization sycl type
yhmtsai Oct 25, 2024
022f4fb
solver/preconditioner/stop sycl type
yhmtsai Oct 25, 2024
7541753
factorization
yhmtsai Nov 19, 2024
d6c9e9d
matrix
yhmtsai Nov 19, 2024
082c1c2
preconditioner
yhmtsai Nov 19, 2024
6fb0afe
solver
yhmtsai Nov 19, 2024
d6da5be
stop
yhmtsai Nov 19, 2024
5122f53
sycl half
yhmtsai Nov 20, 2024
cce61ee
fix template expect argument
yhmtsai Nov 24, 2024
74dab1c
intel sycl introduces silent complex header loading chain.
yhmtsai Nov 22, 2024
75a6846
replace CL/sycl.hpp by sycl/sycl.hpp
yhmtsai Nov 25, 2024
66e4408
sycl does not support 16 bit atomic. throw error or fallback to worki…
yhmtsai Nov 25, 2024
0946934
only provide the custom operation for complex<half>
yhmtsai Nov 26, 2024
39b904c
move to gko::complex<sycl::half> alone
yhmtsai Nov 28, 2024
f99e203
update docuementation
yhmtsai Dec 3, 2024
169db0f
rename precision chain
yhmtsai Dec 2, 2024
5852c12
rename instantiation
yhmtsai Dec 2, 2024
e410e9b
rename testing type
yhmtsai Dec 2, 2024
28d305a
rename type list
yhmtsai Dec 2, 2024
64a170d
manual changes
yhmtsai Dec 2, 2024
8fb8d1a
remove unused data and type
yhmtsai Dec 3, 2024
52b2261
update the multigrid preconditioner usage in test/mpi/solver
yhmtsai Dec 3, 2024
98cd40b
add overload to create json obj from std::string
greole Nov 5, 2024
de217b8
[test] add parse_json_string test
greole Dec 5, 2024
53a8723
[test] add parse_json_string test
greole Dec 5, 2024
718ffe4
disable the half properly
yhmtsai Dec 4, 2024
c6434ff
update documentation
yhmtsai Dec 5, 2024
1291b36
[omp] fix RCM RAW
MarcelKoch Dec 6, 2024
ab53c62
[omp] fix sparsity csr RAW
MarcelKoch Dec 6, 2024
a29b772
[omp] fix kcycle stop WAW
MarcelKoch Dec 6, 2024
bd8afe6
[test] fix 3pt generation for batch solver
MarcelKoch Dec 6, 2024
051fea8
[omp] fix par IC and variants using atomic load/stores
MarcelKoch Dec 6, 2024
81c3b50
[mg] use atomics for match_edge
MarcelKoch Dec 6, 2024
51f3a9b
[core] deprecate the `master` branch
MarcelKoch Dec 4, 2024
d38fd96
[ci] disable CI on master
MarcelKoch Dec 4, 2024
aec24ba
[release] update gtest
MarcelKoch Dec 4, 2024
129a37d
[release] update changelog
MarcelKoch Dec 4, 2024
7b7992f
[release] fix changelog
MarcelKoch Dec 9, 2024
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
25 changes: 24 additions & 1 deletion .clang-format
Original file line number Diff line number Diff line change
Expand Up @@ -58,7 +58,30 @@ ForEachMacros:
- foreach
- Q_FOREACH
- BOOST_FOREACH
IncludeIsMainRegex: '([-_](test|unittest))?$'
IncludeBlocks: Regroup
IncludeCategories:
- Regex: '^<oneapi/dpl.*' # needs to be on top
Priority: -2
- Regex: '<[^.]+>' # standard library
Priority: 1
- Regex: '(^<(hip/hip_runtime|cuda(_runtime)?)\.h)|common/cuda_hip/base/runtime\.hpp$'
Priority: 2
SortPriority: 2
- Regex: '^<(omp|cu|hip|oneapi|thrust|CL/|cooperative|mpi|nvToolsExt|Kokkos).*'
Priority: 2
SortPriority: 3
- Regex: '^<(nlohmann|gflags|gtest|sde_lib|papi).*'
Priority: 4
- Regex: '<ginkgo/ginkgo.hpp>'
Priority: 6
- Regex: '^<ginkgo/.*'
Priority: 7
- Regex: '^<.*' # other library includes
Priority: 5
- Regex: '^.*'
Priority: 8
IncludeIsMainRegex: '(_(stub|kernels|kernels2|test))?$'
IncludeIsMainSourceRegex: '\.cu$|_kernels\.hpp$|\.dp\.cpp$'
IndentCaseLabels: false
IndentWidth: 4
IndentWrappedFunctionNames: false
Expand Down
2 changes: 0 additions & 2 deletions .github/bot-pr-base.sh
Original file line number Diff line number Diff line change
Expand Up @@ -3,8 +3,6 @@
source .github/bot-base.sh

EXTENSION_REGEX='\.(cuh?|hpp|hpp\.inc?|cpp)$'
FORMAT_HEADER_REGEX='^(benchmark|core|cuda|hip|include/ginkgo/core|omp|reference|dpcpp|common/unified|test)/'
FORMAT_REGEX='^(common|examples)/'
CLANG_FORMAT=clang-format-14

echo -n "Collecting information on triggering PR"
Expand Down
5 changes: 1 addition & 4 deletions .github/bot-pr-format-base.sh
Original file line number Diff line number Diff line change
Expand Up @@ -14,9 +14,7 @@ git config user.name "ginkgo-bot"

# save scripts from develop
cp .clang-format .pre-commit-config.yaml /tmp
pushd dev_tools/scripts || exit 1
cp format_header.sh update_ginkgo_header.sh /tmp
popd || exit 1
cp dev_tools/scripts/update_ginkgo_header.sh /tmp

# checkout current PR head
LOCAL_BRANCH=format-tmp-$HEAD_BRANCH
Expand All @@ -25,7 +23,6 @@ git checkout -b $LOCAL_BRANCH fork/$HEAD_BRANCH
# restore files from develop
cp /tmp/.clang-format .
cp /tmp/.pre-commit-config.yaml .
cp /tmp/format_header.sh dev_tools/scripts/
cp /tmp/update_ginkgo_header.sh dev_tools/scripts/

# make base pre-commit config available
Expand Down
10 changes: 5 additions & 5 deletions .github/workflows/intel.yml
Original file line number Diff line number Diff line change
Expand Up @@ -33,20 +33,20 @@ jobs:
- name: configure
run: |
source /etc/profile
module load intel-oneapi-compilers intel-oneapi-dpl intel-oneapi-mkl intel-oneapi-tbb cmake
module load intel-oneapi-compilers intel-oneapi-dpl intel-oneapi-mkl cmake
mkdir build
cd build
cmake .. -DCMAKE_INSTALL_PREFIX=install_ginkgo -DCMAKE_CXX_FLAGS="-Wpedantic -ffp-model=precise" -DCMAKE_CXX_COMPILER=${{ matrix.config.compiler }} -DCMAKE_BUILD_TYPE=${{ matrix.config.build_type }} -DGINKGO_MIXED_PRECISION=${{ matrix.config.mixed }} -DGINKGO_BUILD_CUDA=OFF -DGINKGO_BUILD_HIP=OFF -DGINKGO_BUILD_MPI=OFF -DGINKGO_DPCPP_SINGLE_MODE=ON
make -j8
ONEAPI_DEVICE_SELECTOR=level_zero:gpu ctest -j10 --output-on-failure

- name: install
run: |
source /etc/profile
module load intel-oneapi-compilers intel-oneapi-dpl intel-oneapi-mkl intel-oneapi-tbb cmake
module load intel-oneapi-compilers intel-oneapi-dpl intel-oneapi-mkl cmake
cd build
SYCL_DEVICE_FILTER=level_zero:gpu make install
export GINKGO_PATH="$(pwd)/install_ginkgo/lib"
export LIBRARY_PATH=${GINKGO_PATH}:$LIBRARY_PATH
export LD_LIBRARY_PATH=${GINKGO_PATH}:$LD_LIBRARY_PATH
export LIBRARY_PATH=${ICL_INTEL_TBB_ROOT}/lib64:${GINKGO_PATH}:$LIBRARY_PATH
export LD_LIBRARY_PATH=${ICL_INTEL_TBB_ROOT}/lib64:${GINKGO_PATH}:$LD_LIBRARY_PATH
SYCL_DEVICE_FILTER=level_zero:gpu make test_install
65 changes: 0 additions & 65 deletions .github/workflows/windows-mingw.yml

This file was deleted.

61 changes: 0 additions & 61 deletions .github/workflows/windows-msvc-cuda.yml

This file was deleted.

61 changes: 0 additions & 61 deletions .github/workflows/windows-msvc-ref.yml

This file was deleted.

Loading
Loading