Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Kokkos Kernels: change the default offset ETI from size_t to int #2140

Merged
merged 1 commit into from
Mar 14, 2024

Conversation

lucbv
Copy link
Contributor

@lucbv lucbv commented Mar 13, 2024

This change makes it easier for customer to leverage TPL support which almost always requires offset=int, ordinal=int to be enabled meaning that no TPL support is available with our default ETI...

This change makes it easier for customer to leverage TPL support
which almost always requires offset=int, ordinal=int to be enabled
meaning that no TPL support is available with our default ETI...
@lucbv
Copy link
Contributor Author

lucbv commented Mar 13, 2024

I had a look this morning and the change is really trivial on our end so figured I would create the PR for this. As discussed yesterday there is no big rush to merge this ahead of 4.3.00 but if we feel confident Trilinos will not have issues with it we can add it, I'm not opposed to it.

@kokkos-devops-admin
Copy link

Status Flag 'Pre-Test Inspection' - Auto Inspected - Inspection is Not Necessary for this Pull Request.

@kokkos-devops-admin
Copy link

Status Flag 'Pull Request AutoTester' - Testing Jenkins Projects:

Pull Request Auto Testing STARTING (click to expand)

Build Information

Test Name: KokkosKernels_PullRequest_CUDA11_CUDA11_LayoutRight

  • Build Num: 1228
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/lucbv/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 85299db
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA 519ef7b
PR_LABELS enhancement;feature request
PULLREQUESTNUM 2140
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_GCC930_Light_Tpls_GCC930_Tpls_CLANG13CUDA10

  • Build Num: 815
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/lucbv/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 85299db
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA 519ef7b
PR_LABELS enhancement;feature request
PULLREQUESTNUM 2140
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_GNU1021

  • Build Num: 486
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/lucbv/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 85299db
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA 519ef7b
PR_LABELS enhancement;feature request
PULLREQUESTNUM 2140
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_GNU1021_Light_LayoutRight

  • Build Num: 473
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/lucbv/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 85299db
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA 519ef7b
PR_LABELS enhancement;feature request
PULLREQUESTNUM 2140
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_Tpls_GNU1021

  • Build Num: 474
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/lucbv/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 85299db
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA 519ef7b
PR_LABELS enhancement;feature request
PULLREQUESTNUM 2140
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_Tpls_INTEL19_solo

  • Build Num: 478
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/lucbv/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 85299db
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA 519ef7b
PR_LABELS enhancement;feature request
PULLREQUESTNUM 2140
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_CLANG1001_solo

  • Build Num: 450
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/lucbv/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 85299db
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA 519ef7b
PR_LABELS enhancement;feature request
PULLREQUESTNUM 2140
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_A64FX_Tpls_ARMPL2110

  • Build Num: 960
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/lucbv/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 85299db
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA 519ef7b
PR_LABELS enhancement;feature request
PULLREQUESTNUM 2140
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_A64FX_GCC1020

  • Build Num: 955
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/lucbv/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 85299db
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA 519ef7b
PR_LABELS enhancement;feature request
PULLREQUESTNUM 2140
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_VEGA90A_ROCM560

  • Build Num: 946
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/lucbv/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 85299db
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA 519ef7b
PR_LABELS enhancement;feature request
PULLREQUESTNUM 2140
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_VEGA90A_Tpls_ROCM560

  • Build Num: 468
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/lucbv/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 85299db
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA 519ef7b
PR_LABELS enhancement;feature request
PULLREQUESTNUM 2140
TEST_REPO_ALIAS KOKKOSKERNELS

Using Repos:

Repo: KOKKOSKERNELS (lucbv/kokkos-kernels)
  • Branch: KK_change_offset_eti_default
  • SHA: 85299db
  • Mode: TEST_REPO

Pull Request Author: lucbv

@ndellingwood
Copy link
Contributor

if we feel confident Trilinos will not have issues with it we can add it, I'm not opposed to it

(cue ominous, foreboding sounds...) :)

@kokkos-devops-admin
Copy link

Status Flag 'Pull Request AutoTester' - Jenkins Testing: 1 or more Jobs FAILED

Note: Testing will normally be attempted again in approx. 2 Hrs 30 Mins. If a change to the PR source branch occurs, the testing will be attempted again on next available autotester run.

Pull Request Auto Testing has FAILED (click to expand)

Build Information

Test Name: KokkosKernels_PullRequest_CUDA11_CUDA11_LayoutRight

  • Build Num: 1228
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/lucbv/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 85299db
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA 519ef7b
PR_LABELS enhancement;feature request
PULLREQUESTNUM 2140
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_GCC930_Light_Tpls_GCC930_Tpls_CLANG13CUDA10

  • Build Num: 815
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/lucbv/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 85299db
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA 519ef7b
PR_LABELS enhancement;feature request
PULLREQUESTNUM 2140
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_GNU1021

  • Build Num: 486
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/lucbv/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 85299db
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA 519ef7b
PR_LABELS enhancement;feature request
PULLREQUESTNUM 2140
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_GNU1021_Light_LayoutRight

  • Build Num: 473
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/lucbv/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 85299db
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA 519ef7b
PR_LABELS enhancement;feature request
PULLREQUESTNUM 2140
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_Tpls_GNU1021

  • Build Num: 474
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/lucbv/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 85299db
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA 519ef7b
PR_LABELS enhancement;feature request
PULLREQUESTNUM 2140
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_Tpls_INTEL19_solo

  • Build Num: 478
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/lucbv/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 85299db
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA 519ef7b
PR_LABELS enhancement;feature request
PULLREQUESTNUM 2140
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_CLANG1001_solo

  • Build Num: 450
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/lucbv/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 85299db
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA 519ef7b
PR_LABELS enhancement;feature request
PULLREQUESTNUM 2140
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_A64FX_Tpls_ARMPL2110

  • Build Num: 960
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/lucbv/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 85299db
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA 519ef7b
PR_LABELS enhancement;feature request
PULLREQUESTNUM 2140
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_A64FX_GCC1020

  • Build Num: 955
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/lucbv/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 85299db
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA 519ef7b
PR_LABELS enhancement;feature request
PULLREQUESTNUM 2140
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_VEGA90A_ROCM560

  • Build Num: 946
  • Status: FAILED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/lucbv/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 85299db
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA 519ef7b
PR_LABELS enhancement;feature request
PULLREQUESTNUM 2140
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_VEGA90A_Tpls_ROCM560

  • Build Num: 468
  • Status: FAILED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/lucbv/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 85299db
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA 519ef7b
PR_LABELS enhancement;feature request
PULLREQUESTNUM 2140
TEST_REPO_ALIAS KOKKOSKERNELS
Console Output (last 100 lines) : KokkosKernels_PullRequest_CUDA11_CUDA11_LayoutRight # 1228 (click to expand)

b' * [new tag]             3.1.00                      -> 3.1.00'
b' * [new tag]             3.1.01                      -> 3.1.01'
b' * [new tag]             3.2.00                      -> 3.2.00'
b' * [new tag]             3.2.01                      -> 3.2.01'
b' * [new tag]             3.3.00                      -> 3.3.00'
b' * [new tag]             3.3.01                      -> 3.3.01'
b' * [new tag]             3.4.00                      -> 3.4.00'
b' * [new tag]             3.4.01                      -> 3.4.01'
b' * [new tag]             3.5.00                      -> 3.5.00'
b' * [new tag]             3.6.00                      -> 3.6.00'
b' * [new tag]             3.6.01                      -> 3.6.01'
b' * [new tag]             3.7.00                      -> 3.7.00'
b' * [new tag]             3.7.01                      -> 3.7.01'
b' * [new tag]             4.0.00                      -> 4.0.00'
b' * [new tag]             4.0.01                      -> 4.0.01'
b' * [new tag]             4.1.00                      -> 4.1.00'
b' * [new tag]             4.2.01                      -> 4.2.01'
b' * [new tag]             papers/us-rse-escience-2022 -> papers/us-rse-escience-2022'
b' * [new tag]             4.2.00                      -> 4.2.00'
b"Merge made by the 'recursive' strategy."
b' batched/dense/impl/KokkosBatched_Gesv_Impl.hpp |  36 +-'
b' batched/dense/src/KokkosBatched_Gesv.hpp       |  13 +-'
b' ode/impl/KokkosODE_BDF_impl.hpp                | 532 ++++++++++++++++'
b' ode/impl/KokkosODE_Newton_impl.hpp             |  55 +-'
b' ode/src/KokkosODE_BDF.hpp                      | 227 +++++++'
b' ode/src/KokkosODE_Newton.hpp                   |  10 +-'
b' ode/src/KokkosODE_Types.hpp                    |  13 +-'
b' ode/unit_test/Test_ODE.hpp                     |   1 +'
b' ode/unit_test/Test_ODE_BDF.hpp                 | 830 +++++++++++++++++++++++++'
b' ode/unit_test/Test_ODE_Newton.hpp              |  31 +-'
b' perf_test/ode/CMakeLists.txt                   |   4 +'
b' perf_test/ode/KokkosODE_BDF.cpp                | 266 ++++++++'
b' 12 files changed, 1971 insertions(+), 47 deletions(-)'
b' create mode 100644 ode/impl/KokkosODE_BDF_impl.hpp'
b' create mode 100644 ode/src/KokkosODE_BDF.hpp'
b' create mode 100644 ode/unit_test/Test_ODE_BDF.hpp'
b' create mode 100644 perf_test/ode/KokkosODE_BDF.cpp'
b'/home/jenkins/kkw/workspace/KokkosKernels_PullRequest_CUDA11_CUDA11_LayoutRight'
b'***Forced exclusive execution'
b'Job <70528> is submitted to queue .'
b'<>'
b'<>'
b'Running on machine: weaver'
b"KokkosKernels Repository Status:  0787ce630a692a2d053af68c376faa01e450e2d0 Merge remote-tracking branch 'upstream/develop' into HEAD"
b''
b'Kokkos Repository Status:  71a9bcae52543bd065522bf3e41b5bfa467d8015 Merge pull request #6613 from ndellingwood/master-release-4.2.00'
b''
b''
b'Going to test compilers:  cuda/11.2.2/gcc/8.3.1'
b'Testing compiler cuda/11.2.2/gcc/8.3.1'
b'Unrecognized compiler cuda/11.2.2/gcc/8.3.1 when looking for Spack variants'
b'Unrecognized compiler cuda/11.2.2/gcc/8.3.1 when looking for Spack variants'
b'Unrecognized compiler cuda/11.2.2/gcc/8.3.1 when looking for Spack variants'
b'  Starting job cuda-11.2.2-gcc-8.3.1-Cuda_OpenMP-release'
b'kokkos devices: Cuda,OpenMP'
b'kokkos arch: Volta70'
b'kokkos options: '
b'kokkos cuda options: '
b'kokkos cxxflags: -O3 -Wall -Wunused-parameter -Wshadow -pedantic -Werror -Wsign-compare -Wtype-limits -Wuninitialized '
b'extra_args: '
b"kokkoskernels scalars: 'double,complex_double'"
b'kokkoskernels ordinals: int'
b'kokkoskernels offsets: int,size_t'
b'kokkoskernels layouts: LayoutLeft'
b'kokkoskernels tpls list: '
b'  PASSED cuda-11.2.2-gcc-8.3.1-Cuda_OpenMP-release'
b'#######################################################'
b'PASSED TESTS'
b'#######################################################'
b'cuda-11.2.2-gcc-8.3.1-Cuda_OpenMP-release build_time=1348 run_time=459'
b'Running on machine: weaver'
b"KokkosKernels Repository Status:  0787ce630a692a2d053af68c376faa01e450e2d0 Merge remote-tracking branch 'upstream/develop' into HEAD"
b''
b'Kokkos Repository Status:  71a9bcae52543bd065522bf3e41b5bfa467d8015 Merge pull request #6613 from ndellingwood/master-release-4.2.00'
b''
b''
b'Going to test compilers:  cuda/11.2.2/gcc/8.3.1'
b'Testing compiler cuda/11.2.2/gcc/8.3.1'
b'Unrecognized compiler cuda/11.2.2/gcc/8.3.1 when looking for Spack variants'
b'Unrecognized compiler cuda/11.2.2/gcc/8.3.1 when looking for Spack variants'
b'Unrecognized compiler cuda/11.2.2/gcc/8.3.1 when looking for Spack variants'
b'  Starting job cuda-11.2.2-gcc-8.3.1-Cuda_OpenMP-release'
b'kokkos devices: Cuda,OpenMP'
b'kokkos arch: Volta70'
b'kokkos options: '
b'kokkos cuda options: '
b'kokkos cxxflags: -O3 -Wall -Wunused-parameter -Wshadow -pedantic -Werror -Wsign-compare -Wtype-limits -Wuninitialized '
b'extra_args:  --no-default-eti'
b"kokkoskernels scalars: 'double,complex_double'"
b'kokkoskernels ordinals: int'
b'kokkoskernels offsets: int,size_t'
b'kokkoskernels layouts: LayoutRight'
b'kokkoskernels tpls list: '
b'  PASSED cuda-11.2.2-gcc-8.3.1-Cuda_OpenMP-release'
b'#######################################################'
b'PASSED TESTS'
b'#######################################################'
b'cuda-11.2.2-gcc-8.3.1-Cuda_OpenMP-release build_time=1707 run_time=451'
b'/home/jenkins/kkw/workspace/KokkosKernels_PullRequest_CUDA11_CUDA11_LayoutRight'
b'Finished: SUCCESS'
b''

Console Output (last 100 lines) : KokkosKernels_PullRequest_GCC930_Light_Tpls_GCC930_Tpls_CLANG13CUDA10 # 815 (click to expand)

b'Unrecognized compiler gcc/9.3.0 when looking for Spack variants'
b'Unrecognized compiler gcc/9.3.0 when looking for Spack variants'
b'  Starting job gcc-9.3.0-OpenMP-release'
b'kokkos devices: OpenMP'
b'kokkos arch: Power8,Pascal60'
b'kokkos options: '
b'kokkos cuda options: '
b'kokkos cxxflags: -O3 -Wall -Wunused-parameter -Wshadow -pedantic -Werror -Wsign-compare -Wtype-limits -Wignored-qualifiers -Wempty-body -Wclobbered -Wuninitialized '
b'extra_args: '
b"kokkoskernels scalars: 'double,complex_double'"
b'kokkoskernels ordinals: int'
b'kokkoskernels offsets: int,size_t'
b'kokkoskernels layouts: LayoutLeft'
b'kokkoskernels tpls list: '
b'  PASSED gcc-9.3.0-OpenMP-release'
b'Unrecognized compiler gcc/9.3.0 when looking for Spack variants'
b'Unrecognized compiler gcc/9.3.0 when looking for Spack variants'
b'Unrecognized compiler gcc/9.3.0 when looking for Spack variants'
b'  Starting job gcc-9.3.0-Serial-release'
b'kokkos devices: Serial'
b'kokkos arch: Power8,Pascal60'
b'kokkos options: '
b'kokkos cuda options: '
b'kokkos cxxflags: -O3 -Wall -Wunused-parameter -Wshadow -pedantic -Werror -Wsign-compare -Wtype-limits -Wignored-qualifiers -Wempty-body -Wclobbered -Wuninitialized '
b'extra_args: '
b"kokkoskernels scalars: 'double,complex_double'"
b'kokkoskernels ordinals: int'
b'kokkoskernels offsets: int,size_t'
b'kokkoskernels layouts: LayoutLeft'
b'kokkoskernels tpls list: '
b'  PASSED gcc-9.3.0-Serial-release'
b'Unrecognized compiler gcc/9.3.0 when looking for Spack variants'
b'Unrecognized compiler gcc/9.3.0 when looking for Spack variants'
b'Unrecognized compiler gcc/9.3.0 when looking for Spack variants'
b'  Starting job gcc-9.3.0-OpenMP_Serial-release'
b'kokkos devices: OpenMP,Serial'
b'kokkos arch: Power8,Pascal60'
b'kokkos options: '
b'kokkos cuda options: '
b'kokkos cxxflags: -O3 -Wall -Wunused-parameter -Wshadow -pedantic -Werror -Wsign-compare -Wtype-limits -Wignored-qualifiers -Wempty-body -Wclobbered -Wuninitialized '
b'extra_args: '
b"kokkoskernels scalars: 'double,complex_double'"
b'kokkoskernels ordinals: int'
b'kokkoskernels offsets: int,size_t'
b'kokkoskernels layouts: LayoutLeft'
b'kokkoskernels tpls list: '
b'  PASSED gcc-9.3.0-OpenMP_Serial-release'
b'#######################################################'
b'PASSED TESTS'
b'#######################################################'
b'gcc-9.3.0-OpenMP-release build_time=552 run_time=109'
b'gcc-9.3.0-OpenMP_Serial-release build_time=752 run_time=250'
b'gcc-9.3.0-Serial-release build_time=506 run_time=100'
b'Running on machine: weaver'
b"KokkosKernels Repository Status:  8c9f95b7e186cadad8a2b8b38c1a8161679d17d5 Merge remote-tracking branch 'upstream/develop' into HEAD"
b''
b'Kokkos Repository Status:  71a9bcae52543bd065522bf3e41b5bfa467d8015 Merge pull request #6613 from ndellingwood/master-release-4.2.00'
b''
b''
b'Going to test compilers:  gcc/9.3.0'
b'Testing compiler gcc/9.3.0'
b'Unrecognized compiler gcc/9.3.0 when looking for Spack variants'
b'Unrecognized compiler gcc/9.3.0 when looking for Spack variants'
b'Unrecognized compiler gcc/9.3.0 when looking for Spack variants'
b'  Starting job gcc-9.3.0-OpenMP-release'
b'kokkos devices: OpenMP'
b'kokkos arch: Power9,Volta70'
b'kokkos options: '
b'kokkos cuda options: '
b'kokkos cxxflags: -O3 -Wall -Wunused-parameter -Wshadow -pedantic -Werror -Wsign-compare -Wtype-limits -Wignored-qualifiers -Wempty-body -Wclobbered -Wuninitialized '
b'extra_args: '
b"kokkoskernels scalars: 'double,complex_double'"
b'kokkoskernels ordinals: int'
b'kokkoskernels offsets: int,size_t'
b'kokkoskernels layouts: LayoutLeft'
b'kokkoskernels tpls list: ,blas'
b'  PASSED gcc-9.3.0-OpenMP-release'
b'Unrecognized compiler gcc/9.3.0 when looking for Spack variants'
b'Unrecognized compiler gcc/9.3.0 when looking for Spack variants'
b'Unrecognized compiler gcc/9.3.0 when looking for Spack variants'
b'  Starting job gcc-9.3.0-Serial-release'
b'kokkos devices: Serial'
b'kokkos arch: Power9,Volta70'
b'kokkos options: '
b'kokkos cuda options: '
b'kokkos cxxflags: -O3 -Wall -Wunused-parameter -Wshadow -pedantic -Werror -Wsign-compare -Wtype-limits -Wignored-qualifiers -Wempty-body -Wclobbered -Wuninitialized '
b'extra_args: '
b"kokkoskernels scalars: 'double,complex_double'"
b'kokkoskernels ordinals: int'
b'kokkoskernels offsets: int,size_t'
b'kokkoskernels layouts: LayoutLeft'
b'kokkoskernels tpls list: ,blas'
b'  PASSED gcc-9.3.0-Serial-release'
b'#######################################################'
b'PASSED TESTS'
b'#######################################################'
b'gcc-9.3.0-OpenMP-release build_time=535 run_time=105'
b'gcc-9.3.0-Serial-release build_time=491 run_time=97'
b'/home/jenkins/kkw/workspace/KokkosKernels_PullRequest_GCC930_Light_Tpls_GCC930_Tpls_CLANG13CUDA10'
b'Finished: SUCCESS'
b''

Console Output (last 100 lines) : KokkosKernels_PullRequest_GNU1021 # 486 (click to expand)

b'Fetching changes from the remote Git repository'
b' > git config remote.origin.url https://github.com/lucbv/kokkos-kernels # timeout=10'
b'Fetching upstream changes from https://github.com/lucbv/kokkos-kernels'
b' > git --version # timeout=10'
b" > git --version # 'git version 2.30.1'"
b'Setting http proxy: proxy.sandia.gov:80'
b' > git fetch --tags --force --progress -- https://github.com/lucbv/kokkos-kernels +refs/heads/*:refs/remotes/origin/* # timeout=10'
b' > git rev-parse 85299db5a4dfdea55d1528295e814e9745aae9af^{commit} # timeout=10'
b'Checking out Revision 85299db5a4dfdea55d1528295e814e9745aae9af (detached)'
b' > git config core.sparsecheckout # timeout=10'
b' > git checkout -f 85299db5a4dfdea55d1528295e814e9745aae9af # timeout=10'
b'Commit message: "Kokkos Kernels: change the default offset ETI from size_t to int"'
b' > git rev-list --no-walk 6e9543c33b1b7fac7a11a91895491387b7e448b2 # timeout=10'
b'The recommended git tool is: NONE'
b'No credentials specified'
b' > git rev-parse --resolve-git-dir /gpfs/jenkins/workspace/KokkosKernels_PullRequest_GNU1021/kokkos/.git # timeout=10'
b'Fetching changes from the remote Git repository'
b' > git config remote.origin.url https://github.com/kokkos/kokkos.git # timeout=10'
b'Fetching upstream changes from https://github.com/kokkos/kokkos.git'
b' > git --version # timeout=10'
b" > git --version # 'git version 2.30.1'"
b'Setting http proxy: proxy.sandia.gov:80'
b' > git fetch --tags --force --progress -- https://github.com/kokkos/kokkos.git +refs/heads/*:refs/remotes/origin/* # timeout=10'
b' > git rev-parse origin/4.2.00^{commit} # timeout=10'
b' > git rev-parse 4.2.00^{commit} # timeout=10'
b'Checking out Revision 71a9bcae52543bd065522bf3e41b5bfa467d8015 (4.2.00)'
b' > git config core.sparsecheckout # timeout=10'
b' > git checkout -f 71a9bcae52543bd065522bf3e41b5bfa467d8015 # timeout=10'
b'Commit message: "Merge pull request #6613 from ndellingwood/master-release-4.2.00"'
b' > git rev-list --no-walk 71a9bcae52543bd065522bf3e41b5bfa467d8015 # timeout=10'
b'[KokkosKernels_PullRequest_GNU1021] $ /bin/bash -el /tmp/jenkins7295381610425059570.sh'
b'From https://github.com/kokkos/kokkos-kernels'
b' * branch                develop    -> FETCH_HEAD'
b"Merge made by the 'recursive' strategy."
b' batched/dense/impl/KokkosBatched_Gesv_Impl.hpp |  36 +-'
b' batched/dense/src/KokkosBatched_Gesv.hpp       |  13 +-'
b' ode/impl/KokkosODE_BDF_impl.hpp                | 532 ++++++++++++++++'
b' ode/impl/KokkosODE_Newton_impl.hpp             |  55 +-'
b' ode/src/KokkosODE_BDF.hpp                      | 227 +++++++'
b' ode/src/KokkosODE_Newton.hpp                   |  10 +-'
b' ode/src/KokkosODE_Types.hpp                    |  13 +-'
b' ode/unit_test/Test_ODE.hpp                     |   1 +'
b' ode/unit_test/Test_ODE_BDF.hpp                 | 830 +++++++++++++++++++++++++'
b' ode/unit_test/Test_ODE_Newton.hpp              |  31 +-'
b' perf_test/ode/CMakeLists.txt                   |   4 +'
b' perf_test/ode/KokkosODE_BDF.cpp                | 266 ++++++++'
b' 12 files changed, 1971 insertions(+), 47 deletions(-)'
b' create mode 100644 ode/impl/KokkosODE_BDF_impl.hpp'
b' create mode 100644 ode/src/KokkosODE_BDF.hpp'
b' create mode 100644 ode/unit_test/Test_ODE_BDF.hpp'
b' create mode 100644 perf_test/ode/KokkosODE_BDF.cpp'
b'/gpfs/jenkins/workspace/KokkosKernels_PullRequest_GNU1021'
b'srun: INFO: Adding filesystem licenses to job: qscratch:1,gpfs:1'
b'Running on machine: solo'
b"KokkosKernels Repository Status:  cd6d7c341417cc7aeabdeffe8d52c291eb6fdb42 Merge remote-tracking branch 'upstream/develop' into HEAD"
b''
b'Kokkos Repository Status:  71a9bcae52543bd065522bf3e41b5bfa467d8015 Merge pull request #6613 from ndellingwood/master-release-4.2.00'
b''
b''
b'Going to test compilers:  gnu/10.2.1'
b'Testing compiler gnu/10.2.1'
b'Unrecognized compiler gnu/10.2.1 when looking for Spack variants'
b'Unrecognized compiler gnu/10.2.1 when looking for Spack variants'
b'Unrecognized compiler gnu/10.2.1 when looking for Spack variants'
b'  Starting job gnu-10.2.1-Threads_Serial-release'
b'kokkos devices: Threads,Serial'
b'kokkos arch: BDW'
b'kokkos options: '
b'kokkos cuda options: '
b'kokkos cxxflags: -O3  '
b'extra_args: '
b"kokkoskernels scalars: 'double,complex_double'"
b'kokkoskernels ordinals: int'
b'kokkoskernels offsets: int,size_t'
b'kokkoskernels layouts: LayoutLeft'
b'kokkoskernels tpls list: '
b'  PASSED gnu-10.2.1-Threads_Serial-release'
b'Unrecognized compiler gnu/10.2.1 when looking for Spack variants'
b'Unrecognized compiler gnu/10.2.1 when looking for Spack variants'
b'Unrecognized compiler gnu/10.2.1 when looking for Spack variants'
b'  Starting job gnu-10.2.1-OpenMP-release'
b'kokkos devices: OpenMP'
b'kokkos arch: BDW'
b'kokkos options: '
b'kokkos cuda options: '
b'kokkos cxxflags: -O3  '
b'extra_args: '
b"kokkoskernels scalars: 'double,complex_double'"
b'kokkoskernels ordinals: int'
b'kokkoskernels offsets: int,size_t'
b'kokkoskernels layouts: LayoutLeft'
b'kokkoskernels tpls list: '
b'  PASSED gnu-10.2.1-OpenMP-release'
b'#######################################################'
b'PASSED TESTS'
b'#######################################################'
b'gnu-10.2.1-OpenMP-release build_time=478 run_time=137'
b'gnu-10.2.1-Threads_Serial-release build_time=659 run_time=215'
b'/gpfs/jenkins/workspace/KokkosKernels_PullRequest_GNU1021'
b'Finished: SUCCESS'
b''

Console Output (last 100 lines) : KokkosKernels_PullRequest_GNU1021_Light_LayoutRight # 473 (click to expand)

b'Fetching changes from the remote Git repository'
b' > git config remote.origin.url https://github.com/lucbv/kokkos-kernels # timeout=10'
b'Fetching upstream changes from https://github.com/lucbv/kokkos-kernels'
b' > git --version # timeout=10'
b" > git --version # 'git version 2.30.1'"
b'Setting http proxy: proxy.sandia.gov:80'
b' > git fetch --tags --force --progress -- https://github.com/lucbv/kokkos-kernels +refs/heads/*:refs/remotes/origin/* # timeout=10'
b' > git rev-parse 85299db5a4dfdea55d1528295e814e9745aae9af^{commit} # timeout=10'
b'Checking out Revision 85299db5a4dfdea55d1528295e814e9745aae9af (detached)'
b' > git config core.sparsecheckout # timeout=10'
b' > git checkout -f 85299db5a4dfdea55d1528295e814e9745aae9af # timeout=10'
b'Commit message: "Kokkos Kernels: change the default offset ETI from size_t to int"'
b' > git rev-list --no-walk 6e9543c33b1b7fac7a11a91895491387b7e448b2 # timeout=10'
b'The recommended git tool is: NONE'
b'No credentials specified'
b' > git rev-parse --resolve-git-dir /gpfs/jenkins/workspace/KokkosKernels_PullRequest_GNU1021_Light_LayoutRight/kokkos/.git # timeout=10'
b'Fetching changes from the remote Git repository'
b' > git config remote.origin.url https://github.com/kokkos/kokkos.git # timeout=10'
b'Fetching upstream changes from https://github.com/kokkos/kokkos.git'
b' > git --version # timeout=10'
b" > git --version # 'git version 2.30.1'"
b'Setting http proxy: proxy.sandia.gov:80'
b' > git fetch --tags --force --progress -- https://github.com/kokkos/kokkos.git +refs/heads/*:refs/remotes/origin/* # timeout=10'
b' > git rev-parse origin/4.2.00^{commit} # timeout=10'
b' > git rev-parse 4.2.00^{commit} # timeout=10'
b'Checking out Revision 71a9bcae52543bd065522bf3e41b5bfa467d8015 (4.2.00)'
b' > git config core.sparsecheckout # timeout=10'
b' > git checkout -f 71a9bcae52543bd065522bf3e41b5bfa467d8015 # timeout=10'
b'Commit message: "Merge pull request #6613 from ndellingwood/master-release-4.2.00"'
b' > git rev-list --no-walk 71a9bcae52543bd065522bf3e41b5bfa467d8015 # timeout=10'
b'[KokkosKernels_PullRequest_GNU1021_Light_LayoutRight] $ /bin/bash -el /tmp/jenkins144863531238413974.sh'
b'From https://github.com/kokkos/kokkos-kernels'
b' * branch                develop    -> FETCH_HEAD'
b"Merge made by the 'recursive' strategy."
b' batched/dense/impl/KokkosBatched_Gesv_Impl.hpp |  36 +-'
b' batched/dense/src/KokkosBatched_Gesv.hpp       |  13 +-'
b' ode/impl/KokkosODE_BDF_impl.hpp                | 532 ++++++++++++++++'
b' ode/impl/KokkosODE_Newton_impl.hpp             |  55 +-'
b' ode/src/KokkosODE_BDF.hpp                      | 227 +++++++'
b' ode/src/KokkosODE_Newton.hpp                   |  10 +-'
b' ode/src/KokkosODE_Types.hpp                    |  13 +-'
b' ode/unit_test/Test_ODE.hpp                     |   1 +'
b' ode/unit_test/Test_ODE_BDF.hpp                 | 830 +++++++++++++++++++++++++'
b' ode/unit_test/Test_ODE_Newton.hpp              |  31 +-'
b' perf_test/ode/CMakeLists.txt                   |   4 +'
b' perf_test/ode/KokkosODE_BDF.cpp                | 266 ++++++++'
b' 12 files changed, 1971 insertions(+), 47 deletions(-)'
b' create mode 100644 ode/impl/KokkosODE_BDF_impl.hpp'
b' create mode 100644 ode/src/KokkosODE_BDF.hpp'
b' create mode 100644 ode/unit_test/Test_ODE_BDF.hpp'
b' create mode 100644 perf_test/ode/KokkosODE_BDF.cpp'
b'/gpfs/jenkins/workspace/KokkosKernels_PullRequest_GNU1021_Light_LayoutRight'
b'srun: INFO: Adding filesystem licenses to job: qscratch:1,gpfs:1'
b'Running on machine: solo'
b"KokkosKernels Repository Status:  adc9c0d1e4b5a2520825366faeaa4ca60b89e4b4 Merge remote-tracking branch 'upstream/develop' into HEAD"
b''
b'Kokkos Repository Status:  71a9bcae52543bd065522bf3e41b5bfa467d8015 Merge pull request #6613 from ndellingwood/master-release-4.2.00'
b''
b''
b'Going to test compilers:  gnu/10.2.1'
b'Testing compiler gnu/10.2.1'
b'Unrecognized compiler gnu/10.2.1 when looking for Spack variants'
b'Unrecognized compiler gnu/10.2.1 when looking for Spack variants'
b'Unrecognized compiler gnu/10.2.1 when looking for Spack variants'
b'  Starting job gnu-10.2.1-Threads_Serial-release'
b'kokkos devices: Threads,Serial'
b'kokkos arch: BDW'
b'kokkos options: '
b'kokkos cuda options: '
b'kokkos cxxflags: -O3  '
b'extra_args:  --no-default-eti'
b"kokkoskernels scalars: 'double,complex_double'"
b'kokkoskernels ordinals: int'
b'kokkoskernels offsets: int,size_t'
b'kokkoskernels layouts: LayoutRight'
b'kokkoskernels tpls list: '
b'  PASSED gnu-10.2.1-Threads_Serial-release'
b'Unrecognized compiler gnu/10.2.1 when looking for Spack variants'
b'Unrecognized compiler gnu/10.2.1 when looking for Spack variants'
b'Unrecognized compiler gnu/10.2.1 when looking for Spack variants'
b'  Starting job gnu-10.2.1-OpenMP-release'
b'kokkos devices: OpenMP'
b'kokkos arch: BDW'
b'kokkos options: '
b'kokkos cuda options: '
b'kokkos cxxflags: -O3  '
b'extra_args:  --no-default-eti'
b"kokkoskernels scalars: 'double,complex_double'"
b'kokkoskernels ordinals: int'
b'kokkoskernels offsets: int,size_t'
b'kokkoskernels layouts: LayoutRight'
b'kokkoskernels tpls list: '
b'  PASSED gnu-10.2.1-OpenMP-release'
b'#######################################################'
b'PASSED TESTS'
b'#######################################################'
b'gnu-10.2.1-OpenMP-release build_time=436 run_time=129'
b'gnu-10.2.1-Threads_Serial-release build_time=600 run_time=194'
b'/gpfs/jenkins/workspace/KokkosKernels_PullRequest_GNU1021_Light_LayoutRight'
b'Finished: SUCCESS'
b''

Console Output (last 100 lines) : KokkosKernels_PullRequest_Tpls_GNU1021 # 474 (click to expand)

b'Running as SYSTEM'
b'[EnvInject] - Loading node environment variables.'
b'Building remotely on solo in workspace /gpfs/jenkins/workspace/KokkosKernels_PullRequest_Tpls_GNU1021'
b'The recommended git tool is: NONE'
b'No credentials specified'
b' > git rev-parse --resolve-git-dir /gpfs/jenkins/workspace/KokkosKernels_PullRequest_Tpls_GNU1021/kokkos-kernels/.git # timeout=10'
b'Fetching changes from the remote Git repository'
b' > git config remote.origin.url https://github.com/lucbv/kokkos-kernels # timeout=10'
b'Fetching upstream changes from https://github.com/lucbv/kokkos-kernels'
b' > git --version # timeout=10'
b" > git --version # 'git version 2.30.1'"
b'Setting http proxy: proxy.sandia.gov:80'
b' > git fetch --tags --force --progress -- https://github.com/lucbv/kokkos-kernels +refs/heads/*:refs/remotes/origin/* # timeout=10'
b' > git rev-parse 85299db5a4dfdea55d1528295e814e9745aae9af^{commit} # timeout=10'
b'Checking out Revision 85299db5a4dfdea55d1528295e814e9745aae9af (detached)'
b' > git config core.sparsecheckout # timeout=10'
b' > git checkout -f 85299db5a4dfdea55d1528295e814e9745aae9af # timeout=10'
b'Commit message: "Kokkos Kernels: change the default offset ETI from size_t to int"'
b' > git rev-list --no-walk 6e9543c33b1b7fac7a11a91895491387b7e448b2 # timeout=10'
b'The recommended git tool is: NONE'
b'No credentials specified'
b' > git rev-parse --resolve-git-dir /gpfs/jenkins/workspace/KokkosKernels_PullRequest_Tpls_GNU1021/kokkos/.git # timeout=10'
b'Fetching changes from the remote Git repository'
b' > git config remote.origin.url https://github.com/kokkos/kokkos.git # timeout=10'
b'Fetching upstream changes from https://github.com/kokkos/kokkos.git'
b' > git --version # timeout=10'
b" > git --version # 'git version 2.30.1'"
b'Setting http proxy: proxy.sandia.gov:80'
b' > git fetch --tags --force --progress -- https://github.com/kokkos/kokkos.git +refs/heads/*:refs/remotes/origin/* # timeout=10'
b' > git rev-parse origin/4.2.00^{commit} # timeout=10'
b' > git rev-parse 4.2.00^{commit} # timeout=10'
b'Checking out Revision 71a9bcae52543bd065522bf3e41b5bfa467d8015 (4.2.00)'
b' > git config core.sparsecheckout # timeout=10'
b' > git checkout -f 71a9bcae52543bd065522bf3e41b5bfa467d8015 # timeout=10'
b'Commit message: "Merge pull request #6613 from ndellingwood/master-release-4.2.00"'
b' > git rev-list --no-walk 71a9bcae52543bd065522bf3e41b5bfa467d8015 # timeout=10'
b'[KokkosKernels_PullRequest_Tpls_GNU1021] $ /bin/bash -el /tmp/jenkins3959333640904390237.sh'
b'From https://github.com/kokkos/kokkos-kernels'
b' * branch                develop    -> FETCH_HEAD'
b"Merge made by the 'recursive' strategy."
b' batched/dense/impl/KokkosBatched_Gesv_Impl.hpp |  36 +-'
b' batched/dense/src/KokkosBatched_Gesv.hpp       |  13 +-'
b' ode/impl/KokkosODE_BDF_impl.hpp                | 532 ++++++++++++++++'
b' ode/impl/KokkosODE_Newton_impl.hpp             |  55 +-'
b' ode/src/KokkosODE_BDF.hpp                      | 227 +++++++'
b' ode/src/KokkosODE_Newton.hpp                   |  10 +-'
b' ode/src/KokkosODE_Types.hpp                    |  13 +-'
b' ode/unit_test/Test_ODE.hpp                     |   1 +'
b' ode/unit_test/Test_ODE_BDF.hpp                 | 830 +++++++++++++++++++++++++'
b' ode/unit_test/Test_ODE_Newton.hpp              |  31 +-'
b' perf_test/ode/CMakeLists.txt                   |   4 +'
b' perf_test/ode/KokkosODE_BDF.cpp                | 266 ++++++++'
b' 12 files changed, 1971 insertions(+), 47 deletions(-)'
b' create mode 100644 ode/impl/KokkosODE_BDF_impl.hpp'
b' create mode 100644 ode/src/KokkosODE_BDF.hpp'
b' create mode 100644 ode/unit_test/Test_ODE_BDF.hpp'
b' create mode 100644 perf_test/ode/KokkosODE_BDF.cpp'
b'/gpfs/jenkins/workspace/KokkosKernels_PullRequest_Tpls_GNU1021'
b'srun: INFO: Adding filesystem licenses to job: qscratch:1,gpfs:1'
b'Running on machine: solo'
b"KokkosKernels Repository Status:  b77bf17467081ee48838f16c78324ffb776d5cdb Merge remote-tracking branch 'upstream/develop' into HEAD"
b''
b'Kokkos Repository Status:  71a9bcae52543bd065522bf3e41b5bfa467d8015 Merge pull request #6613 from ndellingwood/master-release-4.2.00'
b''
b''
b'Going to test compilers:  gnu/10.2.1'
b'Testing compiler gnu/10.2.1'
b'Unrecognized compiler gnu/10.2.1 when looking for Spack variants'
b'Unrecognized compiler gnu/10.2.1 when looking for Spack variants'
b'Unrecognized compiler gnu/10.2.1 when looking for Spack variants'
b'  Starting job gnu-10.2.1-OpenMP_Serial-release'
b'kokkos devices: OpenMP,Serial'
b'kokkos arch: BDW'
b'kokkos options: '
b'kokkos cuda options: '
b'kokkos cxxflags: -O3  '
b'extra_args: '
b"kokkoskernels scalars: 'double,complex_double'"
b'kokkoskernels ordinals: int'
b'kokkoskernels offsets: int,size_t'
b'kokkoskernels layouts: LayoutLeft'
b'kokkoskernels tpls list: ,blas'
b'  PASSED gnu-10.2.1-OpenMP_Serial-release'
b'#######################################################'
b'PASSED TESTS'
b'#######################################################'
b'gnu-10.2.1-OpenMP_Serial-release build_time=680 run_time=252'
b'/gpfs/jenkins/workspace/KokkosKernels_PullRequest_Tpls_GNU1021'
b'Finished: SUCCESS'
b''

Console Output (last 100 lines) : KokkosKernels_PullRequest_Tpls_INTEL19_solo # 478 (click to expand)

b'Fetching changes from the remote Git repository'
b' > git config remote.origin.url https://github.com/lucbv/kokkos-kernels # timeout=10'
b'Fetching upstream changes from https://github.com/lucbv/kokkos-kernels'
b' > git --version # timeout=10'
b" > git --version # 'git version 2.30.1'"
b'Setting http proxy: proxy.sandia.gov:80'
b' > git fetch --tags --force --progress -- https://github.com/lucbv/kokkos-kernels +refs/heads/*:refs/remotes/origin/* # timeout=10'
b' > git rev-parse 85299db5a4dfdea55d1528295e814e9745aae9af^{commit} # timeout=10'
b'Checking out Revision 85299db5a4dfdea55d1528295e814e9745aae9af (detached)'
b' > git config core.sparsecheckout # timeout=10'
b' > git checkout -f 85299db5a4dfdea55d1528295e814e9745aae9af # timeout=10'
b'Commit message: "Kokkos Kernels: change the default offset ETI from size_t to int"'
b' > git rev-list --no-walk 6e9543c33b1b7fac7a11a91895491387b7e448b2 # timeout=10'
b'The recommended git tool is: NONE'
b'No credentials specified'
b' > git rev-parse --resolve-git-dir /gpfs/jenkins/workspace/KokkosKernels_PullRequest_Tpls_INTEL19_solo/kokkos/.git # timeout=10'
b'Fetching changes from the remote Git repository'
b' > git config remote.origin.url https://github.com/kokkos/kokkos.git # timeout=10'
b'Fetching upstream changes from https://github.com/kokkos/kokkos.git'
b' > git --version # timeout=10'
b" > git --version # 'git version 2.30.1'"
b'Setting http proxy: proxy.sandia.gov:80'
b' > git fetch --tags --force --progress -- https://github.com/kokkos/kokkos.git +refs/heads/*:refs/remotes/origin/* # timeout=10'
b' > git rev-parse origin/4.2.00^{commit} # timeout=10'
b' > git rev-parse 4.2.00^{commit} # timeout=10'
b'Checking out Revision 71a9bcae52543bd065522bf3e41b5bfa467d8015 (4.2.00)'
b' > git config core.sparsecheckout # timeout=10'
b' > git checkout -f 71a9bcae52543bd065522bf3e41b5bfa467d8015 # timeout=10'
b'Commit message: "Merge pull request #6613 from ndellingwood/master-release-4.2.00"'
b' > git rev-list --no-walk 71a9bcae52543bd065522bf3e41b5bfa467d8015 # timeout=10'
b'[KokkosKernels_PullRequest_Tpls_INTEL19_solo] $ /bin/bash -el /tmp/jenkins4659095565483577431.sh'
b'From https://github.com/kokkos/kokkos-kernels'
b' * branch                develop    -> FETCH_HEAD'
b"Merge made by the 'recursive' strategy."
b' batched/dense/impl/KokkosBatched_Gesv_Impl.hpp |  36 +-'
b' batched/dense/src/KokkosBatched_Gesv.hpp       |  13 +-'
b' ode/impl/KokkosODE_BDF_impl.hpp                | 532 ++++++++++++++++'
b' ode/impl/KokkosODE_Newton_impl.hpp             |  55 +-'
b' ode/src/KokkosODE_BDF.hpp                      | 227 +++++++'
b' ode/src/KokkosODE_Newton.hpp                   |  10 +-'
b' ode/src/KokkosODE_Types.hpp                    |  13 +-'
b' ode/unit_test/Test_ODE.hpp                     |   1 +'
b' ode/unit_test/Test_ODE_BDF.hpp                 | 830 +++++++++++++++++++++++++'
b' ode/unit_test/Test_ODE_Newton.hpp              |  31 +-'
b' perf_test/ode/CMakeLists.txt                   |   4 +'
b' perf_test/ode/KokkosODE_BDF.cpp                | 266 ++++++++'
b' 12 files changed, 1971 insertions(+), 47 deletions(-)'
b' create mode 100644 ode/impl/KokkosODE_BDF_impl.hpp'
b' create mode 100644 ode/src/KokkosODE_BDF.hpp'
b' create mode 100644 ode/unit_test/Test_ODE_BDF.hpp'
b' create mode 100644 perf_test/ode/KokkosODE_BDF.cpp'
b'/gpfs/jenkins/workspace/KokkosKernels_PullRequest_Tpls_INTEL19_solo'
b'srun: INFO: Adding filesystem licenses to job: qscratch:1,gpfs:1'
b'Running on machine: solo'
b"KokkosKernels Repository Status:  1fb84ac129ba504835d32378fbfb4e91cab21be2 Merge remote-tracking branch 'upstream/develop' into HEAD"
b''
b'Kokkos Repository Status:  71a9bcae52543bd065522bf3e41b5bfa467d8015 Merge pull request #6613 from ndellingwood/master-release-4.2.00'
b''
b''
b'Going to test compilers:  intel/19.0.5.281'
b'Testing compiler intel/19.0.5.281'
b'Unrecognized compiler intel/19.0.5.281 when looking for Spack variants'
b'Unrecognized compiler intel/19.0.5.281 when looking for Spack variants'
b'Unrecognized compiler intel/19.0.5.281 when looking for Spack variants'
b'  Starting job intel-19.0.5.281-OpenMP-release'
b'kokkos devices: OpenMP'
b'kokkos arch: BDW'
b'kokkos options: '
b'kokkos cuda options: '
b'kokkos cxxflags: -O3 -Wall -Wunused-parameter -Wshadow -pedantic -Werror -Wsign-compare -Wtype-limits -Wuninitialized -diag-disable=1011 -diag-disable=869 -diag-disable=1011 -diag-disable=869'
b'extra_args:  --disable-perftests'
b"kokkoskernels scalars: 'double,complex_double'"
b'kokkoskernels ordinals: int'
b'kokkoskernels offsets: int,size_t'
b'kokkoskernels layouts: LayoutLeft'
b'kokkoskernels tpls list: ,mkl'
b'  PASSED intel-19.0.5.281-OpenMP-release'
b'Unrecognized compiler intel/19.0.5.281 when looking for Spack variants'
b'Unrecognized compiler intel/19.0.5.281 when looking for Spack variants'
b'Unrecognized compiler intel/19.0.5.281 when looking for Spack variants'
b'  Starting job intel-19.0.5.281-Threads-release'
b'kokkos devices: Threads'
b'kokkos arch: BDW'
b'kokkos options: '
b'kokkos cuda options: '
b'kokkos cxxflags: -O3 -Wall -Wunused-parameter -Wshadow -pedantic -Werror -Wsign-compare -Wtype-limits -Wuninitialized -diag-disable=1011 -diag-disable=869 -diag-disable=1011 -diag-disable=869'
b'extra_args:  --disable-perftests'
b"kokkoskernels scalars: 'double,complex_double'"
b'kokkoskernels ordinals: int'
b'kokkoskernels offsets: int,size_t'
b'kokkoskernels layouts: LayoutLeft'
b'kokkoskernels tpls list: ,mkl'
b'  PASSED intel-19.0.5.281-Threads-release'
b'#######################################################'
b'PASSED TESTS'
b'#######################################################'
b'intel-19.0.5.281-OpenMP-release build_time=1723 run_time=108'
b'intel-19.0.5.281-Threads-release build_time=1214 run_time=91'
b'/gpfs/jenkins/workspace/KokkosKernels_PullRequest_Tpls_INTEL19_solo'
b'Finished: SUCCESS'
b''

Console Output (last 100 lines) : KokkosKernels_PullRequest_CLANG1001_solo # 450 (click to expand)

b'Running as SYSTEM'
b'[EnvInject] - Loading node environment variables.'
b'Building remotely on solo in workspace /gpfs/jenkins/workspace/KokkosKernels_PullRequest_CLANG1001_solo'
b'The recommended git tool is: NONE'
b'No credentials specified'
b' > git rev-parse --resolve-git-dir /gpfs/jenkins/workspace/KokkosKernels_PullRequest_CLANG1001_solo/kokkos-kernels/.git # timeout=10'
b'Fetching changes from the remote Git repository'
b' > git config remote.origin.url https://github.com/lucbv/kokkos-kernels # timeout=10'
b'Fetching upstream changes from https://github.com/lucbv/kokkos-kernels'
b' > git --version # timeout=10'
b" > git --version # 'git version 2.30.1'"
b'Setting http proxy: proxy.sandia.gov:80'
b' > git fetch --tags --force --progress -- https://github.com/lucbv/kokkos-kernels +refs/heads/*:refs/remotes/origin/* # timeout=10'
b' > git rev-parse 85299db5a4dfdea55d1528295e814e9745aae9af^{commit} # timeout=10'
b'Checking out Revision 85299db5a4dfdea55d1528295e814e9745aae9af (detached)'
b' > git config core.sparsecheckout # timeout=10'
b' > git checkout -f 85299db5a4dfdea55d1528295e814e9745aae9af # timeout=10'
b'Commit message: "Kokkos Kernels: change the default offset ETI from size_t to int"'
b' > git rev-list --no-walk 6e9543c33b1b7fac7a11a91895491387b7e448b2 # timeout=10'
b'The recommended git tool is: NONE'
b'No credentials specified'
b' > git rev-parse --resolve-git-dir /gpfs/jenkins/workspace/KokkosKernels_PullRequest_CLANG1001_solo/kokkos/.git # timeout=10'
b'Fetching changes from the remote Git repository'
b' > git config remote.origin.url https://github.com/kokkos/kokkos.git # timeout=10'
b'Fetching upstream changes from https://github.com/kokkos/kokkos.git'
b' > git --version # timeout=10'
b" > git --version # 'git version 2.30.1'"
b'Setting http proxy: proxy.sandia.gov:80'
b' > git fetch --tags --force --progress -- https://github.com/kokkos/kokkos.git +refs/heads/*:refs/remotes/origin/* # timeout=10'
b' > git rev-parse origin/4.2.00^{commit} # timeout=10'
b' > git rev-parse 4.2.00^{commit} # timeout=10'
b'Checking out Revision 71a9bcae52543bd065522bf3e41b5bfa467d8015 (4.2.00)'
b' > git config core.sparsecheckout # timeout=10'
b' > git checkout -f 71a9bcae52543bd065522bf3e41b5bfa467d8015 # timeout=10'
b'Commit message: "Merge pull request #6613 from ndellingwood/master-release-4.2.00"'
b' > git rev-list --no-walk 71a9bcae52543bd065522bf3e41b5bfa467d8015 # timeout=10'
b'[KokkosKernels_PullRequest_CLANG1001_solo] $ /bin/bash -el /tmp/jenkins3049028605989536059.sh'
b'From https://github.com/kokkos/kokkos-kernels'
b' * branch                develop    -> FETCH_HEAD'
b"Merge made by the 'recursive' strategy."
b' batched/dense/impl/KokkosBatched_Gesv_Impl.hpp |  36 +-'
b' batched/dense/src/KokkosBatched_Gesv.hpp       |  13 +-'
b' ode/impl/KokkosODE_BDF_impl.hpp                | 532 ++++++++++++++++'
b' ode/impl/KokkosODE_Newton_impl.hpp             |  55 +-'
b' ode/src/KokkosODE_BDF.hpp                      | 227 +++++++'
b' ode/src/KokkosODE_Newton.hpp                   |  10 +-'
b' ode/src/KokkosODE_Types.hpp                    |  13 +-'
b' ode/unit_test/Test_ODE.hpp                     |   1 +'
b' ode/unit_test/Test_ODE_BDF.hpp                 | 830 +++++++++++++++++++++++++'
b' ode/unit_test/Test_ODE_Newton.hpp              |  31 +-'
b' perf_test/ode/CMakeLists.txt                   |   4 +'
b' perf_test/ode/KokkosODE_BDF.cpp                | 266 ++++++++'
b' 12 files changed, 1971 insertions(+), 47 deletions(-)'
b' create mode 100644 ode/impl/KokkosODE_BDF_impl.hpp'
b' create mode 100644 ode/src/KokkosODE_BDF.hpp'
b' create mode 100644 ode/unit_test/Test_ODE_BDF.hpp'
b' create mode 100644 perf_test/ode/KokkosODE_BDF.cpp'
b'/gpfs/jenkins/workspace/KokkosKernels_PullRequest_CLANG1001_solo'
b'srun: INFO: Adding filesystem licenses to job: qscratch:1,gpfs:1'
b'Running on machine: solo'
b"KokkosKernels Repository Status:  79840dd58acccc95d50f62435c7385642fc52f1e Merge remote-tracking branch 'upstream/develop' into HEAD"
b''
b'Kokkos Repository Status:  71a9bcae52543bd065522bf3e41b5bfa467d8015 Merge pull request #6613 from ndellingwood/master-release-4.2.00'
b''
b''
b'Going to test compilers:  llvm/10.0.1'
b'Testing compiler llvm/10.0.1'
b'Unrecognized compiler llvm/10.0.1 when looking for Spack variants'
b'Unrecognized compiler llvm/10.0.1 when looking for Spack variants'
b'Unrecognized compiler llvm/10.0.1 when looking for Spack variants'
b'  Starting job llvm-10.0.1-Threads_Serial-release'
b'kokkos devices: Threads,Serial'
b'kokkos arch: BDW'
b'kokkos options: '
b'kokkos cuda options: '
b'kokkos cxxflags: -O3 -Wall -Wunused-parameter -Wshadow -pedantic -Werror -Wsign-compare -Wtype-limits -Wuninitialized '
b'extra_args: '
b"kokkoskernels scalars: 'double,complex_double'"
b'kokkoskernels ordinals: int'
b'kokkoskernels offsets: int,size_t'
b'kokkoskernels layouts: LayoutLeft'
b'kokkoskernels tpls list: '
b'  PASSED llvm-10.0.1-Threads_Serial-release'
b'#######################################################'
b'PASSED TESTS'
b'#######################################################'
b'llvm-10.0.1-Threads_Serial-release build_time=729 run_time=226'
b'/gpfs/jenkins/workspace/KokkosKernels_PullRequest_CLANG1001_solo'
b'Finished: SUCCESS'
b''

Console Output (last 100 lines) : KokkosKernels_PullRequest_A64FX_Tpls_ARMPL2110 # 960 (click to expand)

b' * [new tag]             3.5.00                      -> 3.5.00'
b' * [new tag]             3.6.00                      -> 3.6.00'
b' * [new tag]             3.6.01                      -> 3.6.01'
b' * [new tag]             3.7.00                      -> 3.7.00'
b' * [new tag]             3.7.01                      -> 3.7.01'
b' * [new tag]             4.0.00                      -> 4.0.00'
b' * [new tag]             4.0.01                      -> 4.0.01'
b' * [new tag]             4.1.00                      -> 4.1.00'
b' * [new tag]             4.2.01                      -> 4.2.01'
b' * [new tag]             papers/us-rse-escience-2022 -> papers/us-rse-escience-2022'
b' * [new tag]             4.2.00                      -> 4.2.00'
b"Merge made by the 'recursive' strategy."
b' batched/dense/impl/KokkosBatched_Gesv_Impl.hpp |  36 +-'
b' batched/dense/src/KokkosBatched_Gesv.hpp       |  13 +-'
b' ode/impl/KokkosODE_BDF_impl.hpp                | 532 ++++++++++++++++'
b' ode/impl/KokkosODE_Newton_impl.hpp             |  55 +-'
b' ode/src/KokkosODE_BDF.hpp                      | 227 +++++++'
b' ode/src/KokkosODE_Newton.hpp                   |  10 +-'
b' ode/src/KokkosODE_Types.hpp                    |  13 +-'
b' ode/unit_test/Test_ODE.hpp                     |   1 +'
b' ode/unit_test/Test_ODE_BDF.hpp                 | 830 +++++++++++++++++++++++++'
b' ode/unit_test/Test_ODE_Newton.hpp              |  31 +-'
b' perf_test/ode/CMakeLists.txt                   |   4 +'
b' perf_test/ode/KokkosODE_BDF.cpp                | 266 ++++++++'
b' 12 files changed, 1971 insertions(+), 47 deletions(-)'
b' create mode 100644 ode/impl/KokkosODE_BDF_impl.hpp'
b' create mode 100644 ode/src/KokkosODE_BDF.hpp'
b' create mode 100644 ode/unit_test/Test_ODE_BDF.hpp'
b' create mode 100644 perf_test/ode/KokkosODE_BDF.cpp'
b'/home/jenkins/inouye/workspace/workspace/KokkosKernels_PullRequest_A64FX_Tpls_ARMPL2110'
b''
b'Currently Loaded Modules:'
b'  1) gcc/10.2.0       11) numactl/2.0.12          21) parmetis/4.0.3'
b'  2) autoconf/2.69    12) hwloc/1.11.11           22) metis/5.1.0'
b'  3) automake/1.16.1  13) pmix/2.2.3              23) openblas/0.3.4'
b'  4) libtool/2.4.6    14) ucx/1.7.0               24) superlu/5.2.1'
b'  5) cmake/3.17.0     15) openmpi/4.0.2           25) superlu-dist/5.4.0'
b'  6) git/2.19.2       16) curl/7.71.0             26) boost/1.72.0'
b'  7) zlib/1.2.11      17) netcdf-c/4.6.3          27) fftw/3.3.8'
b'  8) bzip2/1.0.6      18) parallel-netcdf/1.11.1  28) singularity/3.5.3'
b'  9) xz/5.2.4         19) hdf5/1.10.5             29) devpack-gnu10/20201002'
b' 10) yaml-cpp/0.6.2   20) cgns/3.4.0'
b''
b' '
b''
b''
b'Currently Loaded Modules:'
b'  1) cmake/3.17.0'
b''
b' '
b''
b'salloc: Granted job allocation 4970'
b'Running on machine: inouye'
b"KokkosKernels Repository Status:  30a5077611550b361a53fa4bc5fc528ae9c7a825 Merge remote-tracking branch 'upstream/develop' into HEAD"
b''
b'Kokkos Repository Status:  71a9bcae52543bd065522bf3e41b5bfa467d8015 Merge pull request #6613 from ndellingwood/master-release-4.2.00'
b''
b''
b'Going to test compilers:  armpl/21.1.0'
b'Testing compiler armpl/21.1.0'
b'Unrecognized compiler armpl/21.1.0 when looking for Spack variants'
b'Unrecognized compiler armpl/21.1.0 when looking for Spack variants'
b'Unrecognized compiler armpl/21.1.0 when looking for Spack variants'
b'  Starting job armpl-21.1.0-OpenMP-release'
b'kokkos devices: OpenMP'
b'kokkos arch: A64FX'
b'kokkos options: '
b'kokkos cuda options: '
b'kokkos cxxflags: -O3 -Wall -Wunused-parameter -Wshadow -pedantic -Werror -Wsign-compare -Wtype-limits -Wignored-qualifiers -Wempty-body -Wclobbered -Wuninitialized '
b'extra_args: '
b"kokkoskernels scalars: 'double,complex_double'"
b'kokkoskernels ordinals: int'
b'kokkoskernels offsets: int,size_t'
b'kokkoskernels layouts: LayoutLeft'
b'kokkoskernels tpls list: armpl,armpl'
b'  PASSED armpl-21.1.0-OpenMP-release'
b'Unrecognized compiler armpl/21.1.0 when looking for Spack variants'
b'Unrecognized compiler armpl/21.1.0 when looking for Spack variants'
b'Unrecognized compiler armpl/21.1.0 when looking for Spack variants'
b'  Starting job armpl-21.1.0-Serial-release'
b'kokkos devices: Serial'
b'kokkos arch: A64FX'
b'kokkos options: '
b'kokkos cuda options: '
b'kokkos cxxflags: -O3 -Wall -Wunused-parameter -Wshadow -pedantic -Werror -Wsign-compare -Wtype-limits -Wignored-qualifiers -Wempty-body -Wclobbered -Wuninitialized '
b'extra_args: '
b"kokkoskernels scalars: 'double,complex_double'"
b'kokkoskernels ordinals: int'
b'kokkoskernels offsets: int,size_t'
b'kokkoskernels layouts: LayoutLeft'
b'kokkoskernels tpls list: armpl,armpl'
b'  PASSED armpl-21.1.0-Serial-release'
b'#######################################################'
b'PASSED TESTS'
b'#######################################################'
b'armpl-21.1.0-OpenMP-release build_time=2100 run_time=177'
b'armpl-21.1.0-Serial-release build_time=1847 run_time=253'
b'salloc: Relinquishing job allocation 4970'
b'/home/jenkins/inouye/workspace/workspace/KokkosKernels_PullRequest_A64FX_Tpls_ARMPL2110'
b'Finished: SUCCESS'
b''

Console Output (last 100 lines) : KokkosKernels_PullRequest_A64FX_GCC1020 # 955 (click to expand)

b' * [new branch]          release-candidate-4.1.00    -> upstream/release-candidate-4.1.00'
b' * [new branch]          release-candidate-4.2.00    -> upstream/release-candidate-4.2.00'
b' * [new branch]          release-candidate-4.2.01    -> upstream/release-candidate-4.2.01'
b' * [new tag]             2.6.00                      -> 2.6.00'
b' * [new tag]             2.7.00                      -> 2.7.00'
b' * [new tag]             2.7.24                      -> 2.7.24'
b' * [new tag]             2.8.00                      -> 2.8.00'
b' * [new tag]             2.9.00                      -> 2.9.00'
b' * [new tag]             3.0.00                      -> 3.0.00'
b' * [new tag]             3.1.00                      -> 3.1.00'
b' * [new tag]             3.1.01                      -> 3.1.01'
b' * [new tag]             3.2.00                      -> 3.2.00'
b' * [new tag]             3.2.01                      -> 3.2.01'
b' * [new tag]             3.3.00                      -> 3.3.00'
b' * [new tag]             3.3.01                      -> 3.3.01'
b' * [new tag]             3.4.00                      -> 3.4.00'
b' * [new tag]             3.4.01                      -> 3.4.01'
b' * [new tag]             3.5.00                      -> 3.5.00'
b' * [new tag]             3.6.00                      -> 3.6.00'
b' * [new tag]             3.6.01                      -> 3.6.01'
b' * [new tag]             3.7.00                      -> 3.7.00'
b' * [new tag]             3.7.01                      -> 3.7.01'
b' * [new tag]             4.0.00                      -> 4.0.00'
b' * [new tag]             4.0.01                      -> 4.0.01'
b' * [new tag]             4.1.00                      -> 4.1.00'
b' * [new tag]             4.2.01                      -> 4.2.01'
b' * [new tag]             papers/us-rse-escience-2022 -> papers/us-rse-escience-2022'
b' * [new tag]             4.2.00                      -> 4.2.00'
b"Merge made by the 'recursive' strategy."
b' batched/dense/impl/KokkosBatched_Gesv_Impl.hpp |  36 +-'
b' batched/dense/src/KokkosBatched_Gesv.hpp       |  13 +-'
b' ode/impl/KokkosODE_BDF_impl.hpp                | 532 ++++++++++++++++'
b' ode/impl/KokkosODE_Newton_impl.hpp             |  55 +-'
b' ode/src/KokkosODE_BDF.hpp                      | 227 +++++++'
b' ode/src/KokkosODE_Newton.hpp                   |  10 +-'
b' ode/src/KokkosODE_Types.hpp                    |  13 +-'
b' ode/unit_test/Test_ODE.hpp                     |   1 +'
b' ode/unit_test/Test_ODE_BDF.hpp                 | 830 +++++++++++++++++++++++++'
b' ode/unit_test/Test_ODE_Newton.hpp              |  31 +-'
b' perf_test/ode/CMakeLists.txt                   |   4 +'
b' perf_test/ode/KokkosODE_BDF.cpp                | 266 ++++++++'
b' 12 files changed, 1971 insertions(+), 47 deletions(-)'
b' create mode 100644 ode/impl/KokkosODE_BDF_impl.hpp'
b' create mode 100644 ode/src/KokkosODE_BDF.hpp'
b' create mode 100644 ode/unit_test/Test_ODE_BDF.hpp'
b' create mode 100644 perf_test/ode/KokkosODE_BDF.cpp'
b'/home/jenkins/inouye/workspace/workspace/KokkosKernels_PullRequest_A64FX_GCC1020'
b''
b'Currently Loaded Modules:'
b'  1) gcc/10.2.0       11) numactl/2.0.12          21) parmetis/4.0.3'
b'  2) autoconf/2.69    12) hwloc/1.11.11           22) metis/5.1.0'
b'  3) automake/1.16.1  13) pmix/2.2.3              23) openblas/0.3.4'
b'  4) libtool/2.4.6    14) ucx/1.7.0               24) superlu/5.2.1'
b'  5) cmake/3.17.0     15) openmpi/4.0.2           25) superlu-dist/5.4.0'
b'  6) git/2.19.2       16) curl/7.71.0             26) boost/1.72.0'
b'  7) zlib/1.2.11      17) netcdf-c/4.6.3          27) fftw/3.3.8'
b'  8) bzip2/1.0.6      18) parallel-netcdf/1.11.1  28) singularity/3.5.3'
b'  9) xz/5.2.4         19) hdf5/1.10.5             29) devpack-gnu10/20201002'
b' 10) yaml-cpp/0.6.2   20) cgns/3.4.0'
b''
b' '
b''
b''
b'Currently Loaded Modules:'
b'  1) cmake/3.17.0'
b''
b' '
b''
b'salloc: Granted job allocation 4971'
b'Running on machine: inouye'
b"KokkosKernels Repository Status:  30a5077611550b361a53fa4bc5fc528ae9c7a825 Merge remote-tracking branch 'upstream/develop' into HEAD"
b''
b'Kokkos Repository Status:  71a9bcae52543bd065522bf3e41b5bfa467d8015 Merge pull request #6613 from ndellingwood/master-release-4.2.00'
b''
b''
b'Going to test compilers:  gcc/10.2.0'
b'Testing compiler gcc/10.2.0'
b'Unrecognized compiler gcc/10.2.0 when looking for Spack variants'
b'Unrecognized compiler gcc/10.2.0 when looking for Spack variants'
b'Unrecognized compiler gcc/10.2.0 when looking for Spack variants'
b'  Starting job gcc-10.2.0-OpenMP_Serial-release'
b'kokkos devices: OpenMP,Serial'
b'kokkos arch: A64FX'
b'kokkos options: '
b'kokkos cuda options: '
b'kokkos cxxflags: -O3 -Wall -Wunused-parameter -Wshadow -pedantic -Werror -Wsign-compare -Wtype-limits -Wignored-qualifiers -Wempty-body -Wclobbered -Wuninitialized '
b'extra_args: '
b"kokkoskernels scalars: 'double,complex_double'"
b'kokkoskernels ordinals: int'
b'kokkoskernels offsets: int,size_t'
b'kokkoskernels layouts: LayoutLeft'
b'kokkoskernels tpls list: '
b'  PASSED gcc-10.2.0-OpenMP_Serial-release'
b'#######################################################'
b'PASSED TESTS'
b'#######################################################'
b'gcc-10.2.0-OpenMP_Serial-release build_time=2847 run_time=570'
b'salloc: Relinquishing job allocation 4971'
b'/home/jenkins/inouye/workspace/workspace/KokkosKernels_PullRequest_A64FX_GCC1020'
b'Finished: SUCCESS'
b''

Console Output (last 100 lines) : KokkosKernels_PullRequest_VEGA90A_ROCM560 # 946 (click to expand)

b' > git rev-list --no-walk 71a9bcae52543bd065522bf3e41b5bfa467d8015 # timeout=10'
b'[KokkosKernels_PullRequest_VEGA90A_ROCM560] $ /bin/bash -el /tmp/jenkins6293582868906551933.sh'
b'From https://github.com/kokkos/kokkos-kernels'
b' * [new branch]          develop                     -> upstream/develop'
b' * [new branch]          master                      -> upstream/master'
b' * [new branch]          release-candidate-4.1.00    -> upstream/release-candidate-4.1.00'
b' * [new branch]          release-candidate-4.2.00    -> upstream/release-candidate-4.2.00'
b' * [new branch]          release-candidate-4.2.01    -> upstream/release-candidate-4.2.01'
b' * [new tag]             2.6.00                      -> 2.6.00'
b' * [new tag]             2.7.00                      -> 2.7.00'
b' * [new tag]             2.7.24                      -> 2.7.24'
b' * [new tag]             2.8.00                      -> 2.8.00'
b' * [new tag]             2.9.00                      -> 2.9.00'
b' * [new tag]             3.0.00                      -> 3.0.00'
b' * [new tag]             3.1.00                      -> 3.1.00'
b' * [new tag]             3.1.01                      -> 3.1.01'
b' * [new tag]             3.2.00                      -> 3.2.00'
b' * [new tag]             3.2.01                      -> 3.2.01'
b' * [new tag]             3.3.00                      -> 3.3.00'
b' * [new tag]             3.3.01                      -> 3.3.01'
b' * [new tag]             3.4.00                      -> 3.4.00'
b' * [new tag]             3.4.01                      -> 3.4.01'
b' * [new tag]             3.5.00                      -> 3.5.00'
b' * [new tag]             3.6.00                      -> 3.6.00'
b' * [new tag]             3.6.01                      -> 3.6.01'
b' * [new tag]             3.7.00                      -> 3.7.00'
b' * [new tag]             3.7.01                      -> 3.7.01'
b' * [new tag]             4.0.00                      -> 4.0.00'
b' * [new tag]             4.0.01                      -> 4.0.01'
b' * [new tag]             4.1.00                      -> 4.1.00'
b' * [new tag]             4.2.01                      -> 4.2.01'
b' * [new tag]             papers/us-rse-escience-2022 -> papers/us-rse-escience-2022'
b' * [new tag]             4.2.00                      -> 4.2.00'
b"Merge made by the 'ort' strategy."
b' batched/dense/impl/KokkosBatched_Gesv_Impl.hpp |  36 +-'
b' batched/dense/src/KokkosBatched_Gesv.hpp       |  13 +-'
b' ode/impl/KokkosODE_BDF_impl.hpp                | 532 ++++++++++++++++'
b' ode/impl/KokkosODE_Newton_impl.hpp             |  55 +-'
b' ode/src/KokkosODE_BDF.hpp                      | 227 +++++++'
b' ode/src/KokkosODE_Newton.hpp                   |  10 +-'
b' ode/src/KokkosODE_Types.hpp                    |  13 +-'
b' ode/unit_test/Test_ODE.hpp                     |   1 +'
b' ode/unit_test/Test_ODE_BDF.hpp                 | 830 +++++++++++++++++++++++++'
b' ode/unit_test/Test_ODE_Newton.hpp              |  31 +-'
b' perf_test/ode/CMakeLists.txt                   |   4 +'
b' perf_test/ode/KokkosODE_BDF.cpp                | 266 ++++++++'
b' 12 files changed, 1971 insertions(+), 47 deletions(-)'
b' create mode 100644 ode/impl/KokkosODE_BDF_impl.hpp'
b' create mode 100644 ode/src/KokkosODE_BDF.hpp'
b' create mode 100644 ode/unit_test/Test_ODE_BDF.hpp'
b' create mode 100644 perf_test/ode/KokkosODE_BDF.cpp'
b'/home/jenkins/caraway-new/workspace/KokkosKernels_PullRequest_VEGA90A_ROCM560'
b'srun: job 1009868 queued and waiting for resources'
b'srun: job 1009868 has been allocated resources'
b'Running on machine: vega90a_caraway'
b"KokkosKernels Repository Status:  1cb64efcb12b570ca4c6ad704981e8941c0a16cf Merge remote-tracking branch 'upstream/develop' into HEAD"
b''
b'Kokkos Repository Status:  71a9bcae52543bd065522bf3e41b5bfa467d8015 Merge pull request #6613 from ndellingwood/master-release-4.2.00'
b''
b''
b'Going to test compilers:  rocm/5.6.0'
b'Testing compiler rocm/5.6.0'
b'Unrecognized compiler rocm/5.6.0 when looking for Spack variants'
b'Unrecognized compiler rocm/5.6.0 when looking for Spack variants'
b'Unrecognized compiler rocm/5.6.0 when looking for Spack variants'
b'  FAILED rocm-5.6.0-Hip_Serial-release'
b'SETUP_ENV: compiler=rocm/5.6.0 modules=cmake rocm/5.6.0'
b'Lmod has detected the following error: The following module(s) are unknown:'
b'"rocm/5.6.0"'
b''
b'Please check the spelling or version number. Also try "module spider ..."'
b'It is also possible your cache file is out-of-date; it may help to try:'
b'  $ module --ignore_cache load "rocm/5.6.0"'
b''
b'Also make sure that all modulefiles written in TCL start with the string'
b'#%Module'
b''
b''
b''
b'#######################################################'
b'PASSED TESTS'
b'#######################################################'
b'#######################################################'
b'FAILED TESTS'
b'#######################################################'
b'rocm-5.6.0-Hip_Serial-release (configure failed)'
b'#######################################################'
b'  # Reproducer instructions:'
b'  #   Load modules:'
b'        module purge'
b'        module load cmake rocm/5.6.0'
b'        export OMP_NUM_THREADS=8'
b'        export OMP_PROC_BIND=spread'
b'        export OMP_PLACES=cores'
b'        export KOKKOS_NUM_THREADS=8'
b''
b'cat: /home/jenkins/caraway-new/workspace/KokkosKernels_PullRequest_VEGA90A_ROCM560/KokkosKernels_PullRequest_VEGA90A_ROCM560.946/TestAll_2024-03-13_11.50.35/rocm/5.6.0/Hip_Serial-release/call_generate_makefile_genericpath.sh: No such file or directory'
b'srun: error: lean1: task 0: Exited with exit code 1'
b"Build step 'Execute shell' marked build as failure"
b'Finished: FAILURE'
b''

Console Output (last 100 lines) : KokkosKernels_PullRequest_VEGA90A_Tpls_ROCM560 # 468 (click to expand)

b'Fetching upstream changes from https://github.com/kokkos/kokkos.git'
b' > git --version # timeout=10'
b" > git --version # 'git version 2.39.3'"
b'Setting http proxy: proxy.sandia.gov:80'
b' > git fetch --tags --force --progress -- https://github.com/kokkos/kokkos.git +refs/heads/*:refs/remotes/origin/* # timeout=10'
b' > git rev-parse origin/4.2.00^{commit} # timeout=10'
b' > git rev-parse 4.2.00^{commit} # timeout=10'
b'Checking out Revision 71a9bcae52543bd065522bf3e41b5bfa467d8015 (4.2.00)'
b' > git config core.sparsecheckout # timeout=10'
b' > git checkout -f 71a9bcae52543bd065522bf3e41b5bfa467d8015 # timeout=10'
b'Commit message: "Merge pull request #6613 from ndellingwood/master-release-4.2.00"'
b' > git rev-list --no-walk 71a9bcae52543bd065522bf3e41b5bfa467d8015 # timeout=10'
b'[KokkosKernels_PullRequest_VEGA90A_Tpls_ROCM560] $ /bin/bash -el /tmp/jenkins5621484687110778812.sh'
b"Merge made by the 'ort' strategy."
b' batched/dense/impl/KokkosBatched_Gesv_Impl.hpp |  36 +-'
b' batched/dense/src/KokkosBatched_Gesv.hpp       |  13 +-'
b' ode/impl/KokkosODE_BDF_impl.hpp                | 532 ++++++++++++++++'
b' ode/impl/KokkosODE_Newton_impl.hpp             |  55 +-'
b' ode/src/KokkosODE_BDF.hpp                      | 227 +++++++'
b' ode/src/KokkosODE_Newton.hpp                   |  10 +-'
b' ode/src/KokkosODE_Types.hpp                    |  13 +-'
b' ode/unit_test/Test_ODE.hpp                     |   1 +'
b' ode/unit_test/Test_ODE_BDF.hpp                 | 830 +++++++++++++++++++++++++'
b' ode/unit_test/Test_ODE_Newton.hpp              |  31 +-'
b' perf_test/ode/CMakeLists.txt                   |   4 +'
b' perf_test/ode/KokkosODE_BDF.cpp                | 266 ++++++++'
b' 12 files changed, 1971 insertions(+), 47 deletions(-)'
b' create mode 100644 ode/impl/KokkosODE_BDF_impl.hpp'
b' create mode 100644 ode/src/KokkosODE_BDF.hpp'
b' create mode 100644 ode/unit_test/Test_ODE_BDF.hpp'
b' create mode 100644 perf_test/ode/KokkosODE_BDF.cpp'
b'/home/jenkins/caraway-new/workspace/KokkosKernels_PullRequest_VEGA90A_Tpls_ROCM560'
b'srun: job 1009869 queued and waiting for resources'
b'srun: job 1009869 has been allocated resources'
b''
b'----------------------------------------------------------------------------'
b'  rocm:'
b'----------------------------------------------------------------------------'
b'     Versions:'
b'        rocm/5.2.0'
b'        rocm/5.6.1'
b'        rocm/6.0.0'
b''
b'----------------------------------------------------------------------------'
b'  For detailed information about a specific "rocm" package (including how to load the modules) use the module\'s full name.'
b'  Note that names that have a trailing (E) are extensions provided by other modules.'
b'  For example:'
b''
b'     $ module spider rocm/6.0.0'
b'----------------------------------------------------------------------------'
b''
b' '
b''
b'Running on machine: vega90a_caraway'
b"KokkosKernels Repository Status:  1cb64efcb12b570ca4c6ad704981e8941c0a16cf Merge remote-tracking branch 'upstream/develop' into HEAD"
b''
b'Kokkos Repository Status:  71a9bcae52543bd065522bf3e41b5bfa467d8015 Merge pull request #6613 from ndellingwood/master-release-4.2.00'
b''
b''
b'Going to test compilers:  rocm/5.6.0'
b'Testing compiler rocm/5.6.0'
b'Unrecognized compiler rocm/5.6.0 when looking for Spack variants'
b'Unrecognized compiler rocm/5.6.0 when looking for Spack variants'
b'Unrecognized compiler rocm/5.6.0 when looking for Spack variants'
b'  FAILED rocm-5.6.0-Hip_Serial-release'
b'SETUP_ENV: compiler=rocm/5.6.0 modules=cmake rocm/5.6.0 openblas/0.3.20/rocm/5.2.0'
b'Lmod has detected the following error: The following module(s) are unknown:'
b'"rocm/5.6.0"'
b''
b'Please check the spelling or version number. Also try "module spider ..."'
b'It is also possible your cache file is out-of-date; it may help to try:'
b'  $ module --ignore_cache load "rocm/5.6.0"'
b''
b'Also make sure that all modulefiles written in TCL start with the string'
b'#%Module'
b''
b''
b''
b'#######################################################'
b'PASSED TESTS'
b'#######################################################'
b'#######################################################'
b'FAILED TESTS'
b'#######################################################'
b'rocm-5.6.0-Hip_Serial-release (configure failed)'
b'#######################################################'
b'  # Reproducer instructions:'
b'  #   Load modules:'
b'        module purge'
b'        module load cmake rocm/5.6.0 openblas/0.3.20/rocm/5.2.0'
b'        export OMP_NUM_THREADS=8'
b'        export OMP_PROC_BIND=spread'
b'        export OMP_PLACES=cores'
b'        export KOKKOS_NUM_THREADS=8'
b''
b'cat: /home/jenkins/caraway-new/workspace/KokkosKernels_PullRequest_VEGA90A_Tpls_ROCM560/KokkosKernels_PullRequest_VEGA90A_Tpls_ROCM560.468/TestAll_2024-03-13_11.50.36/rocm/5.6.0/Hip_Serial-release/call_generate_makefile_genericpath.sh: No such file or directory'
b'srun: error: lean3: task 0: Exited with exit code 1'
b"Build step 'Execute shell' marked build as failure"
b'Sending e-mails to: cwpears@sandia.gov'
b'Finished: FAILURE'
b''

@lucbv
Copy link
Contributor Author

lucbv commented Mar 13, 2024

@ndellingwood agreed, as I said it is not a top priority to have it in 4.3.00 so if everything goes smoothly with all other outstanding issues we can try to bring that in the release branch, otherwise I would rather have the release done well and get this into 4.4.00. In any case we can merge it in develop and observe the effect on the nightly builds...

@lucbv lucbv added the AT: RETEST Have this PR retested. label Mar 14, 2024
@kokkos-devops-admin
Copy link

Status Flag 'Pull Request AutoTester' - User Requested Retest - Label AT: RETEST will be reset after testing.

@kokkos-devops-admin
Copy link

Status Flag 'Pull Request AutoTester' - Testing Jenkins Projects:

Pull Request Auto Testing STARTING (click to expand)

Build Information

Test Name: KokkosKernels_PullRequest_CUDA11_CUDA11_LayoutRight

  • Build Num: 1232
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/lucbv/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 85299db
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA 519ef7b
PR_LABELS enhancement;feature request;AT: RETEST
PULLREQUESTNUM 2140
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_GCC930_Light_Tpls_GCC930_Tpls_CLANG13CUDA10

  • Build Num: 819
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/lucbv/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 85299db
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA 519ef7b
PR_LABELS enhancement;feature request;AT: RETEST
PULLREQUESTNUM 2140
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_GNU1021

  • Build Num: 490
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/lucbv/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 85299db
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA 519ef7b
PR_LABELS enhancement;feature request;AT: RETEST
PULLREQUESTNUM 2140
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_GNU1021_Light_LayoutRight

  • Build Num: 477
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/lucbv/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 85299db
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA 519ef7b
PR_LABELS enhancement;feature request;AT: RETEST
PULLREQUESTNUM 2140
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_Tpls_GNU1021

  • Build Num: 478
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/lucbv/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 85299db
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA 519ef7b
PR_LABELS enhancement;feature request;AT: RETEST
PULLREQUESTNUM 2140
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_Tpls_INTEL19_solo

  • Build Num: 482
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/lucbv/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 85299db
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA 519ef7b
PR_LABELS enhancement;feature request;AT: RETEST
PULLREQUESTNUM 2140
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_CLANG1001_solo

  • Build Num: 454
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/lucbv/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 85299db
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA 519ef7b
PR_LABELS enhancement;feature request;AT: RETEST
PULLREQUESTNUM 2140
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_A64FX_Tpls_ARMPL2110

  • Build Num: 964
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/lucbv/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 85299db
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA 519ef7b
PR_LABELS enhancement;feature request;AT: RETEST
PULLREQUESTNUM 2140
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_A64FX_GCC1020

  • Build Num: 959
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/lucbv/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 85299db
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA 519ef7b
PR_LABELS enhancement;feature request;AT: RETEST
PULLREQUESTNUM 2140
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_VEGA90A_ROCM561

  • Build Num: 950
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/lucbv/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 85299db
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA 519ef7b
PR_LABELS enhancement;feature request;AT: RETEST
PULLREQUESTNUM 2140
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_VEGA90A_Tpls_ROCM561

  • Build Num: 472
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/lucbv/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 85299db
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA 519ef7b
PR_LABELS enhancement;feature request;AT: RETEST
PULLREQUESTNUM 2140
TEST_REPO_ALIAS KOKKOSKERNELS

Using Repos:

Repo: KOKKOSKERNELS (lucbv/kokkos-kernels)
  • Branch: KK_change_offset_eti_default
  • SHA: 85299db
  • Mode: TEST_REPO

Pull Request Author: lucbv

@kokkos-devops-admin
Copy link

Status Flag 'Pull Request AutoTester' - Jenkins Testing: all Jobs PASSED

Pull Request Auto Testing has PASSED (click to expand)

Build Information

Test Name: KokkosKernels_PullRequest_CUDA11_CUDA11_LayoutRight

  • Build Num: 1232
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/lucbv/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 85299db
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA 519ef7b
PR_LABELS enhancement;feature request;AT: RETEST
PULLREQUESTNUM 2140
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_GCC930_Light_Tpls_GCC930_Tpls_CLANG13CUDA10

  • Build Num: 819
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/lucbv/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 85299db
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA 519ef7b
PR_LABELS enhancement;feature request;AT: RETEST
PULLREQUESTNUM 2140
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_GNU1021

  • Build Num: 490
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/lucbv/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 85299db
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA 519ef7b
PR_LABELS enhancement;feature request;AT: RETEST
PULLREQUESTNUM 2140
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_GNU1021_Light_LayoutRight

  • Build Num: 477
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/lucbv/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 85299db
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA 519ef7b
PR_LABELS enhancement;feature request;AT: RETEST
PULLREQUESTNUM 2140
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_Tpls_GNU1021

  • Build Num: 478
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/lucbv/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 85299db
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA 519ef7b
PR_LABELS enhancement;feature request;AT: RETEST
PULLREQUESTNUM 2140
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_Tpls_INTEL19_solo

  • Build Num: 482
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/lucbv/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 85299db
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA 519ef7b
PR_LABELS enhancement;feature request;AT: RETEST
PULLREQUESTNUM 2140
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_CLANG1001_solo

  • Build Num: 454
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/lucbv/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 85299db
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA 519ef7b
PR_LABELS enhancement;feature request;AT: RETEST
PULLREQUESTNUM 2140
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_A64FX_Tpls_ARMPL2110

  • Build Num: 964
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/lucbv/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 85299db
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA 519ef7b
PR_LABELS enhancement;feature request;AT: RETEST
PULLREQUESTNUM 2140
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_A64FX_GCC1020

  • Build Num: 959
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/lucbv/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 85299db
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA 519ef7b
PR_LABELS enhancement;feature request;AT: RETEST
PULLREQUESTNUM 2140
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_VEGA90A_ROCM561

  • Build Num: 950
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/lucbv/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 85299db
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA 519ef7b
PR_LABELS enhancement;feature request;AT: RETEST
PULLREQUESTNUM 2140
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_VEGA90A_Tpls_ROCM561

  • Build Num: 472
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/lucbv/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 85299db
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA 519ef7b
PR_LABELS enhancement;feature request;AT: RETEST
PULLREQUESTNUM 2140
TEST_REPO_ALIAS KOKKOSKERNELS

@kokkos-devops-admin kokkos-devops-admin removed the AT: RETEST Have this PR retested. label Mar 14, 2024
@kokkos-devops-admin
Copy link

Status Flag 'Pre-Merge Inspection' - SUCCESS: The last commit to this Pull Request has been INSPECTED AND APPROVED by [ brian-kelley ]!

@kokkos-devops-admin
Copy link

Status Flag 'Pull Request AutoTester' - Pull Request MUST BE MERGED MANUALLY BY Project Team - This Repo does not support Automerge

@lucbv lucbv merged commit 3a5498d into kokkos:develop Mar 14, 2024
10 checks passed
@lucbv lucbv deleted the KK_change_offset_eti_default branch March 14, 2024 14:52
brian-kelley pushed a commit to brian-kelley/kokkos-kernels that referenced this pull request Mar 14, 2024
…kos#2140)

This change makes it easier for customer to leverage TPL support
which almost always requires offset=int, ordinal=int to be enabled
meaning that no TPL support is available with our default ETI...
ndellingwood added a commit that referenced this pull request Mar 19, 2024
lucbv added a commit that referenced this pull request Apr 8, 2024
* HIP: since Kokkos has moved it out of experimental we should clean up

Just reflecting the move of HIP and HIPSpace out of experimental
so that we do not get deprecation warning and even failures down
the road. This was really done in Kokkos Core 4.0.0 so it is time
to catch up...

* Applying clang-format

* Sparse: fix cusparse spgemm hang properly

The issue is fixed by disabling the TPL in spec_avail
when a problematic version of CUDA/cuSPARSE is being used.

* Sparse: fix logic for bad cursparse spgemm version.

Just inverted the logic statement to enable the TPL when it is
known to work correctly.

* Improvements on the unification attempt logic for axpby(), including new tests

* Addressing feedbacks from Luc, plus some small changes here and there:

In KokkosBlas1_axpby_unification_attempt.hpp:
- Removed unnecessary variables, routines, and checks
- Imposed terminology consistency: variable names begin with lower case letters, type names begin with upper case letters
- Using static_assert as much as possible
- Using 'public' and 'private' keywords accordingly
- Improved some explanations and error messages

In KokkosBlas1_axpby_spec.hpp:
- Replace 'a' and 'b' by 'scalar_x' and 'scalar_y' where appropriate, to keep consistency with the terminology used in the 'impl' and 'mv_impl' files of the axpby operation.
- Not using the 'KOKKOSBLAS_OPTIMIZATION_LEVEL_AXPBY' define anymore. Code is now consistent with the 'old' value 3 for such define.

In KokkosBlas1_axpby_impl.hpp and KokkosBlas1_axpby_mv_impl.hpp:
- Not using the 'KOKKOSBLAS_OPTIMIZATION_LEVEL_AXPBY' define anymore. Code is now consistent with the 'old' value 3 for such define.
- Using 'if constexpr' whenever possible
- Checking that -1 <= scalar_x <= 2 and that -1 <= scalar_y <= 2
- Replaced '} else {' by '} else if (scalar_x == 2)' or by '} else if (scalar_y == 2)', whenever possible
- Improved error messages
- Improved explanation headers a bit

In KokkosBlas1_axpby.hpp:
- Renamed some variables to more meaningful names

* Formatting

* Using 'ifdef HAVE_KOKKOSKERNELS_DEBUG', per Luc's suggestion

* Addressing feedbacks from Luc

* Correcting compilation errors in my Mac

* Backup

* SYR2: fix unit-test type issue

On KokkosEco_Trilinos_Weaver_CUDA112_opt-uvm the SYR2 test
enerates a compile time error probably due to a mixed use of host
and device views when comparing implemented vs. reference results.

* CUDA 11.0.1 / cuSPARSE 11.0.0 changed SpMM enums

* SYR2: applying clang-format

* CUDA 11.2.1 / cuSPARSE 11.4.0 changed SpMV

* KokkosBlas1_axpby: include <iostream> for debug builds

Resolve compilation errors in debug mode:
"error: no member named 'cout' in namespace 'std';"

* Backup

* Backup

* Backup

* Backup

* Backup

* Backup

* Backup

* Backup

* Backup

* Backup

* Backup

* Backup

* Address CI build errors

* Some cleanup on current pull request, making it more related to 'just' the creation of the lapack subdirectory and the moving of some files to there

* More cleanup

* Re-enabling gesv unit tests under the lapack subdirectory

* Adding BLAS routines back, for backwards compatibility

* Formatting

* Small cleaning

* Correcting error in Jenkins

* Fixing compilation error on Jenkins when dealing with HIP

* Add required rtd conf file

* README.md: Use correct project slug

* docs/requirements.txt: Add sphinx-rtd-theme

* Addressing latest feedbacks from Luc.

* Formatting

* KokkosKernelsConfig.cmake: add all_libs target and necessary aliases

* Intent of these changes is to allow for building Trilinos with
  KokkosKernels as an external TPL

* hide native merge-path SpMV behind "native-merge"

* test native-merge algorithm

* Quick fix for night compilation with Trilinos

* SPTRSV: check if cusparse is available before calling TPL path

Since SpTRSV does not implement the TPL layer the usual way we need
to be extra careful before calling the TPL implementation path. If
cusparse is not available then we definitely want to revert back to
calling the native implementation. Similarly, if the execution space
is not Kokkos::CUDA, let's use the native implementation.

* SpTRSV: more strickly check prerequisites in SptrsvHandle

Check that CUSPARSE is enabled and that HandleExecSpace is
Kokkos::CUDA before allowing users to set the implementation to use
the CUSPARSE TPL.

* SpTRSV: fix some type definition and variable usaged for cuSPARSE

Since we are guarding the cusparse path a bit better we need to be
careful when some types are defined and to mark some variables
(void) when they do not get used by an implementation...

* SpTRSV: applying clang-format

* SpTRSV: more fixes

* SpTRSV: apply clang-format

* SYCL: fix for Trilinos build with MKL

* Apply clang-format to non-cmake files

* SYR2: fix issue with bad type in test function

After comparing various function signatures and view types, the change
allows tests to pass correctly and seem correct based on input params.

* Update Test_Blas2_syr2.hpp

Fix mistake in host/device view argument

* LAPACK: adding rocsolver TPL

Adding the necessary CMake logic and TPL layer to support rocsolver
for LAPACK. Enabling the TPL in gesv and updating gesv test to run
by default the more common configurations and only run specific ones
when the associated TPL (MAGMA) is enabled.

* Lapack: change according to Brian's review

The SpaceAccessibility of IPIVV needs to be modified for MAGMA.
The value_type of IPIVV needs to be rocblas_int when running with
rocSOLVER.

The types used for gesv_tpl_spec_avail and the actual TPL
instantiation where mismatched leading to linker error.

* cmake/Dependencies.cmake: remove ROCSOLVER

Removing ROCSOLVER to prevent configuration errors with Trilinos
Will bring back when support is added in Trilinos for ROCSOLVER as TPL

* Lapack: cusolver TPL logic and support for gesv

Adding CMake logic to support cusolver and implementing gesv using
cusolver getrf and getrs. Unit-test is passing without problems!

* Lapack: updating logic in cm_generate_makefile for cusolver

There is some specific TPL logic in cm_generate_makefile and it
cannot be found for cusolver, changing that might to the trick!

* Backup

* Backup

* Backup

* Formatting

* mv_unification tests with double are failing by very small amounts, e.g. 5.9e-14 vs. 3.6e-14

* Trying one more increment on tolerance

* Putting pragma's and unrolls properly right before for loops (compilation warning at weaver)

* Giving it another try to larger tolarance, after fixing the warning on pragma and unroll

* Lapack: gesv, implementing review commments

* Adding Changelog for Release 4.2.0 (#2031)

* Adding Changelog for Release 4.2.0

Part of Kokkos C++ Performance Portability Programming EcoSystem 4.2

* Formatting the changelog a bit more

Mentioning more clearly LAPACK vs BLAS, grouping PRs by logical work unit, etc...

* Remove minor revisions, improve text descriptions

* Changelog: add spmv perftest detail

---------

Co-authored-by: Luc Berger <lberge@sandia.gov>
Co-authored-by: Carl Pearson <cwpearson@users.noreply.github.com>
Co-authored-by: brian-kelley <brian.honda11@gmail.com>

* NRM1: refactoring TPL layer a bit with c++17 if constexpr

Hopefully this leads to simpler code, less duplication, less
macro and easier maintenance!
Adding support for oneapi MKL while making tpl layer changes.

* BLAS: Nrm1 implementing Brian's feedback

* Blas: nrm1, fix in tpl spec decl

* BLAS: nrm1 problems with ExecSpace template and lack of Kokkos::Threads

Fix issue with Kokkos::Threads and Kokkos::HIP

* Another attempt while waiting to get access to the solo cluster

* Formatting

* Correction error from the last commit

* Fixing the error that was happening only at the solo cluster

* Increase tolerance a bit more

* ncreasing tolerances in all 4 locations

* Backup

* Backup

* Formatting

* Forgot to add ClusteringAlgorithm:: at some spots

* Formatting

* Lapack: fixing issue with Magma TPL in gesv, trtri, etc...

Adding proper support for MAGMA after having it moved to the Lapack
directory and checking it does not create issues with cuSOLVER.

* Update blas/unit_test/Test_Blas1_swap.hpp

Co-authored-by: brian-kelley <brian.honda11@gmail.com>

* cmake: Add workaround check for CUSOLVER support with Trilinos

TPL_ENABLE_CUDA default enables CUBLAS and CUSOLVER in Trilinos, but not CUSPARSE
This PR modifies the TPL requirement checks to maintain compatibility with existing configration options of Trilinos

Attempt to resolve/workaround issue #2047

* Addressing Brian Kelley's feedbacks

* Formatting

* Removing 'ClusteringAlgorithm::'

* Lapack: gesv, incorporate Brian's feedback

* Applying clang-format

* Fixing some deprecation warnings/errors for ROCm 6

* BLAS: fix bug in TPL layer of KokkosBlas::swap

The cuBLAS Kokkos::complex<float> specialization had a small bug
where the rank of the view was not specified correctly!

* CMake: fix bugs in deciding KOKKOSKERNELS_TPL_BLAS_RETURN_COMPLEX

* TPL: revise BLAS1 dot implementation

* Fix compile errors for C-linkage dot functions returning std::complex

* Use a C struct for complex numbers

to avoid error: '_Complex' is a C99 extension [-Werror,-Wc99-extensions].

* Add a workaround by disabling host MKL dot with complex numbers

* Allow KokkosKernels_ENABLE_PERFTESTS=ON to build perf_tests without KokkosKernels_ENABLE_TESTS=ON

* format sparse/tpls/KokkosSparse_spmv_tpl_spec_decl.hpp

* cmake: fix tpl check so cusolver can be disabled when needed

* Link std::filesystem for IntelLLVM in perf_test/sparse

* gemm3 perf test: user CUDA, SYCL, or HIP device for kokkos:initialize

* Fix for rocm_verison header inclusion

* fence Kokkos before timed interations

* Deprecate KOKKOSLINALG_OPT_LEVEL

* Add CMake warning message if KokkosKernels_LINALG_OPT_LEVEL is used

* Async matrix release for MKL >= 2023.2

* Support CUBLAS_{LIBRARIES,LIBRARY_DIRS,INCLUDE_DIRS,ROOT} and KokkosKernels_CUBLAS_ROOT

* KokkosSparse_spmv_impl_merge.hpp: use capture by reference

Resolve warnings in builds with c++20 support enabled:
"kokkos-kernels/sparse/impl/KokkosSparse_spmv_impl_merge.hpp:166:81: warning: implicit capture of 'this' via '[=]' is deprecated in C++20 [-Wdeprecated]"

* KokkosSparse_par_ilut_numeric_impl.hpp: use capture by reference

Resolve warnings in builds with c++20 support enabled:
"kokkos-kernels/sparse/impl/KokkosSparse_par_ilut_numeric_impl.hpp(591):
warning #2908-D: the implicit by-copy capture of "this" is deprecated"

* Backup

* Backup

* Backup

* Backup

* Formatting

* Correcting compilation error

* Typo

* Changes for syr and syr2, to be tested at weaver

* Formatting

* Changes for axpby

* Backup

* Formatting

* Just to force new checking tests in github

* Addressing feedback from Luc.

* Don't call optimize_gemv for one-shot spmv

* Add HIPManagedSpace support

- CMake option for ETI
- Run unit tests with a Kokkos::Device, not just Kokkos::HIP
	- Like we do for Cuda
	- Still use HIPSpace unless Managed is the only enabled memspace
- Couple of minor fixes
	- Allow querying free HIPManagedSpace memory for SpGEMM
	- Disable VBD coloring (not a huge deal, had to do same on CUDA)
	- Use correct memory space in SpTRSV solve

* Backup

* Backup

* Backup

* Minor typo

* Add block support to all SPILUK algorithms (#2064)

* Interface for block iluk

* Progress. Test hooked up

* Progress on test refactoring

* More test reorg

* Fix test

* Refactor spiluk numeric a bit with a struct wrapper

* Add good logging

* progress

* Fix block test

* Progress but potential dead end

* Giving up on this approach for now

* progress

* Make verbose

* Progress

* Progress

* RP working?

* Progress on TP alg

* Bug fix

* Progress on template stuff

* Progress on block TP

* Progress

* Get rid of all the static_casts

* More cleanup. Steams now support blocks

* Tests not passing

* Serail tests all working, both algs, blocked

* Remove output coming from spiluk test

* Final fixes for CPU

* Cuda req full template specification for SerialGemm::invoke

* Don't use scratch for now

* Formatting

* Fix warnings

* Formatting

* Add tolerance to view checks. Use macro and remove redundant test util

* Fix for HIP

* formatting

* Another test reorg to fix weirdness on solo

* formatting

* Remove unused var

* Github feedback

* Remove test cout

* formatting

* Zero-size arrays can cause problems

* Fix unused var warning

* Add CUDA/HIP TPL support for KokkosSparse::spadd (#1962)

* spadd: change arguments to ctor of SPADDHandle

add a default value to input_sorted;
add a second argument input_merged to indicate unqiue entries;
So that we can easily know whether we can use TPLs on the input matrices

* spadd: add cuda/rocm TPL support for spadd_symbolic/numeric

* Make spiluk_handle::reset backwards compatible (#2087)

* Make spiluk_handle::reset backwards compatible

By making block_size default to -1, which means don't change
block size.

* Switch default val for block_size for reset_handle

* formatting

* Fix comment

* spadd: add APIs without an execution space argument (#2090)

* Lapack - SVD: adding initial files that do not implement anything (#2092)

Adding SVD feature to Lapack component, the interface is similar
to classic Lapack and the implementation relies on the TPL layer
to provide initial capabilities. The TPL supported are LAPACK,
MKL, cuSOLVER and rocSOLVER.

Testing three analytical cases 2x2, 2x3 and 3x2 and then some
randomly generated matrices.

* Hands off namespace `Kokkos::Impl` - cleanup couple violations that snuck in (#2094)

* Do not use things from namespace Kokkos::Impl (Kokkos::{Impl:: -> }ALL_t)

* Do not use things from namespace Kokkos::Impl (Kokkos::Impl::DeepCopy)

Can achieve the same with Kokkos::deep_copy

* Fix warning `declaration of ‘std::size_t n’ shadows a parameter`

* Change name of yaml-cpp to yamlcpp

* Fix macro setting in CMakeLists

* GMRES: Add support for BSR matrices

Also, add a test for this.

* Remove all mentions of HBWSpace

* Reintroduce EXECSPACE_(SERIAL,OPENMP,THREADS}_VALID_MEM_SPACES

Drop HBWSPACE as an option

* Lapack: adding svd benchmark

Fixing unit-test for CUSOLVER and adding benchmark to check the
algorithm performance on various platforms.

* Fix Cuda TPL finding (#2098)

- Allow finding cusparse, cusolver based on manually provided paths
  - This is necessary when using an nvhpc toolchain instead of a
    standard cuda toolchain
- Set header paths correctly (this is redundant in a cuda installation,
  in which $CUDA_ROOT/include is already a system include dir, but
  needed in other cases)

* Add support for BSR matrices to some trsv routines (#2104)

* Add support for BSR matrices to some trsv routines
* Change trsv to gesv

* Lapack - SVD: adding quick return when cuSOLVER is skipped (#2107)

Currently we still run the tests on U, S and Vt which does not
make sense since we actively skip this test because cuSOLVER does
not support more columns than rows...

* Fix build error in trsv on gcc8

* Add a workaround for compilation errors with cuda-12.2.0 + gcc-12.3 (#2108)

On Perlmutter@NERSC, I met this error

/usr/lib64/gcc/x86_64-suse-linux/12/include/avx512fp16intrin.h(38): error: vector_size attribute requires an arithmetic or enum type
   typedef __half __v8hf __attribute__ ((__vector_size__ (16)));

The workaround was mentioned at https://forums.developer.nvidia.com/t/including-cub-header-breakes-compilation-with-gcc-12-and-sse2-or-better/255018

* Lapack - SVD: fix for unit-test when MKL is enabled (#2110)

This is really a problem with our implementation of the BLAS
interface when MKL is enabled since MKL redefines the function
signatures of blas functions using MKL_INT instead if int...

* Revert "Merge pull request #2037 from ndellingwood/remove-rocsolver-optional-dependency" (#2106)

This reverts commit 5a36d57, reversing
changes made to 2c66d29.

* Fixing missing inclusion in source file

* BLAS - MKL: fixing HostBlas calls to handle MKL_INT type (#2112)

MKL redefines the BLAS interface based on how MKL_INT is defined
we need to wrap that definition with our own Kokkos Kernels INT
type to make both compatible with regular BLAS.

applying clang-format

* Fix weird Trilinos compiler error

It seemed to have a problem with these deep_copies, so just do
the copy by hand like it was being done before my recent trsv
PR.

* Update changelog

* Update changelog

* Block spiluk follow up (#2085)

* Fix for gemm
* Remove unused divide method
* Enhancements to spiluk test
* Progress. Block spiluk now checks out against analytical results
* LUPrec test with spiluk woring
* Disable spiluk LU test on non-host
* Enhancements to spiluk test
* Clean up a few issues uncovered by gh review

* github workflows: update to v4 (use Node 20)

* Refactor Test_Sparse_sptrsv (#2102)

* Refactor Test_Sparse_sptrsv

* More cleanups

* Remove old commented-out code

* CMake: error out in certain case (#2115)

Graph unit tests are unique in that they use default_scalar for the
KokkosKernelsHandle. So if test-eti-only is ON, but neither float nor
double is instatiated, then error out for the graph unit tests.

Users can still build without float or double if they want, but only if
they turn off tests or the graph component.

* Wiki examples for BLAS2 functions are added (#2122)

Some small additional change the the function headers themselves
to add some missing header file inclusions.

Applying clang-format

Removing constexpr since it won't happen before some work in Core.

* Increase tolerance on gesv test (Fix #2123) (#2124)

And uncomment the verbose output for when tolerance is exceeded,
since that helps debug this sort of issue.
This is only printed at most once so it won't spam the output if
the entire vector is wrong.

* Spmv handle (#2126)

* spmv handle, TPL reuse

* using handle in unification layer and hooking up new algorithm
enums with old Controls options

* Update spmv_merge perf test
Compare KK merge vs. default and KK native

* Small changes to help text of spmv_merge perf test

* Complete backwards compatibility with Controls interface
- copy over spmv algorithm selection correctly
- copy expert tuning parameters

* Controls spmv: accept other name for bsr algo

* bsr spmv test: disable tensor core
It was not actually being run before due to a different name
actually enabling it (experimental_bsr_tc rather than experimental_tc)

* Disable OneMKL spmv for complex types
oneapi 2023.2 throws error saying complex isn't supported

* OneMKL: call optimize_gemv during setup

* Option to apply RCM reordering to extracted CRS diagonal blocks (#2125)

* Add rcm option when extracting diagonal blocks

* Update kk_extract_diagonal_blocks_crsmatrix_sequential

* Add test for extracting diagonal blocks with rcm

* Update RCM checking

* cm_test_all_sandia: various updates

- updates for blake

* cm_test_all_sandia: drop decommissioned/unavailable machines

- remove voltrino, mayer

* Fix2130 (#2132)

* Fix #2130

- Do not call BsrMatrix spmv impl if block size is 1
- Instead, convert it to unmanaged CrsMatrix and call spmv again
  - cuSPARSE returned an error code in this case
  - Better performance

* Formatting

* Remove redundant remove_pointer_t

Handle is already a non-pointer type

* Benchmark: modifying spmv benchmark to run range of spmv tests (#2135)

This could be further automated to run on matrix from suite sparse

* Kokkos Kernels: update version guards to drop old version of Kokkos (#2133)

Since we are now in the 4.2 series we only support up to 4.1.00.
Older version of Kokkos Core will require older version of Kokkos
Kernels for compatibility. Once 4.3.00 is out we will move to
drop support for the 4.1 series and only keep 4.2 and 4.3 series.

* ODE: BDF methods (#1930)

* ODE: adding BDF algorithms

Implementing BDF formula for stiff ODEs.
Orders 1 to 5 are available and tested.
The integrators can be called on GPU to
solve multiple systems in parallel.

* ODE: fixing storage handling for start-up RK stack

* ODE: clang-format

* ODE: first adaptive version of BDF

The current implementation only allows for adaptivity in time,
at this point the BDF Step actually converges as expected with
first order integration!

* ODE: fixing issues with adaptive BDF

The unit-test BDF_adaptive now shows the integration
of the logistic equation using adaptive time steps and
increasing integration order from 1 to 5.

* ODE: running BDF on StiffChemistry problem

The problem runs fine and is solved but there are oscillations
while the behavior of the solution is smooth. More investigation
is needed...

* BDF: fixing types and template parameters in batched calls

Bascially we need template parameters to be more versatile
and cannot assume that all rank1 views will have the exact
same underlying type, for instance layouts can be different.

* More fixes for GPUs only in tests this time.

* ODE: BDF adaptive, fix small bug

After adding rhs and update vectors to temp the subviews taken for
other variables need to be offset appropriately...

* Revert "More fixes for GPUs only in tests this time."

This reverts commit 2f70432.

* Revert "Revert "More fixes for GPUs only in tests this time.""

This reverts commit 836012b.

* ODE: BDF small change to temporarily avoid compile time issue

True fix involving a KOKKOS_VERSION check is upcoming after more
tests on GPU side...

* ODE: BDF fix for some printf statements that will go away soon...

* ODE: adding benchmark for BDF

The benchmark helps us monitor the performance of the BDF
implementaiton across multiple platforms as well as impact of
changes over time.

* ODE: improve benchmark interface...

* ODE: BDF changes to use RMS norm and change some default values

Small changes to compare more closely with reference implementation.
Some of these might be reverted eventually but that's fine for now.

* ODE: BDF convergence more stable and results look pretty good now!

Changing the Newton solver convergence criteria as well as changing
a few default input parameters leads to a more stable algorithms
which can now integrate the stiff Henderson autocatalytic example
well in 66 time steps instead of 200k for fixed order integration...

* ODE: BDF fix bug in initial time step calculation

The initial step routine was overwriting the initial right hand side
which led to obvious issues further down the road... now things should
work fine. Need to figure out if I can re-initialize the variables in
the perf test while excluding that time from each iteration.

* ODE: BDF removing bad print statement...

std::cout in device code

* ODE - BDF: improving perf test

Basically adding new untimed setup within the main loop of the
benchmark to reset the intial conditions, buffers and vectors
ahead of each iteration.

* Modifying unit-test to catch proper return type

* Applying clang-format

* cm_test_all_sandia: update caraway compilers

add rocm/5.6.1 and rocm/6.0.0, and openblas/0.3.23 as tpl

* Sparse MKL: changing the location of the MKL_SAFE_CALL macro (#2134)

* Sparse MKL: changing the location of the MKL_SAFE_CALL macro

Moving the macro outside of namespaces to ensure that it will be
interpreted correctly when called from any other location in the
library.

It does not make much sense to guard Impl code in the Experimental
namespace and in this case it cleans up a problem with namespace
disambiguation for the compiler...

* Sparse BsrSpMV: removing Experimental namespace from Impl namespace

* Applying clang-format

* Sparse SpMV: fixing more namespace issues!

* Fixing missing descriptor for bsr spmv

* Kokkos Kernels: change the default offset ETI from size_t to int (#2140)

This change makes it easier for customer to leverage TPL support
which almost always requires offset=int, ordinal=int to be enabled
meaning that no TPL support is available with our default ETI...

* KokkosSparse_spmv_bsrmatrix_spec: fix Bsr_TC_Precision namespacing

Resolve compilation errors in nightly cuda/12.2 A100 build

* Drop comment for cleaner clang-format fix

* Fix usage of RAII to set cusparse/rocsparse stream (#2141)

Temporary objects like "A()" get destructed immediately.
For the object to have scope lifetime, it needs a name like "A a();".
This was causing cusparse/rocsparse spmv to always execute on the default stream,
causing incorrect timing in the spmv perf test.

* Use execution space operator== (#2136)

It actually is part of the public interface

* cm_test_all_sandia: more caraway module updates and cleanup (#2145)

* Spmv perftest improvements (#2146)

* Spmv perf test improvements

- Add option to flush caches by filling a dummy buffer between
iterations
- Add option to call the non-reuse interface instead of handle/reuse
interface
- Fix modes T, H in nonsquare case (make x,y the correct length)

* Fix mode help text

* Update version to 4.3.0

* Revert "Kokkos Kernels: change the default offset ETI from size_t to int (#2140)"

This reverts commit 3a5498d.

* Fix signed/unsigned comparison warnings (#2150)

This is only hit when spmv is called with integer scalars,
which doesn't happen in our CI but does often in Tpetra.

* SPMV tpl fixes, cusparse workaround (#2152)

* SPMV tpl fixes, workaround

* Avoid possible integer conversion warnings

* Document cusparseSpMM algos that were tested

* Merge pull request #2147 from lucbv/KK_Utils_cleanup

KokkosKernels Utils: cleaning the zero_vector interface

(cherry picked from commit 363868e)

* KokkosBlas1_axpby.hpp: change debug macro guard for printInformation (#2157)

* KokkosBlas1_axpby.hpp: change debug macro guard for printInformation

- resolves test failures in Trilinos (MueLu) that rely on gold file diff
comparisons by removing extra output in debug builds

* fix compilation error

* Update changelog for 4.3.00 (#2148)

* Update changelog for 4.3.00

* Update CHANGELOG.md

---------

Co-authored-by: Luc Berger <lberge@sandia.gov>

* FIx changelog typo

* Fix merge artifacts

* CMakeLists.txt: fix Kokkos_VERSION check

* Merge pull request #2165 from ndellingwood/test-updates

Updates from feedback runnig Trilinos testing

(cherry picked from commit cacba80)

* Update master_history.txt for 4.3.0

* KokkosLapack_svd_tpl_spec_decl: defer to MKL spec when LAPACK also enabled

Resolves redefintion of struct SVD compilation errors with both MKL and LAPACK are enabled
Reported by @maartenarnst in trilinos/Trilinos#12891

Co-authored-by: brian-kelley <brian.honda11@gmail.com>
(cherry picked from commit 5bf5474)

---------

Co-authored-by: Luc Berger-Vergiat <lberge@sandia.gov>
Co-authored-by: Ernesto Prudencio <eeprude@sandia.gov>
Co-authored-by: Carl Pearson <cwpears@sandia.gov>
Co-authored-by: Evan Harvey <eharvey@sandia.gov>
Co-authored-by: Carl Pearson <cwpearson@users.noreply.github.com>
Co-authored-by: brian-kelley <brian.honda11@gmail.com>
Co-authored-by: Sean Miller <sean.miller@amd.com>
Co-authored-by: Junchao Zhang <jczhang@mcs.anl.gov>
Co-authored-by: Junchao Zhang <jczhang@anl.gov>
Co-authored-by: Brian Kelley <bmkelle@sandia.gov>
Co-authored-by: James Foucar <jgfouca@sandia.gov>
Co-authored-by: Damien L-G <dalg24+github@gmail.com>
Co-authored-by: Caleb Schilly <cwschilly@gmail.com>
Co-authored-by: Damien L-G <dalg24@gmail.com>
Co-authored-by: Vinh Dang <vqdang@sandia.gov>
lucbv added a commit that referenced this pull request Aug 12, 2024
* fence Kokkos before timed interations

* Deprecate KOKKOSLINALG_OPT_LEVEL

* Add CMake warning message if KokkosKernels_LINALG_OPT_LEVEL is used

* Async matrix release for MKL >= 2023.2

* Support CUBLAS_{LIBRARIES,LIBRARY_DIRS,INCLUDE_DIRS,ROOT} and KokkosKernels_CUBLAS_ROOT

* KokkosSparse_spmv_impl_merge.hpp: use capture by reference

Resolve warnings in builds with c++20 support enabled:
"kokkos-kernels/sparse/impl/KokkosSparse_spmv_impl_merge.hpp:166:81: warning: implicit capture of 'this' via '[=]' is deprecated in C++20 [-Wdeprecated]"

* KokkosSparse_par_ilut_numeric_impl.hpp: use capture by reference

Resolve warnings in builds with c++20 support enabled:
"kokkos-kernels/sparse/impl/KokkosSparse_par_ilut_numeric_impl.hpp(591):
warning #2908-D: the implicit by-copy capture of "this" is deprecated"

* Backup

* Backup

* Backup

* Backup

* Formatting

* Correcting compilation error

* Typo

* Changes for syr and syr2, to be tested at weaver

* Formatting

* Changes for axpby

* Backup

* Formatting

* Just to force new checking tests in github

* Addressing feedback from Luc.

* Don't call optimize_gemv for one-shot spmv

* Add HIPManagedSpace support

- CMake option for ETI
- Run unit tests with a Kokkos::Device, not just Kokkos::HIP
	- Like we do for Cuda
	- Still use HIPSpace unless Managed is the only enabled memspace
- Couple of minor fixes
	- Allow querying free HIPManagedSpace memory for SpGEMM
	- Disable VBD coloring (not a huge deal, had to do same on CUDA)
	- Use correct memory space in SpTRSV solve

* Backup

* Backup

* Backup

* Minor typo

* Add block support to all SPILUK algorithms (#2064)

* Interface for block iluk

* Progress. Test hooked up

* Progress on test refactoring

* More test reorg

* Fix test

* Refactor spiluk numeric a bit with a struct wrapper

* Add good logging

* progress

* Fix block test

* Progress but potential dead end

* Giving up on this approach for now

* progress

* Make verbose

* Progress

* Progress

* RP working?

* Progress on TP alg

* Bug fix

* Progress on template stuff

* Progress on block TP

* Progress

* Get rid of all the static_casts

* More cleanup. Steams now support blocks

* Tests not passing

* Serail tests all working, both algs, blocked

* Remove output coming from spiluk test

* Final fixes for CPU

* Cuda req full template specification for SerialGemm::invoke

* Don't use scratch for now

* Formatting

* Fix warnings

* Formatting

* Add tolerance to view checks. Use macro and remove redundant test util

* Fix for HIP

* formatting

* Another test reorg to fix weirdness on solo

* formatting

* Remove unused var

* Github feedback

* Remove test cout

* formatting

* Zero-size arrays can cause problems

* Fix unused var warning

* Add CUDA/HIP TPL support for KokkosSparse::spadd (#1962)

* spadd: change arguments to ctor of SPADDHandle

add a default value to input_sorted;
add a second argument input_merged to indicate unqiue entries;
So that we can easily know whether we can use TPLs on the input matrices

* spadd: add cuda/rocm TPL support for spadd_symbolic/numeric

* Make spiluk_handle::reset backwards compatible (#2087)

* Make spiluk_handle::reset backwards compatible

By making block_size default to -1, which means don't change
block size.

* Switch default val for block_size for reset_handle

* formatting

* Fix comment

* spadd: add APIs without an execution space argument (#2090)

* Lapack - SVD: adding initial files that do not implement anything (#2092)

Adding SVD feature to Lapack component, the interface is similar
to classic Lapack and the implementation relies on the TPL layer
to provide initial capabilities. The TPL supported are LAPACK,
MKL, cuSOLVER and rocSOLVER.

Testing three analytical cases 2x2, 2x3 and 3x2 and then some
randomly generated matrices.

* Hands off namespace `Kokkos::Impl` - cleanup couple violations that snuck in (#2094)

* Do not use things from namespace Kokkos::Impl (Kokkos::{Impl:: -> }ALL_t)

* Do not use things from namespace Kokkos::Impl (Kokkos::Impl::DeepCopy)

Can achieve the same with Kokkos::deep_copy

* Fix warning `declaration of ‘std::size_t n’ shadows a parameter`

* Change name of yaml-cpp to yamlcpp

* Fix macro setting in CMakeLists

* GMRES: Add support for BSR matrices

Also, add a test for this.

* Remove all mentions of HBWSpace

* Reintroduce EXECSPACE_(SERIAL,OPENMP,THREADS}_VALID_MEM_SPACES

Drop HBWSPACE as an option

* Lapack: adding svd benchmark

Fixing unit-test for CUSOLVER and adding benchmark to check the
algorithm performance on various platforms.

* Fix Cuda TPL finding (#2098)

- Allow finding cusparse, cusolver based on manually provided paths
  - This is necessary when using an nvhpc toolchain instead of a
    standard cuda toolchain
- Set header paths correctly (this is redundant in a cuda installation,
  in which $CUDA_ROOT/include is already a system include dir, but
  needed in other cases)

* Add support for BSR matrices to some trsv routines (#2104)

* Add support for BSR matrices to some trsv routines
* Change trsv to gesv

* Lapack - SVD: adding quick return when cuSOLVER is skipped (#2107)

Currently we still run the tests on U, S and Vt which does not
make sense since we actively skip this test because cuSOLVER does
not support more columns than rows...

* Fix build error in trsv on gcc8

* Add a workaround for compilation errors with cuda-12.2.0 + gcc-12.3 (#2108)

On Perlmutter@NERSC, I met this error

/usr/lib64/gcc/x86_64-suse-linux/12/include/avx512fp16intrin.h(38): error: vector_size attribute requires an arithmetic or enum type
   typedef __half __v8hf __attribute__ ((__vector_size__ (16)));

The workaround was mentioned at https://forums.developer.nvidia.com/t/including-cub-header-breakes-compilation-with-gcc-12-and-sse2-or-better/255018

* Lapack - SVD: fix for unit-test when MKL is enabled (#2110)

This is really a problem with our implementation of the BLAS
interface when MKL is enabled since MKL redefines the function
signatures of blas functions using MKL_INT instead if int...

* Revert "Merge pull request #2037 from ndellingwood/remove-rocsolver-optional-dependency" (#2106)

This reverts commit 5a36d577e725546062af3b297eec87e23a40ab58, reversing
changes made to 2c66d291f9b5512e17f9375304902b6ba42133b2.

* Fixing missing inclusion in source file

* BLAS - MKL: fixing HostBlas calls to handle MKL_INT type (#2112)

MKL redefines the BLAS interface based on how MKL_INT is defined
we need to wrap that definition with our own Kokkos Kernels INT
type to make both compatible with regular BLAS.

applying clang-format

* Fix weird Trilinos compiler error

It seemed to have a problem with these deep_copies, so just do
the copy by hand like it was being done before my recent trsv
PR.

* Update changelog

* Update changelog

* Block spiluk follow up (#2085)

* Fix for gemm
* Remove unused divide method
* Enhancements to spiluk test
* Progress. Block spiluk now checks out against analytical results
* LUPrec test with spiluk woring
* Disable spiluk LU test on non-host
* Enhancements to spiluk test
* Clean up a few issues uncovered by gh review

* github workflows: update to v4 (use Node 20)

* Refactor Test_Sparse_sptrsv (#2102)

* Refactor Test_Sparse_sptrsv

* More cleanups

* Remove old commented-out code

* CMake: error out in certain case (#2115)

Graph unit tests are unique in that they use default_scalar for the
KokkosKernelsHandle. So if test-eti-only is ON, but neither float nor
double is instatiated, then error out for the graph unit tests.

Users can still build without float or double if they want, but only if
they turn off tests or the graph component.

* Wiki examples for BLAS2 functions are added (#2122)

Some small additional change the the function headers themselves
to add some missing header file inclusions.

Applying clang-format

Removing constexpr since it won't happen before some work in Core.

* Increase tolerance on gesv test (Fix #2123) (#2124)

And uncomment the verbose output for when tolerance is exceeded,
since that helps debug this sort of issue.
This is only printed at most once so it won't spam the output if
the entire vector is wrong.

* Spmv handle (#2126)

* spmv handle, TPL reuse

* using handle in unification layer and hooking up new algorithm
enums with old Controls options

* Update spmv_merge perf test
Compare KK merge vs. default and KK native

* Small changes to help text of spmv_merge perf test

* Complete backwards compatibility with Controls interface
- copy over spmv algorithm selection correctly
- copy expert tuning parameters

* Controls spmv: accept other name for bsr algo

* bsr spmv test: disable tensor core
It was not actually being run before due to a different name
actually enabling it (experimental_bsr_tc rather than experimental_tc)

* Disable OneMKL spmv for complex types
oneapi 2023.2 throws error saying complex isn't supported

* OneMKL: call optimize_gemv during setup

* Option to apply RCM reordering to extracted CRS diagonal blocks (#2125)

* Add rcm option when extracting diagonal blocks

* Update kk_extract_diagonal_blocks_crsmatrix_sequential

* Add test for extracting diagonal blocks with rcm

* Update RCM checking

* cm_test_all_sandia: various updates

- updates for blake

* cm_test_all_sandia: drop decommissioned/unavailable machines

- remove voltrino, mayer

* Fix2130 (#2132)

* Fix #2130

- Do not call BsrMatrix spmv impl if block size is 1
- Instead, convert it to unmanaged CrsMatrix and call spmv again
  - cuSPARSE returned an error code in this case
  - Better performance

* Formatting

* Remove redundant remove_pointer_t

Handle is already a non-pointer type

* Benchmark: modifying spmv benchmark to run range of spmv tests (#2135)

This could be further automated to run on matrix from suite sparse

* Kokkos Kernels: update version guards to drop old version of Kokkos (#2133)

Since we are now in the 4.2 series we only support up to 4.1.00.
Older version of Kokkos Core will require older version of Kokkos
Kernels for compatibility. Once 4.3.00 is out we will move to
drop support for the 4.1 series and only keep 4.2 and 4.3 series.

* ODE: BDF methods (#1930)

* ODE: adding BDF algorithms

Implementing BDF formula for stiff ODEs.
Orders 1 to 5 are available and tested.
The integrators can be called on GPU to
solve multiple systems in parallel.

* ODE: fixing storage handling for start-up RK stack

* ODE: clang-format

* ODE: first adaptive version of BDF

The current implementation only allows for adaptivity in time,
at this point the BDF Step actually converges as expected with
first order integration!

* ODE: fixing issues with adaptive BDF

The unit-test BDF_adaptive now shows the integration
of the logistic equation using adaptive time steps and
increasing integration order from 1 to 5.

* ODE: running BDF on StiffChemistry problem

The problem runs fine and is solved but there are oscillations
while the behavior of the solution is smooth. More investigation
is needed...

* BDF: fixing types and template parameters in batched calls

Bascially we need template parameters to be more versatile
and cannot assume that all rank1 views will have the exact
same underlying type, for instance layouts can be different.

* More fixes for GPUs only in tests this time.

* ODE: BDF adaptive, fix small bug

After adding rhs and update vectors to temp the subviews taken for
other variables need to be offset appropriately...

* Revert "More fixes for GPUs only in tests this time."

This reverts commit 2f70432761485bc6a4c65a1833e7299dd2c340e2.

* Revert "Revert "More fixes for GPUs only in tests this time.""

This reverts commit 836012bb529551727b3f5913057acad94dfe60df.

* ODE: BDF small change to temporarily avoid compile time issue

True fix involving a KOKKOS_VERSION check is upcoming after more
tests on GPU side...

* ODE: BDF fix for some printf statements that will go away soon...

* ODE: adding benchmark for BDF

The benchmark helps us monitor the performance of the BDF
implementaiton across multiple platforms as well as impact of
changes over time.

* ODE: improve benchmark interface...

* ODE: BDF changes to use RMS norm and change some default values

Small changes to compare more closely with reference implementation.
Some of these might be reverted eventually but that's fine for now.

* ODE: BDF convergence more stable and results look pretty good now!

Changing the Newton solver convergence criteria as well as changing
a few default input parameters leads to a more stable algorithms
which can now integrate the stiff Henderson autocatalytic example
well in 66 time steps instead of 200k for fixed order integration...

* ODE: BDF fix bug in initial time step calculation

The initial step routine was overwriting the initial right hand side
which led to obvious issues further down the road... now things should
work fine. Need to figure out if I can re-initialize the variables in
the perf test while excluding that time from each iteration.

* ODE: BDF removing bad print statement...

std::cout in device code

* ODE - BDF: improving perf test

Basically adding new untimed setup within the main loop of the
benchmark to reset the intial conditions, buffers and vectors
ahead of each iteration.

* Modifying unit-test to catch proper return type

* Applying clang-format

* cm_test_all_sandia: update caraway compilers

add rocm/5.6.1 and rocm/6.0.0, and openblas/0.3.23 as tpl

* Sparse MKL: changing the location of the MKL_SAFE_CALL macro (#2134)

* Sparse MKL: changing the location of the MKL_SAFE_CALL macro

Moving the macro outside of namespaces to ensure that it will be
interpreted correctly when called from any other location in the
library.

It does not make much sense to guard Impl code in the Experimental
namespace and in this case it cleans up a problem with namespace
disambiguation for the compiler...

* Sparse BsrSpMV: removing Experimental namespace from Impl namespace

* Applying clang-format

* Sparse SpMV: fixing more namespace issues!

* Fixing missing descriptor for bsr spmv

* Kokkos Kernels: change the default offset ETI from size_t to int (#2140)

This change makes it easier for customer to leverage TPL support
which almost always requires offset=int, ordinal=int to be enabled
meaning that no TPL support is available with our default ETI...

* KokkosSparse_spmv_bsrmatrix_spec: fix Bsr_TC_Precision namespacing

Resolve compilation errors in nightly cuda/12.2 A100 build

* Drop comment for cleaner clang-format fix

* Fix usage of RAII to set cusparse/rocsparse stream (#2141)

Temporary objects like "A()" get destructed immediately.
For the object to have scope lifetime, it needs a name like "A a();".
This was causing cusparse/rocsparse spmv to always execute on the default stream,
causing incorrect timing in the spmv perf test.

* Use execution space operator== (#2136)

It actually is part of the public interface

* cm_test_all_sandia: more caraway module updates and cleanup (#2145)

* Spmv perftest improvements (#2146)

* Spmv perf test improvements

- Add option to flush caches by filling a dummy buffer between
iterations
- Add option to call the non-reuse interface instead of handle/reuse
interface
- Fix modes T, H in nonsquare case (make x,y the correct length)

* Fix mode help text

* KokkosKernels Utils: cleaning the zero_vector interface

One of the overload requires an unused template, removing that
extraneous template and simplify how that function is called in
a second overload.

* Kokkos Kernels: fixing call to zero_vector in Gauss-Seidel

* CMakeLists.txt: Update develop to 4.3.99

* Address reviewer comments

* Update CMakeLists.txt

Co-authored-by: brian-kelley <brian.honda11@gmail.com>

* cm_test_all_sandia: solo updates

module updates post TOSS upgrade

* Fix signed/unsigned comparison warnings (#2150)

This is only hit when spmv is called with integer scalars,
which doesn't happen in our CI but does often in Tpetra.

* SPMV tpl fixes, cusparse workaround (#2152)

* SPMV tpl fixes, workaround

* Avoid possible integer conversion warnings

* Document cusparseSpMM algos that were tested

* Applying clang-format

* Use default_size_type as default offset in matrix types (#2149)

Now a declaration like CrsMatrix<Scalar, Ordinal, Device>
will by default use an ETI'd type combination (as int is the default
ETI'd offset)

* spmv bsr perftest: move fences to right place (#2153)

* Kokkos Kernels: removing old code branches ahead of 4.3.00 release (#2139)

Some old code branches kept for compatibility with Kokkos Core
versions less than 4.2.00 are dropped after release 4.3.00.
The largest changes are the removal of support the Kokkos print
macro in favor of Kokkos::printf() and the removal of half support
from Kokkos Kernels since it is now in Kokkos Core.

* Code for running performance measurements on ger() (#2082)

* Correct flop count

* Addressing feedbacks from Luc

* Using 'zero()' instead of '0.'

* KokkosBlas1_axpby.hpp: change debug macro guard for printInformation (#2157)

* KokkosBlas1_axpby.hpp: change debug macro guard for printInformation

- resolves test failures in Trilinos (MueLu) that rely on gold file diff
comparisons by removing extra output in debug builds

* fix compilation error

* Add user tolerance to Serial SVD (#2120)

* Add user tolerance to Serial SVD
---------

Co-authored-by: whorne <whorne@sandia.gov>
Co-authored-by: Carl Pearson <cwpears@sandia.gov>

* Add a simple Harwell-Boeing file reader (#2155)

* Add a simple Harwell-Boeing file reader

And a test that validates against the MM reader.

* Support for symmetrize

* This loop can be simplified, there's no diag duplication

* Improve IO test

* spmv tpls: use correct bool for eti template param (#2160)

(It should just be the default,
KokkosSpars::Impl::eti_spec_avail<..>::value)

* Fix sparse_ioutils test on kokkos-dev (#2162)

Adding a ss.sync call fixes it but I have no idea why it
was needed here. All the other stringstream reads work fine.

* Fix #2156 (#2164)

spmv: add special path for rank-2 x/y, but where both have 1 column
and a TPL is available for rank-1 but not rank-2.

Also call "subhandle->set_exec_space" correctly in the TPLs to ensure
proper synchronization between setup, spmv and cleanup (in the case that
different exec instances are used in different calls)

* Updates from feedback runnig Trilinos testing

- Update debug level to > 1 guarding `printInformation(...)` in
  KokkosBlas1_axpby.hpp to reduce noisy test output
- Loosen tolerance of lapack.svd test to avoid random failures that
  occur near prior tolerance level

* Fix #2167: classic MKL doesn't use space instance (#2168)

* CHANGELOG.md: 4.3.00 update

* KokkosLapack_svd_tpl_spec_decl: defer to MKL spec when LAPACK also enabled

Resolves redefintion of struct SVD compilation errors with both MKL and LAPACK are enabled
Reported by @maartenarnst in https://github.com/trilinos/Trilinos/issues/12891

Co-authored-by: brian-kelley <brian.honda11@gmail.com>

* .github/workflows: Added bdw.yml

* .github/workflows: Added spr.yml

* .github/workflows: Added mi210.yml

* .github/workflows: Added h100.yml

* .github/workflows: Added volta70.yml

* .github/workflows: Added power9.yml and power9_tpls.yml

* Jgfouca/block spiluk fixes (#2172)

* Progress

* Attempt 1, fix multiplication order

* Converges in 1 step

* Various cleanups

* Be sure not to reduce performance of unblocked impl

Also add some comments.

* Remove test mangling

* Fixes for GPU

* Fix warning

* formatting

* Increase eps for floats

* This is no longer needed

* .github/workflows: Add PR_VEGA908_ROCM561_HIP_SERIAL_LEFT_OPENBLAS_OPENLAPACK_REL

* Add guard for cusparse spmv_mv_tpl_spec_avail

Address issue #2175
Configuring with magma tpl enabled and cusparse disabled mistakenly triggers the cusparse tpl avail check to be true
Guard the KOKKOSSPARSE_SPMV_MV_TPL_SPEC_AVAIL_CUSPARSE macros when CUSPARSE is enabled to prevent this

* .github/workflows: Remove OPENLAPACK from names

* .github/workflows: Remove power9 until we have hardware to test it on

* .github/workflows: Enable rocblas in rocm tpl check

* .github/workflows: Remove volta70 until we have hardware to test it on

* Add early return if numRows == 0 in trsv to avoid integer divide-by-zero error

* Resolves multiple definition of Magma and Cuda singletons (#2178)

Address issue #2175

* magma: fix linker errors for builds without cusolver (#2181)

* magma: fix linker errors for builds without cusolver

* BatchedGemm test: workaround testing cublas+magma

- temporary workaround to skip magma test when cublas enabled to avoid issues like #2177

* .github/workflows/mi210: Fix include paths

* workflows/osx.yml: test against most recent kokkos tag

- test against most recent kokkos release rather than develop branch, as
  done with AT CI, to avoid compatibility breakages

* Resolve vortex compilation issue by resolving (potentially) duplicate symbol (#2183)

Stick to pattern of removing leading 'c' or 'z' in method name and relying on the template type

Co-authored-by: malphil <malphil@sandia.gov>

* Changes to enable OneAPI usage with Trilinos build (#2185)

- Get rid of SYCL_OVERRIDE setting
- Add MKL_PROVIDES_BLAS_LAPACK cmake variable and define so that
  code knows whether to use int, or MKL_INT
  - Trilinos builds might link with OneAPI for GPU but standard
    BLAS/LAPACK on CPU

* Fix macOS docs build (#2190)

* Fix docs build

* try docs fix

* make sphinx available at config time

* GH-Actions: adding security actions and scorecard (#2192)

Pretty much taking the new files from PR #2191 and re-creating
it in a clean PR on top of develop with small changes relevant
to our repository.

Cleaning up some workflows to tailor it for our needs

* Scorecard: adding manual dispatch and target default branch (#2195)

* BsrMatrix: Fix HostMirror typedef (#2196)

It needed to have size_type.

* update changelog for 4.3.1

* Update CHANGELOG.md

* docs.yml: change kokkos version to latest release

- avoid version range check issues for release tests

* Sparse - SpGEMM: labeling spgemm_symbolic in TPL layer a bit more clearly (#2193)

This just improves the readability of the output from the tools as it now
has a symbolic matching the numeric phase. Previously we only had spgemm
in the label which is a bit confusing as it could be the whole spgemm time
i.e. both symbolic and numeric, additionally we had symbolic in the MKL
path but not in cusparse, rocsparse...

* SpMV: Test NaN, fix NaN handling when beta=0 (#2188)

* Test_Sparse_spmv_bsr.hpp: add NaNs to tests

* handle NaN in spmv_beta_transpose when beta=0

* handle nan in SpmvMergeHierarchical when beta=0

* Test NaNs in Y, don't reuse modifed Y, catch NaNs in results test

* remove unused <iostream> include

* explicit casting of zero

* Test_sparse_spmv.hpp: remove unused nans parameter

* KokkosSparse_spmv.hpp: CUDA11 can't detect this function always returns

* Test_Sparse_spmv.hpp: remove unused variable

* Run unit tests in correct execution space

* Test_Sparse_spmv.hpp: remove unused type aliases

* Kokkos::nan() -> KokkosKernels::Impl::quiet_NaN()

* Disable cuBLAS dot wrapper (#2206)

(not deleted, just guarded with #if 0 and comments explaining)

It performs significantly worse than our native impl on 11.2, 11.8 and 12.0 on V100.
This is in the dot perf test with a warm-up call.

https://github.com/trilinos/Trilinos/issues/12982 was a symptom of this.

* Fix spmv regressions (#2204)

* Restore cusparse spmv ALG2 path for imbalanced

With correct version cutoffs

* spmv: use separate rank-1 and rank-2 tpl subhandles

* Remove redundant single-column path in native spmv_mv

* Fix unused param warning

* c++17: add [[fallthrough]] attribute (#1493)

* c++17: add [[fallthrough]] attribute

* cm_test_all_sandia: -Wimplicit-fallthrough

* Enable 3 at2 builds (#2210)

* .github/mi210: Enable on PRs

* .github/mi210: Disable non-tpl build

* .github/bdw: Enable PR_BDW_GNU1020_OPENMP_SERIAL_LEFT_OPENBLAS_REL

* .github/h100: Enable PR_HOPPER90_CUDA1180_CUDA_LEFT_RIGHT_REL

* Bump ossf/scorecard-action from 2.0.6 to 2.3.3 (#2214)

Bumps [ossf/scorecard-action](https://github.com/ossf/scorecard-action) from 2.0.6 to 2.3.3.
- [Release notes](https://github.com/ossf/scorecard-action/releases)
- [Changelog](https://github.com/ossf/scorecard-action/blob/main/RELEASE.md)
- [Commits](https://github.com/ossf/scorecard-action/compare/99c53751e09b9529366343771cc321ec74e9bd3d...dc50aa9510b46c811795eb24b2f1ba02a914e534)

---
updated-dependencies:
- dependency-name: ossf/scorecard-action
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Bump actions/upload-artifact from 3.1.3 to 4.3.3 (#2215)

Bumps [actions/upload-artifact](https://github.com/actions/upload-artifact) from 3.1.3 to 4.3.3.
- [Release notes](https://github.com/actions/upload-artifact/releases)
- [Commits](https://github.com/actions/upload-artifact/compare/a8a3f3ad30e3422c9c7b888a15615d19a852ae32...65462800fd760344b1a7b4382951275a0abb4808)

---
updated-dependencies:
- dependency-name: actions/upload-artifact
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Bump actions/dependency-review-action from 2.5.1 to 4.3.2 (#2217)

Bumps [actions/dependency-review-action](https://github.com/actions/dependency-review-action) from 2.5.1 to 4.3.2.
- [Release notes](https://github.com/actions/dependency-review-action/releases)
- [Commits](https://github.com/actions/dependency-review-action/compare/0efb1d1d84fc9633afcdaad14c485cbbc90ef46c...0c155c5e8556a497adf53f2c18edabf945ed8e70)

---
updated-dependencies:
- dependency-name: actions/dependency-review-action
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Bump step-security/harden-runner from 2.7.1 to 2.8.0 (#2218)

Bumps [step-security/harden-runner](https://github.com/step-security/harden-runner) from 2.7.1 to 2.8.0.
- [Release notes](https://github.com/step-security/harden-runner/releases)
- [Commits](https://github.com/step-security/harden-runner/compare/a4aa98b93cab29d9b1101a6143fb8bce00e2eac4...f086349bfa2bd1361f7909c78558e816508cdc10)

---
updated-dependencies:
- dependency-name: step-security/harden-runner
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Bump github/codeql-action from 2.25.3 to 3.25.6 (#2216)

Bumps [github/codeql-action](https://github.com/github/codeql-action) from 2.25.3 to 3.25.6.
- [Release notes](https://github.com/github/codeql-action/releases)
- [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md)
- [Commits](https://github.com/github/codeql-action/compare/ceaec5c11a131e0d282ff3b6f095917d234caace...9fdb3e49720b44c48891d036bb502feb25684276)

---
updated-dependencies:
- dependency-name: github/codeql-action
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Kokkos Kernels: initial security policy (#2220)

We state were patches will be made and how to report security vulnerabilities to the project.

* Kokkos Kernels: adding SHA for github actions (#2221)

* Kokkos Kernels: adding SHA for github actions

* Adding sha for docker images

* More dependencies fix (#2222)

* workflows: fix sha for label checker docker image

* workflows: fixing Docker files dependencies.

* Fix Docker files (#2223)

* SPMV TPLs: improve profile region labels (#2219)

- Mark rank-2 versions as "spmv_mv" not "spmv" (the native impl has
  this, and it's useful to know which one is being run)
- Add missing commas separating "BSRMATRIX" and the scalar type name

* cusparse spgemm: provide non-null row-ptr (#2213)

* Bump github/codeql-action from 3.25.6 to 3.25.7 (#2225)

Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.25.6 to 3.25.7.
- [Release notes](https://github.com/github/codeql-action/releases)
- [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md)
- [Commits](https://github.com/github/codeql-action/compare/9fdb3e49720b44c48891d036bb502feb25684276...f079b8493333aace61c81488f8bd40919487bd9f)

---
updated-dependencies:
- dependency-name: github/codeql-action
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Bump actions/checkout from 3.6.0 to 4.1.6 (#2226)

Bumps [actions/checkout](https://github.com/actions/checkout) from 3.6.0 to 4.1.6.
- [Release notes](https://github.com/actions/checkout/releases)
- [Changelog](https://github.com/actions/checkout/blob/main/CHANGELOG.md)
- [Commits](https://github.com/actions/checkout/compare/f43a0e5ff2bd294095638e18286ca9a3d1956744...a5ac7e51b41094c92402da3b24376905380afc29)

---
updated-dependencies:
- dependency-name: actions/checkout
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* README: adding link to scorecard (#2230)

* spmv_mv wrappers for rocsparse (#2233)

* spmv_mv wrappers for rocsparse (rocsparse_spmm())

* Use consistent types for alpha/beta in spmv wrappers

* Bump step-security/harden-runner from 2.8.0 to 2.8.1 (#2236)

Bumps [step-security/harden-runner](https://github.com/step-security/harden-runner) from 2.8.0 to 2.8.1.
- [Release notes](https://github.com/step-security/harden-runner/releases)
- [Commits](https://github.com/step-security/harden-runner/compare/f086349bfa2bd1361f7909c78558e816508cdc10...17d0e2bd7d51742c71671bd19fa12bdc9d40a3d6)

---
updated-dependencies:
- dependency-name: step-security/harden-runner
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Bump github/codeql-action from 3.25.7 to 3.25.8 (#2237)

Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.25.7 to 3.25.8.
- [Release notes](https://github.com/github/codeql-action/releases)
- [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md)
- [Commits](https://github.com/github/codeql-action/compare/f079b8493333aace61c81488f8bd40919487bd9f...2e230e8fe0ad3a14a340ad0815ddb96d599d2aff)

---
updated-dependencies:
- dependency-name: github/codeql-action
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Bump actions/dependency-review-action from 4.3.2 to 4.3.3 (#2235)

Bumps [actions/dependency-review-action](https://github.com/actions/dependency-review-action) from 4.3.2 to 4.3.3.
- [Release notes](https://github.com/actions/dependency-review-action/releases)
- [Commits](https://github.com/actions/dependency-review-action/compare/0c155c5e8556a497adf53f2c18edabf945ed8e70...72eb03d02c7872a771aacd928f3123ac62ad6d3a)

---
updated-dependencies:
- dependency-name: actions/dependency-review-action
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Add testing for transpose corner cases (#2234)

* Add testing for transpose corner cases

crs, bsr, graph: test cases that are 0x0, 100x0 and 0x100. In these cases make
sure the matrix generator doesn't try to insert any entries (nnz = 0).

* Update sparse/unit_test/Test_Sparse_Transpose.hpp

Co-authored-by: Luc Berger <lberge@sandia.gov>

* Update sparse/unit_test/Test_Sparse_Transpose.hpp

Co-authored-by: Luc Berger <lberge@sandia.gov>

---------

Co-authored-by: Luc Berger <lberge@sandia.gov>

* Graph: removing executable from repo (#2239)

* Fix logic around merge path with TPLs (#2240)

SPMV_MERGE_PATH is not always a native algorithm. Add
SPMV_NATIVE_MERGE_PATH to cover that case specifically. Test this new
option.

* spgemm unit test: change matrix value distribution (#2241)

Change the distribution A, B values are sampled from so that
values in C can't end up close to 0 (as the result of summing terms
that are larger). The relative error metric in is_same_matrix is sensitive
to this.

Fixes #2232

* kokkoskernels_tpls.cmake: remove duplicates arguments when creating argument for exported INTERFACE_INCLUDE_DIRECTORIES

Attempt to workaround issue #2238

* Sparse - BsrMatrix: adding new wiki example for documentation (#2228)

There is already an example for this but it uses a CrsMatrix as
starting point to build a BsrMatrix which is not really helpful in
general as the hope is that you can use the BsrMatrix without needing
the CrsMatrix as it would double the storage needed...

Addressing Kim's comments

* Sparse - CrsToBsr: fix type mismatch (#2242)

* Update rocsparse algo defaults (#2245)

* Update default spmv algorithms for rocsparse

- Use stream for common cases (default, fast setup) as it has nearly zero
  setup cost and performs well for somewhat balanced matrices
- Use adaptive (which is rocsparse's default) only if SPMV_MERGE_PATH
  is the algorithm, as it has a very high setup cost

* Re-enable rocsparse spmv for SPMV_FAST_SETUP

* In deprecated spmv, fix Controls algorithm mapping (#2246)

native -> SPMV_NATIVE
native-merge -> SPMV_NATIVE_MERGE_PATH
merge -> SPMV_MERGE_PATH
tpl -> SPMV_FAST_SETUP

* Add batched serial tbsv (#2202)

* Add batched serial tbsv

* remove incx argument and use strided views instead

* Add a new line at the end of files

* fix random number generation for complex numbers

* remove unused variables from internal tbsv serial functions

* remove allclose for testing

---------

Co-authored-by: Yuuichi Asahi <y.asahi@nr.titech.ac.jp>

* Bump actions/checkout from 4.1.6 to 4.1.7 (#2248)

Bumps [actions/checkout](https://github.com/actions/checkout) from 4.1.6 to 4.1.7.
- [Release notes](https://github.com/actions/checkout/releases)
- [Changelog](https://github.com/actions/checkout/blob/main/CHANGELOG.md)
- [Commits](https://github.com/actions/checkout/compare/a5ac7e51b41094c92402da3b24376905380afc29...692973e3d937129bcbf40652eb9f2f61becf3332)

---
updated-dependencies:
- dependency-name: actions/checkout
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Bump github/codeql-action from 3.25.8 to 3.25.10 (#2249)

Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.25.8 to 3.25.10.
- [Release notes](https://github.com/github/codeql-action/releases)
- [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md)
- [Commits](https://github.com/github/codeql-action/compare/2e230e8fe0ad3a14a340ad0815ddb96d599d2aff...23acc5c183826b7a8a97bce3cecc52db901f8251)

---
updated-dependencies:
- dependency-name: github/codeql-action
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* FindTPLROC*: updates to fix export of import targets

Changes for the Rocm tpls to match the handling as done with the Cuda tpls

Should resolve issue #2238

* Fix warning about memcpy (#2252)

When building Stokhos BlockCrs, this util function gave a warning
about memcpy modifying a non-trivially-copyable type. Silence it
by casting to void*

* RCM fixes, improvements (#2254)

* Fix RCM starting vertex issue, improve testing
* apply reversing as labels are computed
instead of at the end. Saves a loop over all the labels
* use min-degree starting vertex within each connected component

* spgemm: add profiling regions to native implementations (#2253)

* spgemm: add profiling regions to native implementations

* Add profiling region to KokkosSPGEMM::KokkosSPGEMM_symbolic

* sparse: replace macros with constexpr bools (#2260)

* Rename `Impl::alignPtr` to `Impl::alignPtrTo`, allow it to infer argument type (#2261)

* KokkosKernels::Impl::alignPtr infers argument type

* Rename KokkosKernels::Impl::alignPtr -> alignPtrTo

* Bump github/codeql-action from 3.25.10 to 3.25.11 (#2263)

Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.25.10 to 3.25.11.
- [Release notes](https://github.com/github/codeql-action/releases)
- [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md)
- [Commits](https://github.com/github/codeql-action/compare/23acc5c183826b7a8a97bce3cecc52db901f8251...b611370bb5703a7efb587f9d136a52ea24c5c38c)

---
updated-dependencies:
- dependency-name: github/codeql-action
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* sparse: spadd_symbolic fences before device values used on host (#2259)

* sparse: spadd_symbolic fences before device values used on host

* sparse: use prefix sum to remove explicit spadd fence

* sycl: use alternative `alignPtrTo` when SYCL is enabled (SpGEMM) (#2262)

* sycl: use alternative alignPtr when SYCL is enabled

The current alignPtr, as well as two other alternatives below, do not work on SYCL on Intel PVC.

unsigned int f1(unsigned int i, unsigned int align) {
    return ((i + align - 1) / align * align);
}

unsigned int f2(unsigned int i, unsigned int align) {
    return (i + align - 1) & (-align);
}

* alignPtrTo unit tests

* Help gcc/8.3 with ctad issue

Resolves #2264

Co-authored-by: Carl Pearson <cwpears@sandia.gov>

* Bump actions/upload-artifact from 4.3.3 to 4.3.4 (#2266)

Bumps [actions/upload-artifact](https://github.com/actions/upload-artifact) from 4.3.3 to 4.3.4.
- [Release notes](https://github.com/actions/upload-artifact/releases)
- [Commits](https://github.com/actions/upload-artifact/compare/65462800fd760344b1a7b4382951275a0abb4808...0b2256b8c012f0828dc542b3febcab082c67f72b)

---
updated-dependencies:
- dependency-name: actions/upload-artifact
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* handle_t* -> unique_ptr<handle_t> in Bsr SpMV unit tests (#2269)

* Workarounds for removed cusparse functions (#2270)

cusparse 12.5 removed some functions that were deprecated, like the ILU
factorizations and the legacy csrsv (sparse triangular solve) functions.
As a workaround, if the cusparse version is >= 12.5 then disable the paths
in perftests that call those.

* BLAS - gemv: using fallback when mode is 't' or 'c' and onemkl is used (#2272)

* Implement batched serial pttrf (#2256)

* Batched serial pttrf implementation

* fix: use GEMM to add matrices

* fix: initialization order

* fformat

* fix: temporary variable in a test code

* fix: docstring of pttrf

* check_positive_definitiveness only if KOKKOSKERNELS_DEBUG_LEVEL > 0

* Improve the test for pttrf

* fix: int type

* fix: cleanup tests for SerialPttrf

* cleanup: remove unused deep_copies

* fix: docstrings and comments for pttrf

* ConjTranspose with conj and Transpose

* quick return in pttrf for size 1 or 0 matrix

* Add tests for invalid input

* fix: info computation

---------

Co-authored-by: Yuuichi Asahi <y.asahi@nr.titech.ac.jp>

* A little sptrsv cleanup before the main block effort (#2247)

* Some cleanup and refactoring

* First round of cleanup complete

* Fix a couple warnings

* formatting

* Sparse - SpMV: removing calls to unsuported oneapi - MKL functions (#2274)

* Sycl gemv beta (#2276)

* BLAS - GEMV: zero out Y when beta == 0 in SYCL TPL code path

* BLAS - GEMV: reverting wrong change from previous PR, my bad.

* Applying clang-format

* Unify alignPtrTo implementation (#2275)

* init (#2273)

* Bigger sptrsv cleanup (#2280)

* Some cleanup and refactoring
* Remove Upper/Lower TriLvlSchedTP2SolverFunctors
* Remove Upper/Lower single block functors
* Remove unused TriLvlSchedTP1SingleBlockFunctorDiagValues and merge upper/lower tri_solve_cg
* Merge two big upper/lower branch of tri_solve_chain
* Merge upper/lower tri_solve_streams
* Switch over block spiluk precond test to use new block sptrsv

* Bump actions/dependency-review-action from 4.3.3 to 4.3.4 (#2279)

Bumps [actions/dependency-review-action](https://github.com/actions/dependency-review-action) from 4.3.3 to 4.3.4.
- [Release notes](https://github.com/actions/dependency-review-action/releases)
- [Commits](https://github.com/actions/dependency-review-action/compare/72eb03d02c7872a771aacd928f3123ac62ad6d3a...5a2ce3f5b92ee19cbb1541a4984c76d921601d7c)

---
updated-dependencies:
- dependency-name: actions/dependency-review-action
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Bump github/codeql-action from 3.25.11 to 3.25.12 (#2278)

Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.25.11 to 3.25.12.
- [Release notes](https://github.com/github/codeql-action/releases)
- [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md)
- [Commits](https://github.com/github/codeql-action/compare/b611370bb5703a7efb587f9d136a52ea24c5c38c...4fa2a7953630fd2f3fb380f21be14ede0169dd4f)

---
updated-dependencies:
- dependency-name: github/codeql-action
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Propose increasing column limit to 120. (#2255)

* Change key files

* Full reformat

* Update format.yml

* Update ubuntu version for format checker

* A couple platforms do not correctly handle static complexes

* .github/workflows: Auto trigger AT2 ci upon review (#2243)

* Bump github/codeql-action from 3.25.12 to 3.25.13 (#2284)

Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.25.12 to 3.25.13.
- [Release notes](https://github.com/github/codeql-action/releases)
- [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md)
- [Commits](https://github.com/github/codeql-action/compare/4fa2a7953630fd2f3fb380f21be14ede0169dd4f...2d790406f505036ef40ecba973cc774a50395aac)

---
updated-dependencies:
- dependency-name: github/codeql-action
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Bump step-security/harden-runner from 2.8.1 to 2.9.0 (#2283)

Bumps [step-security/harden-runner](https://github.com/step-security/harden-runner) from 2.8.1 to 2.9.0.
- [Release notes](https://github.com/step-security/harden-runner/releases)
- [Commits](https://github.com/step-security/harden-runner/compare/17d0e2bd7d51742c71671bd19fa12bdc9d40a3d6...0d381219ddf674d61a7572ddd19d7941e271515c)

---
updated-dependencies:
- dependency-name: step-security/harden-runner
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Add big reformat commits to ignore revs for blame (#2286)

* Bump github/codeql-action from 3.25.13 to 3.25.15 (#2288)

Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.25.13 to 3.25.15.
- [Release notes](https://github.com/github/codeql-action/releases)
- [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md)
- [Commits](https://github.com/github/codeql-action/compare/2d790406f505036ef40ecba973cc774a50395aac...afb54ba388a7dca6ecae48f608c4ff05ff4cc77a)

---
updated-dependencies:
- dependency-name: github/codeql-action
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Bump ossf/scorecard-action from 2.3.3 to 2.4.0 (#2287)

Bumps [ossf/scorecard-action](https://github.com/ossf/scorecard-action) from 2.3.3 to 2.4.0.
- [Release notes](https://github.com/ossf/scorecard-action/releases)
- [Changelog](https://github.com/ossf/scorecard-action/blob/main/RELEASE.md)
- [Commits](https://github.com/ossf/scorecard-action/compare/dc50aa9510b46c811795eb24b2f1ba02a914e534...62b2cac7ed8198b15735ed49ab1e5cf35480ba46)

---
updated-dependencies:
- dependency-name: ossf/scorecard-action
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Sparse - coo2csr: deactivating test on PVC (#2290)

* Sparse - coo2csr: deactivating test on PVC

With this test deactivated everything in Kokkos Kernels passes.
Let us confirm that with a nightly build and potentially create
of SYCL CI-build.

* Protecting SYCL check with preprocessor guard

* cmake: add CMake language support for CUDA/HIP (#2173)

* cmake: add CMake language support for CUDA/HIP

* Apply suggestions of @brian-kelley

* Make sure clang is using a newer GCC on weaver (#2289)

It's clear from the name of the AT job GCC930_Light_Tpls_GCC930_Tpls_CLANG13CUDA10 that
the intent was to use gcc-9.3 but gcc-7.4.0 was the one actually being
used because the clang-13 module loads that gcc.

* Set version number to 4.4.0

* Revert "Changes to enable OneAPI usage with Trilinos build (#2185)"

This reverts commit b87dc951f64463fe5d6d4d886c0496ee804b2715.

* Add support for BSRs to sptrsv (#2281)

* Some cleanup and refactoring

* First round of cleanup complete

* Fix a couple warnings

* formatting

* Cleanup progress

* Fixes

* Remove Upper/Lower TriLvlSchedTP2SolverFunctors

* Remove Upper/Lower single block functors

* Remove unused TriLvlSchedTP1SingleBlockFunctorDiagValues and merge upper/lower tri_solve_cg

* Merge two big upper/lower branch of tri_solve_chain

* Merge upper/lower tri_solve_streams

* progres

* progress

* Progress, test added

* progress and fixes

* prog

* prog but broken

* Prog with debug prints

* Remove extreme debug printing

* works

* all working

* Remove test mangling

* Switch over block spiluk precond test to use new block sptrsv

* More test cleanup

* Fixes for GPU warnings

* Conflicts resolved, still work needed

* builds

* Formatting

* Update work to latest format style

* Remove unused functions. Remove prints. Add barriers

* Minor fixes

* lset is not used

* Fix for clang

* formatting

* New impl approach

* Unset macro

* There's no reason to limit team-policy alg to MAX_VEC_SIZE

* formatting

* Add missing kokkos-inline-funcs

* Fix warnings

* format

* Some compilers throw shadow warnings in static functions (#2297)

* Update changelog for 4.4.00

* Remove SYCL GEMV update

The SYCL GEMV update has been reverted in a later PR as the fix required was for the GEMV algorithm in the Sparse namespace, also known as SpMV. The previous BLAS::gemv fix was reverted

* Add entry for #2281, remove #2185

* docs.yaml: update kokkos version to 4.3.01

* update min kokkos version to 4.3.01

* apply clang-format

* update yaml files to use kokkos v 4.3.01

* github workflow: adding release workflow to have provenance for release artifacts (#2304)

This adds a workflow based on the scorecard recommendation for software provenance.
The workflow requires some manual actions to verify that the signiture in the release is correct.

* update master_history.txt for 4.4.00

---------

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: Luc Berger <lberge@sandia.gov>
Co-authored-by: Carl Pearson <cwpearson@users.noreply.github.com>
Co-authored-by: Carl Pearson <cwpears@sandia.gov>
Co-authored-by: Ernesto Prudencio <eeprude@sandia.gov>
Co-authored-by: Brian Kelley <bmkelle@sandia.gov>
Co-authored-by: James Foucar <jgfouca@sandia.gov>
Co-authored-by: Junchao Zhang <jczhang@anl.gov>
Co-authored-by: Damien L-G <dalg24+github@gmail.com>
Co-authored-by: Caleb Schilly <cwschilly@gmail.com>
Co-authored-by: Damien L-G <dalg24@gmail.com>
Co-authored-by: Sean Miller <sean.miller@amd.com>
Co-authored-by: Vinh Dang <vqdang@sandia.gov>
Co-authored-by: Luc Berger-Vergiat (-EXP) <lberge@trappist.son.sandia.gov>
Co-authored-by: brian-kelley <brian.honda11@gmail.com>
Co-authored-by: eeprude <119708086+eeprude@users.noreply.github.com>
Co-authored-by: Wyatt Horne <68676884+wjhorne@users.noreply.github.com>
Co-authored-by: whorne <whorne@sandia.gov>
Co-authored-by: Evan Harvey <eharvey@sandia.gov>
Co-authored-by: malphil <malphil@sandia.gov>
Co-authored-by: Evan Harvey <57234914+e10harvey@users.noreply.github.com>
Co-authored-by: Malachi <malachi2@illinois.edu>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: yasahi-hpc <57478230+yasahi-hpc@users.noreply.github.com>
Co-authored-by: Yuuichi Asahi <y.asahi@nr.titech.ac.jp>
Co-authored-by: Daniel Arndt <arndtd@ornl.gov>
Co-authored-by: Baptiste Legouix <stilynx51@gmail.com>
Co-authored-by: Richard Berger <richard.berger@outlook.com>
brian-kelley added a commit to brian-kelley/kokkos-kernels that referenced this pull request Aug 20, 2024
If building KokkosKernels standalone, leave int as the default offset
(this was the case since kokkos#2140). But if building KokkosKernels as a
Trilinos/Tribits package, then make size_t the default offset because
this is what the Tpetra stack currently uses.

Signed-off-by: Brian Kelley <bmkelle@sandia.gov>
brian-kelley added a commit that referenced this pull request Aug 20, 2024
If building KokkosKernels standalone, leave int as the default offset
(this was the case since #2140). But if building KokkosKernels as a
Trilinos/Tribits package, then make size_t the default offset because
this is what the Tpetra stack currently uses.

Signed-off-by: Brian Kelley <bmkelle@sandia.gov>
ndellingwood pushed a commit to ndellingwood/kokkos-kernels that referenced this pull request Sep 4, 2024
If building KokkosKernels standalone, leave int as the default offset
(this was the case since kokkos#2140). But if building KokkosKernels as a
Trilinos/Tribits package, then make size_t the default offset because
this is what the Tpetra stack currently uses.

Signed-off-by: Brian Kelley <bmkelle@sandia.gov>
ndellingwood added a commit that referenced this pull request Sep 12, 2024
* Restore size_t as default offset, in Tribits builds (#2313)

If building KokkosKernels standalone, leave int as the default offset
(this was the case since #2140). But if building KokkosKernels as a
Trilinos/Tribits package, then make size_t the default offset because
this is what the Tpetra stack currently uses.

Signed-off-by: Brian Kelley <bmkelle@sandia.gov>

* Improve crs/bsr sorting performance (#2293)

* CRS sorting improvements

- Wrote bulk sort/permutation based sorting for CRS graph, matrix, and
  BSR matrix (bulk = one large sort of all the entries, using row-major
  dense index as keys)
  - This is more performant for imbalanced entries per row
- If matrix dimensions are too large to do bulk sort, fall back to
  sorting within each row with a thread.

* Add perf test for sort_crs_matrix
* sort_crs: improve parallel labels
* Work around kokkos issue 7036
* sort_crs: replace radix sort lambda with functor
(Lambda segfaults with nvcc+openmp)
---------
Signed-off-by: Brian Kelley <bmkelle@sandia.gov>

* SpAdd handle: delete sort_option getter/setter (#2296)

SpAdd handle was originally a copy-paste of the spgemm
handle way back in #122, and included get_sort_option() and
set_sort_option() from spgemm. But these try to use the member
bool sort_option, which doesn't exist. Somehow these functions never
produced compile errors until someone tried to call them.

* Improve GH action to produce release artifacts (#2312)

* coo2csr: add parens to function calls (#2318)

* Update changelog

* Update master_history.txt

* .github/workflows: Group jobs under common github-AT2 name (#2320)

* Update master_history.txt

---------

Signed-off-by: Brian Kelley <bmkelle@sandia.gov>
Co-authored-by: brian-kelley <bmkelle@sandia.gov>
Co-authored-by: Damien L-G <dalg24@gmail.com>
Co-authored-by: Carl Pearson <cwpearson@users.noreply.github.com>
Co-authored-by: Evan Harvey <57234914+e10harvey@users.noreply.github.com>
lucbv pushed a commit that referenced this pull request Sep 13, 2024
* Set version number to 4.4.01

* [4.4.01] Patches to 4.4.01 (#2327)

* Restore size_t as default offset, in Tribits builds (#2313)

If building KokkosKernels standalone, leave int as the default offset
(this was the case since #2140). But if building KokkosKernels as a
Trilinos/Tribits package, then make size_t the default offset because
this is what the Tpetra stack currently uses.

Signed-off-by: Brian Kelley <bmkelle@sandia.gov>

* Improve crs/bsr sorting performance (#2293)

* CRS sorting improvements

- Wrote bulk sort/permutation based sorting for CRS graph, matrix, and
  BSR matrix (bulk = one large sort of all the entries, using row-major
  dense index as keys)
  - This is more performant for imbalanced entries per row
- If matrix dimensions are too large to do bulk sort, fall back to
  sorting within each row with a thread.

* Add perf test for sort_crs_matrix
* sort_crs: improve parallel labels
* Work around kokkos issue 7036
* sort_crs: replace radix sort lambda with functor
(Lambda segfaults with nvcc+openmp)
---------
Signed-off-by: Brian Kelley <bmkelle@sandia.gov>

* SpAdd handle: delete sort_option getter/setter (#2296)

SpAdd handle was originally a copy-paste of the spgemm
handle way back in #122, and included get_sort_option() and
set_sort_option() from spgemm. But these try to use the member
bool sort_option, which doesn't exist. Somehow these functions never
produced compile errors until someone tried to call them.

* Improve GH action to produce release artifacts (#2312)

* coo2csr: add parens to function calls (#2318)

* Update changelog

* Update master_history.txt

* .github/workflows: Group jobs under common github-AT2 name (#2320)

* Update master_history.txt

---------

Signed-off-by: Brian Kelley <bmkelle@sandia.gov>
Co-authored-by: brian-kelley <bmkelle@sandia.gov>
Co-authored-by: Damien L-G <dalg24@gmail.com>
Co-authored-by: Carl Pearson <cwpearson@users.noreply.github.com>
Co-authored-by: Evan Harvey <57234914+e10harvey@users.noreply.github.com>

* Update master_history.txt for 4.4.01

---------

Signed-off-by: Brian Kelley <bmkelle@sandia.gov>
Co-authored-by: brian-kelley <bmkelle@sandia.gov>
Co-authored-by: Damien L-G <dalg24@gmail.com>
Co-authored-by: Carl Pearson <cwpearson@users.noreply.github.com>
Co-authored-by: Evan Harvey <57234914+e10harvey@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants