New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Single-precision support for HIP variants #93

Merged

reuterbal merged 2 commits into develop from nams-hip-sp-single-dir

Nov 4, 2024

Contributor

MichaelSt98 commented Jul 17, 2024

HIP SP tested on LUMI via e.g.

./cloudsc-bundle build --clean --build-dir=build-sp-hip --arch=arch/eurohpc/lumi/cray-gpu/16.0.1 --with-hip --single-precision [--with-serialbox]

MichaelSt98 requested a review from reuterbal

July 18, 2024 09:40

reuterbal approved these changes

View reviewed changes

Collaborator

reuterbal left a comment

Many thanks, this looks great and confirmed to work on LUMI!

A few minor clean-up comments but no show-stoppers

src/cloudsc_hip/cloudsc/cloudsc_validate.cpp Outdated

@@ @@ -10,17 +10,18 @@ @@
               #include "cloudsc_validate.h"
-              #include <float.h>
+              #include <dtype.h>

Collaborator

reuterbal Jul 26, 2024

#include "dtype.h" or redundant?

Was previously used for DBL_EPSILON, I think.

src/cloudsc_hip/cloudsc/cloudsc_validate.cpp Outdated

Comment on lines 63 to 64

		// #pragma omp parallel for default(shared) private(b, bsize, jk) \
		// reduction(min:zminval) reduction(max:zmaxval,zmaxerr) reduction(+:zerrsum,zsum)

Collaborator

reuterbal Jul 26, 2024

I think you re-enabled this in the othe rPR?

src/cloudsc_hip/cloudsc/cloudsc_validate.cpp Outdated

Comment on lines 101 to 102

		// #pragma omp parallel for default(shared) private(b, bsize, jl, jk) \
		// reduction(min:zminval) reduction(max:zmaxval,zmaxerr) reduction(+:zerrsum,zsum)

Collaborator

reuterbal Jul 26, 2024

Same comment

src/cloudsc_hip/cloudsc/cloudsc_validate.cpp Outdated

Comment on lines 91 to 92

		// dtype (field)[nlev][nlon] = (dtype ()[nlev][nlon]) v_field;
		// dtype (reference)[nlev][nlon] = (dtype ()[nlev][nlon]) v_ref;

Collaborator

reuterbal Jul 26, 2024

debug leftover?

src/cloudsc_hip/cloudsc/cloudsc_validate.cpp Outdated

Comment on lines 54 to 55

		//dtype (field)[nlon] = (dtype ()[nlon]) v_field;
		//dtype (reference)[nlon] = (dtype ()[nlon]) v_ref;

Collaborator

reuterbal Jul 26, 2024

debug leftover?

src/cloudsc_hip/cloudsc/cloudsc_validate.cpp Outdated

Comment on lines 133 to 134

		// dtype (field)[nclv][nlev][nlon] = (dtype ()[nclv][nlev][nlon]) v_field;
		// dtype (reference)[nclv][nlev][nlon] = (dtype ()[nclv][nlev][nlon]) v_ref;

Collaborator

reuterbal Jul 26, 2024

debug leftover?

src/cloudsc_hip/cloudsc/cloudsc_validate.cpp Outdated

Comment on lines 142 to 143

		// #pragma omp parallel for default(shared) private(b, bsize, jl, jk, jm) \
		// reduction(min:zminval) reduction(max:zmaxval,zmaxerr) reduction(+:zerrsum,zsum)

Collaborator

reuterbal Jul 26, 2024

re-enable or remove?

reuterbal changed the title ~~HIP SP~~ Single-precision support for HIP variants

reuterbal assigned MichaelSt98

MichaelSt98 added 2 commits

October 16, 2024 10:33


          single precision HIP (via preprocessor macro(s))

ac9bbb5


          Remove some debug leftovers and re-introduce OpenMP pragmas for HIP v…

50863dc

…alidation step

MichaelSt98 force-pushed the nams-hip-sp-single-dir branch from ce87db3 to 50863dc Compare

October 16, 2024 07:34

MichaelSt98 requested a review from reuterbal

October 16, 2024 09:42

reuterbal approved these changes

View reviewed changes

Collaborator

reuterbal left a comment

Great, many thanks. Tested in conjunction with #97 and seems to work fine.

reuterbal merged commit f914829 into develop

18 checks passed

reuterbal deleted the nams-hip-sp-single-dir branch

November 4, 2024 16:41

reuterbal mentioned this pull request

Add Lumi CCE17 (17.0.1 with rocm 6.0.3) toolchain and env file #97

Merged

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet