[Bf-blender-cvs] [390b9f13050] master: Fix Cycles HIP assuming warp size 32

Brecht Van Lommel noreply at git.blender.org
Wed Mar 16 18:06:21 CET 2022


Commit: 390b9f1305059f5d8c7f944d44fc3e5821a3eb82
Author: Brecht Van Lommel
Date:   Wed Mar 16 15:03:38 2022 +0100
Branches: master
https://developer.blender.org/rB390b9f1305059f5d8c7f944d44fc3e5821a3eb82

Fix Cycles HIP assuming warp size 32

In HIP these masks are 64 bit, while in CUDA only 32 bit.

===================================================================

M	intern/cycles/kernel/device/hip/compat.h

===================================================================

diff --git a/intern/cycles/kernel/device/hip/compat.h b/intern/cycles/kernel/device/hip/compat.h
index 94d9d1273e8..29fbc119cd1 100644
--- a/intern/cycles/kernel/device/hip/compat.h
+++ b/intern/cycles/kernel/device/hip/compat.h
@@ -62,7 +62,7 @@ typedef unsigned long long uint64_t;
 #define ccl_gpu_block_idx_x (blockIdx.x)
 #define ccl_gpu_grid_dim_x (gridDim.x)
 #define ccl_gpu_warp_size (warpSize)
-#define ccl_gpu_thread_mask(thread_warp) uint(0xFFFFFFFF >> (ccl_gpu_warp_size - thread_warp))
+#define ccl_gpu_thread_mask(thread_warp) uint64_t(0xFFFFFFFFFFFFFFFF >> (64 - thread_warp))
 
 #define ccl_gpu_global_id_x() (ccl_gpu_block_idx_x * ccl_gpu_block_dim_x + ccl_gpu_thread_idx_x)
 #define ccl_gpu_global_size_x() (ccl_gpu_grid_dim_x * ccl_gpu_block_dim_x)



More information about the Bf-blender-cvs mailing list