CUTLASS
CUDA Templates for Linear Algebra Subroutines and Solvers

device → reduction Relation

File in include/cutlass/gemm/deviceIncludes file in include/cutlass/reduction
device/gemm_splitk_parallel.hkernel / reduce_split_k.h
device/gemm_splitk_parallel.hthread / reduction_operators.h