CUTLASS
CUDA Templates for Linear Algebra Subroutines and Solvers

gemm → arch Relation

File in include/cutlass/gemmIncludes file in include/cutlass/arch
kernel / default_gemm.hwmma.h
device / default_gemm_configuration.harch.h
device / default_gemm_configuration.harch/mma.h
device / default_gemm_configuration.hwmma.h
threadblock / default_mma.harch.h
threadblock / default_mma.hwmma.h
threadblock / default_mma_core_wmma.hwmma.h
warp / default_mma_wmma_tensor_op.hwmma.h
device / device/gemm_batched.harch.h
device / device/gemm_splitk_parallel.harch.h
thread / gemm/thread/mma.harch/mma.h
thread / gemm/thread/mma_sm50.harch/mma.h
device / include/cutlass/gemm/device/gemm.harch.h
device / include/cutlass/gemm/device/gemm_complex.harch.h
threadblock / mma_base.hmemory.h
warp / mma_complex_tensor_op.hmemory_sm75.h
warp / mma_complex_tensor_op.hmma_sm75.h
warp / mma_tensor_op.hmemory_sm75.h
warp / mma_tensor_op.hmma_sm75.h
warp / mma_tensor_op_sm70.harch/mma.h
warp / mma_tensor_op_tile_iterator.hmemory_sm75.h
warp / mma_tensor_op_tile_iterator_wmma.hwmma.h
warp / mma_tensor_op_wmma.hwmma.h