CUTLASS
CUDA Templates for Linear Algebra Subroutines and Solvers

threadblock → transform Relation

File in include/cutlass/epilogue/threadblockIncludes file in include/cutlass/transform
default_epilogue_complex_tensor_op.hthreadblock / regular_tile_iterator_pitch_linear.h
default_epilogue_simt.hthreadblock / regular_tile_iterator_pitch_linear.h
default_epilogue_tensor_op.hthreadblock / regular_tile_iterator_pitch_linear.h
default_epilogue_volta_tensor_op.hthreadblock / regular_tile_iterator_pitch_linear.h
default_epilogue_wmma_tensor_op.hthreadblock / regular_tile_iterator_pitch_linear.h
epilogue.hpitch_linear_thread_map.h
epilogue.hthreadblock / regular_tile_iterator.h
epilogue/threadblock/predicated_tile_iterator.hpitch_linear_thread_map.h
epilogue_base.hpitch_linear_thread_map.h
interleaved_epilogue.hpitch_linear_thread_map.h
interleaved_epilogue.hthreadblock / regular_tile_iterator.h