Refactor ck_tile fMHA forward example #1249

poyenc · 2024-04-17T16:35:57Z

No description provided.

rocking5566 · 2024-04-18T20:48:04Z

include/ck_tile/core/container/span.hpp

rocking5566 · 2024-04-18T21:02:46Z

include/ck_tile/host/reference/reference_batched_elementwise.hpp

-CK_TILE_HOST void reference_batched_elementwise(const HostTensor<ADataType>& a_b_m_n,
-                                                const HostTensor<BDataType>& b_b_m_n,
-                                                HostTensor<CDataType>& c_b_m_n,
+CK_TILE_HOST void reference_batched_elementwise(const ATensorView& a_b_m_n,


I suggest using Tensor instead of TensorView. Because we also have
struct tensor_view.
Using TensorView easily make reader confuse.

include/ck_tile/host/host_tensor.hpp

rocking5566 · 2024-04-18T21:11:07Z

include/ck_tile/host/host_tensor.hpp

-    std::size_t GetOffsetFromMultiIndex(Is... is) const
+    std::enable_if_t<((std::is_integral_v<Is> && std::is_convertible_v<Is, std::size_t>)&&...),
+                     std::size_t>
+    GetOffsetFromMultiIndex(Is... is) const


Do we need to sync the naming style of funcion?

sure, I will rename the functions

rocking5566 · 2024-04-18T21:29:24Z

include/ck_tile/host/reference/reference_batched_gemm.hpp

-CK_TILE_HOST void reference_batched_gemm(const HostTensor<ADataType>& a_b_m_k,
-                                         const HostTensor<BDataType>& b_b_n_k,
-                                         HostTensor<CDataType>& c_b_m_n,
+CK_TILE_HOST void reference_batched_gemm(const ATensorView& a_b_m_k,


TensorView -> Tensor

rocking5566 · 2024-04-18T21:29:48Z

include/ck_tile/host/reference/reference_batched_masking.hpp

@@ -9,11 +9,13 @@

 namespace ck_tile {

-template <typename CDataType, typename MaskingType>
-CK_TILE_HOST void reference_batched_masking(HostTensor<CDataType>& c_b_m_n, const MaskingType& mask)
+template <typename CTensorView, typename MaskingType>


TensorView->HostTensor

rocking5566 · 2024-04-18T21:30:23Z

include/ck_tile/host/reference/reference_batched_softmax.hpp

-          typename CompDataType,
-          typename BDataType,
+template <typename CompDataType,
+          typename ATensorView,


TensorView->Tensor

include/ck_tile/host/reference/reference_gemm.hpp

include/ck_tile/host/reference/reference_im2col.hpp

…mha-fwd-example

rocking5566

LGTM

poyenc · 2024-04-25T01:12:07Z

we need to wait for @danyao12 merge his fmha bwd & dropout changes then refactor all the updated example codes together.

…mha-fwd-example

poyenc · 2024-04-25T13:17:41Z

I will continue developing the fmha fwd + KV cache reference function base on current design of HostTensor<>.

…mha-fwd-example

Refactor ck_tile fMHA forward example

ed524f6

poyenc requested a review from rocking5566 April 17, 2024 16:35

poyenc self-assigned this Apr 17, 2024

poyenc requested review from zjing14, junliume, illsilin, carlushuang and aosewski as code owners April 17, 2024 16:35

poyenc added 12 commits April 17, 2024 16:42

Re-order include directives

e555d5f

Unify naming style

adc0d20

Add comment for intermediate tensors

b279d95

Remove qualified name

91556a1

Use better comment for tensor views

9ff7714

Use standard way to determine iterator category

fde9b86

Support more operations in permutation_iterator

1659b37

Add zip_iterator<>

4f8aced

Remove unused include directive

6b196bc

Add transform_iterator<>

9bb9361

Support operator- for zip_iterator<>

9153db0

Remove unnecessary data member

4d2b0ef