24.02.1
|
Go to the documentation of this file.
24 #ifndef ACL_SRC_GPU_CL_KERNELS_CLGEMMMATRIXMULTIPLYRESHAPEDONLYRHSKERNEL_H
25 #define ACL_SRC_GPU_CL_KERNELS_CLGEMMMATRIXMULTIPLYRESHAPEDONLYRHSKERNEL_H
107 bool _slide_matrix_b{
true};
108 bool _reinterpret_input_as_3d{
false};
109 bool _reinterpret_output_as_3d{
false};
110 bool _use_dummy_work_items{
false};
111 bool _add_bias{
false};
112 bool _export_to_cl_image{
false};
113 bool _has_pad_y{
false};
121 #endif // ACL_SRC_GPU_CL_KERNELS_CLGEMMMATRIXMULTIPLYRESHAPEDONLYRHSKERNEL_H
OpenCL kernel to multiply matrices when only the input matrix RHS (src1) has been reshaped.
void configure(const ClCompileContext &compile_context, const ITensorInfo *src0, const ITensorInfo *src1, const ITensorInfo *src2, ITensorInfo *dst, float alpha, float beta, const GEMMLHSMatrixInfo &lhs_info, const GEMMRHSMatrixInfo &rhs_info, const GEMMKernelInfo &gemm_info)
Initialise the kernel's input and output.
ClGemmMatrixMultiplyReshapedOnlyRhsKernel()
Descriptor used by the GEMM kernels.
ARM_COMPUTE_DISALLOW_COPY_ALLOW_MOVE(ClGemmMatrixMultiplyReshapedOnlyRhsKernel)
void run_op(ITensorPack &tensors, const Window &window, cl::CommandQueue &queue) override
Enqueue the OpenCL kernel to process the given window on the passed OpenCL command queue.
Common interface for all the OpenCL kernels.
static Status validate(const ITensorInfo *src0, const ITensorInfo *src1, const ITensorInfo *src2, const ITensorInfo *dst, float alpha, float beta, const GEMMLHSMatrixInfo &lhs_info, const GEMMRHSMatrixInfo &rhs_info, const GEMMKernelInfo &gemm_info)
Static function to check if given info will lead to a valid configuration.
const Window & window() const
The maximum window the kernel can be executed on.
GEMM LHS (Left Hand Side) matrix information.
Describe a multidimensional execution window.
Copyright (c) 2017-2024 Arm Limited.
Store the tensor's metadata.
GEMM RHS (Right Hand Side) matrix information.