24.02.1
|
Go to the documentation of this file.
24 #ifndef ACL_SRC_GPU_CL_KERNELS_CLGEMMMATRIXMULTIPLYNATIVEKERNEL_H
25 #define ACL_SRC_GPU_CL_KERNELS_CLGEMMMATRIXMULTIPLYNATIVEKERNEL_H
92 bool _slide_matrix_b{
true};
93 bool _reinterpret_input_as_3d{
false};
94 bool _reinterpret_output_as_3d{
false};
95 bool _use_dummy_work_items{
false};
96 bool _add_bias{
false};
104 #endif // ACL_SRC_GPU_CL_KERNELS_CLGEMMMATRIXMULTIPLYNATIVEKERNEL_H
ARM_COMPUTE_DISALLOW_COPY_ALLOW_MOVE(ClGemmMatrixMultiplyNativeKernel)
void run_op(ITensorPack &tensors, const Window &window, cl::CommandQueue &queue) override
Enqueue the OpenCL kernel to process the given window on the passed OpenCL command queue.
Descriptor used by the GEMM kernels.
void configure(const ClCompileContext &compile_context, ITensorInfo *src0, ITensorInfo *src1, ITensorInfo *src2, ITensorInfo *dst, float alpha, float beta, const GEMMLHSMatrixInfo &lhs_info, const GEMMRHSMatrixInfo &rhs_info, const GEMMKernelInfo &gemm_info)
Initialise the kernel's input and dst.
OpenCL kernel to multiply matrices when neither of the input matrices have been reshaped.
Common interface for all the OpenCL kernels.
ClGemmMatrixMultiplyNativeKernel()
const Window & window() const
The maximum window the kernel can be executed on.
GEMM LHS (Left Hand Side) matrix information.
Describe a multidimensional execution window.
Copyright (c) 2017-2024 Arm Limited.
Store the tensor's metadata.
GEMM RHS (Right Hand Side) matrix information.
static Status validate(const ITensorInfo *src0, const ITensorInfo *src1, const ITensorInfo *src2, const ITensorInfo *dst, float alpha, float beta, const GEMMLHSMatrixInfo &lhs_info, const GEMMRHSMatrixInfo &rhs_info, const GEMMKernelInfo &gemm_info)
Static function to check if given info will lead to a valid configuration.