24.02.1
|
Go to the documentation of this file.
24 #ifndef ARM_COMPUTE_CL_GEMMLOWP_OFFSET_CONTRIBUTION_KERNEL_H
25 #define ARM_COMPUTE_CL_GEMMLOWP_OFFSET_CONTRIBUTION_KERNEL_H
ClGemmLowpOffsetContributionKernel()
void run_op(ITensorPack &tensors, const Window &window, cl::CommandQueue &queue) override
Enqueue the OpenCL kernel to process the given window on the passed OpenCL command queue.
ARM_COMPUTE_DISALLOW_COPY_ALLOW_MOVE(ClGemmLowpOffsetContributionKernel)
Common interface for all the OpenCL kernels.
void configure(const CLCompileContext &compile_context, const ITensorInfo *mm_result, const ITensorInfo *vector_sum_col, const ITensorInfo *vector_sum_row, const ITensorInfo *bias, int32_t k, int32_t a_offset, int32_t b_offset)
Initialise the kernel's input and output.
const Window & window() const
The maximum window the kernel can be executed on.
Describe a multidimensional execution window.
Copyright (c) 2017-2024 Arm Limited.
static Status validate(const ITensorInfo *mm_result, const ITensorInfo *vector_sum_col, const ITensorInfo *vector_sum_row, const ITensorInfo *bias, int32_t a_offset, int32_t b_offset)
Static function to check if given info will lead to a valid configuration.
Store the tensor's metadata.
OpenCL kernel used to add the offset contribution after the matrix multiplication.