24 #ifndef ARM_COMPUTE_CLGEMMLOWPOFFSETCONTRIBUTIONOUTPUTSTAGEKERNEL_H 25 #define ARM_COMPUTE_CLGEMMLOWPOFFSETCONTRIBUTIONOUTPUTSTAGEKERNEL_H 96 int32_t a_offset, int32_t b_offset,
132 bool _is_quantized_per_channel;
const Window & window() const
The maximum window the kernel can be executed on.
void run(const Window &window, cl::CommandQueue &queue) override
Enqueue the OpenCL kernel to process the given window on the passed OpenCL command queue...
void configure(const ICLTensor *mm_result, const ICLTensor *vector_sum_col, const ICLTensor *vector_sum_row, const ICLTensor *bias, ICLTensor *output, int32_t k, int32_t a_offset, int32_t b_offset, const GEMMLowpOutputStageInfo &output_stage, const ICLTensor *output_multipliers, const ICLTensor *output_shifts)
Initialise the kernel's input and output.
OpenCL kernel used to add the offset contribution after the matrix multiplication and perform the out...
Store the tensor's metadata.
Common interface for all the OpenCL kernels.
CLGEMMLowpOffsetContributionOutputStageKernel & operator=(const CLGEMMLowpOffsetContributionOutputStageKernel &)=delete
Prevent instances of this class from being copied (As this class contains pointers) ...
Copyright (c) 2017-2021 Arm Limited.
static Status validate(const ITensorInfo *mm_result, const ITensorInfo *vector_sum_col, const ITensorInfo *vector_sum_row, const ITensorInfo *bias, const ITensorInfo *output, int32_t a_offset, int32_t b_offset, const GEMMLowpOutputStageInfo &output_stage, const ITensorInfo *output_multipliers, const ITensorInfo *output_shifts)
Static function to check if given info will lead to a valid configuration of CLGEMMLowpOffsetContribu...
GEMMLowp output stage info.
Interface for OpenCL tensor.
CLGEMMLowpOffsetContributionOutputStageKernel()
Constructor.
Describe a multidimensional execution window.