24 #ifndef ARM_COMPUTE_CPU_GEMMLOWP_OFFSETCONTRIBUTION_OUTPUTSTAGE_KERNEL_H 25 #define ARM_COMPUTE_CPU_GEMMLOWP_OFFSETCONTRIBUTION_OUTPUTSTAGE_KERNEL_H 101 const char *
name()
const override;
105 int32_t _a_offset{ 0 };
106 int32_t _b_offset{ 0 };
107 int32_t _k_offset{ 0 };
108 bool _slide_vector_sum_col{
true };
const Window & window() const
The maximum window the kernel can be executed on.
Common interface for all kernels implemented in C++.
Kernel used to add the offset contribution and perform the output stage after CpuGemmLowpMatrixMultip...
const char * name() const override
Name of the kernel.
Store the tensor's metadata.
void configure(const ITensorInfo *mm_result, const ITensorInfo *vector_sum_col, const ITensorInfo *vector_sum_row, const ITensorInfo *bias, ITensorInfo *dst, int32_t k, int32_t a_offset, int32_t b_offset, GEMMLowpOutputStageInfo output_stage)
Initialise the kernel inputs and output.
Copyright (c) 2017-2021 Arm Limited.
CpuGemmLowpOffsetContributionOutputStageKernel()=default
Default constructor.
GEMMLowp output stage info.
ARM_COMPUTE_DISALLOW_COPY_ALLOW_MOVE(CpuGemmLowpOffsetContributionOutputStageKernel)
static Status validate(const ITensorInfo *mm_result, const ITensorInfo *vector_sum_col, const ITensorInfo *vector_sum_row, const ITensorInfo *bias, const ITensorInfo *dst, int32_t a_offset, int32_t b_offset, GEMMLowpOutputStageInfo output_stage)
Static function to check if given info will lead to a valid configuration.
void run_op(ITensorPack &tensors, const Window &window, const ThreadInfo &info) override
Execute the kernel on the passed window.
ScaleKernelInfo info(interpolation_policy, default_border_mode, PixelValue(), sampling_policy, false)
Information about executing thread and CPU.
Describe a multidimensional execution window.