24 #ifndef ARM_COMPUTE_CPU_GEMMLOWP_REDUCTION_KERNEL_H 25 #define ARM_COMPUTE_CPU_GEMMLOWP_REDUCTION_KERNEL_H 33 struct GEMMLowpReductionKernelInfo;
70 const char *
name()
const override;
90 CpuGemmLowpMatrixAReductionKernelPtr _func{
nullptr };
93 bool _mul_by_scalar{
false };
128 const char *
name()
const override;
138 template <
typename T>
149 CpuGemmLowpMatrixBReductionKernelPtr _func{
nullptr };
151 int32_t _scalar{ 0 };
152 bool _mul_by_scalar{
false };
const Window & window() const
The maximum window the kernel can be executed on.
Kernel used to compute the row-vectors of sums of all the entries in each row of Matrix A...
ARM_COMPUTE_DISALLOW_COPY_ALLOW_MOVE(CpuGemmLowpMatrixAReductionKernel)
Common interface for all kernels implemented in C++.
Store the tensor's metadata.
void run_op(ITensorPack &tensors, const Window &window, const ThreadInfo &info) override
Execute the kernel on the passed window.
Interface for CPU tensor.
SimpleTensor< float > src
Copyright (c) 2017-2021 Arm Limited.
static Status validate(const ITensorInfo *src, const ITensorInfo *dst, const GEMMLowpReductionKernelInfo &info)
Static function to check if given info will lead to a valid configuration.
Kernel used to compute the row-vectors of sums of all the entries in each column of Matrix B...
void configure(const ITensorInfo *src, ITensorInfo *dst, const GEMMLowpReductionKernelInfo &info)
Initialise the kernel's input and output.
ScaleKernelInfo info(interpolation_policy, default_border_mode, PixelValue(), sampling_policy, false)
Information about executing thread and CPU.
const char * name() const override
Name of the kernel.
Describe a multidimensional execution window.
CpuGemmLowpMatrixAReductionKernel()=default
Default constructor.