24 #ifndef ARM_COMPUTE_CLDEPTHWISECONVOLUTIONNCHWKERNEL3x3_H 25 #define ARM_COMPUTE_CLDEPTHWISECONVOLUTIONNCHWKERNEL3x3_H 59 const ICLTensor *output_multipliers =
nullptr,
const ICLTensor *output_shifts =
nullptr)
override;
80 const ICLTensor *output_multipliers =
nullptr,
const ICLTensor *output_shifts =
nullptr)
override;
109 unsigned int _conv_stride_x;
110 unsigned int _conv_pad_top;
111 unsigned int _conv_pad_left;
const Window & window() const
The maximum window the kernel can be executed on.
Interface for the kernel to run a 3x3 depthwise convolution on a tensor.
Container for 2D border size.
static Status validate(const ITensorInfo *input, const ITensorInfo *weights, const ITensorInfo *biases, const ITensorInfo *output, const PadStrideInfo &conv_info, unsigned int depth_multiplier=1, ActivationLayerInfo act_info=ActivationLayerInfo(), GPUTarget gpu_target=GPUTarget::MIDGARD, const Size2D &dilation=Size2D(1U, 1U), const ITensorInfo *output_multipliers=nullptr, const ITensorInfo *output_shifts=nullptr)
Static function to check if given info will lead to a valid configuration of CLDepthwiseConvolutionLa...
Store the tensor's metadata.
Activation Layer Information class.
Copyright (c) 2017-2021 Arm Limited.
Interface for the kernel to run a 3x3 depthwise convolution on a tensor when the data layout is NCHW...
CLDepthwiseConvolutionLayer3x3NCHWKernel()
Default constructor.
void run(const Window &window, cl::CommandQueue &queue) override
Enqueue the OpenCL kernel to process the given window on the passed OpenCL command queue...
Padding and stride information class.
void configure(const ICLTensor *input, const ICLTensor *weights, const ICLTensor *biases, ICLTensor *output, const PadStrideInfo &conv_info, unsigned int depth_multiplier=1, ActivationLayerInfo act_info=ActivationLayerInfo(), const Size2D &dilation=Size2D(1U, 1U), const ICLTensor *output_multipliers=nullptr, const ICLTensor *output_shifts=nullptr) override
Initialize the function's source, destination, conv and border_size.
Interface for OpenCL tensor.
GPUTarget
Available GPU Targets.
Class for specifying the size of an image or rectangle.
Describe a multidimensional execution window.
BorderSize border_size() const override
The size of the border for that kernel.