24.02.1
|
Go to the documentation of this file.
24 #ifndef ARM_COMPUTE_CLDEPTHWISECONVOLUTIONLAYERNATIVEKERNEL_H
25 #define ARM_COMPUTE_CLDEPTHWISECONVOLUTIONLAYERNATIVEKERNEL_H
84 const ICLTensor *output_multipliers =
nullptr,
85 const ICLTensor *output_shifts =
nullptr);
97 const ICLTensor *output_multipliers =
nullptr,
98 const ICLTensor *output_shifts =
nullptr);
123 unsigned int _depth_multiplier{0};
126 bool _export_input_to_cl_image{
false};
127 bool _export_weights_to_cl_image{
true};
128 bool _is_quantized{
false};
Interface for OpenCL tensor.
void run(const Window &window, cl::CommandQueue &queue) override
Enqueue the OpenCL kernel to process the given window on the passed OpenCL command queue.
static Status validate(const ITensorInfo *input, const ITensorInfo *weights, const ITensorInfo *biases, const ITensorInfo *output, const DWCComputeKernelInfo &dwc_info, const ConvolutionInfo &conv_info, const ITensorInfo *output_multipliers=nullptr, const ITensorInfo *output_shifts=nullptr)
Static function to check if given info will lead to a valid configuration of CLDepthwiseConvolutionLa...
Common interface for all the OpenCL kernels.
const Window & window() const
The maximum window the kernel can be executed on.
CLDepthwiseConvolutionLayerNativeKernel & operator=(const CLDepthwiseConvolutionLayerNativeKernel &)=delete
Prevent instances of this class from being copied (As this class contains pointers)
CLDepthwiseConvolutionLayerNativeKernel()
Default Constructor.
Describe a multidimensional execution window.
Copyright (c) 2017-2024 Arm Limited.
Compute descriptor used by the depthwise convolution native kernel.
Store the tensor's metadata.
void configure(const CLCompileContext &compile_context, ICLTensor *input, const ICLTensor *weights, const ICLTensor *biases, ICLTensor *output, const DWCComputeKernelInfo &dwc_info, const ConvolutionInfo &conv_info, const ICLTensor *output_multipliers=nullptr, const ICLTensor *output_shifts=nullptr)
Initialize the function's source, destination and parameters.
Interface for the kernel to run a MxN depthwise convolution.