24.02.1
|
Go to the documentation of this file.
48 ITensorPack select_activation_src_dst(ITensorPack &tensors)
56 DirectConvComputeKernelInfo
57 config_direct_convolution_nhwc(
const ITensorInfo *
src,
const ITensorInfo *weights,
const PadStrideInfo &
conv_info)
87 auto k = std::make_unique<kernels::ClDirectConv2dKernel>();
89 k->configure(compile_context,
src, weights, biases,
dst,
conv_info, conv2d_act_info, desc);
90 _direct_conv_kernel = std::move(k);
98 auto b = std::make_unique<CLFillBorderKernel>();
100 _src_border_handler = std::move(
b);
105 auto a = std::make_unique<kernels::ClActivationKernel>();
107 _activation_kernel = std::move(a);
140 if (_activation_kernel)
142 auto act_pack = select_activation_src_dst(tensors);
@ CONSTANT
Pixels outside the image are assumed to have a constant value.
SimpleTensor< float > src
Class describing the value of a pixel for any image format.
@ NHWC
Num samples, height, width, channels.
void add_tensor(int id, ITensor *tensor)
Add tensor to the pack.
#define ARM_COMPUTE_RETURN_ON_ERROR(status)
Checks if a status contains an error and returns it.
Activation Layer Information class.
#define ARM_COMPUTE_ERROR_ON_NULLPTR(...)
bool enabled() const
Check if initialised.
Interface to enqueue OpenCL kernels and get/set the OpenCL CommandQueue and ICLTuner.
void run(ITensorPack &tensors) override
Run the kernels contained in the function.
void tune_kernel_static(ICLKernel &kernel)
Tunes OpenCL kernel.
static Status validate(const ITensorInfo *src, const ITensorInfo *weights, const ITensorInfo *biases, const ITensorInfo *dst, const PadStrideInfo &conv_info, const ActivationLayerInfo &act_info=ActivationLayerInfo())
Static function to check if given info will lead to a valid configuration.
static CLScheduler & get()
Access the scheduler singleton.
GPUTarget target() const
Get the target GPU.
GPUTarget
Available GPU Targets.
static std::unique_ptr< IClDirectConvKernelConfig > create(GPUTarget gpu)
Static method to call the ClDirectConvolution kernel configuration class accordingly with the GPU tar...
void configure(const CLCompileContext &compile_context, ITensorInfo *src, ITensorInfo *weights, ITensorInfo *biases, ITensorInfo *dst, const PadStrideInfo &conv_info, const ActivationLayerInfo &act_info=ActivationLayerInfo())
Set the src and dst tensors.
Copyright (c) 2017-2024 Arm Limited.
bool is_data_type_float(DataType dt)
Check if a given data type is of floating point type.
bool is_data_type_quantized_asymmetric(DataType dt)
Check if a given data type is of asymmetric quantized type.
void enqueue_op(ICLKernel &kernel, ITensorPack &tensors, bool flush=true)
Schedule the execution of the passed kernel if possible.
Status validate(const ITensorInfo *scores_in, const ITensorInfo *boxes_in, const ITensorInfo *batch_splits_in, const ITensorInfo *scores_out, const ITensorInfo *boxes_out, const ITensorInfo *classes, const ITensorInfo *batch_splits_out, const ITensorInfo *keeps, const ITensorInfo *keeps_size, const BoxNMSLimitInfo info)
Store the tensor's metadata.
#define ARM_COMPUTE_LOG_PARAMS(...)
Compute descriptor used by the direct convolution kernel.