21.02
|
Interface for softmax computation for QASYMM8 with pre-computed max. More...
#include <CpuSoftmaxKernel.h>
Public Member Functions | |
CpuLogits1DSoftmaxKernel () | |
Default constructor. More... | |
ARM_COMPUTE_DISALLOW_COPY_ALLOW_MOVE (CpuLogits1DSoftmaxKernel) | |
void | configure (const ITensorInfo *src, const ITensorInfo *max, ITensorInfo *dst, const float beta, ITensorInfo *tmp) |
Set the input and output tensors. More... | |
void | run_op (ITensorPack &tensors, const Window &window, const ThreadInfo &info) override |
Execute the kernel on the passed window. More... | |
const char * | name () const override |
Name of the kernel. More... | |
Public Member Functions inherited from ICPPKernel | |
virtual | ~ICPPKernel ()=default |
Default destructor. More... | |
virtual void | run (const Window &window, const ThreadInfo &info) |
Execute the kernel on the passed window. More... | |
virtual void | run_nd (const Window &window, const ThreadInfo &info, const Window &thread_locator) |
legacy compatibility layer for implemantions which do not support thread_locator In these cases we simply narrow the interface down the legacy version More... | |
Public Member Functions inherited from IKernel | |
IKernel () | |
Constructor. More... | |
virtual | ~IKernel ()=default |
Destructor. More... | |
virtual bool | is_parallelisable () const |
Indicates whether or not the kernel is parallelisable. More... | |
virtual BorderSize | border_size () const |
The size of the border for that kernel. More... | |
const Window & | window () const |
The maximum window the kernel can be executed on. More... | |
Static Public Member Functions | |
static Status | validate (const ITensorInfo *src, const ITensorInfo *max, const ITensorInfo *dst, const float beta, const ITensorInfo *tmp) |
Static function to check if given info will lead to a valid configuration of CpuLogits1DSoftmaxKernel. More... | |
Interface for softmax computation for QASYMM8 with pre-computed max.
Definition at line 65 of file CpuSoftmaxKernel.h.
Default constructor.
Definition at line 306 of file CpuSoftmaxKernel.cpp.
ARM_COMPUTE_DISALLOW_COPY_ALLOW_MOVE | ( | CpuLogits1DSoftmaxKernel< IS_LOG > | ) |
void configure | ( | const ITensorInfo * | src, |
const ITensorInfo * | max, | ||
ITensorInfo * | dst, | ||
const float | beta, | ||
ITensorInfo * | tmp | ||
) |
Set the input and output tensors.
[in] | src | Source tensor info. Data types supported: QASYMM8/QASYMM8_SIGNED/F16/F32. |
[in] | max | Max values tensor info. Same shape as input with dimension 0 set to 1. Data types supported: same as input . |
[out] | dst | Destination tensor info. Data types supported: same as input . |
[in] | beta | A scaling factor for the exponent. |
tmp | Auxiliary tensor info. Must be type F32 and same shape as the input. |
Definition at line 312 of file CpuSoftmaxKernel.cpp.
References ARM_COMPUTE_ERROR_ON_NULLPTR, ARM_COMPUTE_ERROR_THROW_ON, arm_compute::auto_init_if_empty(), arm_compute::calculate_max_window(), ITensorInfo::data_type(), arm_compute::F32, arm_compute::get_softmax_output_quantization_info(), arm_compute::is_data_type_quantized_asymmetric(), ITensorInfo::num_dimensions(), ITensorInfo::quantization_info(), ITensorInfo::reset_padding(), TensorInfo::set_data_type(), Dimensions< T >::set_num_dimensions(), TensorInfo::set_quantization_info(), ITensorInfo::set_valid_region(), and ITensorInfo::tensor_shape().
|
overridevirtual |
Name of the kernel.
Implements ICPPKernel.
Definition at line 375 of file CpuSoftmaxKernel.cpp.
|
overridevirtual |
Execute the kernel on the passed window.
[in] | tensors | A vector containing the tensors to operate on. |
[in] | window | Region on which to execute the kernel. (Must be a region of the window returned by window()) |
[in] | info | Info about executing thread and CPU. |
Reimplemented from ICPPKernel.
Definition at line 352 of file CpuSoftmaxKernel.cpp.
References arm_compute::ACL_DST_0, arm_compute::ACL_DST_1, arm_compute::ACL_SRC_0, arm_compute::ACL_SRC_1, ARM_COMPUTE_ERROR_ON, ARM_COMPUTE_ERROR_ON_INVALID_SUBWINDOW, ARM_COMPUTE_ERROR_ON_UNCONFIGURED_KERNEL, ARM_COMPUTE_UNUSED, arm_compute::test::validation::dst, ITensorPack::get_const_tensor(), ITensorPack::get_tensor(), num_elems_processed_per_iteration, ThreadInfo::num_threads, arm_compute::test::validation::src, ThreadInfo::thread_id, and IKernel::window().
|
static |
Static function to check if given info will lead to a valid configuration of CpuLogits1DSoftmaxKernel.
[in] | src | Source tensor info. Data types supported: QASYMM8/QASYMM8_SIGNED/F16/F32. |
[in] | max | Max values tensor info. Same shape as input with dimension 0 set to 1. Data types supported: same as input . |
[in] | dst | Destination tensor info. Data types supported: same as input . |
[in] | beta | A scaling factor for the exponent. |
[in] | tmp | Tensor info of auxiliary. Must be type F32 and same shape as the input. |
Definition at line 342 of file CpuSoftmaxKernel.cpp.
References ARM_COMPUTE_ERROR_ON_NULLPTR, and ARM_COMPUTE_RETURN_ON_ERROR.