21.02
|
Interface to add a bias to each row of the input tensor. More...
#include <GCGEMMMatrixAccumulateBiasesKernel.h>
Public Member Functions | |
GCGEMMMatrixAccumulateBiasesKernel () | |
Default constructor. More... | |
GCGEMMMatrixAccumulateBiasesKernel (const GCGEMMMatrixAccumulateBiasesKernel &)=delete | |
Prevent instances of this class from being copied (As this class contains pointers) More... | |
GCGEMMMatrixAccumulateBiasesKernel & | operator= (const GCGEMMMatrixAccumulateBiasesKernel &)=delete |
Prevent instances of this class from being copied (As this class contains pointers) More... | |
GCGEMMMatrixAccumulateBiasesKernel (GCGEMMMatrixAccumulateBiasesKernel &&)=default | |
Allow instances of this class to be moved. More... | |
GCGEMMMatrixAccumulateBiasesKernel & | operator= (GCGEMMMatrixAccumulateBiasesKernel &&)=default |
Allow instances of this class to be moved. More... | |
void | configure (IGCTensor *accum, const IGCTensor *biases) |
Set the accumulate buffer and the biases of the kernel. More... | |
void | run (const Window &window) override |
Enqueue the OpenGL ES shader to process the given window. More... | |
Public Member Functions inherited from IGCKernel | |
IGCKernel () | |
Constructor. More... | |
GCKernel & | kernel () |
Returns a reference to the GLES kernel of this object. More... | |
void | add_1D_tensor_argument (unsigned int &idx, const IGCTensor *tensor, const unsigned int binding_point, const Window &window) |
Add the passed 1D tensor's parameters to the object's kernel's arguments starting from the index idx. More... | |
void | add_2D_tensor_argument (unsigned int &idx, const IGCTensor *tensor, const unsigned int binding_point, const Window &window) |
Add the passed 2D tensor's parameters to the object's kernel's arguments starting from the index idx. More... | |
void | add_3D_tensor_argument (unsigned int &idx, const IGCTensor *tensor, const unsigned int binding_point, const Window &window) |
Add the passed 3D tensor's parameters to the object's kernel's arguments starting from the index idx. More... | |
unsigned int | num_arguments_per_1D_tensor () const |
Returns the number of arguments enqueued per 1D tensor object. More... | |
unsigned int | num_arguments_per_2D_tensor () const |
Returns the number of arguments enqueued per 2D tensor object. More... | |
unsigned int | num_arguments_per_3D_tensor () const |
Returns the number of arguments enqueued per 3D tensor object. More... | |
void | set_lws_hint (gles::NDRange &lws_hint) |
Set the Local-Workgroup-Size hint. More... | |
void | set_target (GPUTarget target) |
Set the targeted GPU architecture. More... | |
GPUTarget | get_target () const |
Get the targeted GPU architecture. More... | |
Public Member Functions inherited from IKernel | |
IKernel () | |
Constructor. More... | |
virtual | ~IKernel ()=default |
Destructor. More... | |
virtual bool | is_parallelisable () const |
Indicates whether or not the kernel is parallelisable. More... | |
virtual BorderSize | border_size () const |
The size of the border for that kernel. More... | |
const Window & | window () const |
The maximum window the kernel can be executed on. More... | |
Interface to add a bias to each row of the input tensor.
Definition at line 34 of file GCGEMMMatrixAccumulateBiasesKernel.h.
Default constructor.
Definition at line 42 of file GCGEMMMatrixAccumulateBiasesKernel.cpp.
References arm_compute::utils::cast::U.
|
delete |
Prevent instances of this class from being copied (As this class contains pointers)
Allow instances of this class to be moved.
Set the accumulate buffer and the biases of the kernel.
[in,out] | accum | The accumulate tensor to convert. Data types supported: F16/F32 |
[in] | biases | The shared biases tensor to append. It must be 1D tensor. Data types supported: Same as input |
Definition at line 47 of file GCGEMMMatrixAccumulateBiasesKernel.cpp.
References ARM_COMPUTE_ERROR_ON, ARM_COMPUTE_ERROR_ON_DATA_TYPE_CHANNEL_NOT_IN, ARM_COMPUTE_ERROR_ON_MISMATCHING_DATA_TYPES, arm_compute::calculate_max_enlarged_window(), arm_compute::ceil_to_multiple(), GCKernelLibrary::create_kernel(), ITensorInfo::data_type(), ITensorInfo::dimension(), arm_compute::F16, arm_compute::F32, GCKernelLibrary::get(), ITensor::info(), ITensorInfo::num_dimensions(), num_elems_processed_per_iteration, arm_compute::support::cpp11::to_string(), and arm_compute::update_window_and_padding().
Referenced by GCFullyConnectedLayer::configure().
|
delete |
Prevent instances of this class from being copied (As this class contains pointers)
|
default |
Allow instances of this class to be moved.
|
overridevirtual |
Enqueue the OpenGL ES shader to process the given window.
[in] | window | Region on which to execute the kernel. (Must be a valid region of the window returned by window()). |
Implements IGCKernel.
Definition at line 106 of file GCGEMMMatrixAccumulateBiasesKernel.cpp.
References IGCKernel::add_1D_tensor_argument(), IGCKernel::add_2D_tensor_argument(), ARM_COMPUTE_ERROR_ON_MISMATCHING_WINDOWS, ARM_COMPUTE_ERROR_ON_UNCONFIGURED_KERNEL, Window::DimY, arm_compute::enqueue(), Window::first_slice_window_2D(), Window::set(), Window::slide_window_slice_2D(), and IKernel::window().