Basic function to execute GEMMLowpQuantizeDown kernels on CL. More...

#include <CLGEMMLowpOutputStage.h>

Collaboration diagram for CLGEMMLowpOutputStage:

Public Member Functions
	CLGEMMLowpOutputStage ()

	CLGEMMLowpOutputStage (const CLGEMMLowpOutputStage &)=delete
	Prevent instances of this class from being copied (As this class contains pointers) More...

	CLGEMMLowpOutputStage (CLGEMMLowpOutputStage &&)
	Default move constructor. More...

CLGEMMLowpOutputStage &	operator= (const CLGEMMLowpOutputStage &)=delete
	Prevent instances of this class from being copied (As this class contains pointers) More...

CLGEMMLowpOutputStage &	operator= (CLGEMMLowpOutputStage &&)
	Default move assignment operator. More...

	~CLGEMMLowpOutputStage ()
	Default destructor. More...

void	configure (const ICLTensor input, const ICLTensor bias, ICLTensor *output, const GEMMLowpOutputStageInfo &info)
	Initialise the kernel's inputs, output. More...

void	configure (const CLCompileContext &compile_context, const ICLTensor input, const ICLTensor bias, ICLTensor *output, const GEMMLowpOutputStageInfo &info)
	Initialise the kernel's inputs, output. More...

void	run () override
	Run the kernels contained in the function. More...

Public Member Functions inherited from IFunction
virtual	~IFunction ()=default
	Destructor. More...

virtual void	prepare ()
	Prepare the function for executing. More...

Static Public Member Functions
static Status	validate (const ITensorInfo input, const ITensorInfo bias, const ITensorInfo *output, const GEMMLowpOutputStageInfo &info)
	Static function to check if given info will lead to a valid configuration of opencl::kernels::ClGemmLowpQuantizeDownInt32ScaleByFixedPointKernel. More...

Detailed Description

Basic function to execute GEMMLowpQuantizeDown kernels on CL.

This function calls the following CL kernels:

Definition at line 56 of file CLGEMMLowpOutputStage.h.

Constructor & Destructor Documentation

◆ CLGEMMLowpOutputStage() [1/3]

CLGEMMLowpOutputStage ( )

Definition at line 50 of file CLGEMMLowpOutputStage.cpp.

                                              : _impl(std::make_unique<Impl>())
 {
 }

◆ CLGEMMLowpOutputStage() [2/3]

CLGEMMLowpOutputStage ( const CLGEMMLowpOutputStage & )

delete

Prevent instances of this class from being copied (As this class contains pointers)

◆ CLGEMMLowpOutputStage() [3/3]

CLGEMMLowpOutputStage ( CLGEMMLowpOutputStage && )

default

Default move constructor.

◆ ~CLGEMMLowpOutputStage()

~CLGEMMLowpOutputStage ( )

default

Default destructor.

Member Function Documentation

◆ configure() [1/2]

void configure	(	const CLCompileContext &	compile_context,
		const ICLTensor *	input,
		const ICLTensor *	bias,
		ICLTensor *	output,
		const GEMMLowpOutputStageInfo &	info
	)

Initialise the kernel's inputs, output.

Parameters

[in]	compile_context	The compile context to be used.
[in]	input	Input tensor. Data type supported: S32
[in]	bias	Biases tensor. Only shared biases supported and it can be a nullptr if the biases addition is not required. Biases are 1D tensor with dimensions [OFM]. Data type supported: Same as `input`.
[out]	output	Output tensor. Data type supported: QASYMM8/QASYMM8_SIGNED
[in]	info	GEMMLowp output stage metadata.

Definition at line 65 of file CLGEMMLowpOutputStage.cpp.

 {
     ARM_COMPUTE_ERROR_ON_NULLPTR(input, output);
  
     _impl->src  = input;
     _impl->bias = bias;
     _impl->dst  = output;
  
     _impl->op = std::make_unique<opencl::ClGemmLowpOutputStage>();
     _impl->op->configure(compile_context, input->info(), bias != nullptr ? bias->info() : nullptr, output->info(),
                          info);
     _impl->run_pack = {{ACL_SRC, _impl->src}, {ACL_BIAS, _impl->bias}, {ACL_DST, _impl->dst}};
 }

References arm_compute::ACL_BIAS, arm_compute::ACL_DST, arm_compute::ACL_SRC, ARM_COMPUTE_ERROR_ON_NULLPTR, bias, ITensor::info(), arm_compute::test::validation::info, and arm_compute::test::validation::input.

◆ configure() [2/2]

void configure	(	const ICLTensor *	input,
		const ICLTensor *	bias,
		ICLTensor *	output,
		const GEMMLowpOutputStageInfo &	info
	)

Initialise the kernel's inputs, output.

Valid data layouts:

All

Valid data type configurations:

src0	src1	dst
S32	S32	QASYMM8
S32	S32	QASYMM8_SIGNED
S32	S32	QSYMM16

Parameters

[in]	input	Input tensor. Data type supported: S32
[in]	bias	Biases tensor. Only shared biases supported and it can be a nullptr if the biases addition is not required. Biases are 1D tensor with dimensions [OFM]. Data type supported: Same as `input`.
[out]	output	Output tensor. Data type supported: QASYMM8/QASYMM8_SIGNED/QSYMM16
[in]	info	GEMMLowp output stage metadata.

Definition at line 57 of file CLGEMMLowpOutputStage.cpp.

 {
     configure(CLKernelLibrary::get().get_compile_context(), input, bias, output, info);
 }

References bias, CLKernelLibrary::get(), arm_compute::test::validation::info, and arm_compute::test::validation::input.

Referenced by CLGEMMDeconvolutionLayer::configure(), CLLSTMLayerQuantized::configure(), and CLQLSTMLayer::configure().

◆ operator=() [1/2]

CLGEMMLowpOutputStage & operator= ( CLGEMMLowpOutputStage && )

default

Default move assignment operator.

◆ operator=() [2/2]

CLGEMMLowpOutputStage& operator= ( const CLGEMMLowpOutputStage & )

delete

Prevent instances of this class from being copied (As this class contains pointers)

◆ run()

void run ( )

overridevirtual

Run the kernels contained in the function.

For CPU kernels:

Multi-threading is used for the kernels which are parallelisable.
By default std::thread::hardware_concurrency() threads are used.

Note: CPPScheduler::set_num_threads() can be used to manually set the number of threads

For OpenCL kernels:

All the kernels are enqueued on the queue associated with CLScheduler.
The queue is then flushed.

Note: The function will not block until the kernels are executed. It is the user's responsibility to wait.; Will call prepare() on first run if hasn't been done

Implements IFunction.

Definition at line 91 of file CLGEMMLowpOutputStage.cpp.

 {
     _impl->op->run(_impl->run_pack);
 }

Referenced by CLGEMMDeconvolutionLayer::run(), CLLSTMLayerQuantized::run(), and CLQLSTMLayer::run().

◆ validate()

Status validate	(	const ITensorInfo *	input,
		const ITensorInfo *	bias,
		const ITensorInfo *	output,
		const GEMMLowpOutputStageInfo &	info
	)

static

Static function to check if given info will lead to a valid configuration of opencl::kernels::ClGemmLowpQuantizeDownInt32ScaleByFixedPointKernel.

Parameters

[in]	input	Input tensor. It is the output of CLGEMMLowpMatrixMultiplyCore function. Data type supported: S32
[in]	bias	Biases tensor. Only shared biases supported and it can be a nullptr if the addition of biases is not required. Biases are 1D tensor with dimensions [OFM]. Data type supported: Same as `input`.
[in]	output	Output tensor. Data type supported: QASYMM8/QASYMM8_SIGNED
[in]	info	GEMMLowp output stage metadata.

Returns: a status

Definition at line 83 of file CLGEMMLowpOutputStage.cpp.

 {
     return opencl::ClGemmLowpOutputStage::validate(input, bias, output, info);
 }

References bias, arm_compute::test::validation::info, arm_compute::test::validation::input, and ClGemmLowpOutputStage::validate().

Referenced by CLGEMMDeconvolutionLayer::validate(), CLLSTMLayerQuantized::validate(), and CLQLSTMLayer::validate().

The documentation for this class was generated from the following files:

arm_compute/runtime/CL/functions/CLGEMMLowpOutputStage.h
src/runtime/CL/functions/CLGEMMLowpOutputStage.cpp

Public Member Functions

Static Public Member Functions

Detailed Description

Constructor & Destructor Documentation

◆ CLGEMMLowpOutputStage() [1/3]

◆ CLGEMMLowpOutputStage() [2/3]

◆ CLGEMMLowpOutputStage() [3/3]

◆ ~CLGEMMLowpOutputStage()

Member Function Documentation

◆ configure() [1/2]

◆ configure() [2/2]

◆ operator=() [1/2]

◆ operator=() [2/2]

◆ run()

◆ validate()