Basic function to run cpu::CpuMul. More...

#include <NEPixelWiseMultiplication.h>

Collaboration diagram for NEPixelWiseMultiplication:

Public Member Functions
	NEPixelWiseMultiplication ()
	Default Constructor. More...

	~NEPixelWiseMultiplication ()
	Default Destructor. More...

	NEPixelWiseMultiplication (const NEPixelWiseMultiplication &)=delete
	Prevent instances of this class from being copied (As this class contains pointers) More...

	NEPixelWiseMultiplication (NEPixelWiseMultiplication &&)=default
	Default move constructor. More...

NEPixelWiseMultiplication &	operator= (const NEPixelWiseMultiplication &)=delete
	Prevent instances of this class from being copied (As this class contains pointers) More...

NEPixelWiseMultiplication &	operator= (NEPixelWiseMultiplication &&)=default
	Default move assignment operator. More...

void	configure (const ITensor input1, const ITensor input2, ITensor *output, float scale, ConvertPolicy overflow_policy, RoundingPolicy rounding_policy, const ActivationLayerInfo &act_info=ActivationLayerInfo())
	Initialise the kernel's inputs, output and convertion policy. More...

void	run () override
	Run the kernels contained in the function. More...

Public Member Functions inherited from IFunction
virtual	~IFunction ()=default
	Destructor. More...

virtual void	prepare ()
	Prepare the function for executing. More...

Static Public Member Functions
static Status	validate (const ITensorInfo input1, const ITensorInfo input2, const ITensorInfo *output, float scale, ConvertPolicy overflow_policy, RoundingPolicy rounding_policy, const ActivationLayerInfo &act_info=ActivationLayerInfo())
	Static function to check if given info will lead to a valid configuration of NEPixelWiseMultiplication. More...

Detailed Description

Basic function to run cpu::CpuMul.

Definition at line 40 of file NEPixelWiseMultiplication.h.

Constructor & Destructor Documentation

◆ NEPixelWiseMultiplication() [1/3]

NEPixelWiseMultiplication ( )

Default Constructor.

Definition at line 42 of file NEPixelWiseMultiplication.cpp.

                                                      : _impl(std::make_unique<Impl>())
 {
 }

◆ ~NEPixelWiseMultiplication()

~NEPixelWiseMultiplication ( )

default

Default Destructor.

◆ NEPixelWiseMultiplication() [2/3]

NEPixelWiseMultiplication ( const NEPixelWiseMultiplication & )

delete

Prevent instances of this class from being copied (As this class contains pointers)

◆ NEPixelWiseMultiplication() [3/3]

NEPixelWiseMultiplication ( NEPixelWiseMultiplication && )

default

Default move constructor.

Member Function Documentation

◆ configure()

void configure	(	const ITensor *	input1,
		const ITensor *	input2,
		ITensor *	output,
		float	scale,
		ConvertPolicy	overflow_policy,
		RoundingPolicy	rounding_policy,
		const ActivationLayerInfo &	act_info = `ActivationLayerInfo()`
	)

Initialise the kernel's inputs, output and convertion policy.

Valid data layouts:

All

Valid data type configurations:

src0	src1	dst
QASYMM8	QASYMM8	QASYMM8
QASYMM8_SIGNED	QASYMM8_SIGNED	QASYMM8_SIGNED
QSYMM16	QSYMM16	QASYMM16
QSYMM16	QSYMM16	S32
U8	U8	U8
U8	U8	S16
U8	S16	S16
S16	U8	S16
S16	S16	S16
F16	F16	F16
F32	S32	F32

Note: For scale equal to 1/255 only round to nearest even (implemented as round half up) is supported. For all other scale values only round to zero (implemented as round towards minus infinity) is supported.

Parameters

[in,out]	input1	An input tensor. Data types supported: U8/QASYMM8/QASYMM8_SIGNED/S16/S32/QSYMM16/F16/F32 This input tensor is [in, out] because its TensorInfo might be modified inside the kernel in case of broadcasting of dimension 0.
[in,out]	input2	An input tensor. Data types supported: U8, QASYMM8 (only if `input1` is QASYMM8), QASYMM8_SIGNED (only if `input1` is QASYMM8_SIGNED), S16, S32, QSYMM16 (only if `input1` is QSYMM16), F16 (only if `input1` is F16), F32 (only if `input1` is F32). This input tensor is [in, out] because its TensorInfo might be modified inside the kernel in case of broadcasting of dimension 0.
[out]	output	Output tensor. Data types supported: U8, only if both inputs are U8. QASYMM8, only if both inputs are QASYMM8. QASYMM8_SIGNED, only if `input1` is QASYMM8_SIGNED. S16. QSYMM16, only if both inputs are QSYMM16. S32, only if both inputs are S32 or both are QSYMM16. F16, only if `input1` is F16. F32, only if both inputs are F32.
[in]	scale	Scale to apply after multiplication. Scale must be positive and its value must be either 1/255 or 1/2^n where n is between 0 and 15. If both `input1`, `input2` and `output` are of datatype S32, scale cannot be 1/255
[in]	overflow_policy	Overflow policy. ConvertPolicy cannot be WRAP if any of the inputs is of quantized datatype
[in]	rounding_policy	Rounding policy.
[in]	act_info	(Optional) Activation layer information in case of a fused activation. Currently not supported.

Definition at line 58 of file NEPixelWiseMultiplication.cpp.

 {
     _impl->src_0 = input1;
     _impl->src_1 = input2;
     _impl->dst   = output;
     _impl->op    = std::make_unique<cpu::CpuMul>();
     _impl->op->configure(input1->info(), input2->info(), output->info(), scale, overflow_policy, rounding_policy,
                          act_info);
 }

References arm_compute::test::validation::act_info, ITensor::info(), and arm_compute::test::validation::scale.

Referenced by NENormalizationLayer::configure(), NELSTMLayerQuantized::configure(), NELSTMLayer::configure(), and NEQLSTMLayer::configure().

◆ operator=() [1/2]

NEPixelWiseMultiplication& operator= ( const NEPixelWiseMultiplication & )

delete

Prevent instances of this class from being copied (As this class contains pointers)

◆ operator=() [2/2]

NEPixelWiseMultiplication& operator= ( NEPixelWiseMultiplication && )

default

Default move assignment operator.

◆ run()

void run ( )

overridevirtual

Run the kernels contained in the function.

For CPU kernels:

Multi-threading is used for the kernels which are parallelisable.
By default std::thread::hardware_concurrency() threads are used.

Note: CPPScheduler::set_num_threads() can be used to manually set the number of threads

For OpenCL kernels:

All the kernels are enqueued on the queue associated with CLScheduler.
The queue is then flushed.

Note: The function will not block until the kernels are executed. It is the user's responsibility to wait.; Will call prepare() on first run if hasn't been done

Implements IFunction.

Definition at line 74 of file NEPixelWiseMultiplication.cpp.

 {
     ITensorPack pack;
     pack.add_tensor(TensorType::ACL_SRC_0, _impl->src_0);
     pack.add_tensor(TensorType::ACL_SRC_1, _impl->src_1);
     pack.add_tensor(TensorType::ACL_DST, _impl->dst);
     _impl->op->run(pack);
 }

References arm_compute::ACL_DST, arm_compute::ACL_SRC_0, arm_compute::ACL_SRC_1, ITensorPack::add_tensor(), and arm_compute::test::validation::pack.

Referenced by NENormalizationLayer::run(), NELSTMLayerQuantized::run(), NELSTMLayer::run(), and NEQLSTMLayer::run().

◆ validate()

Status validate	(	const ITensorInfo *	input1,
		const ITensorInfo *	input2,
		const ITensorInfo *	output,
		float	scale,
		ConvertPolicy	overflow_policy,
		RoundingPolicy	rounding_policy,
		const ActivationLayerInfo &	act_info = `ActivationLayerInfo()`
	)

static

Static function to check if given info will lead to a valid configuration of NEPixelWiseMultiplication.

Note: For scale equal to 1/255 only round to nearest even (implemented as round half up) is supported. For all other scale values only round to zero (implemented as round towards minus infinity) is supported.

Parameters

[in]	input1	An input tensor info. Data types supported: U8/QASYMM8/QASYMM8_SIGNED/S16/S32/QSYMM16/F16/F32
[in]	input2	An input tensor info. Data types supported: U8, QASYMM8 (only if `input1` is QASYMM8), QASYMM8_SIGNED (only if `input1` is QASYMM8_SIGNED), S16, S32, QSYMM16 (only if both inputs are QSYMM16), F16 (only if `input1` is F16), F32 (only if `input1` is F32).
[in]	output	Output tensor info. Data types supported: U8, only if both inputs are U8. QASYMM8, only if both inputs are QASYMM8. QASYMM8_SIGNED, only if `input1` is QASYMM8_SIGNED. S16. QSYMM16, only if both inputs are QSYMM16. S32, only if both inputs are S32 or both are QSYMM16. F16, only if `input1` is F16. F32, only if both inputs are F32.
[in]	scale	Scale to apply after multiplication. Scale must be positive and its value must be either 1/255 or 1/2^n where n is between 0 and 15. If both `input1`, `input2` and `output` are of datatype S32, scale cannot be 1/255
[in]	overflow_policy	Overflow policy. ConvertPolicy cannot be WRAP if any of the inputs is of quantized datatype
[in]	rounding_policy	Rounding policy.
[in]	act_info	(Optional) Activation layer information in case of a fused activation. Currently not supported.

Returns: a status

Definition at line 47 of file NEPixelWiseMultiplication.cpp.

 {
     return cpu::CpuMul::validate(input1, input2, output, scale, overflow_policy, rounding_policy, act_info);
 }

References arm_compute::test::validation::act_info, arm_compute::test::validation::scale, and CpuMul::validate().

Referenced by NENormalizationLayer::validate(), NELSTMLayerQuantized::validate(), NELSTMLayer::validate(), and NEQLSTMLayer::validate().

The documentation for this class was generated from the following files:

arm_compute/runtime/NEON/functions/NEPixelWiseMultiplication.h
src/runtime/NEON/functions/NEPixelWiseMultiplication.cpp

Public Member Functions

Static Public Member Functions

Detailed Description

Constructor & Destructor Documentation

◆ NEPixelWiseMultiplication() [1/3]

◆ ~NEPixelWiseMultiplication()

◆ NEPixelWiseMultiplication() [2/3]

◆ NEPixelWiseMultiplication() [3/3]

Member Function Documentation

◆ configure()

◆ operator=() [1/2]

◆ operator=() [2/2]

◆ run()

◆ validate()