Interface for the normalization layer kernel. More...

#include <NENormalizationLayerKernel.h>

Collaboration diagram for NENormalizationLayerKernel:

Public Member Functions
const char *	name () const override
	Name of the kernel. More...

	NENormalizationLayerKernel ()
	Default constructor. More...

	NENormalizationLayerKernel (const NENormalizationLayerKernel &)=delete
	Prevent instances of this class from being copied (As this class contains pointers) More...

NENormalizationLayerKernel &	operator= (const NENormalizationLayerKernel &)=delete
	Prevent instances of this class from being copied (As this class contains pointers) More...

	NENormalizationLayerKernel (NENormalizationLayerKernel &&)=default
	Default Move Constructor. More...

NENormalizationLayerKernel &	operator= (NENormalizationLayerKernel &&)=default
	Default move assignment operator. More...

	~NENormalizationLayerKernel ()=default
	Default destructor. More...

void	configure (const ITensor input, const ITensor input_squared, ITensor *output, NormalizationLayerInfo norm_info)
	Set the input and output tensors. More...

void	run (const Window &window, const ThreadInfo &info) override
	Execute the kernel on the passed window. More...

Public Member Functions inherited from ICPPKernel
virtual	~ICPPKernel ()=default
	Default destructor. More...

virtual void	run_nd (const Window &window, const ThreadInfo &info, const Window &thread_locator)
	legacy compatibility layer for implemantions which do not support thread_locator In these cases we simply narrow the interface down the legacy version More...

virtual void	run_op (ITensorPack &tensors, const Window &window, const ThreadInfo &info)
	Execute the kernel on the passed window. More...

virtual size_t	get_mws (const CPUInfo &platform, size_t thread_count) const
	Return minimum workload size of the relevant kernel. More...

Public Member Functions inherited from IKernel
	IKernel ()
	Constructor. More...

virtual	~IKernel ()=default
	Destructor. More...

virtual bool	is_parallelisable () const
	Indicates whether or not the kernel is parallelisable. More...

virtual BorderSize	border_size () const
	The size of the border for that kernel. More...

const Window &	window () const
	The maximum window the kernel can be executed on. More...

bool	is_window_configured () const
	Function to check if the embedded window of this kernel has been configured. More...

Static Public Member Functions
static Status	validate (const ITensorInfo input, const ITensorInfo input_squared, const ITensorInfo *output, NormalizationLayerInfo norm_info)
	Static function to check if given info will lead to a valid configuration of NENormalizationLayerKernel. More...

Additional Inherited Members
Static Public Attributes inherited from ICPPKernel
static constexpr size_t	default_mws = 1

Detailed Description

Interface for the normalization layer kernel.

Definition at line 35 of file NENormalizationLayerKernel.h.

Constructor & Destructor Documentation

◆ NENormalizationLayerKernel() [1/3]

NENormalizationLayerKernel ( )

Default constructor.

Definition at line 74 of file NENormalizationLayerKernel.cpp.

     : _func(nullptr), _input(nullptr), _input_squared(nullptr), _output(nullptr), _norm_info(NormType::IN_MAP_1D)
 {
 }

References arm_compute::IN_MAP_1D.

◆ NENormalizationLayerKernel() [2/3]

NENormalizationLayerKernel ( const NENormalizationLayerKernel & )

delete

Prevent instances of this class from being copied (As this class contains pointers)

◆ NENormalizationLayerKernel() [3/3]

NENormalizationLayerKernel ( NENormalizationLayerKernel && )

default

Default Move Constructor.

◆ ~NENormalizationLayerKernel()

~NENormalizationLayerKernel ( )

default

Default destructor.

Member Function Documentation

◆ configure()

void configure	(	const ITensor *	input,
		const ITensor *	input_squared,
		ITensor *	output,
		NormalizationLayerInfo	norm_info
	)

Set the input and output tensors.

Parameters

[in]	input	Source tensor. 3 lower dims represent a single input with dimensions [width, height, IFM], and an optional 4th dimension for batch of inputs. Data types supported: FP16/F32. Data layouts supported: NCHW/NHWC.
[in]	input_squared	Source with each element has been squared. 3 lower dims represent a single input with dimensions [width, height, IFM], Data type and layout supported: same as `input`.
[out]	output	Destination tensor. Output will have the same number of dimensions as input. Data type and layout supported: same as `input`.
[in]	norm_info	Normalization layer information like the normalization type, normalization size and other parameters.

Definition at line 79 of file NENormalizationLayerKernel.cpp.

 {
     ARM_COMPUTE_ERROR_ON_NULLPTR(input, input_squared, output);
     // Output tensor auto initialization if not yet initialized
     auto_init_if_empty(*output->info(), *input->info());
  
     // Perform validation step
     ARM_COMPUTE_ERROR_THROW_ON(validate_arguments(input->info(), input_squared->info(), output->info(), norm_info));
  
     const unsigned int norm_idx = get_normalization_dimension_index(input->info()->data_layout(), norm_info);
  
     _input         = input;
     _input_squared = input_squared;
     _output        = output;
     _norm_info     = norm_info;
     switch (_input->info()->data_type())
     {
         case DataType::F32:
         {
             switch (norm_idx)
             {
                 case 0:
                 {
                     if (norm_info.type() == NormType::IN_MAP_2D)
                     {
                         _func = REGISTER_FP32_NEON(cpu::neon_normalize_float32_4_0_2D);
                     }
                     else
                     {
                         _func = REGISTER_FP32_NEON(cpu::neon_normalize_float32_4_0);
                     }
                     break;
                 }
                 case 1:
                     if (norm_info.type() == NormType::IN_MAP_2D)
                     {
                         _func = REGISTER_FP32_NEON(cpu::neon_normalize_float32_4_1_2D);
                     }
                     else
                     {
                         _func = REGISTER_FP32_NEON(cpu::neon_normalize_float32_4_1);
                     }
                     break;
                 case 2:
                     _func = REGISTER_FP32_NEON(cpu::neon_normalize_float32_4_2);
                     break;
                 default:
                     break;
             }
             break;
         }
 #ifdef ARM_COMPUTE_ENABLE_FP16
         case DataType::F16:
         {
             switch (norm_idx)
             {
                 case 0:
                 {
                     if (norm_info.type() == NormType::IN_MAP_2D)
                     {
                         _func = REGISTER_FP16_NEON(cpu::neon_normalize_float16_8_0_2D);
                     }
                     else
                     {
                         _func = REGISTER_FP16_NEON(cpu::neon_normalize_float16_8_0);
                     }
                     break;
                 }
                 case 1:
                     if (norm_info.type() == NormType::IN_MAP_2D)
                     {
                         _func = REGISTER_FP16_NEON(cpu::neon_normalize_float16_8_1_2D);
                     }
                     else
                     {
                         _func = REGISTER_FP16_NEON(cpu::neon_normalize_float16_8_1);
                     }
                     break;
                 case 2:
                     _func = REGISTER_FP16_NEON(cpu::neon_normalize_float16_8_2);
                     break;
                 default:
                     break;
             }
             break;
         }
 #endif /* ARM_COMPUTE_ENABLE_FP16 */
         default:
             ARM_COMPUTE_ERROR("NOT SUPPORTED!");
     }
  
     // Configure kernel window
     Window win = calculate_max_window(*input->info(), Steps());
     INEKernel::configure(win);
 }

◆ name()

const char* name ( ) const

inlineoverridevirtual

Name of the kernel.

Returns: Kernel name

Implements ICPPKernel.

Definition at line 38 of file NENormalizationLayerKernel.h.

     {
         return "NENormalizationLayerKernel";
     }

◆ operator=() [1/2]

NENormalizationLayerKernel& operator= ( const NENormalizationLayerKernel & )

delete

Prevent instances of this class from being copied (As this class contains pointers)

◆ operator=() [2/2]

NENormalizationLayerKernel& operator= ( NENormalizationLayerKernel && )

default

Default move assignment operator.

◆ run()

void run	(	const Window &	window,
		const ThreadInfo &	info
	)

overridevirtual

Execute the kernel on the passed window.

Warning: If is_parallelisable() returns false then the passed window must be equal to window()

Note: The window has to be a region within the window returned by the window() method; The width of the window has to be a multiple of num_elems_processed_per_iteration().

Parameters

[in]	window	Region on which to execute the kernel. (Must be a region of the window returned by window())
[in]	info	Info about executing thread and CPU.

Reimplemented from ICPPKernel.

Definition at line 188 of file NENormalizationLayerKernel.cpp.

 {
     ARM_COMPUTE_UNUSED(info);
     ARM_COMPUTE_ERROR_ON_UNCONFIGURED_KERNEL(this);
     ARM_COMPUTE_ERROR_ON_INVALID_SUBWINDOW(INEKernel::window(), window);
     ARM_COMPUTE_ERROR_ON(_func == nullptr);
  
     // Run function
     (*_func)(window, _input, _input_squared, _output, _norm_info);
 }

References ARM_COMPUTE_ERROR_ON, ARM_COMPUTE_ERROR_ON_INVALID_SUBWINDOW, ARM_COMPUTE_ERROR_ON_UNCONFIGURED_KERNEL, ARM_COMPUTE_UNUSED, arm_compute::test::validation::info, and IKernel::window().

◆ validate()

Status validate	(	const ITensorInfo *	input,
		const ITensorInfo *	input_squared,
		const ITensorInfo *	output,
		NormalizationLayerInfo	norm_info
	)

static

Static function to check if given info will lead to a valid configuration of NENormalizationLayerKernel.

Parameters

[in]	input	Source tensor. 3 lower dims represent a single input with dimensions [width, height, IFM], and an optional 4th dimension for batch of inputs. Data types supported: FP16/F32. Data layouts supported: NCHW/NHWC.
[in]	input_squared	Source with each element has been squared. 3 lower dims represent a single input with dimensions [width, height, IFM], Data type and layout supported: same as `input`.
[in]	output	Destination tensor. Output will have the same number of dimensions as input. Data type and layout supported: same as `input`.
[in]	norm_info	Normalization layer information like the normalization type, normalization size and other parameters.

Returns: a status

Definition at line 178 of file NENormalizationLayerKernel.cpp.

 {
     ARM_COMPUTE_RETURN_ON_ERROR(validate_arguments(input, input_squared, output, norm_info));
  
     return Status{};
 }

References ARM_COMPUTE_RETURN_ON_ERROR, arm_compute::test::validation::input, and arm_compute::cpu::kernels::validate_arguments().

Referenced by NENormalizationLayer::validate().

The documentation for this class was generated from the following files:

src/core/NEON/kernels/NENormalizationLayerKernel.h
src/core/NEON/kernels/NENormalizationLayerKernel.cpp

Public Member Functions

Static Public Member Functions

Additional Inherited Members

Detailed Description

Constructor & Destructor Documentation

◆ NENormalizationLayerKernel() [1/3]

◆ NENormalizationLayerKernel() [2/3]

◆ NENormalizationLayerKernel() [3/3]

◆ ~NENormalizationLayerKernel()

Member Function Documentation

◆ configure()

◆ name()

◆ operator=() [1/2]

◆ operator=() [2/2]

◆ run()

◆ validate()