Basic function to execute concatenate tensors along a given axis. More...

#include <GCConcatenateLayer.h>

Collaboration diagram for GCConcatenateLayer:

Public Member Functions
	GCConcatenateLayer ()
	Default constructor. More...

void	configure (std::vector< IGCTensor > inputs_vector, IGCTensor output, size_t axis)
	Initialise the kernel's inputs vector and output. More...

void	run () override
	Run the kernels contained in the function. More...

Public Member Functions inherited from IFunction
virtual	~IFunction ()=default
	Destructor. More...

virtual void	prepare ()
	Prepare the function for executing. More...

Detailed Description

Basic function to execute concatenate tensors along a given axis.

This function calls the following kernels:

Note

only axis z is supported

GCDepthConcatenateLayerKernel

Deprecated:: This function is deprecated and is intended to be removed in 21.05 release

Definition at line 47 of file GCConcatenateLayer.h.

Constructor & Destructor Documentation

◆ GCConcatenateLayer()

GCConcatenateLayer ( )

Default constructor.

Definition at line 36 of file GCConcatenateLayer.cpp.

     : _concat_kernels(),
       _num_inputs(0),
       _axis(Window::DimZ)
 {
 }

Member Function Documentation

◆ configure()

void configure	(	std::vector< IGCTensor *>	inputs_vector,
		IGCTensor *	output,
		size_t	axis
	)

Initialise the kernel's inputs vector and output.

Note: Input and output tensor dimensions preconditions defer depending on the concatenation axis.

Parameters

[in,out]	inputs_vector	The vectors containing all the tensors to concatenate. Data types supported: F16/F32.
[out]	output	Output tensor. Data types supported: Same as `input`.
[in]	axis	Concatenation axis. Supported underlying concatenation axis is 2.

Definition at line 43 of file GCConcatenateLayer.cpp.

References ARM_COMPUTE_ERROR, ARM_COMPUTE_ERROR_ON, arm_compute::auto_init_if_empty(), arm_compute::misc::shape_calculator::calculate_concatenate_shape(), Window::DimZ, ITensor::info(), offset(), and arm_compute::test::validation::output_shape.

 {
     ARM_COMPUTE_ERROR_ON(inputs_vector.size() < 2);
 
     _num_inputs = inputs_vector.size();
     _axis       = axis;
 
     TensorShape output_shape = arm_compute::misc::shape_calculator::calculate_concatenate_shape(inputs_vector, axis);
 
     // Output auto inizialitation if not yet initialized
     auto_init_if_empty(*output->info(), output_shape, 1, inputs_vector[0]->info()->data_type());
 
     unsigned int offset = 0;
     switch(axis)
     {
         case Window::DimZ:
         {
             for(unsigned int i = 0; i < _num_inputs; ++i)
             {
                 auto kernel = std::make_unique<GCDepthConcatenateLayerKernel>();
                 kernel->configure(inputs_vector.at(i), offset, output);
                 offset += inputs_vector.at(i)->info()->dimension(axis);
                 _concat_kernels.emplace_back(std::move(kernel));
             }
             break;
         }
         default:
             ARM_COMPUTE_ERROR("Axis not supported");
     }
 }

◆ run()

void run ( )

overridevirtual

Run the kernels contained in the function.

For Neon kernels:

Multi-threading is used for the kernels which are parallelisable.
By default std::thread::hardware_concurrency() threads are used.

Note: CPPScheduler::set_num_threads() can be used to manually set the number of threads

For OpenCL kernels:

All the kernels are enqueued on the queue associated with CLScheduler.
The queue is then flushed.

Note: The function will not block until the kernels are executed. It is the user's responsibility to wait.; Will call prepare() on first run if hasn't been done

Implements IFunction.

Definition at line 74 of file GCConcatenateLayer.cpp.

References GCScheduler::dispatch(), and GCScheduler::get().

 {
     for(auto &kernel : _concat_kernels)
     {
         GCScheduler::get().dispatch(*kernel, true);
     }
 }

The documentation for this class was generated from the following files: