Namespaces
	assembly_utils

	cl_direct_conv

	cl_dwc

	cl_gemm

	cl_indirect_conv

	cl_matmul

	cl_tuner

	cpu

	cpuinfo

	detail

	experimental

	gpu

	graph

	graph_utils

	helpers

	io

	kernels

	logging

	misc

	mlgo

	opencl
	Interface to convert the 2D Fully Connected weights from NCHW to NHWC or vice versa.

	quantization

	scale_helpers

	scale_utils

	scheduler_utils

	softmax_helpers

	support

	test

	utility

	utils

	weights_transformations

	wrapper

Data Structures
class	AccessWindowAutoPadding
	Dummy access window. More...

class	AccessWindowHorizontal
	Implementation of a row access pattern. More...

class	AccessWindowRectangle
	Implementation of a rectangular access pattern. More...

class	AccessWindowStatic
	Implementation of a static rectangular access pattern. More...

class	AccessWindowTranspose
	Implementation of a XY-transpose access pattern. More...

class	AccessWindowVertical
	Implementation of a column access pattern. More...

class	ActivationLayerInfo
	Activation Layer Information class. More...

class	Allocator
	Default malloc allocator implementation. More...

class	AllocatorWrapper
	Default malloc allocator implementation. More...

class	Array
	Basic implementation of the IArray interface which allocates a static number of T values More...

class	bfloat16
	Brain floating point representation class. More...

struct	BlobInfo
	Meta-data information for each blob. More...

class	BlobLifetimeManager
	Concrete class that tracks the lifetime of registered tensors and calculates the systems memory requirements in terms of blobs. More...

class	BlobMemoryPool
	Blob memory pool. More...

struct	BorderSize
	Container for 2D border size. More...

class	BoundingBoxTransformInfo
	Bounding Box Transform information class. More...

class	BoxNMSLimitInfo
	BoxWithNonMaximaSuppressionLimit Information class. More...

class	CLAbsLayer
	Basic function to get the absolute value of an input tensor. More...

class	CLActivationLayer
	Basic function to run opencl::kernels::ClActivationKernel. More...

class	CLArgMinMaxLayer
	Function to calculate the index of the minimum or maximum values in a tensor based on an axis. More...

class	CLArgMinMaxLayerKernel
	Interface for the reduction operation kernel. More...

class	CLArithmeticAddition
	Basic function to run opencl::kernels::ClSaturatedArithmeticKernel for addition. More...

class	CLArithmeticDivision
	Basic function to run opencl::kernels::ClSaturatedArithmeticKernel for division. More...

class	CLArithmeticSubtraction
	Basic function to run opencl::kernels::ClSaturatedArithmeticKernel for subtraction. More...

class	CLArray
	CLArray implementation More...

class	CLBatchNormalizationLayer
	Basic function to run CLNormalizationLayerKernel and simulate a batch normalization layer. More...

class	CLBatchNormalizationLayerKernel
	Interface for the BatchNormalization layer kernel. More...

class	CLBatchToSpaceLayer
	Basic function to run CLBatchToSpaceLayerKernel. More...

class	CLBatchToSpaceLayerKernel
	Interface for the batch to space kernel. More...

class	CLBitwiseAnd
	Basic function to perform bitwise AND by running CLBitwiseKernel. More...

class	CLBitwiseKernel
	Interface for the bitwise operation kernel. More...

class	CLBitwiseNot
	Basic function to perform bitwise NOT by running CLBitwiseKernel. More...

class	CLBitwiseOr
	Basic function to perform bitwise OR by running CLBitwiseKernel. More...

class	CLBitwiseXor
	Basic function to perform bitwise XOR by running CLBitwiseKernel. More...

class	CLBoundingBoxTransform
	Basic function to run CLBoundingBoxTransformKernel. More...

class	CLBoundingBoxTransformKernel
	Interface for the bounding box kernel. More...

class	CLBufferAllocator
	Default OpenCL cl buffer allocator implementation. More...

class	CLBufferMemoryRegion
	OpenCL buffer memory region implementation. More...

class	CLBuildOptions
	Build options. More...

class	CLCast
	Basic function to run opencl::kernels::ClCastKernel. More...

class	CLChannelShuffleLayer
	Basic function to run CLChannelShuffleLayerKernel. More...

class	CLChannelShuffleLayerKernel
	Interface for the channel shuffle kernel. More...

class	CLCoarseSVMMemoryRegion
	OpenCL coarse-grain SVM memory region implementation. More...

class	CLCommandBuffer
	Command buffer contains a list of commands that is constructed once and later enqueued multiple times. More...

class	CLComparison
	Basic function to run CLComparisonKernel. More...

class	CLComparisonKernel
	Interface for the comparison kernel. More...

class	CLComparisonStatic
	Basic function to run CLComparisonKernel. More...

class	CLCompatCommandBuffer
	Command buffer implementation for platform without mutable dispatch command buffer extension. More...

class	CLCompileContext
	CLCompileContext class. More...

class	CLComplexPixelWiseMultiplication
	Basic function to run opencl::ClComplexMul. More...

class	CLComputeAllAnchorsKernel
	Interface for Compute All Anchors kernel. More...

class	CLComputeMeanVariance
	Interface for compute Mean and Variance per channel. More...

class	CLConcatenateLayer
	Basic function to execute concatenate tensors along a given axis. More...

class	CLConv3D
	Basic function to compute the convolution3d layer. More...

class	CLConvertFullyConnectedWeights
	Basic function to run an opencl::kernels::ClConvertFullyConnectedWeightsKernel. More...

class	CLConvolutionLayer
	Basic function to compute the convolution layer. More...

class	CLCopy
	Basic function to run opencl::kernels::ClCopyKernel. More...

class	CLCrop
	Basic function to run opencl::kernels::ClCropKernel. More...

class	CLCropResize
	Function to perform cropping and resizing. More...

class	CLDeconvolutionLayer
	Basic function to compute the deconvolution layer. More...

class	CLDeconvolutionLayerUpsample
	Basic function to execute deconvolution upsample on OpenCL. More...

class	CLDeconvolutionLayerUpsampleKernel
	Interface for the Deconvolution layer kernel on OpenCL. More...

class	CLDeconvolutionReshapeOutputKernel
	Interface for the OpenCL kernel to be used for reshaping the tensor before returning the result of deconvolution. More...

class	CLDepthConvertLayer
	Basic function to run opencl::kernels::ClCastKernel. More...

class	CLDepthToSpaceLayer
	Basic function to run CLDepthToSpaceLayerKernel. More...

class	CLDepthToSpaceLayerKernel
	Interface for the depth to space kernel. More...

class	CLDepthwiseConvolutionLayer
	Function to execute a depthwise convolution. More...

class	CLDepthwiseConvolutionLayerNativeKernel
	Interface for the kernel to run a MxN depthwise convolution. More...

class	CLDequantizationLayer
	Basic function to run opencl::ClDequantize that dequantizes an input tensor. More...

class	CLDevice
	OpenCL device type class. More...

struct	CLDeviceOptions
	OpenCL device options. More...

class	CLDirectConvolutionLayer
	Basic function to execute direct convolution function: More...

class	CLDirectDeconvolutionLayer
	Function to run the deconvolution layer. More...

class	CLElementwiseMax
	Basic function to run opencl::kernels::ClArithmeticKernel for max. More...

class	CLElementwiseMin
	Basic function to run opencl::kernels::ClArithmeticKernel for min. More...

class	CLElementwisePower
	Basic function to run opencl::kernels::ClArithmeticKernel for power. More...

class	CLElementwiseSquaredDiff
	Basic function to run opencl::kernels::ClArithmeticKernel for squared difference. More...

class	CLExpLayer
	Basic function to perform exponential on an input tensor. More...

class	CLFFT1D
	Basic function to execute one dimensional FFT. More...

class	CLFFT2D
	Basic function to execute two dimensional FFT. More...

class	CLFFTConvolutionLayer
	Basic function to execute FFT-based convolution on OpenCL. More...

class	CLFFTDigitReverseKernel
	Interface for the digit reverse operation kernel. More...

class	CLFFTRadixStageKernel
	Interface for the FFT radix stage kernel. More...

class	CLFFTScaleKernel
	Interface for the inverse fft scale kernel. More...

class	CLFill
	Basic function to run opencl::kernels::ClFillKernel. More...

class	CLFillBorderKernel
	Interface for filling the border of a kernel. More...

class	CLFineSVMMemoryRegion
	OpenCL fine-grain SVM memory region implementation. More...

class	CLFlattenLayer
	Basic function to execute flatten. More...

class	CLFloor
	Basic function to run opencl::kernels::ClFloorKernel. More...

class	CLFullyConnectedLayer
	Basic function to compute a Fully Connected layer on OpenCL. More...

class	CLFuseBatchNormalization
	Basic function to fuse the batch normalization node to a preceding convolution node. More...

class	CLFuseBatchNormalizationKernel
	OpenCL kernel to fuse the batch normalization node to a preceding convolution node. More...

class	CLGather
	Basic function to run CLGatherKernel. More...

class	CLGatherKernel
	Interface for the kernel to perform tensor reshaping. More...

class	CLGEMM
	Basic function to execute GEMM on OpenCL. More...

class	CLGEMMConvolutionLayer
	Basic function to compute the convolution layer. More...

class	CLGEMMDeconvolutionLayer
	Function to run the deconvolution layer through a call to GEMM. More...

class	CLGEMMHeuristicsHandle
	Handle for loading and retrieving GEMM heuristics. More...

struct	CLGEMMKernelSelectionParams
	OpenCL GEMM kernel selection parameters. More...

class	CLGEMMLowpMatrixMultiplyCore
	Basic function to execute GEMMLowpMatrixMultiplyCore on OpenCL. More...

class	CLGEMMLowpOutputStage
	Basic function to execute GEMMLowpQuantizeDown kernels on CL. More...

class	CLGenerateProposalsLayer
	Basic function to generate proposals for a RPN (Region Proposal Network) More...

class	CLIndirectConvolutionLayer
	Basic function to run the indirect convolution function. More...

class	CLInstanceNormalizationLayer
	Basic function to perform a Instance normalization. More...

class	CLInstanceNormalizationLayerKernel
	Interface for performing an instance normalization. More...

class	CLKernelLibrary
	CLKernelLibrary class. More...

class	CLL2NormalizeLayer
	Basic function to perform a L2 normalization on a given axis. More...

class	CLL2NormalizeLayerKernel
	Interface for performing a L2 normalize on a given axis given the square sum of it in this axis. More...

class	CLLogicalAnd
	Basic function to run arm_compute::opencl::kernels::ClLogicalBinaryKernel. More...

class	CLLogicalNot
	Basic function to do logical NOT operation. More...

class	CLLogicalOr
	Basic function to run arm_compute::opencl::kernels::ClLogicalBinaryKernel. More...

class	CLLogLayer
	Basic function to perform elementwise log on an input tensor. More...

class	CLLSTMLayer
	This function performs a single time step in a Long Short-Term Memory (LSTM) layer. More...

class	CLLSTMLayerQuantized
	Basic function to run CLLSTMLayerQuantized. More...

class	CLMatMul
	Basic function to execute MatMul (Matrix Multiplication) on OpenCL. More...

class	CLMaxUnpoolingLayer
	Function to perform MaxUnpooling. More...

class	CLMaxUnpoolingLayerKernel
	Interface for the pooling layer kernel. More...

class	CLMeanStdDevNormalizationKernel
	Interface for the kernel to normalize the input 2D tensor across the first dimension with respect to mean and standard deviation of the same dimension. More...

class	CLMeanStdDevNormalizationLayer
	Basic function to execute mean and standard deviation normalization by calling CLMeanStdDevNormalizationKernel. More...

class	CLMemory
	OpenCL implementation of memory object. More...

class	CLMutableCommandBuffer
	Command buffer implementaton based on CL mutable dispatch command buffer extension. More...

class	CLNegLayer
	Basic function to negate an input tensor. More...

class	CLNormalizationLayer
	Basic function to compute a normalization layer. More...

class	CLNormalizationLayerKernel
	Interface for the normalization layer kernel. More...

class	CLNormalizePlanarYUVLayer
	Basic function to run CLNormalizePlanarYUVLayerKernel. More...

class	CLNormalizePlanarYUVLayerKernel
	Interface for the NormalizePlanarYUV layer kernel. More...

class	CLPadLayer
	Basic function to pad a tensor. More...

class	CLPadLayerKernel
	Interface for the PadLayer function. More...

class	CLPermute
	Basic function to execute an opencl::kernels::ClPermuteKernel. More...

class	CLPixelWiseMultiplication
	Basic function to run opencl::ClMul. More...

class	CLPooling3dLayer
	Basic function to run opencl::ClPool3d. More...

class	CLPoolingLayer
	Basic function to run opencl::ClPool2d. More...

class	CLPReluLayer
	Basic function to run opencl::kernels::ClArithmeticKernel for PRELU. More...

class	CLPriorBoxLayer
	Basic function to run CLPriorBoxLayerKernel. More...

class	CLPriorBoxLayerKernel
	Interface for the PriorBox layer kernel. More...

class	CLQLSTMLayer
	Basic function to run CLQLSTMLayer. More...

class	CLQLSTMLayerNormalizationKernel
	Interface for the kernel to do layer normalization. More...

struct	CLQuantization
	OpenCL quantization data. More...

class	CLQuantizationLayer
	Basic function to simulate a quantization layer. More...

class	CLRange
	Basic function to run CLRangeKernel. More...

class	CLRangeKernel
	Kernel class for Range. More...

class	CLReduceMean
	Basic function to perform reduce operation. More...

class	CLReductionOperation
	Perform reduction operation. More...

class	CLReductionOperationKernel
	Interface for the reduction operation kernel. More...

class	CLReorgLayer

class	CLReorgLayerKernel
	OpenCL kernel to perform a reorg layer. More...

class	CLReshapeLayer
	Basic function to run opencl::kernels::ClReshapeKernel. More...

class	CLReverse
	Basic function to run CLReverseKernel. More...

class	CLReverseKernel
	Interface for the reverse kernel. More...

class	CLRNNLayer
	Basic function to run CLRNNLayer. More...

class	CLROIAlignLayer
	Basic function to run CLROIAlignLayerKernel. More...

class	CLROIAlignLayerKernel
	Interface for the RoIAlign kernel. More...

class	CLROIPoolingLayer
	Basic function to run CLROIPoolingLayerKernel. More...

class	CLROIPoolingLayerKernel
	Interface for the ROI pooling layer kernel. More...

class	CLRoundLayer
	Basic function to get the round (to the nearest even) value of an input tensor. More...

class	CLRsqrtLayer
	Basic function to perform inverse square root on an input tensor. More...

class	CLRuntimeContext
	Runtime context. More...

class	CLScale
	Basic function to run opencl::ClScale. More...

class	CLScheduler
	Provides global access to a CL context and command queue. More...

class	CLSelect
	Basic function to run CLSelect. More...

class	CLSelectKernel
	OpenCL interface for executing the select kernel. More...

class	CLSinLayer
	Basic function to calculate sine of an input tensor. More...

class	CLSlice
	Basic function to perform tensor slicing. More...

class	CLSoftmaxLayerGeneric
	Basic function to compute a SoftmaxLayer. More...

class	CLSpaceToBatchLayer
	Basic function to spatial divide a tensor. More...

class	CLSpaceToBatchLayerKernel
	Interface for the space to batch kernel. More...

class	CLSpaceToDepthLayer
	Basic function to run CLSpaceToDepthLayerKernel. More...

class	CLSpaceToDepthLayerKernel
	Interface for the space to depth kernel. More...

class	CLSplit
	Basic function to split a tensor along a given axis. More...

class	CLStackLayer
	Basic function to stack tensors along an axis. More...

class	CLStackLayerKernel
	OpenCL kernel to stacks a rank-R tensor into one with rank-(R+1) along the axis dimension. More...

class	CLStridedSlice
	Basic function to run CLStridedSliceKernel. More...

class	CLStridedSliceKernel
	Interface for the kernel to perform tensor strided slicing. More...

class	CLSubTensor
	Basic implementation of the OpenCL sub-tensor interface. More...

class	CLSymbols
	Class for loading OpenCL symbols. More...

class	CLTensor
	Basic implementation of the OpenCL tensor interface. More...

class	CLTensorAllocator
	Basic implementation of a CL memory tensor allocator. More...

class	CLTile
	Basic function to run CLTileKernel. More...

class	CLTileKernel
	OpenCL kernel to perform a Tile operation. More...

class	CLTranspose
	Basic function to execute an opencl::kernels::ClTransposeKernel. More...

class	CLTuner
	Basic implementation of the OpenCL tuner interface. More...

struct	CLTuningInfo

class	CLTuningParams
	< OpenCL tuner parameters More...

class	CLUnstack
	Basic function to unpack a rank-R tensor into rank-(R-1) tensors. More...

class	CLWinogradConvolutionLayer
	Basic function to execute Winograd-based convolution on OpenCL. More...

class	ComputeAnchorsInfo
	ComputeAnchors information class. More...

struct	Conv2dInfo
	Descriptor used by the 2d Convolution function. More...

struct	Conv3dInfo
	Descriptor used by the 3d Convolution function. More...

struct	ConvolutionInfo

class	Coordinates
	Coordinates of an item. More...

struct	Coordinates2D
	Coordinate type. More...

struct	Coordinates3D
	Coordinate type. More...

class	CPPBoxWithNonMaximaSuppressionLimit
	Basic function to run CPPBoxWithNonMaximaSuppressionLimitKernel. More...

class	CPPBoxWithNonMaximaSuppressionLimitKernel
	CPP kernel to perform computation of BoxWithNonMaximaSuppressionLimit. More...

class	CPPDetectionOutputLayer
	CPP Function to generate the detection output based on location and confidence predictions by doing non maximum suppression. More...

class	CPPDetectionPostProcessLayer
	CPP Function to generate the detection output based on center size encoded boxes, class prediction and anchors by doing non maximum suppression. More...

class	CPPNonMaximumSuppression
	CPP Function to perform non maximum suppression on the bounding boxes and scores. More...

class	CPPNonMaximumSuppressionKernel
	CPP Function to perform non maximum suppression on the bounding boxes and scores. More...

class	CPPPermute
	Basic function to run CPPPermuteKernel. More...

class	CPPPermuteKernel
	CPP kernel to perform tensor permutation. More...

class	CPPScheduler
	C++11 implementation of a pool of threads to automatically split a kernel's execution among several threads. More...

class	CPPSplit
	Basic function to split a tensor along a given axis. More...

class	CPPTopKV
	Basic function to run CPPTopKVKernel. More...

class	CPPTopKVKernel
	CPP kernel to perform tensor TopKV operation. More...

class	CPPUpsample
	Basic function to run CPPUpsample. More...

class	CPPUpsampleKernel
	CPP kernel to perform tensor upsample. More...

class	CPUInfo

class	CpuMatMulSettings
	Settings for MatMul Cpu implementation. More...

class	DetectionOutputLayerInfo
	Detection Output layer info. More...

class	DetectionPostProcessLayerInfo
	Detection Output layer info. More...

struct	DetectionWindow
	Detection window used for the object detection. More...

class	Dimensions
	Dimensions with dimensionality. More...

struct	DirectConvComputeKernelInfo
	Compute descriptor used by the direct convolution kernel. More...

struct	DirectConvolutionLayerOutputStageKernelInfo
	Descriptor used by the direct convolution layer output stage kernels. More...

struct	DWCComputeKernelInfo
	Compute descriptor used by the depthwise convolution native kernel. More...

struct	enable_bitwise_ops
	Disable bitwise operations by default. More...

struct	enable_bitwise_ops< arm_compute::GPUTarget >
	Enable bitwise operations on GPUTarget enumerations. More...

struct	ErfLutEntry
	Lookup table for erf(x) calculation. More...

struct	FFT1DInfo
	Descriptor used by the FFT1D function. More...

struct	FFT2DInfo
	Descriptor used by the FFT2D function. More...

struct	FFTDigitReverseKernelInfo
	Descriptor for FFT digit reverse kernels. More...

struct	FFTRadixStageKernelInfo
	Descriptor used by the FFT core kernels. More...

struct	FFTScaleKernelInfo
	Descriptor for FFT scale kernels. More...

struct	FullyConnectedLayerInfo
	Fully connected layer info. More...

class	GEMMInfo
	GEMM information class. More...

struct	GEMMKernelInfo
	Descriptor used by the GEMM kernels. More...

struct	GEMMLHSMatrixInfo
	GEMM LHS (Left Hand Side) matrix information. More...

struct	GEMMLowpOutputStageInfo
	GEMMLowp output stage info. More...

struct	GEMMLowpReductionKernelInfo

class	GEMMReshapeInfo
	GEMM reshape information class. More...

struct	GEMMRHSMatrixInfo
	GEMM RHS (Right Hand Side) matrix information. More...

class	GenerateProposalsInfo
	Generate Proposals Information class. More...

class	GpuMatMulSettings
	Settings for MatMul OpenCL implementation. More...

class	IAccessWindow
	Interface describing methods to update access window and padding based on kernel parameters. More...

class	IAllocator
	Allocator interface. More...

class	IArray
	Array of type T. More...

class	IAssetManager
	Asset manager interface. More...

class	ICLArray
	Interface for OpenCL Array. More...

class	ICLKernel
	Common interface for all the OpenCL kernels. More...

class	ICLMemoryRegion
	OpenCL memory region interface. More...

class	ICLSimple2DKernel
	Interface for simple OpenCL kernels having 1 tensor input and 1 tensor output. More...

class	ICLSimple3DKernel
	Interface for simple OpenCL kernels having 1 tensor input and 1 tensor output. More...

class	ICLSimpleFunction
	Basic interface for functions which have a single OpenCL kernel. More...

class	ICLSimpleKernel
	Interface for simple OpenCL kernels having 1 tensor input and 1 tensor output. More...

class	ICLSVMMemoryRegion
	OpenCL SVM memory region interface. More...

class	ICLTensor
	Interface for OpenCL tensor. More...

class	ICLTuner
	Basic interface for tuning the OpenCL kernels. More...

class	IContext
	Context interface. More...

class	ICPPKernel
	Common interface for all kernels implemented in C++. More...

class	ICPPSimpleFunction
	Basic interface for functions which have a single CPP kernel. More...

class	IDevice
	Interface for device object. More...

class	IFunction
	Base class for all functions. More...

class	IKernel
	Common information for all the kernels. More...

class	ILifetimeManager
	Interface for managing the lifetime of objects. More...

class	IMemory
	Memory interface. More...

class	IMemoryGroup
	Memory group interface. More...

class	IMemoryManageable
	Interface of an object than can be memory managed. More...

class	IMemoryManager
	Memory manager interface to handle allocations of backing memory. More...

class	IMemoryPool
	Memory Pool Inteface. More...

class	IMemoryRegion
	Memory region interface. More...

class	INESimpleFunction
	Basic interface for functions which have a single CPU kernel. More...

class	INESimpleFunctionNoBorder
	Basic interface for functions which have a single CPU kernel and no border. More...

struct	InstanceNormalizationLayerKernelInfo

struct	IOFormatInfo
	IO formatting information class. More...

class	IOperator
	Base class specifying the operator interface. More...

class	IPoolManager
	Memory pool manager interface. More...

class	IQueue
	Base class specifying the queue interface. More...

class	IRuntimeContext
	Context interface. More...

class	IScheduler
	Scheduler interface to run kernels. More...

class	ISimpleLifetimeManager
	Abstract class of the simple lifetime manager interface. More...

class	ITensor
	Interface for CPU tensor. More...

class	ITensorAllocator
	Interface to allocate tensors. More...

class	ITensorInfo
	Store the tensor's metadata. More...

class	ITensorPack
	Tensor packing service. More...

class	ITensorV2
	Base class specifying the tensor interface. More...

class	Iterator
	Iterator updated by execute_window_loop for each window element. More...

class	ITransformWeights
	Weights tensor transform interface In order to identify the different reshape functions, each reshape function has to generate a unique id. More...

class	IWeightsManager
	Weights manager interface to handle weights transformations. More...

class	Kernel
	Kernel class. More...

class	LSTMParams

struct	LUTInfo

class	LUTManager

class	MatMulInfo
	Class for holding information related to matrix multiplication function. More...

struct	MatMulKernelInfo

class	Memory
	CPU implementation of memory object. More...

class	MemoryGroup
	Memory group. More...

class	MemoryGroupResourceScope
	Memory group resources scope handling class. More...

class	MemoryManagerOnDemand
	On-demand memory manager. More...

class	MemoryRegion
	Memory region CPU implementation. More...

struct	MinMaxLocationValues
	Min and max values and locations. More...

class	NEActivationLayer
	Basic function to run cpu::kernels::CpuActivationKernel. More...

class	NEAddMulAdd
	Function to compute Add+Mul+Add fused operation. More...

class	NEArgMinMaxLayer
	Function to calculate the index of the minimum or maximum values in a tensor based on an axis. More...

class	NEArithmeticAddition
	Basic function to run cpu::kernels::CpuAddKernel. More...

class	NEArithmeticSubtraction
	Basic function to run cpu::kernels::CpuSubKernel. More...

class	NEBatchNormalizationLayer
	Basic function to run NENormalizationLayerKernel and simulate a batch normalization layer. More...

class	NEBatchNormalizationLayerKernel
	Interface for the batch normalization layer kernel. More...

class	NEBatchToSpaceLayer
	Basic function to run NEBatchToSpaceLayerKernel. More...

class	NEBatchToSpaceLayerKernel
	Interface for the batch to space kernel. More...

class	NEBitwiseAnd
	Basic function to run NEBitwiseAndKernel. More...

class	NEBitwiseAndKernel
	Interface for the kernel to perform bitwise AND between XY-planes of two tensors. More...

class	NEBitwiseNot
	Basic function to run NEBitwiseNotKernel. More...

class	NEBitwiseNotKernel
	Interface for the kernel to perform bitwise NOT operation. More...

class	NEBitwiseOr
	Basic function to run NEBitwiseOrKernel. More...

class	NEBitwiseOrKernel
	Interface for the kernel to perform bitwise inclusive OR between two tensors. More...

class	NEBitwiseXor
	Basic function to run NEBitwiseXorKernel. More...

class	NEBitwiseXorKernel
	Interface for the kernel to perform bitwise exclusive OR (XOR) between two tensors. More...

class	NEBoundingBoxTransform
	Basic function to run NEBoundingBoxTransformKernel. More...

class	NEBoundingBoxTransformKernel
	Interface for the bounding box kernel. More...

class	NECast
	Basic function to run cpu::kernels::CpuCastKernel. More...

class	NEChannelShuffleLayer
	Basic function to run NEChannelShuffleLayerKernel. More...

class	NEChannelShuffleLayerKernel
	Interface for the channel shuffle kernel. More...

class	NECol2ImKernel
	Kernel to perform col2im reshaping. More...

class	NEComplexPixelWiseMultiplication
	Basic function to run cpu::CpuComplexMul. More...

class	NEComputeAllAnchorsKernel
	Interface for Compute All Anchors kernel. More...

class	NEConcatenateLayer
	Basic function to execute concatenate tensors along a given axis. More...

class	NEConv3D
	Basic function to simulate a 3d convolution. More...

class	NEConvertFullyConnectedWeights
	Basic function to run cpu::kernels::CpuConvertFullyConnectedWeightsKernel. More...

class	NEConvolutionLayer
	Basic function to simulate a convolution layer. More...

class	NECopy
	Basic function to run cpu::kernels::CpuCopyKernel. More...

class	NECropKernel
	Interface for the kernel to perform tensor cropping. More...

class	NECropResize
	Function to perform cropping and resizing. More...

class	NEDeconvolutionLayer
	Function to run the deconvolution layer. More...

class	NEDepthConvertLayer
	Basic function to run cpu::kernels::CpuCastKernel. More...

class	NEDepthToSpaceLayer
	Basic function to run NEDepthToSpaceLayerKernel. More...

class	NEDepthToSpaceLayerKernel
	Interface for the depth to space kernel. More...

class	NEDepthwiseConvolutionLayer
	Function to execute a depthwise convolution. More...

class	NEDequantizationLayer
	Basic function to run cpu::CpuDequantize that dequantizes an input tensor. More...

class	NEDetectionPostProcessLayer
	NE Function to generate the detection output based on center size encoded boxes, class prediction and anchors by doing non maximum suppression. More...

class	NEDirectConvolutionLayer
	Function to run the direct convolution. More...

class	NEElementwiseComparison
	Basic function to run cpu::kernels::CpuComparisonKernel. More...

class	NEElementwiseComparisonStatic
	Basic function to run cpu::kernels::CpuComparisonKernel. More...

class	NEElementwiseDivision
	Basic function to run cpu::kernels::CpuArithmeticKernel for division. More...

class	NEElementwiseMax
	Basic function to run cpu::kernels::CpuArithmeticKernel for max. More...

class	NEElementwiseMin
	Basic function to run cpu::kernels::CpuArithmeticKernel for min. More...

class	NEElementwisePower
	Basic function to run cpu::kernels::CpuArithmeticKernel for power. More...

class	NEElementwiseSquaredDiff
	Basic function to run cpu::kernels::CpuArithmeticKernel for squared difference. More...

class	NEElementwiseUnaryLayer
	Basic function to perform unary elementwise operations. More...

class	NEFFT1D
	Basic function to execute one dimensional FFT. More...

class	NEFFT2D
	Basic function to execute two dimensional FFT. More...

class	NEFFTConvolutionLayer
	Basic function to execute FFT-based convolution on CPU. More...

class	NEFFTDigitReverseKernel
	Interface for the digit reverse operation kernel. More...

class	NEFFTRadixStageKernel
	Interface for the FFT kernel. More...

class	NEFFTScaleKernel
	Interface for the inverse fft scale kernel. More...

class	NEFill
	Basic function to run cpu::kernels::CpuFillKernel. More...

class	NEFillBorder
	Basic function to run NEFillBorderKernel. More...

class	NEFillBorderKernel
	Interface for the kernel to fill borders. More...

class	NEFlattenLayer
	Basic function to execute flatten layer kernel. More...

class	NEFloor
	Basic function to run cpu::kernels::CpuFloorKernel. More...

class	NEFullyConnectedLayer
	Basic function to compute a Fully Connected layer. More...

class	NEFuseBatchNormalization
	Basic function to fuse the batch normalization node to a preceding convolution node. More...

class	NEFuseBatchNormalizationKernel
	OpenNE kernel to fuse the batch normalization node to a preceding convolution node. More...

class	NEGather
	Basic function to run NEGatherKernel. More...

class	NEGatherKernel
	Kernel to perform gather operation. More...

class	NEGEMM
	Basic function to execute GEMM. More...

class	NEGEMMConv2d
	Basic function to compute the convolution layer. More...

class	NEGEMMConvolutionLayer
	Basic function to compute the convolution layer. More...

class	NEGEMMLowpMatrixMultiplyCore
	Function to run Gemm on quantized types. More...

class	NEGEMMLowpOutputStage
	Basic function to execute GEMMLowpQuantizeDown kernels. More...

class	NEGenerateProposalsLayer
	Basic function to generate proposals for a RPN (Region Proposal Network) More...

class	NEInstanceNormalizationLayer
	Basic function to perform a Instance normalization. More...

class	NEInstanceNormalizationLayerKernel
	Interface for performing an instance normalization. More...

class	NEL2NormalizeLayer
	Basic function to perform a L2 normalization on a given axis. More...

class	NEL2NormalizeLayerKernel
	Interface for performing a L2 normalize on a given axis given the square sum of it in this axis. More...

class	NELogicalAnd
	Basic function to perform logical AND. More...

class	NELogicalNot
	Basic function to perform logical NOT. More...

class	NELogicalOr
	Basic function to perform logical OR. More...

class	NELSTMLayer
	Basic function to run NELSTMLayer. More...

class	NELSTMLayerQuantized
	Basic function to run NELSTMLayerQuantized. More...

class	NEMatMul
	Basic function to run the following operators: More...

class	NEMaxUnpoolingLayer
	Function to perform MaxUnpooling. More...

class	NEMeanStdDevNormalizationKernel
	Interface for the kernel to normalize the input 2D tensor across the first dimension with respect to mean and standard deviation of the same dimension. More...

class	NEMeanStdDevNormalizationLayer
	Basic function to execute mean and standard deviation normalization by calling NEMeanStdDevNormalizationKernel. More...

class	NENormalizationLayer
	Basic function to compute a normalization layer. More...

class	NENormalizationLayerKernel
	Interface for the normalization layer kernel. More...

class	NEPadLayer
	Basic function to pad a tensor. More...

class	NEPadLayerKernel
	Basic kernel to pad the input tensor given padding information. More...

class	NEPermute
	Basic function to run cpu::kernels::CpuPermuteKernel. More...

class	NEPixelWiseMultiplication
	Basic function to run cpu::CpuMul. More...

class	NEPooling3dLayer
	Basic function to simulate a pooling 3d layer with the specified pooling operation. More...

class	NEPoolingLayer
	Basic function to simulate a pooling layer with the specified pooling operation. More...

class	NEPReluLayer
	Basic function to run cpu::kernels::CpuArithmeticKernel for PRELU. More...

class	NEPriorBoxLayer
	Basic function to run NEPriorBoxLayerKernel. More...

class	NEPriorBoxLayerKernel
	Interface for the kernel to calculate prior boxes. More...

class	NEQLSTMLayer
	Basic function to run NEQLSTMLayer. More...

class	NEQLSTMLayerNormalizationKernel
	Kernel to perform layer normalization for QLSTM. More...

class	NEQuantizationLayer
	Basic function to run a quantization layer using cpu::CpuQuantize. More...

class	NERange
	Basic function to run NERangeKernel. More...

class	NERangeKernel
	Kernel class for Range. More...

class	NEReduceMean
	Basic function to perform reduce operation. More...

class	NEReductionOperation
	Basic function to simulate a reduction operation. More...

class	NEReductionOperationKernel
	Kernel to perform a reduction operation. More...

class	NEReorgLayer
	Basic function to run NEReorgLayerKernel. More...

class	NEReorgLayerKernel
	Interface for the kernel to perform tensor re-organization. More...

class	NEReshapeLayer
	Basic function to run cpu::kernels::CpuReshapeKernel. More...

class	NEReverse
	Basic function to run NEReverseKernel. More...

class	NEReverseKernel
	Interface for the reverse layer kernel. More...

class	NERNNLayer
	Basic function to run NERNNLayer. More...

class	NEROIAlignLayer
	Basic function to run NEROIAlignLayerKernel. More...

class	NEROIAlignLayerKernel
	Interface for the RoIAlign kernel. More...

class	NEROIPoolingLayer
	Basic function to run NEROIPoolingLayerKernel. More...

class	NEROIPoolingLayerKernel
	Interface for the ROI pooling layer kernel. More...

class	NEScale
	Basic function to compute Scale. More...

class	NESelect
	Basic function to run NESelect. More...

class	NESelectKernel
	Interface for the select kernel. More...

class	NESlice
	Basic function to perform tensor slicing. More...

class	NESoftmaxLayerGeneric
	Basic function to compute a SoftmaxLayer and a Log SoftmaxLayer. More...

class	NESpaceToBatchLayer
	Basic function to spatial divide a tensor. More...

class	NESpaceToBatchLayerKernel
	Interface for the space to batch kernel. More...

class	NESpaceToDepthLayer
	Basic function to run NESpaceToDepthLayerKernel. More...

class	NESpaceToDepthLayerKernel
	Interface for the space to depth kernel. More...

class	NESplit
	Basic function to split a tensor along a given axis. More...

class	NEStackLayer
	Basic function to stack tensors along an axis. More...

class	NEStackLayerKernel
	Basic kernel to stack a rank-R tensor into one with rank-(R+1) along the axis dimension. More...

class	NEStridedSlice
	Basic function to run NEStridedSliceKernel. More...

class	NEStridedSliceKernel
	Interface for the kernel to perform tensor strided slicing. More...

class	NETile
	Basic function to run NETileKernel. More...

class	NETileKernel
	Basic kernel to perform a tile operation. More...

class	NETranspose
	Basic function to run cpu::kernels::CpuTransposeKernel. More...

class	NEUnstack
	Basic function to unpack a rank-R tensor into rank-(R-1) tensors. More...

class	NEWinogradConvolutionLayer
	Basic function to simulate a convolution layer. More...

class	NormalizationLayerInfo
	Normalization Layer Information class. More...

class	OffsetLifetimeManager
	Concrete class that tracks the lifetime of registered tensors and calculates the systems memory requirements in terms of a single blob and a list of offsets. More...

class	OffsetMemoryPool
	Offset based memory pool. More...

class	OMPScheduler
	Pool of threads to automatically split a kernel's execution among several threads. More...

struct	Padding2D
	Padding and stride information class. More...

struct	Padding3D
	Padding information for 3D operations like Conv3d. More...

class	PadStrideInfo

class	PixelValue
	Class describing the value of a pixel for any image format. More...

struct	Pooling3dLayerInfo
	Pooling Layer Information struct. More...

struct	PoolingLayerInfo
	Pooling Layer Information struct. More...

class	PoolManager
	Memory pool manager. More...

class	PriorBoxLayerInfo
	PriorBox layer info. More...

class	Program
	Program class. More...

struct	Qasymm8QuantizationHelper

class	QuantizationInfo
	Quantization information. More...

struct	Rectangle
	Rectangle type. More...

class	ROIPoolingLayerInfo
	ROI Pooling Layer Information class. More...

class	RuntimeContext
	Runtime context. More...

struct	ScaleKernelInfo

class	Scheduler
	Configurable scheduler which supports multiple multithreading APIs and choosing between different schedulers at runtime. More...

class	SchedulerFactory
	Scheduler Factory. More...

class	Semaphore
	Semamphore class. More...

class	SingleThreadScheduler
	Pool of threads to automatically split a kernel's execution among several threads. More...

class	Size2D
	Class for specifying the size of an image or rectangle. More...

class	Size3D
	Class for specifying the size of a 3D shape or object. More...

struct	SoftmaxKernelInfo
	Descriptor used by the softmax kernels. More...

class	Status
	Status class. More...

class	Steps
	Class to describe a number of elements in each dimension. More...

class	StridedSliceLayerInfo

class	Strides
	Strides of an item in bytes. More...

class	SubTensor
	Basic implementation of the sub-tensor interface. More...

class	SubTensorInfo
	Store the sub tensor's metadata. More...

class	Tensor
	Basic implementation of the tensor interface. More...

class	TensorAccessor
	Tensor accessors to make it easier to interface with arm_gemm. More...

class	TensorAllocator
	Basic implementation of a CPU memory tensor allocator. More...

class	TensorInfo
	Store the tensor's metadata. More...

class	TensorPack
	Tensor packing service. More...

class	TensorShape
	Shape of a tensor. More...

struct	ThreadInfo
	Information about executing thread and CPU. More...

struct	UniformQuantizationInfo
	Quantization info when assuming per layer quantization. More...

struct	ValidRegion
	Container for valid region of a window. More...

class	WeightsInfo
	Convolution Layer Weights Information class. More...

class	Window
	Describe a multidimensional execution window. More...

class	WindowIterator
	Iterate over a portion of a Window. More...

struct	WinogradInfo
	Winograd information. More...

struct	WorkspaceDataElement

Typedefs
using	ICLUInt8Array = ICLArray< cl_uchar >
	Interface for OpenCL Array of uint8s. More...

using	ICLUInt16Array = ICLArray< cl_ushort >
	Interface for OpenCL Array of uint16s. More...

using	ICLUInt32Array = ICLArray< cl_uint >
	Interface for OpenCL Array of uint32s. More...

using	ICLInt16Array = ICLArray< cl_short >
	Interface for OpenCL Array of int16s. More...

using	ICLInt32Array = ICLArray< cl_int >
	Interface for OpenCL Array of int32s. More...

using	ICLFloatArray = ICLArray< cl_float >
	Interface for OpenCL Array of floats. More...

using	ICLImage = ICLTensor

using	half = half_float::half
	16-bit floating point type More...

using	PermutationVector = Strides
	Permutation vector. More...

using	IUInt8Array = IArray< uint8_t >
	Interface for Array of uint8s. More...

using	IUInt16Array = IArray< uint16_t >
	Interface for Array of uint16s. More...

using	IUInt32Array = IArray< uint32_t >
	Interface for Array of uint32s. More...

using	IInt16Array = IArray< int16_t >
	Interface for Array of int16s. More...

using	IInt32Array = IArray< int32_t >
	Interface for Array of int32s. More...

using	IFloatArray = IArray< float >
	Interface for Array of floats. More...

using	IImage = ITensor

using	qasymm8_signed_t = int8_t
	8 bit signed quantized asymmetric scalar value More...

using	qasymm8_t = uint8_t
	8 bit quantized asymmetric scalar value More...

using	qsymm16_t = int16_t
	16 bit quantized symmetric scalar value More...

using	qasymm16_t = uint16_t
	16 bit quantized asymmetric scalar value More...

using	BiStrides = Coordinates
	Bidirectional strides. More...

using	PaddingSize = BorderSize
	Container for 2D padding size. More...

using	PaddingInfo = std::pair< uint32_t, uint32_t >
	Padding information as a pair of unsigned int start/end. More...

using	PaddingList = std::vector< PaddingInfo >
	List of padding information. More...

using	Multiples = std::vector< uint32_t >
	Information to produce a tiled version of a Tensor. More...

using	BBox = std::array< float, 4 >

using	LabelBBox = std::map< int, std::vector< BBox > >

using	CropInfo = Padding2D
	Class for holding information related to cropping. More...

using	UInt8Array = Array< uint8_t >
	Array of uint8s. More...

using	UInt16Array = Array< uint16_t >
	Array of uint16s. More...

using	UInt32Array = Array< uint32_t >
	Array of uint32s. More...

using	Int16Array = Array< int16_t >
	Array of int16s. More...

using	Int32Array = Array< int32_t >
	Array of int32s. More...

using	FloatArray = Array< float >
	Array of floats. More...

using	CLUInt8Array = CLArray< cl_uchar >
	OpenCL Array of uint8s. More...

using	CLUInt16Array = CLArray< cl_ushort >
	OpenCL Array of uint16s. More...

using	CLUInt32Array = CLArray< cl_uint >
	OpenCL Array of uint32s. More...

using	CLInt16Array = CLArray< cl_short >
	OpenCL Array of int16s. More...

using	CLInt32Array = CLArray< cl_int >
	OpenCL Array of int32s. More...

using	CLFloatArray = CLArray< cl_float >
	OpenCL Array of floats. More...

using	CLImage = CLTensor
	OpenCL Image. More...

using	CLEqual = CLComparisonStatic< ComparisonOperation::Equal >
	Basic function to run equal comparison. More...

using	CLNotEqual = CLComparisonStatic< ComparisonOperation::NotEqual >
	Basic function to run not equal comparison. More...

using	CLGreater = CLComparisonStatic< ComparisonOperation::Greater >
	Basic function to run greater comparison. More...

using	CLGreaterEqual = CLComparisonStatic< ComparisonOperation::GreaterEqual >
	Basic function to run greater-equal comparison. More...

using	CLLess = CLComparisonStatic< ComparisonOperation::Less >
	Basic function to run less comparison. More...

using	CLLessEqual = CLComparisonStatic< ComparisonOperation::LessEqual >
	Basic function to run less-equal comparison. More...

using	CLSoftmaxLayer = CLSoftmaxLayerGeneric< false >

using	CLLogSoftmaxLayer = CLSoftmaxLayerGeneric< true >

using	NEEqual = NEElementwiseComparisonStatic< ComparisonOperation::Equal >
	Basic function to run equal comparison. More...

using	NENotEqual = NEElementwiseComparisonStatic< ComparisonOperation::NotEqual >
	Basic function to run not equal comparison. More...

using	NEGreater = NEElementwiseComparisonStatic< ComparisonOperation::Greater >
	Basic function to run greater comparison. More...

using	NEGreaterEqual = NEElementwiseComparisonStatic< ComparisonOperation::GreaterEqual >
	Basic function to run greater-equal comparison. More...

using	NELess = NEElementwiseComparisonStatic< ComparisonOperation::Less >
	Basic function to run less comparison. More...

using	NELessEqual = NEElementwiseComparisonStatic< ComparisonOperation::LessEqual >
	Basic function to run less-equal comparison. More...

using	NERsqrtLayer = NEElementwiseUnaryLayer< ElementWiseUnary::RSQRT >

using	NEExpLayer = NEElementwiseUnaryLayer< ElementWiseUnary::EXP >

using	NENegLayer = NEElementwiseUnaryLayer< ElementWiseUnary::NEG >

using	NELogLayer = NEElementwiseUnaryLayer< ElementWiseUnary::LOG >

using	NEAbsLayer = NEElementwiseUnaryLayer< ElementWiseUnary::ABS >

using	NERoundLayer = NEElementwiseUnaryLayer< ElementWiseUnary::ROUND >

using	NESinLayer = NEElementwiseUnaryLayer< ElementWiseUnary::SIN >

using	NESoftmaxLayer = NESoftmaxLayerGeneric< false >

using	NELogSoftmaxLayer = NESoftmaxLayerGeneric< true >

using	INEKernel = ICPPKernel
	Common interface for all kernels implemented in Neon. More...

using	NEScheduler = Scheduler
	CPU Scheduler. More...

using	Image = Tensor
	Image. More...

using	MemoryMappings = std::map< IMemory *, size_t >
	A map of (handle, index/offset), where handle is the memory handle of the object to provide the memory for and index/offset is the buffer/offset from the pool that should be used. More...

using	GroupMappings = std::map< size_t, MemoryMappings >
	A map of the groups and memory mappings. More...

using	MemoryRequirements = experimental::MemoryRequirements

template<typename TensorType >
using	WorkspaceData = std::vector< WorkspaceDataElement< TensorType > >

using	qasymm8x8_t = uint8x8_t
	8 bit quantized asymmetric vector with 8 elements More...

using	qasymm8x8x2_t = uint8x8x2_t
	8 bit quantized asymmetric vector with 16 elements More...

using	qasymm8x8x3_t = uint8x8x3_t
	8 bit quantized asymmetric vector with 24 elements More...

using	qasymm8x8x4_t = uint8x8x4_t
	8 bit quantized asymmetric vector with 32 elements More...

using	qasymm8x16_t = uint8x16_t
	8 bit quantized asymmetric vector with 16 elements More...

using	qasymm8x8_signed_t = int8x8_t
	8 bit quantized signed asymmetric vector with 8 elements More...

using	qasymm8x8x2_signed_t = int8x8x2_t
	8 bit quantized signed asymmetric vector with 16 elements More...

using	qasymm8x8x3_signed_t = int8x8x3_t
	8 bit quantized signed asymmetric vector with 24 elements More...

using	qasymm8x8x4_signed_t = int8x8x4_t
	8 bit quantized signed asymmetric vector with 32 elements More...

using	qasymm8x16_signed_t = int8x16_t
	8 bit quantized signed asymmetric vector with 16 elements More...

using	qsymm8_t = int8_t
	8 bit quantized symmetric scalar value More...

using	qsymm16x8_t = int16x8_t
	16 bit quantized symmetric vector with 8 elements More...

using	qsymm16x8x2_t = int16x8x2_t
	16 bit quantized symmetric vector with 16 elements More...

typedef float	float32_t

using	OperatorType = opencl::ClGemm

using	Mutex = std::mutex
	Wrapper of Mutex data-object. More...

template<typename Mutex >
using	lock_guard = std::lock_guard< Mutex >
	Wrapper of lock_guard data-object. More...

template<typename Mutex >
using	unique_lock = std::unique_lock< Mutex >
	Wrapper of lock_guard data-object. More...

Enumerations
enum	CLVersion { CL10, CL11, CL12, CL20, CL30, UNKNOWN }
	Available OpenCL Version. More...

enum	CLKernelType { UNKNOWN, UNKNOWN, DEPTHWISE, DIRECT, ELEMENTWISE, GEMM, POOL, WINOGRAD }

enum	Channel { UNKNOWN, C0, C1, C2, C3, R, G, B, A, Y, U, V }
	Available channels. More...

enum	Format { UNKNOWN, U8, S16, U16, S32, U32, S64, U64, BFLOAT16, F16, F32, UV88, RGB888, RGBA8888, YUV444, YUYV422, NV12, NV21, IYUV, UYVY422 }
	Image colour formats. More...

enum	DataType { UNKNOWN, U8, S8, QSYMM8, QASYMM8, QASYMM8_SIGNED, QSYMM8_PER_CHANNEL, U16, S16, QSYMM16, QASYMM16, U32, S32, U64, S64, BFLOAT16, F16, F32, F64, SIZET }
	Available data types. More...

enum	DataLayout { UNKNOWN, NCHW, NHWC, NCDHW, NDHWC }
	[DataLayout enum definition] More...

enum	DataLayoutDimension { CHANNEL, HEIGHT, WIDTH, DEPTH, BATCHES }
	[DataLayout enum definition] More...

enum	DimensionRoundingType { FLOOR, CEIL }
	Dimension rounding type when down-scaling on CNNs. More...

enum	WeightFormat { UNSPECIFIED = 0x1, ANY = 0x2, OHWI = 0x100100, OHWIo2 = 0x100200, OHWIo4 = 0x100400, OHWIo8 = 0x100800, OHWIo16 = 0x101000, OHWIo32 = 0x102000, OHWIo64 = 0x104000, OHWIo128 = 0x108000, OHWIo4i2 = 0x200400, OHWIo4i2_bf16 = 0x200410, OHWIo8i2 = 0x200800, OHWIo8i2_bf16 = 0x200810, OHWIo16i2 = 0x201000, OHWIo16i2_bf16 = 0x201010, OHWIo32i2 = 0x202000, OHWIo32i2_bf16 = 0x202010, OHWIo64i2 = 0x204000, OHWIo64i2_bf16 = 0x204010, OHWIo4i4 = 0x400400, OHWIo4i4_bf16 = 0x400410, OHWIo8i4 = 0x400800, OHWIo8i4_bf16 = 0x400810, OHWIo16i4 = 0x401000, OHWIo16i4_bf16 = 0x401010, OHWIo32i4 = 0x402000, OHWIo32i4_bf16 = 0x402010, OHWIo64i4 = 0x404000, OHWIo64i4_bf16 = 0x404010, OHWIo2i8 = 0x800200, OHWIo4i8 = 0x800400, OHWIo8i8 = 0x800800, OHWIo16i8 = 0x801000, OHWIo32i8 = 0x802000, OHWIo64i8 = 0x804000 }
	Memory layouts for the weights tensor. More...

enum	CPUModel { X, GENERIC, GENERIC_FP16, GENERIC_FP16_DOT, A53, A55r0, A55r1, A35, A73, A76, A510, X1, V1, A64FX, N1 }
	CPU models types. More...

enum	ErrorCode { OK, RUNTIME_ERROR, UNSUPPORTED_EXTENSION_USE }
	Available error codes. More...

enum	TensorType : int32_t { ACL_UNKNOWN = -1, ACL_SRC_DST = 0, ACL_SRC = 0, ACL_SRC_0 = 0, ACL_SRC_1 = 1, ACL_SRC_2 = 2, ACL_SRC_3 = 3, ACL_SRC_4 = 4, ACL_SRC_5 = 5, ACL_SRC_6 = 6, ACL_SRC_END = 6, ACL_DST = 30, ACL_DST_0 = 30, ACL_DST_1 = 31, ACL_DST_2 = 32, ACL_DST_END = 32, ACL_INT = 50, ACL_INT_0 = 50, ACL_INT_1 = 51, ACL_INT_2 = 52, ACL_INT_3 = 53, ACL_INT_4 = 54, ACL_SRC_VEC = 256, ACL_DST_VEC = 512, ACL_INT_VEC = 1024, ACL_BIAS = ACL_SRC_2, ACL_VEC_ROW_SUM = ACL_SRC_3, ACL_VEC_COL_SUM = ACL_SRC_4, ACL_SHIFTS = ACL_SRC_5, ACL_MULTIPLIERS = ACL_SRC_6 }
	Memory type. More...

enum	GPUTarget { UNKNOWN = 0x101, GPU_ARCH_MASK = 0xF00, GPU_GENERATION_MASK = 0x0F0, MIDGARD = 0x100, BIFROST = 0x200, VALHALL = 0x300, FIFTHGEN = 0X400, T600 = 0x110, T700 = 0x120, T800 = 0x130, G71 = 0x210, G72 = 0x220, G51 = 0x221, G51BIG = 0x222, G51LIT = 0x223, G31 = 0x224, G76 = 0x230, G52 = 0x231, G52LIT = 0x232, G77 = 0x310, G57 = 0x311, G78 = 0x320, G68 = 0x321, G78AE = 0x330, G710 = 0x340, G610 = 0x341, G510 = 0x342, G310 = 0x343, G715 = 0x350, G615 = 0x351, G720 = 0x410, G620 = 0X411 }
	Available GPU Targets. More...

enum	DeviceType { NEON, CL }
	Device types. More...

enum	RoundingPolicy { TO_ZERO, TO_NEAREST_UP, TO_NEAREST_EVEN }
	Rounding method. More...

enum	SamplingPolicy { CENTER, TOP_LEFT }
	Available Sampling Policies. More...

enum	ConvolutionMethod { GEMM, GEMM_CONV2D, DIRECT, INDIRECT, WINOGRAD, FFT }
	Available ConvolutionMethod. More...

enum	DepthwiseConvolutionFunction { OPTIMIZED, GENERIC }
	Available DepthwiseConvolutionFunction. More...

enum	DeconvolutionMethod { GEMM, DIRECT, UPSCALE_CONV2D }
	Available DeconvolutionMethod. More...

enum	FuseBatchNormalizationType { CONVOLUTION, DEPTHWISECONVOLUTION }
	Available FuseBatchNormalizationType. More...

enum	PaddingMode { CONSTANT, REFLECT, SYMMETRIC }
	Padding mode to use for PadLayer. More...

enum	ComparisonOperation { Equal, NotEqual, Greater, GreaterEqual, Less, LessEqual }
	Supported comparison operations. More...

enum	BorderMode { UNDEFINED, CONSTANT, REPLICATE }
	Methods available to handle borders. More...

enum	ConvertPolicy { WRAP, SATURATE }
	Policy to handle integer overflow. More...

enum	InterpolationPolicy { NEAREST_NEIGHBOR, BILINEAR, AREA }
	Interpolation method. More...

enum	BilinearInterpolation { BILINEAR_OLD_NEW, BILINEAR_SCHARR }
	Bilinear Interpolation method used by LKTracker. More...

enum	ReductionOperation { ARG_IDX_MAX, ARG_IDX_MIN, MEAN_SUM, PROD, SUM_SQUARE, SUM, MIN, MAX }
	Available reduction operations. More...

enum	ArithmeticOperation { ADD, SUB, DIV, MIN, MAX, SQUARED_DIFF, POWER, PRELU }
	Available element-wise operations. More...

enum	ElementWiseUnary { RSQRT, EXP, NEG, LOG, ABS, SIN, ROUND, LOGICAL_NOT }
	Available element wise unary operations. More...

enum	BitwiseOperation { AND, NOT, OR, XOR }
	Available bitwise operations. More...

enum	NormType { IN_MAP_1D, IN_MAP_2D, CROSS_MAP }
	The normalization type used for the normalization layer. More...

enum	PoolingType { MAX, AVG, L2 }
	Available pooling types. More...

enum	NMSType { LINEAR, GAUSSIAN, ORIGINAL }
	Available non maxima suppression types. More...

enum	DetectionOutputLayerCodeType { CORNER, CENTER_SIZE, CORNER_SIZE, TF_CENTER }
	Available Detection Output code types. More...

enum	ActivationFunction { LOGISTIC, TANH, RELU, BOUNDED_RELU, LU_BOUNDED_RELU, LEAKY_RELU, SOFT_RELU, ELU, ABS, SQUARE, SQRT, LINEAR, IDENTITY, HARD_SWISH, SWISH, GELU }
	Available activation functions. More...

enum	GEMMLowpOutputStageType { NONE, QUANTIZE_DOWN, QUANTIZE_DOWN_FIXEDPOINT, QUANTIZE_DOWN_FLOAT }
	GEMMLowp output stage type. More...

enum	CLTunerMode { EXHAUSTIVE, NORMAL, RAPID }
	< OpenCL tuner modes More...

enum	CLGEMMKernelType { NATIVE, RESHAPED, RESHAPED_ONLY_RHS, RESHAPED_ONLY_RHS_MMUL }
	OpenCL GEMM kernel types. More...

enum	CLBackendType { Native, Clvk }
	List the possible OpenCL backends. More...

enum	FFTDirection { Forward, Inverse }
	FFT direction to use. More...

enum	MappingType { BLOBS, OFFSETS }
	Mapping type. More...

enum	StatusCode { Success = AclSuccess, RuntimeError = AclRuntimeError, OutOfMemory = AclOutOfMemory, Unimplemented = AclUnimplemented, UnsupportedTarget = AclUnsupportedTarget, InvalidTarget = AclInvalidTarget, InvalidArgument = AclInvalidArgument, UnsupportedConfig = AclUnsupportedConfig, InvalidObjectState = AclInvalidObjectState }

enum	Target { Cpu = AclTarget::AclCpu, GpuOcl = AclTarget::AclGpuOcl }

enum	ExecutionMode { FastRerun = AclPreferFastRerun, FastStart = AclPreferFastStart }

enum	ImportMemoryType { HostPtr = AclImportMemoryType::AclHostPtr }

enum	CLImage2DType { ReadOnly, WriteOnly }
	OpenCL Image2D types. More...

enum	LogicalOperation { Unknown, And, Or, Not }
	List of supported logical operations. More...

Functions
std::string	get_cl_type_from_data_type (const DataType &dt)
	Translates a tensor data type to the appropriate OpenCL type. More...

std::string	get_cl_promoted_type_from_data_type (const DataType &dt)
	Translates a tensor data type to the appropriate OpenCL promoted type. More...

std::string	get_cl_unsigned_type_from_element_size (size_t element_size)
	Translates the element size to an unsigned integer data type. More...

std::string	get_cl_signed_type_from_element_size (size_t element_size)
	Translates the element size to an signed integer data type. More...

std::string	get_cl_select_type_from_data_type (const DataType &dt)
	Translates a tensor data type to the appropriate OpenCL select type. More...

std::string	get_cl_dot8_acc_type_from_data_type (const DataType &dt)
	Translates a tensor data type to the appropriate OpenCL dot8 accumulator type. More...

std::string	get_data_size_from_data_type (const DataType &dt)
	Get the size of a data type in number of bits. More...

GPUTarget	get_target_from_device (const cl::Device &device)
	Helper function to get the GPU target from CL device. More...

CLVersion	get_cl_version (const cl::Device &device)
	Helper function to get the highest OpenCL version supported. More...

size_t	get_cl_image_pitch_alignment (const cl::Device &device)
	Helper function to get the cl_image pitch alignment in pixels. More...

bool	get_cl_non_uniform_work_group_supported (const cl::Device &device)
	Helper function to check whether non-uniform work group is supported. More...

bool	device_supports_extension (const cl::Device &device, const char *extension_name)
	Helper function to check whether a given extension is supported. More...

bool	fp16_supported (const cl::Device &device)
	Helper function to check whether the cl_khr_fp16 extension is supported. More...

bool	arm_non_uniform_workgroup_supported (const cl::Device &device)
	Helper function to check whether the arm_non_uniform_work_group_size extension is supported. More...

bool	dot8_supported (const cl::Device &device)
	Helper function to check whether the cl_arm_integer_dot_product_int8 extension is supported. More...

bool	dot8_acc_supported (const cl::Device &device)
	Helper function to check whether the cl_arm_integer_dot_product_accumulate_int8 extension is supported. More...

bool	cl_winograd_convolution_layer_supported (const Size2D &output_tile, const Size2D &kernel_size, DataLayout data_layout)
	This function checks if the Winograd configuration (defined through the output tile, kernel size and the data layout) is supported on OpenCL. More...

size_t	preferred_vector_width (const cl::Device &device, DataType dt)
	Helper function to get the preferred native vector width size for built-in scalar types that can be put into vectors. More...

bool	preferred_dummy_work_items_support (const cl::Device &device)
	Helper function to check if "dummy work-items" are preferred to have a power of two NDRange In case dummy work-items is enabled, it is OpenCL kernel responsibility to check if the work-item is out-of range or not. More...

bool	image2d_from_buffer_supported (const cl::Device &device)
	Helper function to check whether the cl_khr_image2d_from_buffer extension is supported. More...

cl::Kernel	create_kernel (const CLCompileContext &ctx, const std::string &kernel_name, const std::set< std::string > &build_opts=std::set< std::string >())
	Creates an opencl kernel using a compile context. More...

cl::NDRange	create_lws_hint_parallel_implementations (unsigned int input_dimension, unsigned int vector_size)
	Creates a suitable LWS hint object for parallel implementations. More...

bool	get_wbsm_support_info (const cl::Device &device)

void	set_wbsm (cl::Kernel &kernel, cl_int wbsm_hint)

bool	export_to_cl_image (const ITensorInfo *tensor)

void	set_unroll_with_pragma (CLBuildOptions &built_opts, std::initializer_list< int > values)

bool	arm_matrix_multiply_supported (const cl::Device &device)
	Helper function to check whether the cl_arm_matrix_multiply extension is supported. More...

bool	command_buffer_supported (const cl::Device &device)
	Check whether cl_khr_command_buffer extension is supported by the specified CL device. More...

bool	command_buffer_mutable_dispatch_supported (const cl::Device &device)
	Check whether cl_khr_command_buffer_mutable_dispatch extension is supported by the specified CL device. More...

bool	opencl_is_available ()
	Check if OpenCL is available. More...

template<typename T >
bool	operator== (const Dimensions< T > &lhs, const Dimensions< T > &rhs)
	Check that given dimensions are equal. More...

template<typename T >
bool	operator!= (const Dimensions< T > &lhs, const Dimensions< T > &rhs)
	Check that given dimensions are not equal. More...

template<typename... T>
void	ignore_unused (T &&...)
	Ignores unused arguments. More...

Status	create_error (ErrorCode error_code, std::string msg)
	Creates an error containing the error message. More...

Status	create_error_msg (ErrorCode error_code, const char func, const char file, int line, const char *msg)
	Creates an error and the error message. More...

void	throw_error (Status err)
	Throw an std::runtime_error. More...

const std::string &	string_from_target (GPUTarget target)
	Translates a given gpu device target to string. More...

GPUTarget	get_target_from_name (const std::string &device_name)
	Helper function to get the GPU target from a device name. More...

GPUTarget	get_arch_from_target (GPUTarget target)
	Helper function to get the GPU arch. More...

template<typename... Args>
bool	gpu_target_is_in (GPUTarget target_to_check, GPUTarget target, Args... targets)
	Helper function to check whether a gpu target is equal to the provided targets. More...

bool	gpu_target_is_in (GPUTarget target_to_check, GPUTarget target)
	Variant of gpu_target_is_in for comparing two targets. More...

template<typename L , typename... Ts>
void	execute_window_loop (const Window &w, L &&lambda_function, Ts &&...iterators)
	Iterate through the passed window, automatically adjusting the iterators and calling the lambda_functino for each element. More...

template<typename T >
void	permute (Dimensions< T > &dimensions, const PermutationVector &perm)
	Permutes given Dimensions according to a permutation vector. More...

void	permute (TensorShape &shape, const PermutationVector &perm)
	Permutes given TensorShape according to a permutation vector. More...

ValidRegion	calculate_valid_region_scale (const ITensorInfo &src_info, const TensorShape &dst_shape, InterpolationPolicy interpolate_policy, SamplingPolicy sampling_policy, bool border_undefined)
	Helper function to calculate the Valid Region for Scale. More...

Coordinates	index2coords (const TensorShape &shape, int index)
	Convert a linear index into n-dimensional coordinates. More...

int	coords2index (const TensorShape &shape, const Coordinates &coord)
	Convert n-dimensional coordinates into a linear index. More...

const std::map< DataLayout, std::vector< DataLayoutDimension > > &	get_layout_map ()
	Returns a static map used to find an index or dimension based on a data layout. More...

size_t	get_data_layout_dimension_index (const DataLayout &data_layout, const DataLayoutDimension &data_layout_dimension)
	Get the index of the given dimension. More...

DataLayoutDimension	get_index_data_layout_dimension (const DataLayout &data_layout, const size_t index)
	Get the DataLayoutDimension of a given index and layout. More...

Size2D	compute_winograd_convolution_tiles (const Size2D &in_dims, const Size2D &kernel_size, const Size2D &output_tile_size, const PadStrideInfo &conv_info)
	Calculate the number of output tiles required by Winograd Convolution layer. More...

template<typename T >
T	wrap_around (T x, T m)
	Wrap-around a number within the range 0 <= x < m. More...

Coordinates &	convert_negative_axis (Coordinates &coords, int max_value)
	Convert negative coordinates to positive in the range [0, num_dims_input]. More...

int	adjust_down (int required, int available, int step)
	Decrease `required` in steps of `step` until it's less than `available`. More...

int	adjust_up (int required, int available, int step)
	Increase `required` in steps of `step` until it's greater than `available`. More...

bool	operator== (const QuantizationInfo &lhs, const QuantizationInfo &rhs)
	Check whether two quantization info are equal. More...

bool	operator!= (const QuantizationInfo &lhs, const QuantizationInfo &rhs)
	Check whether two quantization info are not equal. More...

bool	operator== (const UniformQuantizationInfo &lhs, const UniformQuantizationInfo &rhs)
	Check whether two quantization info are equal. More...

bool	operator!= (const UniformQuantizationInfo &lhs, const UniformQuantizationInfo &rhs)
	Check whether two quantization info are not equal. More...

template<typename INFO_TYPE >
uint8_t	quantize_qasymm8 (float value, const INFO_TYPE &qinfo, RoundingPolicy rounding_policy=RoundingPolicy::TO_NEAREST_UP)
	Quantize a value given an unsigned 8-bit asymmetric quantization scheme. More...

template<typename INFO_TYPE >
int8_t	quantize_qasymm8_signed (float value, const INFO_TYPE &qinfo, RoundingPolicy rounding_policy=RoundingPolicy::TO_NEAREST_UP)
	Quantize a value given a signed 8-bit asymmetric quantization scheme. More...

int8_t	quantize_qsymm8 (float value, const QuantizationInfo &qinfo)
	Quantize a value given a 8-bit symmetric quantization scheme. More...

int8_t	quantize_qsymm8_per_channel (float value, const QuantizationInfo &qinfo, size_t channel_id=0)
	Quantize a value given a 8-bit symmetric per channel quantization scheme. More...

template<typename INFO_TYPE >
float	dequantize_qasymm8 (uint8_t value, const INFO_TYPE &qinfo)
	Dequantize a value given an unsigned 8-bit asymmetric quantization scheme. More...

template<typename INFO_TYPE >
float	dequantize_qasymm8_signed (int8_t value, const INFO_TYPE &qinfo)
	Dequantize a value given a signed 8-bit asymmetric quantization scheme. More...

float	dequantize (uint8_t value, float scale, int32_t offset)
	Dequantize a value given an 8-bit asymmetric quantization scheme. More...

float	dequantize_qsymm8 (int8_t value, const UniformQuantizationInfo &qinfo)
	Dequantize a value given a 8-bit symmetric quantization scheme. More...

float	dequantize (int8_t value, float scale)
	Dequantize a value given a 8-bit symmetric quantization scheme. More...

float	dequantize (int16_t value, float scale)
	Dequantize a value given a 16-bit symmetric quantization scheme. More...

float	dequantize (uint16_t value, float scale, int32_t offset)
	Dequantize a value given a 16-bit asymmetric quantization scheme. More...

int16_t	quantize_qsymm16 (float value, const UniformQuantizationInfo &qinfo, RoundingPolicy rounding_policy=RoundingPolicy::TO_NEAREST_UP)
	Quantize a value given a 16-bit symmetric quantization scheme. More...

float	dequantize_qsymm16 (int16_t value, const UniformQuantizationInfo &qinfo)
	Dequantize a value given a 16-bit symmetric quantization scheme. More...

int16_t	quantize_qsymm16 (float value, const QuantizationInfo &qinfo)
	Quantize a value given a 16-bit symmetric quantization scheme. More...

float	dequantize_qsymm16 (int16_t value, const QuantizationInfo &qinfo)
	Dequantize a value given a 16-bit symmetric quantization scheme. More...

uint16_t	quantize_qasymm16 (float value, const UniformQuantizationInfo &qinfo, RoundingPolicy rounding_policy=RoundingPolicy::TO_NEAREST_UP)
	Quantize a value given a 16-bit asymmetric quantization scheme. More...

float	dequantize_qasymm16 (uint16_t value, const UniformQuantizationInfo &qinfo)
	Dequantize a value given a 16-bit asymmetric quantization scheme. More...

uint16_t	quantize_qasymm16 (float value, const QuantizationInfo &qinfo)
	Quantize a value given a 16-bit asymmetric quantization scheme. More...

float	dequantize_qasymm16 (uint16_t value, const QuantizationInfo &qinfo)
	Dequantize a value given a 16-bit asymmetric quantization scheme. More...

UniformQuantizationInfo	compute_requantization_scale_offset (const UniformQuantizationInfo &uqinfo_in, const UniformQuantizationInfo &uqinfo_out)

int	round (float x, RoundingPolicy rounding_policy)
	Return a rounded value of x. More...

bool	operator== (const TensorInfo &lhs, const TensorInfo &rhs)
	Check whether two tensor info are equal. More...

bool	operator== (const ValidRegion &lhs, const ValidRegion &rhs)

int	interleave_by (const WeightFormat wf)

int	block_by (const WeightFormat wf)

bool	is_fixed_format (const WeightFormat &wf)

bool	is_fixed_format_fast_math (const WeightFormat &wf)

const std::string &	string_from_activation_func (const ActivationFunction &act)
	Translates a given activation function to a string. More...

const std::string &	string_from_data_layout (DataLayout dl)
	Convert a data layout identity into a string. More...

size_t	data_size_from_type (DataType data_type)
	The size in bytes of the data type. More...

size_t	element_size_from_data_type (DataType dt)
	The size in bytes of the data type. More...

DataType	data_type_from_format (Format format)
	Return the data type used by a given single-planar pixel format. More...

DataType	get_promoted_data_type (DataType dt)
	Return the promoted data type of a given data type. More...

std::tuple< PixelValue, PixelValue >	get_min_max (DataType dt)
	Compute the mininum and maximum values a data type can take. More...

const std::string &	string_from_data_type (DataType dt)
	Convert a data type identity into a string. More...

DataType	data_type_from_name (const std::string &name)
	Convert a string to DataType. More...

inline ::std::istream &	operator>> (::std::istream &stream, DataType &data_type)
	Input Stream operator for DataType. More...

bool	is_data_type_float (DataType dt)
	Check if a given data type is of floating point type. More...

bool	is_data_type_quantized (DataType dt)
	Check if a given data type is of quantized type. More...

bool	is_data_type_quantized_asymmetric (DataType dt)
	Check if a given data type is of asymmetric quantized type. More...

bool	is_data_type_quantized_asymmetric_signed (DataType dt)
	Check if a given data type is of asymmetric quantized signed type. More...

bool	is_data_type_quantized_symmetric (DataType dt)
	Check if a given data type is of symmetric quantized type. More...

bool	is_data_type_quantized_per_channel (DataType dt)
	Check if a given data type is of per channel type. More...

template<typename T >
bool	check_value_range (T val, DataType dt, QuantizationInfo qinfo=QuantizationInfo())
	Returns true if the value can be represented by the given data type. More...

std::string	cpu_impl_dt (const DataType &data_type)
	Returns the suffix string of CPU kernel implementation names based on the given data type. More...

size_t	pixel_size_from_format (Format format)
	The size in bytes of the pixel format. More...

int	plane_idx_from_channel (Format format, Channel channel)
	Return the plane index of a given channel given an input format. More...

int	channel_idx_from_format (Format format, Channel channel)
	Return the channel index of a given channel given an input format. More...

size_t	num_planes_from_format (Format format)
	Return the number of planes for a given format. More...

size_t	num_channels_from_format (Format format)
	Return the number of channels for a given single-planar pixel format. More...

const std::string &	string_from_format (Format format)
	Convert a tensor format into a string. More...

unsigned int	adjust_vec_size (unsigned int vec_size, size_t dim0)
	Returns the adjusted vector size in case it is less than the input's first dimension, getting rounded down to its closest valid vector size. More...

const std::string &	string_from_interpolation_policy (InterpolationPolicy policy)
	Translates a given interpolation policy to a string. More...

template<typename S , typename T >
constexpr auto	DIV_CEIL (S val, T m) -> decltype((val+m - 1)/m)
	Calculate the rounded up quotient of val / m. More...

template<typename S , typename T >
auto	ceil_to_multiple (S value, T divisor) -> decltype(((value+divisor - 1)/divisor) *divisor)
	Computes the smallest number larger or equal to value that is a multiple of divisor. More...

template<typename S , typename T >
auto	floor_to_multiple (S value, T divisor) -> decltype((value/divisor) *divisor)
	Computes the largest number smaller or equal to value that is a multiple of divisor. More...

std::string	lower_string (const std::string &val)
	Lower a given string. More...

std::string	upper_string (const std::string &val)
	Raise a given string to upper case. More...

std::string	float_to_string_with_full_precision (float val)
	Create a string with the float in full precision. More...

std::string	join (const std::vector< std::string > strings, const std::string &sep)
	Join a sequence of strings with separator `sep`. More...

std::string	read_file (const std::string &filename, bool binary)
	Load an entire file in memory. More...

template<typename T >
void	permute_strides (Dimensions< T > &dimensions, const PermutationVector &perm)
	Permutes the given dimensions according the permutation vector. More...

PadStrideInfo	calculate_same_pad (TensorShape input_shape, TensorShape weights_shape, PadStrideInfo conv_info, DataLayout data_layout=DataLayout::NCHW, const Size2D &dilation=Size2D(1u, 1u), const DimensionRoundingType &rounding_type=DimensionRoundingType::FLOOR)
	Calculate padding requirements in case of SAME padding. More...

std::pair< unsigned int, unsigned int >	deconvolution_output_dimensions (unsigned int in_width, unsigned int in_height, unsigned int kernel_width, unsigned int kernel_height, const PadStrideInfo &pad_stride_info)
	Returns expected width and height of the deconvolution's output tensor. More...

std::pair< unsigned int, unsigned int >	scaled_dimensions (int width, int height, int kernel_width, int kernel_height, const PadStrideInfo &pad_stride_info, const Size2D &dilation=Size2D(1U, 1U))
	Returns expected width and height of output scaled tensor depending on dimensions rounding mode. More...

std::pair< int, int >	scaled_dimensions_signed (int width, int height, int kernel_width, int kernel_height, const PadStrideInfo &pad_stride_info)
	Returns calculated width and height of output scaled tensor depending on dimensions rounding mode. More...

std::tuple< int, int, int >	scaled_3d_dimensions_signed (int width, int height, int depth, int kernel_width, int kernel_height, int kernel_depth, const Pooling3dLayerInfo &pool3d_info)
	Returns calculated width, height and depth of output scaled tensor depending on dimensions rounding mode. More...

bool	needs_serialized_reduction (ReductionOperation op, DataType dt, unsigned int axis)
	Check if the given reduction operation should be handled in a serial way. More...

QuantizationInfo	get_softmax_output_quantization_info (DataType input_type, bool is_log)
	Returns output quantization information for softmax layer. More...

std::pair< int32_t, int32_t >	get_quantized_activation_min_max (const ActivationLayerInfo &act_info, DataType data_type, UniformQuantizationInfo oq_info)
	Returns a pair of minimum and maximum values for a quantized activation. More...

const std::string &	string_from_channel (Channel channel)
	Convert a channel identity into a string. More...

const std::string &	string_from_border_mode (BorderMode border_mode)
	Translates a given border mode policy to a string. More...

const std::string &	string_from_norm_type (NormType type)
	Translates a given normalization type to a string. More...

const std::string &	string_from_pooling_type (PoolingType type)
	Translates a given pooling type to a string. More...

bool	is_pool_region_entirely_outside_input (const PoolingLayerInfo &info)
	Check if the pool region is entirely outside the input tensor. More...

bool	is_pool_3d_region_entirely_outside_input (const Pooling3dLayerInfo &info)
	Check if the 3d pool region is entirely outside the input tensor. More...

bool	is_symmetric (const Padding3D &info)
	Check if the 3D padding is symmetric i.e. More...

const std::string &	string_from_gemmlowp_output_stage (GEMMLowpOutputStageType output_stage)
	Translates a given GEMMLowp output stage to a string. More...

std::string	string_from_pixel_value (const PixelValue &value, const DataType data_type)
	Convert a PixelValue to a string, represented through the specific data type. More...

std::unordered_map< const ITensorInfo *, PaddingSize >	get_padding_info (std::initializer_list< const ITensorInfo * > infos)
	Stores padding information before configuring a kernel. More...

std::unordered_map< const ITensorInfo *, PaddingSize >	get_padding_info (std::initializer_list< const ITensor * > tensors)
	Stores padding information before configuring a kernel. More...

bool	has_padding_changed (const std::unordered_map< const ITensorInfo *, PaddingSize > &padding_map)
	Check if the previously stored padding info has changed after configuring a kernel. More...

size_t	num_of_elements_in_range (const float start, const float end, const float step)
	Returns the number of elements required to go from start to end with the wanted step. More...

template<typename... Ts>
arm_compute::Status	error_on_nullptr (const char function, const char file, const int line, Ts &&...pointers)
	Create an error if one of the pointers is a nullptr. More...

arm_compute::Status	error_on_mismatching_windows (const char function, const char file, const int line, const Window &full, const Window &win)
	Return an error if the passed window is invalid. More...

arm_compute::Status	error_on_invalid_subwindow (const char function, const char file, const int line, const Window &full, const Window &sub)
	Return an error if the passed subwindow is invalid. More...

arm_compute::Status	error_on_window_not_collapsable_at_dimension (const char function, const char file, const int line, const Window &full, const Window &window, const int dim)
	Return an error if the window can't be collapsed at the given dimension. More...

arm_compute::Status	error_on_coordinates_dimensions_gte (const char function, const char file, const int line, const Coordinates &pos, unsigned int max_dim)
	Return an error if the passed coordinates have too many dimensions. More...

arm_compute::Status	error_on_window_dimensions_gte (const char function, const char file, const int line, const Window &win, unsigned int max_dim)
	Return an error if the passed window has too many dimensions. More...

template<typename T , typename... Ts>
arm_compute::Status	error_on_mismatching_dimensions (const char function, const char file, int line, const Dimensions< T > &dim1, const Dimensions< T > &dim2, Ts &&...dims)
	Return an error if the passed dimension objects differ. More...

bool	has_format_horizontal_subsampling (Format format)
	Return true if the given format has horizontal subsampling. More...

bool	has_format_vertical_subsampling (Format format)
	Return true if the given format has vertical subsampling. More...

TensorShape	adjust_odd_shape (const TensorShape &shape, Format format)
	Adjust tensor shape size if width or height are odd for a given multi-planar format. More...

template<typename... Ts>
arm_compute::Status	error_on_tensors_not_even (const char function, const char file, int line, const Format &format, const ITensor *tensor1, Ts... tensors)
	Return an error if the passed tensor objects are not even. More...

TensorShape	calculate_subsampled_shape (const TensorShape &shape, Format format, Channel channel=Channel::UNKNOWN)
	Calculate subsampled shape for a given format and channel. More...

template<typename... Ts>
arm_compute::Status	error_on_tensors_not_subsampled (const char function, const char file, int line, const Format &format, const TensorShape &shape, const ITensor *tensor1, Ts... tensors)
	Return an error if the passed tensor objects are not sub-sampled. More...

template<typename... Ts>
arm_compute::Status	error_on_mismatching_shapes (const char function, const char file, const int line, const ITensorInfo tensor_info_1, const ITensorInfo tensor_info_2, Ts... tensor_infos)
	Return an error if the passed two tensor infos have different shapes from the given dimension. More...

template<typename... Ts>
arm_compute::Status	error_on_mismatching_shapes (const char function, const char file, const int line, const ITensor tensor_1, const ITensor tensor_2, Ts... tensors)
	Return an error if the passed two tensors have different shapes from the given dimension. More...

template<typename... Ts>
arm_compute::Status	error_on_mismatching_shapes (const char function, const char file, const int line, unsigned int upper_dim, const ITensorInfo tensor_info_1, const ITensorInfo tensor_info_2, Ts... tensor_infos)
	Return an error if the passed two tensors have different shapes from the given dimension. More...

template<typename... Ts>
arm_compute::Status	error_on_mismatching_shapes (const char function, const char file, const int line, unsigned int upper_dim, const ITensor tensor_1, const ITensor tensor_2, Ts... tensors)
	Return an error if the passed two tensors have different shapes from the given dimension. More...

template<typename... Ts>
arm_compute::Status	error_on_mismatching_data_layouts (const char function, const char file, const int line, const ITensorInfo *tensor_info, Ts... tensor_infos)
	Return an error if the passed tensor infos have different data layouts. More...

template<typename... Ts>
arm_compute::Status	error_on_mismatching_data_layouts (const char function, const char file, const int line, const ITensor *tensor, Ts... tensors)
	Return an error if the passed tensors have different data layouts. More...

template<typename... Ts>
arm_compute::Status	error_on_mismatching_data_types (const char function, const char file, const int line, const ITensorInfo *tensor_info, Ts... tensor_infos)
	Return an error if the passed two tensor infos have different data types. More...

template<typename... Ts>
arm_compute::Status	error_on_mismatching_data_types (const char function, const char file, const int line, const ITensor *tensor, Ts... tensors)
	Return an error if the passed two tensors have different data types. More...

template<typename... Ts>
arm_compute::Status	error_on_mismatching_quantization_info (const char function, const char file, const int line, const ITensorInfo tensor_info_1, const ITensorInfo tensor_info_2, Ts... tensor_infos)
	Return an error if the passed tensor infos have different asymmetric quantized data types or different quantization info. More...

template<typename... Ts>
arm_compute::Status	error_on_mismatching_quantization_info (const char function, const char file, const int line, const ITensor tensor_1, const ITensor tensor_2, Ts... tensors)
	Return an error if the passed tensor have different asymmetric quantized data types or different quantization info. More...

template<typename T , typename F , typename... Fs>
void	error_on_format_not_in (const char function, const char file, const int line, const T *object, F &&format, Fs &&...formats)
	Throw an error if the format of the passed tensor/multi-image does not match any of the formats provided. More...

template<typename T , typename... Ts>
arm_compute::Status	error_on_data_type_not_in (const char function, const char file, const int line, const ITensorInfo *tensor_info, T &&dt, Ts &&...dts)
	Return an error if the data type of the passed tensor info does not match any of the data types provided. More...

template<typename T , typename... Ts>
arm_compute::Status	error_on_data_type_not_in (const char function, const char file, const int line, const ITensor *tensor, T &&dt, Ts &&...dts)
	Return an error if the data type of the passed tensor does not match any of the data types provided. More...

template<typename T , typename... Ts>
arm_compute::Status	error_on_data_layout_not_in (const char function, const char file, const int line, const ITensorInfo *tensor_info, T &&dl, Ts &&...dls)
	Return an error if the data layout of the passed tensor info does not match any of the data layouts provided. More...

template<typename T , typename... Ts>
arm_compute::Status	error_on_data_layout_not_in (const char function, const char file, const int line, const ITensor *tensor, T &&dl, Ts &&...dls)
	Return an error if the data layout of the passed tensor does not match any of the data layout provided. More...

template<typename T , typename... Ts>
arm_compute::Status	error_on_data_type_channel_not_in (const char function, const char file, const int line, const ITensorInfo *tensor_info, size_t num_channels, T &&dt, Ts &&...dts)
	Return an error if the data type or the number of channels of the passed tensor info does not match any of the data types and number of channels provided. More...

template<typename T , typename... Ts>
arm_compute::Status	error_on_data_type_channel_not_in (const char function, const char file, const int line, const ITensor *tensor, size_t num_channels, T &&dt, Ts &&...dts)
	Return an error if the data type or the number of channels of the passed tensor does not match any of the data types and number of channels provided. More...

arm_compute::Status	error_on_unsupported_fp16 (const char function, const char file, const int line, const ITensorInfo *tensor_info, bool is_fp16_supported)
	Return an error if the data type of the passed tensor info is FP16 and FP16 extension is not supported by the device. More...

arm_compute::Status	error_on_unsupported_fp16 (const char function, const char file, const int line, const ITensor *tensor, bool is_fp16_supported)
	Return an error if the data type of the passed tensor is FP16 and FP16 extension is not supported by the device. More...

arm_compute::Status	error_on_tensor_not_2d (const char function, const char file, const int line, const ITensor *tensor)
	Return an error if the tensor is not 2D. More...

arm_compute::Status	error_on_tensor_not_2d (const char function, const char file, const int line, const ITensorInfo *tensor)
	Return an error if the tensor info is not 2D. More...

template<typename T , typename... Ts>
arm_compute::Status	error_on_channel_not_in (const char function, const char file, const int line, T cn, T &&channel, Ts &&...channels)
	Return an error if the channel is not in channels. More...

arm_compute::Status	error_on_channel_not_in_known_format (const char function, const char file, const int line, Format fmt, Channel cn)
	Return an error if the channel is not in format. More...

arm_compute::Status	error_on_unconfigured_kernel (const char function, const char file, const int line, const IKernel *kernel)
	Return an error if the kernel is not configured. More...

arm_compute::Status	error_on_invalid_subtensor (const char function, const char file, const int line, const TensorShape &parent_shape, const Coordinates &coords, const TensorShape &shape)
	Return an error if if the coordinates and shape of the subtensor are within the parent tensor. More...

arm_compute::Status	error_on_invalid_subtensor_valid_region (const char function, const char file, const int line, const ValidRegion &parent_valid_region, const ValidRegion &valid_region)
	Return an error if the valid region of a subtensor is not inside the valid region of the parent tensor. More...

std::string	build_information ()
	Returns the arm_compute library build information. More...

void	swap (Window &lhs, Window &rhs)

bool	operator== (const Window &lhs, const Window &rhs)

Coordinates	convert_window_coord_to_position (const Window &w, const Coordinates &offset)
	Convert an offset in window steps into absolute coordinates. More...

template<typename L >
WindowIterator< L >	create_window_iterator (const Window &w, const Coordinates &start, const Coordinates &end, L &&lambda_function)
	Create a WindowIterator object. More...

arm_compute::DataLayout	data_layout_from_name (const std::string &name)
	Converts a string to a strong types enumeration DataLayout. More...

inline ::std::istream &	operator>> (::std::istream &stream, arm_compute::DataLayout &data_layout)
	Input Stream operator for DataLayout. More...

std::tuple< cl::Context, cl::Device, cl_int >	create_opencl_context_and_device (CLBackendType cl_backend_type)
	This function creates an OpenCL context and a device. More...

void	schedule_kernel_on_ctx (CLRuntimeContext ctx, ICLKernel kernel, bool flush=true)
	Schedules a kernel using the context if not nullptr else uses the legacy scheduling flow. More...

cl::Platform	select_preferable_platform (CLBackendType cl_backend_type)
	This function selects the OpenCL platform based on the backend type. More...

CLTunerMode	tuner_mode_from_name (const std::string &name)
	Converts a string to a strong types enumeration CLTunerMode. More...

inline ::std::istream &	operator>> (::std::istream &stream, CLTunerMode &tuner_mode)
	Input Stream operator for CLTunerMode. More...

void	save_program_cache_to_file (const std::string &filename="cache.bin")
	This function saves opencl kernels library to a file. More...

void	restore_program_cache_from_file (const std::string &filename="cache.bin")
	This function loads prebuilt opencl kernels from a file. More...

IContext *	get_internal (AclContext ctx)
	Extract internal representation of a Context. More...

IOperator *	get_internal (AclOperator op)
	Extract internal representation of an Operator. More...

IQueue *	get_internal (AclQueue queue)
	Extract internal representation of a Queue. More...

ITensorV2 *	get_internal (AclTensor tensor)
	Extract internal representation of a Tensor. More...

TensorPack *	get_internal (AclTensorPack pack)
	Extract internal representation of a TensoPack. More...

cl::Image2D	create_image2d_from_tensor (const ICLTensor *tensor, CLImage2DType image_type)
	Create a cl::Image2D object from a tensor. More...

cl::Image2D	create_image2d_from_buffer (const cl::Context &ctx, const cl::Buffer &buffer, const TensorShape &shape2d, DataType data_type, size_t image_row_pitch, CLImage2DType image_type)
	Create a cl::Image2D object from an OpenCL buffer. More...

void	handle_cl_error (const std::string &function_name, cl_int error_code)
	Check for CL error code and throw exception accordingly. More...

arm_compute::Status	error_on_unsupported_int64_base_atomics (const char function, const char file, const int line)
	Return an error if int64_base_atomics extension is not supported by the device. More...

cl::NDRange	get_default_lws_for_type (CLKernelType kernel_type, cl::NDRange gws)

void	enqueue (cl::CommandQueue &queue, ICLKernel &kernel, const Window &window, const cl::NDRange &lws_hint=CLKernelLibrary::get().default_ndrange(), bool use_dummy_work_items=false)
	Add the kernel to the command queue with the given window. More...

Status	error_on_unsupported_cpu_fp16 (const char function, const char file, const int line, const ITensorInfo *tensor_info)
	Return an error if the data type of the passed tensor info is FP16 and FP16 support is not compiled in. More...

Status	error_on_unsupported_cpu_bf16 (const char function, const char file, const int line, const ITensorInfo *tensor_info)
	Return an error if the data type of the passed tensor info is BFLOAT16 and BFLOAT16 support is not compiled in. More...

Status	error_on_unsupported_cpu_fp16 (const char function, const char file, const int line, const ITensor *tensor)
	Return an error if the data type of the passed tensor is FP16 and FP16 support is not compiled in. More...

Status	error_on_unsupported_cpu_bf16 (const char function, const char file, const int line, const ITensor *tensor)
	Return an error if the data type of the passed tensor is BFLOAT16 and BFLOAT16 support is not compiled in. More...

bool	auto_init_if_empty (ITensorInfo &info, const TensorShape &shape, int num_channels, DataType data_type, QuantizationInfo quantization_info=QuantizationInfo())
	Auto initialize the tensor info (shape, number of channels and data type) if the current assignment is empty. More...

bool	auto_init_if_empty (ITensorInfo &info_sink, const ITensorInfo &info_source)
	Auto initialize the tensor info using another tensor info. More...

bool	set_shape_if_empty (ITensorInfo &info, const TensorShape &shape)
	Set the shape to the specified value if the current assignment is empty. More...

bool	set_format_if_unknown (ITensorInfo &info, Format format)
	Set the format, data type and number of channels to the specified value if the current data type is unknown. More...

bool	set_data_type_if_unknown (ITensorInfo &info, DataType data_type)
	Set the data type and number of channels to the specified value if the current data type is unknown. More...

bool	set_data_layout_if_unknown (ITensorInfo &info, DataLayout data_layout)
	Set the data layout to the specified value if the current data layout is unknown. More...

bool	set_quantization_info_if_empty (ITensorInfo &info, QuantizationInfo quantization_info)
	Set the quantization info to the specified value if the current quantization info is empty and the data type of asymmetric quantized type. More...

int	offset_int_vec (int offset)

template<typename TensorType >
WorkspaceData< TensorType >	manage_workspace (const experimental::MemoryRequirements &mem_reqs, MemoryGroup &mgroup, ITensorPack &run_pack)

template<typename TensorType >
WorkspaceData< TensorType >	manage_workspace (const experimental::MemoryRequirements &mem_reqs, MemoryGroup &mgroup, ITensorPack &run_pack, ITensorPack &prep_pack)

template<typename TensorType >
void	release_prepare_tensors (WorkspaceData< TensorType > &workspace, ITensorPack &prep_pack)

template<typename TensorType >
void	release_temporaries (const experimental::MemoryRequirements &mem_reqs, WorkspaceData< TensorType > &workspace)
	Utility function to release tensors with lifetime marked as Prepare. More...

unsigned int	get_normalization_dimension_index (DataLayout layout, const NormalizationLayerInfo &info)
	Calculate the normalization dimension index for a given normalization type. More...

bool	has_holes (const ITensorInfo &info)
	Check if the tensor has any holes. More...

bool	has_holes (const ITensorInfo &info, size_t dimension)
	Check if the tensor has any holes. More...

template<typename T , typename... Ts>
Strides	compute_strides (const ITensorInfo &info, T stride_x, Ts &&...fixed_strides)
	Create a strides object based on the provided strides and the tensor dimensions. More...

template<typename... Ts>
Strides	compute_strides (const ITensorInfo &info)
	Create a strides object based on the tensor dimensions. More...

unsigned int	get_next_power_two (unsigned int x)
	Given an integer value, this function returns the next power of two. More...

Window	calculate_max_window (const ValidRegion &valid_region, const Steps &steps, bool skip_border, BorderSize border_size)

Window	calculate_max_window (const TensorShape &shape, const Steps &steps, bool skip_border, BorderSize border_size)

Window	calculate_max_enlarged_window (const ValidRegion &valid_region, const Steps &steps, BorderSize border_size)

Window	calculate_max_window_horizontal (const ValidRegion &valid_region, const Steps &steps, bool skip_border, BorderSize border_size)

std::pair< Window, size_t >	calculate_squashed_or_max_window (const ITensorInfo &src0, const ITensorInfo &src1)

std::pair< Window, size_t >	calculate_squashed_or_max_window (const ITensorInfo &src)

template<typename... Ts>
bool	update_window_and_padding (Window &win, Ts &&...patterns)
	Update window and padding size for each of the access patterns. More...

template<typename... Ts>
ValidRegion	intersect_valid_regions (const Ts &...regions)
	Intersect multiple valid regions. More...

void	colorconvert_rgb_to_rgbx (const void __restrict input, void __restrict output, const Window &win)
	Convert RGB to RGBX. More...

void	colorconvert_rgb_to_u8 (const void __restrict input, void __restrict output, const Window &win)
	Convert RGB to U8. More...

void	colorconvert_rgbx_to_rgb (const void input, void output, const Window &win)
	Convert RGBX to RGB. More...

template<bool yuyv, bool alpha>
void	colorconvert_yuyv_to_rgb (const void __restrict input, void __restrict output, const Window &win)
	Convert YUYV to RGB. More...

template<bool uv, bool alpha>
void	colorconvert_nv12_to_rgb (const void __restrict input, void __restrict output, const Window &win)
	Convert NV12 to RGB. More...

template<bool alpha>
void	colorconvert_iyuv_to_rgb (const void __restrict input, void __restrict output, const Window &win)
	Convert IYUV to RGB. More...

template<bool yuyv>
void	colorconvert_yuyv_to_nv12 (const void __restrict input, void __restrict output, const Window &win)
	Convert YUYV to NV12. More...

void	colorconvert_iyuv_to_nv12 (const void __restrict input, void __restrict output, const Window &win)
	Convert IYUV to NV12. More...

template<bool uv>
void	colorconvert_nv12_to_iyuv (const void __restrict input, void __restrict output, const Window &win)
	Convert NV12 to IYUV. More...

template<bool yuyv>
void	colorconvert_yuyv_to_iyuv (const void __restrict input, void __restrict output, const Window &win)
	Convert YUYV to IYUV. More...

template<bool uv>
void	colorconvert_nv12_to_yuv4 (const void __restrict input, void __restrict output, const Window &win)
	Convert NV12 to YUV4. More...

void	colorconvert_iyuv_to_yuv4 (const void __restrict input, void __restrict output, const Window &win)
	Convert IYUV to YUV4. More...

template<bool alpha>
void	colorconvert_rgb_to_nv12 (const void __restrict input, void __restrict output, const Window &win)
	Convert RGB to NV12. More...

template<bool alpha>
void	colorconvert_rgb_to_iyuv (const void __restrict input, void __restrict output, const Window &win)
	Convert RGB to IYUV. More...

template<bool alpha>
void	colorconvert_rgb_to_yuv4 (const void __restrict input, void __restrict output, const Window &win)
	Convert RGB to YUV4. More...

template<typename T >
void	run_reverse (const Window &window, const ITensor input, const ITensor axis, ITensor *output, bool use_inverted_axis)

template<RoundingPolicy round_policy = RoundingPolicy::TO_ZERO>
qasymm8x16_t	vmlaq_qasymm8 (qasymm8x16_t vd, float32x4_t vs, float32x4_t vo)
	Perform a multiply-accumulate on all 16 components of a QASYMM8 vector. More...

template<RoundingPolicy round_policy = RoundingPolicy::TO_ZERO>
qasymm8x16_signed_t	vmlaq_qasymm8_signed (qasymm8x16_signed_t vd, float32x4_t vs, float32x4_t vo)
	Perform a multiply-accumulate on all 16 components of a QASYMM8_SIGNED vector. More...

uint8x16_t	finalize_quantization (int32x4x4_t &in_s32, int result_fixedpoint_multiplier, int32_t result_shift, int32x4_t result_offset_after_shift_s32, uint8x16_t min_u8, uint8x16_t max_u8, bool is_bounded_relu)
	Performs final quantization step on 16 elements. More...

int8x16_t	finalize_quantization (int32x4x4_t &in_s32, int result_fixedpoint_multiplier, int32_t result_shift, int32x4_t result_offset_after_shift_s32, int8x16_t min_s8, int8x16_t max_s8, bool is_bounded_relu)
	Performs final quantization step on 16 elements. More...

int8x16_t	finalize_quantization_symm (int32x4x4_t &in_s32, const int32x4x4_t &result_fixedpoint_multiplier, const int32x4x4_t &result_shift, const int32x4_t &result_offset_after_shift_s32, const int8x16_t &min_s8, const int8x16_t &max_s8, const bool is_bounded_relu)
	Performs final quantization step on 16 elements for symmetric quantization. More...

uint8_t	finalize_quantization (int32_t in_value, int result_fixedpoint_multiplier, int32_t result_shift, int32_t result_offset_after_shift_s32, uint8_t min_u8, uint8_t max_u8, bool is_bounded_relu)
	Performs final quantization step on single element. More...

int8_t	finalize_quantization (int32_t in_value, int result_fixedpoint_multiplier, int32_t result_shift, int32_t result_offset_after_shift_s32, int8_t min_s8, int8_t max_s8, bool is_bounded_relu)
	Performs final quantization step on single element. More...

float32x4x2_t	vdequantize (const uint8x8_t &qv, const UniformQuantizationInfo &qi)
	Dequantize a neon vector holding 8 quantized values. More...

float32x4x2_t	vdequantize (const int8x8_t &qv, const UniformQuantizationInfo &qi)
	Dequantize a neon vector holding 8 singed quantized values. More...

float32x4x4_t	vdequantize (const uint8x16_t &qv, const UniformQuantizationInfo &qi)
	Dequantize a neon vector holding 16 quantized values. More...

float32x4x4_t	vdequantize (const int8x16_t &qv, const UniformQuantizationInfo &qi)
	Dequantize a neon vector holding 16 signed quantized values. More...

float32x4x4_t	vdequantize (const uint8x16_t &qv, float scale, int32_t offset)
	Dequantize following an asymmetric quantization scheme a neon vector holding 16 quantized values. More...

float32x4x4_t	vdequantize (const int8x16_t &qv, float scale, int32_t offset)
	Dequantize a vector of 16 values stored as signed asymmetric. More...

float32x4x4_t	vdequantize (const int8x16_t &qv, const float32x4x4_t vscale)
	Dequantize following symmetric quantization scheme a neon vector holding 16 quantized values. More...

float32x4x4_t	vdequantize (const int8x16_t &qv, float scale)
	Dequantize following a symmetric quantization scheme a neon vector holding 16 quantized values. More...

uint8x8_t	vquantize (const float32x4x2_t &qv, const UniformQuantizationInfo &qi)
	Quantize a neon vector holding 8 floating point values. More...

int8x8_t	vquantize_signed (const float32x4x2_t &qv, const UniformQuantizationInfo &qi)
	Quantize a neon vector holding 8 floating point values. More...

int32x4x4_t	vquantize_internal (const float32x4x4_t &qv, float scale, int32_t offset)

uint8x16_t	vquantize (const float32x4x4_t &qv, const UniformQuantizationInfo &qi)
	Quantize a neon vector holding 16 floating point values. More...

int8x16_t	vquantize_signed (const float32x4x4_t &qv, const UniformQuantizationInfo &qi)
	Signed quantize a neon vector holding 16 floating point values. More...

uint16x8x2_t	vquantize_qasymm16 (const float32x4x4_t &qv, const UniformQuantizationInfo &qi)
	Quantize to QASYMM16 a neon vector holding 16 floating point values. More...

float32x4x2_t	vmax2q_f32 (float32x4x2_t a, float32x4x2_t b)
	Compute lane-by-lane maximum between elements of a float vector with 4x2 elements. More...

float32x4_t	vfloorq_f32 (float32x4_t val)
	Calculate floor of a vector. More...

float32x4_t	vroundq_rte_f32 (float32x4_t val)
	Calculate round value of a vector to nearest with ties to even. More...

float32x2_t	vinvsqrt_f32 (float32x2_t x)
	Calculate inverse square root. More...

float32x4_t	vinvsqrtq_f32 (float32x4_t x)
	Calculate inverse square root. More...

float32x2_t	vinv_f32 (float32x2_t x)
	Calculate reciprocal. More...

float32x4_t	vinvq_f32 (float32x4_t x)
	Calculate reciprocal. More...

float32x4_t	vtaylor_polyq_f32 (float32x4_t x, const std::array< float32x4_t, 8 > &coeffs)
	Perform a 7th degree polynomial approximation using Estrin's method. More...

float32x4_t	vexpq_f32 (float32x4_t x)
	Calculate exponential. More...

float32x4_t	verfq_f32 (float32x4_t x)
	Calculate error function. More...

float32x4_t	vlogq_f32 (float32x4_t x)
	Calculate logarithm. More...

float32x4_t	vtanhq_f32 (float32x4_t val)
	Calculate hyperbolic tangent. More...

float32x4_t	vpowq_f32 (float32x4_t val, float32x4_t n)
	Calculate n power of a number. More...

int32x4_t	rounding_divide_by_pow2 (int32x4_t x, int32x4_t exponent)
	Round to the nearest division by a power-of-two using exponent. More...

int32x4_t	rounding_divide_by_pow2 (int32x4_t x, int exponent)
	Round to the nearest division by a power-of-two using exponent. More...

int32_t	rounding_divide_by_pow2 (int32_t x, int exponent)
	Round to the nearest division by a power-of-two using exponent. More...

float32x4x4_t	convert_uint8x16_to_float32x4x4 (const uint8x16_t &in)
	Converts from uint8x16 to float32x4x4_t. More...

float32x4x4_t	convert_int8x16_to_float32x4x4 (const int8x16_t &in)
	Converts from int8x16 to float32x4x4_t. More...

template<typename T >
float32x4x4_t	convert_to_float32x4x4 (const T &in)
	Converts to float32x4x4_t from the specified templated 16 elements vectors. More...

void	convert_float32x4x3_to_uint8x8x3 (const float32x4x3_t &in1, const float32x4x3_t &in2, uint8x8x3_t &out)
	Converts from two float32x4x3_t to just one uint8x8x3_t. More...

void	convert_float32x4x4_to_uint8x16 (const float32x4x4_t &in, uint8x16_t &out)
	Converts from two float32x4x4_t to just one uint8x16_t. More...

void	convert_float32x4x4_to_int8x16 (const float32x4x4_t &in, int8x16_t &out)
	Converts from float32x4x4_t to just one int8x16_t. More...

template<typename float_vec_type , typename int_vec_type >
int_vec_type	convert_float_to_int (const float_vec_type &in)
	Converts from float vector to integer vector. More...

template<typename float_vec_type , typename int_vec_type >
float_vec_type	convert_int_to_float (const int_vec_type &in)
	Converts from integer vector to float vector. More...

float32x4_t	vsinq_f32 (float32x4_t val)
	Calculate sine. More...

float32x2_t	vsin_f32 (float32x2_t val)
	Calculate sine. More...

float	vreduce (const float32x4_t &v)
	Reduce a vector to be a scalar by accumulating all lanes in the vector. More...

template<>
float32x4x4_t	convert_to_float32x4x4 (const uint8x16_t &in)

template<>
float32x4x4_t	convert_to_float32x4x4 (const int8x16_t &in)

template<>
uint8x16_t	convert_float_to_int< float32x4x4_t, uint8x16_t > (const float32x4x4_t &in)

template<>
float32x4x4_t	convert_int_to_float< float32x4x4_t, uint8x16_t > (const uint8x16_t &in)

template<>
int8x16_t	convert_float_to_int< float32x4x4_t, int8x16_t > (const float32x4x4_t &in)

template<>
float32x4x4_t	convert_int_to_float< float32x4x4_t, int8x16_t > (const int8x16_t &in)

template<bool is_bounded_relu>
int16x8_t	finalize_quantization_int16 (int32x4x2_t &in_s32, int result_fixedpoint_multiplier, int32_t result_shift, int16x8_t min_s16, int16x8_t max_s16)
	Performs final quantization step on 8 signed 16-bit elements. More...

template<bool is_bounded_relu>
int16_t	finalize_quantization_int16 (int32_t in_value, int result_fixedpoint_multiplier, int32_t result_shift, int16_t min_s16, int16_t max_s16)
	Performs final quantization step on single signed 16-bit element. More...

float32x4x2_t	vdequantize_int16 (const int16x8_t &qv, float scale)
	Dequantize a neon vector holding 8 16-bit quantized values. More...

int16x8_t	vquantize_int16 (const float32x4x2_t &qv, float scale)
	Quantize a neon vector holding 8 floating point values. More...

float32x4x4_t	vdequantize (const int16x8x2_t &qv, const UniformQuantizationInfo &qi)
	Dequantize a neon vector holding 16 16-bit quantized values. More...

qsymm16x8x2_t	vquantize_qsymm16 (const float32x4x4_t &qv, const UniformQuantizationInfo &qi)
	Quantize a neon vector holding 16 floating point values. More...

int32x4x2_t	multiply_by_quantized_multiplier_2row (int32x4x2_t input, int32_t qmul, int32_t shift)
	Multiply a neon vector using quantized multiplier and shift. More...

template<typename T , unsigned int S, unsigned int dim, bool do_2D_norm>
void	normalize_float (const Window &window, const ITensor in, const ITensor in_squared, ITensor *out, NormalizationLayerInfo ninfo)
	Function to perform normalization depending on the given template dimension. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const ClComponentElementwiseBinary::Attributes::ElementwiseOp &op)
	Formatted output of the pute::experimental::dynamic_fusion::ClComponentElementwiseBinary::Attributes::ElementwiseOp type. More...

std::string	to_string (const ClComponentElementwiseBinary::Attributes::ElementwiseOp &op)
	Formatted output of the arm_compute::experimental::dynamic_fusion::ClComponentElementwiseBinary::Attributes::ElementwiseOp type. More...

Status	validate (const ITensorInfo scores_in, const ITensorInfo boxes_in, const ITensorInfo batch_splits_in, const ITensorInfo scores_out, const ITensorInfo boxes_out, const ITensorInfo classes, const ITensorInfo batch_splits_out, const ITensorInfo keeps, const ITensorInfo *keeps_size, const BoxNMSLimitInfo info)

inline ::std::istream &	operator>> (::std::istream &is, BorderMode &mode)
	Formatted input of the BorderMode type. More...

template<typename T >
std::string	to_string_if_not_null (T *arg)
	Formatted output if arg is not null. More...

template<typename T >
std::string	to_string (const T &val)
	Fallback method: try to use std::to_string: More...

template<typename T >
::std::ostream &	operator<< (::std::ostream &os, const std::vector< T > &args)
	Formatted output of a vector of objects. More...

template<typename T >
std::string	to_string (const std::vector< T > &args)
	Formatted output of a vector of objects. More...

template<typename T >
inline ::std::ostream &	operator<< (::std::ostream &os, const Dimensions< T > &dimensions)
	Formatted output of the Dimensions type. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const RoundingPolicy &rounding_policy)
	Formatted output of the RoundingPolicy type. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const WeightsInfo &weights_info)
	Formatted output of the WeightsInfo type. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const ROIPoolingLayerInfo &pool_info)
	Formatted output of the ROIPoolingInfo type. More...

std::string	to_string (const ROIPoolingLayerInfo &pool_info)
	Formatted output of the ROIPoolingInfo type. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const GEMMKernelInfo &gemm_info)
	Formatted output of the GEMMKernelInfo type. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const GEMMLHSMatrixInfo &gemm_info)
	Formatted output of the GEMMLHSMatrixInfo type. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const GEMMRHSMatrixInfo &gemm_info)
	Formatted output of the GEMMRHSMatrixInfo type. More...

std::string	to_string (const GEMMRHSMatrixInfo &gemm_info)
	Formatted output of the GEMMRHSMatrixInfo type. More...

std::string	to_string (const GEMMLHSMatrixInfo &gemm_info)
	Formatted output of the GEMMLHSMatrixInfo type. More...

std::string	to_string (const GEMMKernelInfo &gemm_info)
	Formatted output of the GEMMKernelInfo type. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const BoundingBoxTransformInfo &bbox_info)
	Formatted output of the BoundingBoxTransformInfo type. More...

std::string	to_string (const BoundingBoxTransformInfo &bbox_info)
	Formatted output of the BoundingBoxTransformInfo type. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const ComputeAnchorsInfo &anchors_info)
	Formatted output of the ComputeAnchorsInfo type. More...

std::string	to_string (const ComputeAnchorsInfo &anchors_info)
	Formatted output of the ComputeAnchorsInfo type. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const GenerateProposalsInfo &proposals_info)
	Formatted output of the GenerateProposalsInfo type. More...

std::string	to_string (const GenerateProposalsInfo &proposals_info)
	Formatted output of the GenerateProposalsInfo type. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const QuantizationInfo &qinfo)
	Formatted output of the QuantizationInfo type. More...

std::string	to_string (const QuantizationInfo &quantization_info)
	Formatted output of the QuantizationInfo type. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const ActivationLayerInfo::ActivationFunction &act_function)
	Formatted output of the activation function type. More...

std::string	to_string (const arm_compute::ActivationLayerInfo &info)
	Formatted output of the activation function info type. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const ActivationLayerInfo *info)
	Formatted output of the activation function info. More...

std::string	to_string (const arm_compute::ActivationLayerInfo::ActivationFunction &function)
	Formatted output of the activation function type. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const NormType &norm_type)
	Formatted output of the NormType type. More...

std::string	to_string (const arm_compute::NormalizationLayerInfo &info)
	Formatted output of NormalizationLayerInfo. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const NormalizationLayerInfo &info)
	Formatted output of NormalizationLayerInfo. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const PoolingType &pool_type)
	Formatted output of the PoolingType type. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const PoolingLayerInfo &info)
	Formatted output of PoolingLayerInfo. More...

std::string	to_string (const RoundingPolicy &rounding_policy)
	Formatted output of RoundingPolicy. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const DataLayout &data_layout)
	[Print DataLayout type] More...

std::string	to_string (const arm_compute::DataLayout &data_layout)
	Formatted output of the DataLayout type. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const DataLayoutDimension &data_layout_dim)
	[Print DataLayout type] More...

inline ::std::ostream &	operator<< (::std::ostream &os, const DataType &data_type)
	Formatted output of the DataType type. More...

std::string	to_string (const arm_compute::DataType &data_type)
	Formatted output of the DataType type. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const Format &format)
	Formatted output of the Format type. More...

std::string	to_string (const Format &format)
	Formatted output of the Format type. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const Channel &channel)
	Formatted output of the Channel type. More...

std::string	to_string (const Channel &channel)
	Formatted output of the Channel type. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const BorderMode &mode)
	Formatted output of the BorderMode type. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const BorderSize &border)
	Formatted output of the BorderSize type. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const PaddingList &padding)
	Formatted output of the PaddingList type. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const Multiples &multiples)
	Formatted output of the Multiples type. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const InterpolationPolicy &policy)
	Formatted output of the InterpolationPolicy type. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const SamplingPolicy &policy)
	Formatted output of the SamplingPolicy type. More...

inline ::std::ostream &	operator<< (std::ostream &os, const ITensorInfo *info)
	Formatted output of the ITensorInfo type. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const TensorInfo &info)
	Formatted output of the const TensorInfo& type. More...

std::string	to_string (const TensorInfo &info)
	Formatted output of the const TensorInfo& type. More...

std::string	to_string (const ITensorInfo &info)
	Formatted output of the const ITensorInfo& type. More...

std::string	to_string (const ITensorInfo *info)
	Formatted output of the const ITensorInfo* type. More...

std::string	to_string (ITensorInfo *info)
	Formatted output of the ITensorInfo* type. More...

std::string	to_string (const ITensor *tensor)
	Formatted output of the ITensorInfo type obtained from const ITensor* type. More...

std::string	to_string (ITensor *tensor)
	Formatted output of the ITensorInfo type obtained from the ITensor* type. More...

std::string	to_string (ITensor &tensor)
	Formatted output of the ITensorInfo type obtained from the ITensor& type. More...

template<typename T >
std::string	to_string (const Dimensions< T > &dimensions)
	Formatted output of the Dimensions type. More...

std::string	to_string (const Strides &stride)
	Formatted output of the Strides type. More...

std::string	to_string (const TensorShape &shape)
	Formatted output of the TensorShape type. More...

std::string	to_string (const Coordinates &coord)
	Formatted output of the Coordinates type. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const GEMMReshapeInfo &info)
	Formatted output of the GEMMReshapeInfo type. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const GEMMInfo &info)
	Formatted output of the GEMMInfo type. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const Window::Dimension &dim)
	Formatted output of the Window::Dimension type. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const Window &win)
	Formatted output of the Window type. More...

std::string	to_string (const WeightsInfo &info)
	Formatted output of the WeightsInfo type. More...

std::string	to_string (const GEMMReshapeInfo &info)
	Formatted output of the GEMMReshapeInfo type. More...

std::string	to_string (const GEMMInfo &info)
	Formatted output of the GEMMInfo type. More...

std::string	to_string (const Window::Dimension &dim)
	Formatted output of the Window::Dimension type. More...

std::string	to_string (const Window &win)
	Formatted output of the Window& type. More...

std::string	to_string (Window *win)
	Formatted output of the Window* type. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const Rectangle &rect)
	Formatted output of the Rectangle type. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const PaddingMode &mode)
	Formatted output of the PaddingMode type. More...

std::string	to_string (const PaddingMode &mode)
	Formatted output of the PaddingMode type. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const PadStrideInfo &pad_stride_info)
	Formatted output of the PadStrideInfo type. More...

std::string	to_string (const PadStrideInfo &pad_stride_info)
	Formatted output of the PadStrideInfo type. More...

std::string	to_string (const BorderMode &mode)
	Formatted output of the BorderMode type. More...

std::string	to_string (const BorderSize &border)
	Formatted output of the BorderSize type. More...

std::string	to_string (const PaddingList &padding)
	Formatted output of the PaddingList type. More...

std::string	to_string (const Multiples &multiples)
	Formatted output of the Multiples type. More...

std::string	to_string (const InterpolationPolicy &policy)
	Formatted output of the InterpolationPolicy type. More...

std::string	to_string (const SamplingPolicy &policy)
	Formatted output of the SamplingPolicy type. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const ConvertPolicy &policy)
	Formatted output of the ConvertPolicy type. More...

std::string	to_string (const ConvertPolicy &policy)

inline ::std::ostream &	operator<< (::std::ostream &os, const ArithmeticOperation &op)
	Formatted output of the ArithmeticOperation type. More...

std::string	to_string (const ArithmeticOperation &op)
	Formatted output of the Arithmetic Operation. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const ReductionOperation &op)
	Formatted output of the Reduction Operations. More...

std::string	to_string (const ReductionOperation &op)
	Formatted output of the Reduction Operations. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const ComparisonOperation &op)
	Formatted output of the Comparison Operations. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const ElementWiseUnary &op)
	Formatted output of the Elementwise unary Operations. More...

std::string	to_string (const ComparisonOperation &op)
	Formatted output of the Comparison Operations. More...

std::string	to_string (const ElementWiseUnary &op)
	Formatted output of the Elementwise unary Operations. More...

std::string	to_string (const NormType &type)
	Formatted output of the Norm Type. More...

std::string	to_string (const PoolingType &type)
	Formatted output of the Pooling Type. More...

std::string	to_string (const PoolingLayerInfo &info)
	Formatted output of the Pooling Layer Info. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const Size3D &size)
	Formatted output of the Size3D type. More...

std::string	to_string (const Size3D &type)
	Formatted output of the Size3D type. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const Padding3D &padding3d)
	Formatted output of the Padding3D type. More...

std::string	to_string (const Padding3D &padding3d)
	Converts a Padding3D to string. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const DimensionRoundingType &rounding_type)
	Formatted output of the DimensionRoundingType type. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const Pooling3dLayerInfo &info)
	Formatted output of the Pooling 3d Layer Info. More...

std::string	to_string (const Pooling3dLayerInfo &info)
	Formatted output of the Pooling 3d Layer Info. More...

std::string	to_string (const PriorBoxLayerInfo &info)
	Formatted output of the PriorBoxLayerInfo. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const Size2D &size)
	Formatted output of the Size2D type. More...

std::string	to_string (const Size2D &type)
	Formatted output of the Size2D type. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const ConvolutionMethod &conv_method)
	Formatted output of the ConvolutionMethod type. More...

std::string	to_string (const ConvolutionMethod &conv_method)
	Formatted output of the ConvolutionMethod type. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const GPUTarget &gpu_target)
	Formatted output of the GPUTarget type. More...

std::string	to_string (const GPUTarget &gpu_target)
	Formatted output of the GPUTarget type. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const DetectionWindow &detection_window)
	Formatted output of the DetectionWindow type. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const DetectionOutputLayerCodeType &detection_code)
	Formatted output of the DetectionOutputLayerCodeType type. More...

std::string	to_string (const DetectionOutputLayerCodeType &detection_code)
	Formatted output of the DetectionOutputLayerCodeType type. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const DetectionOutputLayerInfo &detection_info)
	Formatted output of the DetectionOutputLayerInfo type. More...

std::string	to_string (const DetectionOutputLayerInfo &detection_info)
	Formatted output of the DetectionOutputLayerInfo type. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const DetectionPostProcessLayerInfo &detection_info)
	Formatted output of the DetectionPostProcessLayerInfo type. More...

std::string	to_string (const DetectionPostProcessLayerInfo &detection_info)
	Formatted output of the DetectionPostProcessLayerInfo type. More...

std::string	to_string (const DetectionWindow &detection_window)
	Formatted output of the DetectionWindow type. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const PriorBoxLayerInfo &info)
	Formatted output of PriorBoxLayerInfo. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const WinogradInfo &info)
	Formatted output of the WinogradInfo type. More...

std::string	to_string (const WinogradInfo &type)

std::string	to_string (const CLTunerMode val)
	Convert a CLTunerMode value to a string. More...

std::string	to_string (CLGEMMKernelType val)
	Converts a CLGEMMKernelType to string. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const CLTunerMode &val)
	[Print CLTunerMode type] More...

inline ::std::ostream &	operator<< (::std::ostream &os, const ConvolutionInfo &conv_info)
	Formatted output of the ConvolutionInfo type. More...

std::string	to_string (const ConvolutionInfo &info)
	Converts a ConvolutionInfo to string. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const FullyConnectedLayerInfo &layer_info)
	Formatted output of the FullyConnectedLayerInfo type. More...

std::string	to_string (const FullyConnectedLayerInfo &info)
	Converts a FullyConnectedLayerInfo to string. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const GEMMLowpOutputStageType &gemm_type)
	Formatted output of the GEMMLowpOutputStageType type. More...

std::string	to_string (const GEMMLowpOutputStageType &gemm_type)
	Converts a GEMMLowpOutputStageType to string. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const GEMMLowpOutputStageInfo &gemm_info)
	Formatted output of the GEMMLowpOutputStageInfo type. More...

std::string	to_string (const GEMMLowpOutputStageInfo &gemm_info)
	Converts a GEMMLowpOutputStageInfo to string. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const Conv2dInfo &conv_info)
	Formatted output of the Conv2dInfo type. More...

std::string	to_string (const Conv2dInfo &conv_info)
	Converts a Conv2dInfo to string. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const PixelValue &pixel_value)
	Formatted output of the PixelValue type. More...

std::string	to_string (const PixelValue &pixel_value)
	Converts a PixelValue to string. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const ScaleKernelInfo &scale_info)
	Formatted output of the ScaleKernelInfo type. More...

std::string	to_string (const ScaleKernelInfo &scale_info)
	Converts a ScaleKernelInfo to string. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const FFTDirection &fft_dir)
	Formatted output of the FFTDirection type. More...

std::string	to_string (const FFTDirection &fft_dir)
	Converts a FFT1DInfo to string. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const FFT1DInfo &fft1d_info)
	Formatted output of the FFT1DInfo type. More...

std::string	to_string (const FFT1DInfo &fft1d_info)
	Converts a FFT1DInfo to string. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const FFT2DInfo &fft2d_info)
	Formatted output of the FFT2DInfo type. More...

std::string	to_string (const FFT2DInfo &fft2d_info)
	Converts a FFT2DInfo to string. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const Coordinates2D &coord_2d)
	Formatted output of the Coordinates2D type. More...

std::string	to_string (const Coordinates2D &coord_2d)
	Converts a Coordinates2D to string. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const FuseBatchNormalizationType &fuse_type)
	Formatted output of the FuseBatchNormalizationType type. More...

std::string	to_string (const FuseBatchNormalizationType &fuse_type)
	Converts a FuseBatchNormalizationType to string. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const SoftmaxKernelInfo &info)
	Formatted output of the SoftmaxKernelInfo type. More...

std::string	to_string (const SoftmaxKernelInfo &info)
	Converts a SoftmaxKernelInfo to string. More...

template<typename T >
::std::ostream &	operator<< (::std::ostream &os, const LSTMParams< T > &lstm_params)
	Formatted output of the ScaleKernelInfo type. More...

template<typename T >
std::string	to_string (const LSTMParams< T > &lstm_params)
	Converts a LSTMParams to string. More...

std::string	to_string (const uint8_t num)
	Converts a LSTMParams to string. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const NMSType &nms_type)
	Available non maxima suppression types. More...

std::string	to_string (const NMSType nms_type)
	Converts a NMSType to string. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const BoxNMSLimitInfo &info)
	Formatted output of the BoxNMSLimitInfo type. More...

std::string	to_string (const BoxNMSLimitInfo &info)
	Converts a BoxNMSLimitInfo to string. More...

std::string	to_string (const DimensionRoundingType &rounding_type)
	Converts a DimensionRoundingType to string. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const Conv3dInfo &conv3d_info)
	Formatted output of the Conv3dInfo type. More...

std::string	to_string (const Conv3dInfo &conv3d_info)
	Formatted output of the Conv3dInfo type. More...

std::string	to_string (const WeightFormat wf)
	Formatted output of the arm_compute::WeightFormat type. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const arm_compute::WeightFormat &wf)
	Formatted output of the arm_compute::WeightFormat type. More...

std::string	to_string (const std::tuple< TensorShape, TensorShape, arm_compute::WeightFormat > values)
	Formatted output of the std::tuple<TensorShape, TensorShape, arm_compute::WeightFormat> tuple. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const Padding2D &padding2d)
	Formatted output of the Padding2D type. More...

std::string	to_string (const Padding2D &padding2d)
	Converts a Padding2D to string. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const experimental::dynamic_fusion::Pool2dAttributes &pool2d_attr)
	Formatted output of the arm_compute::experimental::dynamic_fusion::Pool2dAttributes type. More...

std::string	to_string (const experimental::dynamic_fusion::Pool2dAttributes &pool2d_attr)
	Formatted output of the arm_compute::experimental::dynamic_fusion::Pool2dAttributes type. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const experimental::dynamic_fusion::GpuPool2dSettings &settings)
	Formatted output of the arm_compute::experimental::dynamic_fusion::GpuPool2dSettings type. More...

std::string	to_string (const experimental::dynamic_fusion::GpuPool2dSettings &settings)
	Formatted output of the arm_compute::experimental::dynamic_fusion::GpuPool2dSettings type. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const experimental::dynamic_fusion::Conv2dAttributes &conv2d_attr)
	Formatted output of the arm_compute::experimental::dynamic_fusion::Conv2dAttributes type. More...

std::string	to_string (const experimental::dynamic_fusion::Conv2dAttributes &conv2d_attr)
	Formatted output of the arm_compute::experimental::dynamic_fusion::Conv2dAttributes type. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const experimental::dynamic_fusion::CastAttributes &cast_attr)
	Formatted output of the arm_compute::experimental::dynamic_fusion::CastAttributes type. More...

std::string	to_string (const experimental::dynamic_fusion::CastAttributes &cast_attr)
	Formatted output of the arm_compute::experimental::dynamic_fusion::CastAttributes type. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const experimental::dynamic_fusion::DepthwiseConv2dAttributes &dw_conv2d_attr)
	Formatted output of the arm_compute::experimental::dynamic_fusion::DepthwiseConv2dAttributes type. More...

std::string	to_string (const experimental::dynamic_fusion::DepthwiseConv2dAttributes &dw_conv2d_attr)
	Formatted output of the arm_compute::experimental::dynamic_fusion::DepthwiseConv2dAttributes type. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const experimental::dynamic_fusion::ClampAttributes &clamp_attr)
	Formatted output of the arm_compute::experimental::dynamic_fusion::ClampAttributes type. More...

std::string	to_string (const experimental::dynamic_fusion::ClampAttributes &clamp_attr)
	Formatted output of the arm_compute::experimental::dynamic_fusion::ClampAttributes type. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const experimental::dynamic_fusion::ResizeAttributes &resize_attr)
	Formatted output of the arm_compute::experimental::dynamic_fusion::ResizeAttributes type. More...

std::string	to_string (const experimental::dynamic_fusion::ResizeAttributes &resize_attr)
	Formatted output of the arm_compute::experimental::dynamic_fusion::ResizeAttributes type. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const experimental::dynamic_fusion::SoftmaxAttributes &softmax_attr)
	Formatted output of the arm_compute::experimental::dynamic_fusion::SoftmaxAttributes type. More...

std::string	to_string (const experimental::dynamic_fusion::SoftmaxAttributes &softmax_attr)
	Formatted output of the arm_compute::experimental::dynamic_fusion::SoftmaxAttributes type. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const arm_compute::MatMulInfo &matmul_info)
	Formatted output of the arm_compute::MatMulInfo type. More...

std::string	to_string (const arm_compute::MatMulInfo &matmul_info)
	Formatted output of the arm_compute::MatMulInfo type. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const arm_compute::MatMulKernelInfo &matmul_info)
	Formatted output of the arm_compute::MatMulKernelInfo type. More...

std::string	to_string (const arm_compute::MatMulKernelInfo &matmul_info)
	Formatted output of the arm_compute::MatMulKernelInfo type. More...

inline ::std::ostream &	operator<< (::std::ostream &os, const arm_compute::CpuMatMulSettings &settings)
	Formatted output of the arm_compute::CpuMatMulSettings type. More...

std::string	to_string (const arm_compute::CpuMatMulSettings &settings)
	Formatted output of the arm_compute::CpuMatMulSettings type. More...

Variables
constexpr size_t	MAX_DIMS = 6
	Constant value used to indicate maximum dimensions of a Window, TensorShape and Coordinates. More...

const std::array< float32x4_t, 8 >	log_tab
	Logarithm polynomial coefficients. More...

constexpr float	te_sin_coeff2 = 0.166666666666f
	Sin polynomial coefficients. More...

constexpr float	te_sin_coeff3 = 0.05f

constexpr float	te_sin_coeff4 = 0.023809523810f

constexpr float	te_sin_coeff5 = 0.013888888889f

const std::array< ErfLutEntry< float >, 513 >	erf_f32_lut
	The lookup table for FP32 erf(x) calculation. More...

Detailed Description

Synced with tests/validation/dynamic_fusion/gpu/cl/DirectConv2d.cpp Please check there for any differences in the coverage.

A DotMLGO file parser (LL(k) parser)

[CLReshapeLayer snippet]

[ClReshapeKernel Kernel]

[NEReshapeLayerKernel Kernel]

This file contains all available output stages for GEMMLowp.

This file contains all available output stages for GEMMLowp on OpenCL.

The following symbols have been moved to: half PermutationVector Format DataType DataLayout DataLayoutDimension PadStrideInfo WeightFormat Channel DimensionRoundingType.

CoreTypes.h groups together essential small types that are used across functions.

SPDX-License-Identifier: MIT

Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

The following symbols have been moved to: ActivationFunction ActivationLayerInfo The following symbols have been moved to: ConvolutionInfo The following symbols have been moved to: FullyConnectedLayerInfo The following symbols have been moved to: GEMMLowpOutputStageType GEMMLowpOutputStageInfo GEMMInfo The following symbols have been moved to: MatMulInfo

In gemmlowp, the "output stage" is the process that takes a final int32 accumulator value (the output of CLGEMMLowpMatrixMultiplyCore), and processes it to obtain the final QASYMM8/QASYMM8_SIGNED value.

More information about the GEMMLowp output stage can be found at https://github.com/google/gemmlowp/blob/master/doc/output.md

In gemmlowp, the "output stage" is the process that takes a final int32 accumulator value (the output of NEGEMMLowpMatrixMultiplyCore), and processes it to obtain the final ASYMM8 value.

More information about the GEMMLowp output stage can be found at https://github.com/google/gemmlowp/blob/master/doc/output.md

The grammar of DotMLGO is defined as the following ENBF:

delim = "," | "\n"; // Note that delimiters are omitted from the definition below

mlgo = header, heuristics-table, {heuristic-tree};

header = "<header>", gemm-version, ip-type, "</header>"; gemm-version = "gemm-version", "[", int, int, int, "]"; ip-type = "ip-type", ("gpu" | "cpu");

heiristics-table = "<heuristics-table>", {heuristics-table-entry}, "</heuristics-table>"; heuristics-table-entry = entry-id, ip-name, num-cores, data-type, gpu-priority, gpu-behavior, heuristic-type, free-vars; entry-id = int; ip-name = char-sequence; num-cores = int; data-type = "f32" | "f16" | "qasymm8"; gpu-priority = "best-performance" | "best-memory-usage"; gpu-behavior = "static" | "dynamic"; heuristic-type = "gemm-type" | "gemm-config-native" | "gemm-config-reshaped-only-rhs" | "gemm-config-reshaped"; free-vars = "[", {char-sequence}, "]";

heuristic-tree = "<heuristic", entry-id, ">", {tree-node}, "</heuristic>"; tree-node = branch-node | leaf-node; branch-node = "b", entry-id, lhs-type, lhs-value, conditional-op, rhs-type, rhs-value, true-node, false-node; lhs-type = comparator-type; lhs-value = comparator-value; rhs-type = comparator-type; rhs-value = comparator-value; comparator-type = "var" | "num" | "enum"; comparator-value = char-sequence | float; conditional-op = "<" | "<=" | "==" | ">=" | ">"; true-node = entry-id; false-node = entry-id; leaf-node = "l", entry-id, heuristic-type, leaf-value; leaf-value = gemm-type | gemm-config-native | gemm-config-reshaped-only-rhs | gemm-config-reshaped gemm-type = "native" | "reshaped-only-rhs" | "reshaped"; gemm-config-native = "[", int, int, int, "]"; gemm-config-reshaped-only-rhs = "[", int, int, int, int, bool, bool, bool, "]"; gemm-config-reshaped = "[", int, int, int, int, int, bool, bool, bool, bool, "]";

Typedef Documentation

◆ BBox

using BBox = std::array<float, 4>

Definition at line 776 of file Types.h.

◆ BiStrides

using BiStrides = Coordinates

Bidirectional strides.

Definition at line 81 of file Types.h.

◆ CLEqual

using CLEqual = CLComparisonStatic<ComparisonOperation::Equal>

Basic function to run equal comparison.

Definition at line 131 of file CLComparison.h.

◆ CLFloatArray

using CLFloatArray = CLArray<cl_float>

OpenCL Array of floats.

Definition at line 117 of file CLArray.h.

◆ CLGreater

using CLGreater = CLComparisonStatic<ComparisonOperation::Greater>

Basic function to run greater comparison.

Definition at line 135 of file CLComparison.h.

◆ CLGreaterEqual

using CLGreaterEqual = CLComparisonStatic<ComparisonOperation::GreaterEqual>

Basic function to run greater-equal comparison.

Definition at line 137 of file CLComparison.h.

◆ CLImage

using CLImage = CLTensor

OpenCL Image.

Definition at line 104 of file CLTensor.h.

◆ CLInt16Array

using CLInt16Array = CLArray<cl_short>

OpenCL Array of int16s.

Definition at line 113 of file CLArray.h.

◆ CLInt32Array

using CLInt32Array = CLArray<cl_int>

OpenCL Array of int32s.

Definition at line 115 of file CLArray.h.

◆ CLLess

using CLLess = CLComparisonStatic<ComparisonOperation::Less>

Basic function to run less comparison.

Definition at line 139 of file CLComparison.h.

◆ CLLessEqual

using CLLessEqual = CLComparisonStatic<ComparisonOperation::LessEqual>

Basic function to run less-equal comparison.

Definition at line 141 of file CLComparison.h.

◆ CLLogSoftmaxLayer

using CLLogSoftmaxLayer = CLSoftmaxLayerGeneric<true>

Definition at line 110 of file CLSoftmaxLayer.h.

◆ CLNotEqual

using CLNotEqual = CLComparisonStatic<ComparisonOperation::NotEqual>

Basic function to run not equal comparison.

Definition at line 133 of file CLComparison.h.

◆ CLSoftmaxLayer

using CLSoftmaxLayer = CLSoftmaxLayerGeneric<false>

Definition at line 109 of file CLSoftmaxLayer.h.

◆ CLUInt16Array

using CLUInt16Array = CLArray<cl_ushort>

OpenCL Array of uint16s.

Definition at line 109 of file CLArray.h.

◆ CLUInt32Array

using CLUInt32Array = CLArray<cl_uint>

OpenCL Array of uint32s.

Definition at line 111 of file CLArray.h.

◆ CLUInt8Array

using CLUInt8Array = CLArray<cl_uchar>

OpenCL Array of uint8s.

Definition at line 107 of file CLArray.h.

◆ CropInfo

using CropInfo = Padding2D

Class for holding information related to cropping.

Definition at line 2017 of file Types.h.

◆ float32_t

typedef float float32_t

Definition at line 33 of file CpuTypes.h.

◆ FloatArray

using FloatArray = Array<float>

Array of floats.

Definition at line 72 of file Array.h.

◆ GroupMappings

using GroupMappings = std::map<size_t, MemoryMappings>

A map of the groups and memory mappings.

Definition at line 48 of file Types.h.

◆ half

using half = half_float::half

16-bit floating point type

Definition at line 36 of file CoreTypes.h.

◆ ICLFloatArray

using ICLFloatArray = ICLArray<cl_float>

Interface for OpenCL Array of floats.

Definition at line 126 of file ICLArray.h.

◆ ICLImage

using ICLImage = ICLTensor

Definition at line 114 of file ICLTensor.h.

◆ ICLInt16Array

using ICLInt16Array = ICLArray<cl_short>

Interface for OpenCL Array of int16s.

Definition at line 122 of file ICLArray.h.

◆ ICLInt32Array

using ICLInt32Array = ICLArray<cl_int>

Interface for OpenCL Array of int32s.

Definition at line 124 of file ICLArray.h.

◆ ICLUInt16Array

using ICLUInt16Array = ICLArray<cl_ushort>

Interface for OpenCL Array of uint16s.

Definition at line 118 of file ICLArray.h.

◆ ICLUInt32Array

using ICLUInt32Array = ICLArray<cl_uint>

Interface for OpenCL Array of uint32s.

Definition at line 120 of file ICLArray.h.

◆ ICLUInt8Array

using ICLUInt8Array = ICLArray<cl_uchar>

Interface for OpenCL Array of uint8s.

Definition at line 116 of file ICLArray.h.

◆ IFloatArray

using IFloatArray = IArray<float>

Interface for Array of floats.

Definition at line 143 of file IArray.h.

◆ IImage

using IImage = ITensor

Definition at line 100 of file ITensor.h.

◆ IInt16Array

using IInt16Array = IArray<int16_t>

Interface for Array of int16s.

Definition at line 139 of file IArray.h.

◆ IInt32Array

using IInt32Array = IArray<int32_t>

Interface for Array of int32s.

Definition at line 141 of file IArray.h.

◆ Image

using Image = Tensor

Image.

Definition at line 69 of file Tensor.h.

◆ INEKernel

typedef ICPPKernel INEKernel

Common interface for all kernels implemented in Neon.

Definition at line 39 of file INEOperator.h.

◆ Int16Array

using Int16Array = Array<int16_t>

Array of int16s.

Definition at line 68 of file Array.h.

◆ Int32Array

using Int32Array = Array<int32_t>

Array of int32s.

Definition at line 70 of file Array.h.

◆ IUInt16Array

using IUInt16Array = IArray<uint16_t>

Interface for Array of uint16s.

Definition at line 135 of file IArray.h.

◆ IUInt32Array

using IUInt32Array = IArray<uint32_t>

Interface for Array of uint32s.

Definition at line 137 of file IArray.h.

◆ IUInt8Array

using IUInt8Array = IArray<uint8_t>

Interface for Array of uint8s.

Definition at line 133 of file IArray.h.

◆ LabelBBox

using LabelBBox = std::map<int, std::vector<BBox> >

Definition at line 778 of file Types.h.

◆ lock_guard

using lock_guard = std::lock_guard<Mutex>

Wrapper of lock_guard data-object.

Definition at line 37 of file Mutex.h.

◆ MemoryMappings

using MemoryMappings = std::map<IMemory *, size_t>

A map of (handle, index/offset), where handle is the memory handle of the object to provide the memory for and index/offset is the buffer/offset from the pool that should be used.

Note: All objects are pre-pinned to specific buffers to avoid any relevant overheads

Definition at line 45 of file Types.h.

◆ MemoryRequirements

using MemoryRequirements = experimental::MemoryRequirements

Definition at line 56 of file IOperator.h.

◆ Multiples

using Multiples = std::vector<uint32_t>

Information to produce a tiled version of a Tensor.

Definition at line 405 of file Types.h.

◆ Mutex

using Mutex = std::mutex

Wrapper of Mutex data-object.

Definition at line 33 of file Mutex.h.

◆ NEAbsLayer

using NEAbsLayer = NEElementwiseUnaryLayer<ElementWiseUnary::ABS>

Definition at line 91 of file NEElementwiseUnaryLayer.h.

◆ NEEqual

using NEEqual = NEElementwiseComparisonStatic<ComparisonOperation::Equal>

Basic function to run equal comparison.

Definition at line 468 of file NEElementwiseOperations.h.

◆ NEExpLayer

using NEExpLayer = NEElementwiseUnaryLayer<ElementWiseUnary::EXP>

Definition at line 88 of file NEElementwiseUnaryLayer.h.

◆ NEGreater

using NEGreater = NEElementwiseComparisonStatic<ComparisonOperation::Greater>

Basic function to run greater comparison.

Definition at line 472 of file NEElementwiseOperations.h.

◆ NEGreaterEqual

using NEGreaterEqual = NEElementwiseComparisonStatic<ComparisonOperation::GreaterEqual>

Basic function to run greater-equal comparison.

Definition at line 474 of file NEElementwiseOperations.h.

◆ NELess

using NELess = NEElementwiseComparisonStatic<ComparisonOperation::Less>

Basic function to run less comparison.

Definition at line 476 of file NEElementwiseOperations.h.

◆ NELessEqual

using NELessEqual = NEElementwiseComparisonStatic<ComparisonOperation::LessEqual>

Basic function to run less-equal comparison.

Definition at line 478 of file NEElementwiseOperations.h.

◆ NELogLayer

using NELogLayer = NEElementwiseUnaryLayer<ElementWiseUnary::LOG>

Definition at line 90 of file NEElementwiseUnaryLayer.h.

◆ NELogSoftmaxLayer

using NELogSoftmaxLayer = NESoftmaxLayerGeneric<true>

Definition at line 98 of file NESoftmaxLayer.h.

◆ NENegLayer

using NENegLayer = NEElementwiseUnaryLayer<ElementWiseUnary::NEG>

Definition at line 89 of file NEElementwiseUnaryLayer.h.

◆ NENotEqual

using NENotEqual = NEElementwiseComparisonStatic<ComparisonOperation::NotEqual>

Basic function to run not equal comparison.

Definition at line 470 of file NEElementwiseOperations.h.

◆ NERoundLayer

using NERoundLayer = NEElementwiseUnaryLayer<ElementWiseUnary::ROUND>

Definition at line 92 of file NEElementwiseUnaryLayer.h.

◆ NERsqrtLayer

using NERsqrtLayer = NEElementwiseUnaryLayer<ElementWiseUnary::RSQRT>

Definition at line 87 of file NEElementwiseUnaryLayer.h.

◆ NEScheduler

using NEScheduler = Scheduler

CPU Scheduler.

Definition at line 32 of file NEScheduler.h.

◆ NESinLayer

using NESinLayer = NEElementwiseUnaryLayer<ElementWiseUnary::SIN>

Definition at line 93 of file NEElementwiseUnaryLayer.h.

◆ NESoftmaxLayer

using NESoftmaxLayer = NESoftmaxLayerGeneric<false>

Definition at line 97 of file NESoftmaxLayer.h.

◆ OperatorType

typedef cpu::CpuPRelu OperatorType

Definition at line 40 of file CLGEMM.cpp.

◆ PaddingInfo

using PaddingInfo = std::pair<uint32_t, uint32_t>

Padding information as a pair of unsigned int start/end.

Definition at line 399 of file Types.h.

◆ PaddingList

using PaddingList = std::vector<PaddingInfo>

List of padding information.

Definition at line 402 of file Types.h.

◆ PaddingSize

using PaddingSize = BorderSize

Container for 2D padding size.

Definition at line 346 of file Types.h.

◆ PermutationVector

using PermutationVector = Strides

Permutation vector.

Definition at line 38 of file CoreTypes.h.

◆ qasymm16_t

using qasymm16_t = uint16_t

16 bit quantized asymmetric scalar value

Definition at line 39 of file QuantizationInfo.h.

◆ qasymm8_signed_t

using qasymm8_signed_t = int8_t

8 bit signed quantized asymmetric scalar value

Definition at line 36 of file QuantizationInfo.h.

◆ qasymm8_t

using qasymm8_t = uint8_t

8 bit quantized asymmetric scalar value

Definition at line 37 of file QuantizationInfo.h.

◆ qasymm8x16_signed_t

using qasymm8x16_signed_t = int8x16_t

8 bit quantized signed asymmetric vector with 16 elements

Definition at line 44 of file NEAsymm.h.

◆ qasymm8x16_t

using qasymm8x16_t = uint8x16_t

8 bit quantized asymmetric vector with 16 elements

Definition at line 38 of file NEAsymm.h.

◆ qasymm8x8_signed_t

using qasymm8x8_signed_t = int8x8_t

8 bit quantized signed asymmetric vector with 8 elements

Definition at line 40 of file NEAsymm.h.

◆ qasymm8x8_t

using qasymm8x8_t = uint8x8_t

8 bit quantized asymmetric vector with 8 elements

Definition at line 34 of file NEAsymm.h.

◆ qasymm8x8x2_signed_t

using qasymm8x8x2_signed_t = int8x8x2_t

8 bit quantized signed asymmetric vector with 16 elements

Definition at line 41 of file NEAsymm.h.

◆ qasymm8x8x2_t

using qasymm8x8x2_t = uint8x8x2_t

8 bit quantized asymmetric vector with 16 elements

Definition at line 35 of file NEAsymm.h.

◆ qasymm8x8x3_signed_t

using qasymm8x8x3_signed_t = int8x8x3_t

8 bit quantized signed asymmetric vector with 24 elements

Definition at line 42 of file NEAsymm.h.

◆ qasymm8x8x3_t

using qasymm8x8x3_t = uint8x8x3_t

8 bit quantized asymmetric vector with 24 elements

Definition at line 36 of file NEAsymm.h.

◆ qasymm8x8x4_signed_t

using qasymm8x8x4_signed_t = int8x8x4_t

8 bit quantized signed asymmetric vector with 32 elements

Definition at line 43 of file NEAsymm.h.

◆ qasymm8x8x4_t

using qasymm8x8x4_t = uint8x8x4_t

8 bit quantized asymmetric vector with 32 elements

Definition at line 37 of file NEAsymm.h.

◆ qsymm16_t

typedef int16_t qsymm16_t

16 bit quantized symmetric scalar value

Definition at line 38 of file QuantizationInfo.h.

◆ qsymm16x8_t

using qsymm16x8_t = int16x8_t

16 bit quantized symmetric vector with 8 elements

Definition at line 38 of file NESymm.h.

◆ qsymm16x8x2_t

using qsymm16x8x2_t = int16x8x2_t

16 bit quantized symmetric vector with 16 elements

Definition at line 39 of file NESymm.h.

◆ qsymm8_t

using qsymm8_t = int8_t

8 bit quantized symmetric scalar value

Definition at line 35 of file NESymm.h.

◆ UInt16Array

using UInt16Array = Array<uint16_t>

Array of uint16s.

Definition at line 64 of file Array.h.

◆ UInt32Array

using UInt32Array = Array<uint32_t>

Array of uint32s.

Definition at line 66 of file Array.h.

◆ UInt8Array

using UInt8Array = Array<uint8_t>

Array of uint8s.

Definition at line 62 of file Array.h.

◆ unique_lock

using unique_lock = std::unique_lock<Mutex>

Wrapper of lock_guard data-object.

Definition at line 41 of file Mutex.h.

◆ WorkspaceData

using WorkspaceData = std::vector<WorkspaceDataElement<TensorType> >

Definition at line 52 of file MemoryHelpers.h.

Enumeration Type Documentation

◆ ActivationFunction

enum ActivationFunction

strong

Available activation functions.

Enumerator
LOGISTIC	Logistic ( \( f(x) = \frac{1}{1 + e^{-x}} \) )
TANH	Hyperbolic tangent ( \( f(x) = a \cdot tanh(b \cdot x) \) )
RELU	Rectifier ( \( f(x) = max(0,x) \) )
BOUNDED_RELU	Upper Bounded Rectifier ( \( f(x) = min(a, max(0,x)) \) )
LU_BOUNDED_RELU	Lower and Upper Bounded Rectifier ( \( f(x) = min(a, max(b,x)) \) )
LEAKY_RELU	Leaky Rectifier ( \( f(x) = \begin{cases} \alpha x & \quad \text{if } x \text{ < 0}\\ x & \quad \text{if } x \geq \text{ 0 } \end{cases} \) )
SOFT_RELU	Soft Rectifier ( \( f(x)= log(1+e^x) \) )
ELU	Exponential Linear Unit ( \( f(x) = \begin{cases} \alpha (exp(x) - 1) & \quad \text{if } x \text{ < 0}\\ x & \quad \text{if } x \geq \text{ 0 } \end{cases} \) )
ABS	Absolute ( \( f(x)= \|x\| \) )
SQUARE	Square ( \( f(x)= x^2 \) )
SQRT	Square root ( \( f(x) = \sqrt{x} \) )
LINEAR	Linear ( \( f(x)= ax + b \) )
IDENTITY	Identity ( \( f(x)= x \) )
HARD_SWISH	Hard-swish ( \( f(x) = (x \text{ReLU6}(x+3))/6 = x \min(\max(0,x+3),6)/6 \) )
SWISH	Swish ( \( f(x) = \frac{x}{1 + e^{-ax}} = x \text{logistic}(ax) \) )
GELU	GELU ( \( f(x) = x * 1/2 * 1 + erf(x / \sqrt{2}) \) )

Definition at line 41 of file ActivationLayerInfo.h.

 {
     LOGISTIC,        /**< Logistic ( \f$ f(x) = \frac{1}{1 + e^{-x}} \f$ ) */
     TANH,            /**< Hyperbolic tangent ( \f$ f(x) = a \cdot tanh(b \cdot x) \f$ ) */
     RELU,            /**< Rectifier ( \f$ f(x) = max(0,x) \f$ ) */
     BOUNDED_RELU,    /**< Upper Bounded Rectifier ( \f$ f(x) = min(a, max(0,x)) \f$ ) */
     LU_BOUNDED_RELU, /**< Lower and Upper Bounded Rectifier ( \f$ f(x) = min(a, max(b,x)) \f$ ) */
     LEAKY_RELU, /**< Leaky Rectifier ( \f$ f(x) = \begin{cases}  \alpha x & \quad \text{if } x \text{ < 0}\\  x & \quad \text{if } x \geq \text{ 0 } \end{cases} \f$ ) */
     SOFT_RELU,  /**< Soft Rectifier ( \f$ f(x)= log(1+e^x) \f$ ) */
     ELU, /**< Exponential Linear Unit ( \f$ f(x) = \begin{cases}  \alpha (exp(x) - 1) & \quad \text{if } x \text{ < 0}\\  x & \quad \text{if } x \geq \text{ 0 } \end{cases} \f$ ) */
     ABS, /**< Absolute ( \f$ f(x)= |x| \f$ ) */
     SQUARE,     /**< Square ( \f$ f(x)= x^2 \f$ )*/
     SQRT,       /**< Square root ( \f$ f(x) = \sqrt{x} \f$ )*/
     LINEAR,     /**< Linear ( \f$ f(x)= ax + b \f$ ) */
     IDENTITY,   /**< Identity ( \f$ f(x)= x \f$ ) */
     HARD_SWISH, /**< Hard-swish ( \f$ f(x) = (x \text{ReLU6}(x+3))/6 = x \min(\max(0,x+3),6)/6 \f$ ) */
     SWISH,      /**< Swish ( \f$ f(x) = \frac{x}{1 + e^{-ax}} = x \text{logistic}(ax) \f$ ) */
     GELU        /**< GELU ( \f$ f(x) = x * 1/2 * 1 + erf(x / \sqrt{2}) \f$ ) */
 };

◆ ArithmeticOperation

enum ArithmeticOperation

strong

Available element-wise operations.

Enumerator
ADD	(x + y)
SUB	(x - y)
DIV	(x / y)
MIN	Min(x, y)
MAX	Max(x, y)
SQUARED_DIFF	(x - y)^2
POWER	x ^ y
PRELU	y*x if x < 0, x otherwise

Definition at line 421 of file Types.h.

 {
     ADD,          /**< (x + y) */
     SUB,          /**< (x  - y) */
     DIV,          /**< (x / y) */
     MIN,          /**< Min(x, y) */
     MAX,          /**< Max(x, y) */
     SQUARED_DIFF, /**< (x - y)^2 */
     POWER,        /**< x ^ y */
     PRELU,        /**< y*x if x < 0, x otherwise */
 };

◆ BilinearInterpolation

enum BilinearInterpolation

strong

Bilinear Interpolation method used by LKTracker.

Enumerator
BILINEAR_OLD_NEW	Old-new method.
BILINEAR_SCHARR	Scharr method.

Definition at line 368 of file Types.h.

 {
     BILINEAR_OLD_NEW, /**< Old-new method */
     BILINEAR_SCHARR   /**< Scharr method */
 };

◆ BitwiseOperation

enum BitwiseOperation

strong

Available bitwise operations.

Enumerator
AND	Bitwise AND operation.
NOT	Bitwise NOT operation.
OR	Bitwise OR operation
XOR	Bitwise XOR operation

Definition at line 447 of file Types.h.

 {
     AND, /**< Bitwise AND operation */
     NOT, /**< Bitwise NOT operation */
     OR,  /**< Bitwise OR operation  */
     XOR, /**< Bitwise XOR operation  */
 };

◆ BorderMode

enum BorderMode

strong

Methods available to handle borders.

Enumerator
UNDEFINED	Borders are left undefined.
CONSTANT	Pixels outside the image are assumed to have a constant value.
REPLICATE	Pixels outside the image are assumed to have the same value as the closest image pixel.

Definition at line 231 of file Types.h.

 {
     UNDEFINED, /**< Borders are left undefined */
     CONSTANT,  /**< Pixels outside the image are assumed to have a constant value */
     REPLICATE  /**< Pixels outside the image are assumed to have the same value as the closest image pixel */
 };

◆ Channel

enum Channel

strong

Available channels.

Enumerator
UNKNOWN
C0	Unknown channel format. First channel (used by formats with unknown channel types).
C1	Second channel (used by formats with unknown channel types).
C2	Third channel (used by formats with unknown channel types).
C3	Fourth channel (used by formats with unknown channel types).
R	Red channel.
G	Green channel.
B	Blue channel.
A	Alpha channel.
Y	Luma channel.
U	Cb/U channel.
V	Cr/V/Value channel.

Definition at line 41 of file CoreTypes.h.

 {
     UNKNOWN, /** Unknown channel format */
     C0,      /**< First channel (used by formats with unknown channel types). */
     C1,      /**< Second channel (used by formats with unknown channel types). */
     C2,      /**< Third channel (used by formats with unknown channel types). */
     C3,      /**< Fourth channel (used by formats with unknown channel types). */
     R,       /**< Red channel. */
     G,       /**< Green channel. */
     B,       /**< Blue channel. */
     A,       /**< Alpha channel. */
     Y,       /**< Luma channel. */
     U,       /**< Cb/U channel. */
     V        /**< Cr/V/Value channel. */
 };

◆ CLBackendType

enum CLBackendType

strong

List the possible OpenCL backends.

Enumerator
Native	OpenCL native backend.
Clvk	CLVK backend.

Definition at line 55 of file CLTypes.h.

 {
     Native, /**< OpenCL native backend */
     Clvk,   /**< CLVK backend */
 };

◆ CLGEMMKernelType

enum CLGEMMKernelType

strong

OpenCL GEMM kernel types.

Enumerator
NATIVE	Native GEMM kernel with configurable block size.
RESHAPED	Reshaped GEMM kernel where both lhs and rhs matrices are reshaped. Configurable reshape and block size
RESHAPED_ONLY_RHS	Reshaped GEMM kernel where only the rhs matrix is reshaped. Configurable reshape and block size
RESHAPED_ONLY_RHS_MMUL	Reshaped GEMM kernel where only the rhs matrix is reshaped. Using MMUL with configurable block size.

Definition at line 31 of file CLTypes.h.

 {
     /** Native GEMM kernel with configurable block size.*/
     NATIVE,
     /** Reshaped GEMM kernel where both lhs and rhs matrices are reshaped. Configurable reshape and block size */
     RESHAPED,
     /** Reshaped GEMM kernel where only the rhs matrix is reshaped. Configurable reshape and block size */
     RESHAPED_ONLY_RHS,
     /** Reshaped GEMM kernel where only the rhs matrix is reshaped. Using MMUL with configurable block size. */
     RESHAPED_ONLY_RHS_MMUL
 };

◆ CLImage2DType

enum CLImage2DType

strong

OpenCL Image2D types.

Enumerator
ReadOnly
WriteOnly

Definition at line 41 of file CLUtils.h.

 {
     ReadOnly,
     WriteOnly
 };

◆ CLKernelType

enum CLKernelType

Enumerator
UNKNOWN	Unknown CL kernel type.
UNKNOWN
DEPTHWISE	Depthwise CL kernel type.
DIRECT	Direct Convolution CL kernel type.
ELEMENTWISE	Elementwise CL kernel type.
GEMM	GEMM CL kernel type.
POOL	Pool CL kernel type.
WINOGRAD	Winograd CL kernel type.

Definition at line 78 of file CLTypes.h.

 {
     UNKNOWN,     /**< Unknown CL kernel type */
     DEPTHWISE,   /**< Depthwise CL kernel type */
     DIRECT,      /**< Direct Convolution CL kernel type */
     ELEMENTWISE, /**< Elementwise CL kernel type */
     GEMM,        /**< GEMM CL kernel type */
     POOL,        /**< Pool CL kernel type */
     WINOGRAD     /**< Winograd CL kernel type */
 };

◆ CLTunerMode

enum CLTunerMode

strong

< OpenCL tuner modes

Enumerator
EXHAUSTIVE	Searches all possible LWS configurations while tuning.
NORMAL	Searches a subset of LWS configurations while tuning.
RAPID	Searches a minimal subset of LWS configurations while tuning.

Definition at line 35 of file CLTunerTypes.h.

 {
     EXHAUSTIVE, /**< Searches all possible LWS configurations while tuning */
     NORMAL,     /**< Searches a subset of LWS configurations while tuning */
     RAPID       /**< Searches a minimal subset of LWS configurations while tuning */
 };

◆ CLVersion

enum CLVersion

strong

Available OpenCL Version.

Enumerator
CL10
CL11
CL12
CL20
CL30
UNKNOWN

Definition at line 39 of file CLTypes.h.

 {
     CL10,   /* the OpenCL 1.0 */
     CL11,   /* the OpenCL 1.1 */
     CL12,   /* the OpenCL 1.2 */
     CL20,   /* the OpenCL 2.x */
     CL30,   /* the OpenCL 3.x */
     UNKNOWN /* unkown version */
 };

◆ ComparisonOperation

enum ComparisonOperation

strong

Supported comparison operations.

Enumerator
Equal	Equal comparison ( \( x == y \) )
NotEqual	NotEqual comparison ( \( x != y \) )
Greater	Greater comparison ( \( x > y \) )
GreaterEqual	Greater equal comparison ( \( x >= y \) )
Less	Less comparison ( \( x < y \) )
LessEqual	Less equal comparison ( \( x <= y \) )

Definition at line 132 of file Types.h.

 {
     Equal,        /**< Equal comparison ( \f$ x == y \f$ ) */
     NotEqual,     /**< NotEqual comparison ( \f$ x != y \f$ ) */
     Greater,      /**< Greater comparison ( \f$ x > y \f$ ) */
     GreaterEqual, /**< Greater equal comparison ( \f$ x >= y \f$ ) */
     Less,         /**< Less comparison ( \f$ x < y \f$ ) */
     LessEqual     /**< Less equal comparison ( \f$ x <= y \f$ ) */
 };

◆ ConvertPolicy

enum ConvertPolicy

strong

Policy to handle integer overflow.

Note: : This is ignored by floating point operations where the overflow behavior adheres to the IEEE-754 standard which states that in case of overflow ±infinity is returned for the round-to-nearest modes (and follows the rounding rules for the directed rounding modes) by default.

Enumerator
WRAP	Wrap around.
SATURATE	Saturate.

Definition at line 353 of file Types.h.

 {
     WRAP,    /**< Wrap around */
     SATURATE /**< Saturate */
 };

◆ ConvolutionMethod

enum ConvolutionMethod

strong

Available ConvolutionMethod.

Enumerator
GEMM	Convolution using GEMM.
GEMM_CONV2D	Direct 2D GEMM convolution.
DIRECT	Direct convolution.
INDIRECT	Indirect convolution.
WINOGRAD	Convolution using Winograd.
FFT	Convolution using FFT.

Definition at line 91 of file Types.h.

 {
     GEMM,        /**< Convolution using GEMM */
     GEMM_CONV2D, /**< Direct 2D GEMM convolution */
     DIRECT,      /**< Direct convolution */
     INDIRECT,    /**< Indirect convolution */
     WINOGRAD,    /**< Convolution using Winograd */
     FFT          /**< Convolution using FFT */
 };

◆ CPUModel

enum CPUModel

strong

CPU models types.

Note: We only need to detect CPUs we have microarchitecture-specific code for.; Architecture features are detected via HWCAPs.

Enumerator
X
GENERIC
GENERIC_FP16
GENERIC_FP16_DOT
A53
A55r0
A55r1
A35
A73
A76
A510
X1
V1
A64FX
N1

Definition at line 59 of file CPPTypes.h.

 {
 #define X(model) model,
     ARM_COMPUTE_CPU_MODEL_LIST
 #undef X
 };

◆ DataLayout

enum DataLayout

strong

[DataLayout enum definition]

Supported tensor data layouts

Enumerator
UNKNOWN	Unknown data layout.
NCHW	Num samples, channels, height, width.
NHWC	Num samples, height, width, channels.
NCDHW	Num samples, channels, depth, height, width.
NDHWC	Num samples, depth, height, width, channels.

Definition at line 110 of file CoreTypes.h.

 {
     UNKNOWN, /**< Unknown data layout */
     NCHW,    /**< Num samples, channels, height, width */
     NHWC,    /**< Num samples, height, width, channels */
     NCDHW,   /**< Num samples, channels, depth, height, width */
     NDHWC    /**< Num samples, depth, height, width, channels */
 };

◆ DataLayoutDimension

enum DataLayoutDimension

strong

[DataLayout enum definition]

Supported tensor data layout dimensions

Enumerator
CHANNEL	channel
HEIGHT	height
WIDTH	width
DEPTH	depth
BATCHES	batches

Definition at line 121 of file CoreTypes.h.

 {
     CHANNEL, /**< channel */
     HEIGHT,  /**< height */
     WIDTH,   /**< width */
     DEPTH,   /**< depth */
     BATCHES  /**< batches */
 };

◆ DataType

enum DataType

strong

Available data types.

Enumerator
UNKNOWN	Unknown data type.
U8	unsigned 8-bit number
S8	signed 8-bit number
QSYMM8	quantized, symmetric fixed-point 8-bit number
QASYMM8	quantized, asymmetric fixed-point 8-bit number unsigned
QASYMM8_SIGNED	quantized, asymmetric fixed-point 8-bit number signed
QSYMM8_PER_CHANNEL	quantized, symmetric per channel fixed-point 8-bit number
U16	unsigned 16-bit number
S16	signed 16-bit number
QSYMM16	quantized, symmetric fixed-point 16-bit number
QASYMM16	quantized, asymmetric fixed-point 16-bit number
U32	unsigned 32-bit number
S32	signed 32-bit number
U64	unsigned 64-bit number
S64	signed 64-bit number
BFLOAT16	16-bit brain floating-point number
F16	16-bit floating-point number
F32	32-bit floating-point number
F64	64-bit floating-point number
SIZET	size_t

Definition at line 83 of file CoreTypes.h.

 {
     UNKNOWN,            /**< Unknown data type */
     U8,                 /**< unsigned 8-bit number */
     S8,                 /**< signed 8-bit number */
     QSYMM8,             /**< quantized, symmetric fixed-point 8-bit number */
     QASYMM8,            /**< quantized, asymmetric fixed-point 8-bit number unsigned */
     QASYMM8_SIGNED,     /**< quantized, asymmetric fixed-point 8-bit number signed */
     QSYMM8_PER_CHANNEL, /**< quantized, symmetric per channel fixed-point 8-bit number */
     U16,                /**< unsigned 16-bit number */
     S16,                /**< signed 16-bit number */
     QSYMM16,            /**< quantized, symmetric fixed-point 16-bit number */
     QASYMM16,           /**< quantized, asymmetric fixed-point 16-bit number */
     U32,                /**< unsigned 32-bit number */
     S32,                /**< signed 32-bit number */
     U64,                /**< unsigned 64-bit number */
     S64,                /**< signed 64-bit number */
     BFLOAT16,           /**< 16-bit brain floating-point number */
     F16,                /**< 16-bit floating-point number */
     F32,                /**< 32-bit floating-point number */
     F64,                /**< 64-bit floating-point number */
     SIZET               /**< size_t */
 };

◆ DeconvolutionMethod

enum DeconvolutionMethod

strong

Available DeconvolutionMethod.

Enumerator
GEMM	Deconvolution using GEMM.
DIRECT	Direct deconvolution.
UPSCALE_CONV2D	Deconvolution with Upscaling.

Definition at line 109 of file Types.h.

 {
     GEMM,          /**< Deconvolution using GEMM */
     DIRECT,        /**< Direct deconvolution */
     UPSCALE_CONV2D /**< Deconvolution with Upscaling */
 };

◆ DepthwiseConvolutionFunction

enum DepthwiseConvolutionFunction

strong

Available DepthwiseConvolutionFunction.

Enumerator
OPTIMIZED	Optimized Depthwise Convolution.
GENERIC	Generic Depthwise Convolution.

Definition at line 102 of file Types.h.

 {
     OPTIMIZED, /**< Optimized Depthwise Convolution */
     GENERIC,   /**< Generic Depthwise Convolution */
 };

◆ DetectionOutputLayerCodeType

enum DetectionOutputLayerCodeType

strong

Available Detection Output code types.

Enumerator
CORNER	Use box corners.
CENTER_SIZE	Use box centers and size.
CORNER_SIZE	Use box centers and size.
TF_CENTER	Use box centers and size but flip x and y co-ordinates.

Definition at line 781 of file Types.h.

 {
     CORNER,      /**< Use box corners */
     CENTER_SIZE, /**< Use box centers and size */
     CORNER_SIZE, /**< Use box centers and size */
     TF_CENTER    /**< Use box centers and size but flip x and y co-ordinates */
 };

◆ DeviceType

enum DeviceType

strong

Device types.

Enumerator
NEON
CL

Definition at line 33 of file IDevice.h.

 {
     NEON,
     CL,
 };

◆ DimensionRoundingType

enum DimensionRoundingType

strong

Dimension rounding type when down-scaling on CNNs.

Note: Used in pooling and convolution layer

Enumerator
FLOOR	Floor rounding.
CEIL	Ceil rounding.

Definition at line 133 of file CoreTypes.h.

 {
     FLOOR, /**< Floor rounding */
     CEIL   /**< Ceil rounding */
 };

◆ ElementWiseUnary

enum ElementWiseUnary

strong

Available element wise unary operations.

Enumerator
RSQRT	Reverse square root.
EXP	Exponential.
NEG	Negate.
LOG	Natural Logarithm.
ABS	Absolute value.
SIN	Sine.
ROUND	Round.
LOGICAL_NOT	Logical Not.

Definition at line 434 of file Types.h.

 {
     RSQRT,       /**< Reverse square root */
     EXP,         /**< Exponential */
     NEG,         /**< Negate */
     LOG,         /**< Natural Logarithm */
     ABS,         /**< Absolute value */
     SIN,         /**< Sine */
     ROUND,       /**< Round */
     LOGICAL_NOT, /**< Logical Not */
 };

◆ ErrorCode

enum ErrorCode

strong

Available error codes.

Enumerator
OK	No error.
RUNTIME_ERROR	Generic runtime error.
UNSUPPORTED_EXTENSION_USE	Unsupported extension used.

Definition at line 44 of file Error.h.

 {
     OK,                       /**< No error */
     RUNTIME_ERROR,            /**< Generic runtime error */
     UNSUPPORTED_EXTENSION_USE /**< Unsupported extension used*/
 };

◆ ExecutionMode

enum ExecutionMode

strong

Enumerator
FastRerun
FastStart

Definition at line 51 of file Types.h.

 {
     FastRerun = AclPreferFastRerun,
     FastStart = AclPreferFastStart,
 };

◆ FFTDirection

enum FFTDirection

strong

FFT direction to use.

Enumerator
Forward
Inverse

Definition at line 35 of file FunctionDescriptors.h.

 {
     Forward,
     Inverse
 };

◆ Format

enum Format

strong

Image colour formats.

Enumerator
UNKNOWN	Unknown image format.
U8	1 channel, 1 U8 per channel
S16	1 channel, 1 S16 per channel
U16	1 channel, 1 U16 per channel
S32	1 channel, 1 S32 per channel
U32	1 channel, 1 U32 per channel
S64	1 channel, 1 S64 per channel
U64	1 channel, 1 U64 per channel
BFLOAT16	16-bit brain floating-point number
F16	1 channel, 1 F16 per channel
F32	1 channel, 1 F32 per channel
UV88	2 channel, 1 U8 per channel
RGB888	3 channels, 1 U8 per channel
RGBA8888	4 channels, 1 U8 per channel
YUV444	A 3 plane of 8 bit 4:4:4 sampled Y, U, V planes.
YUYV422	A single plane of 32-bit macro pixel of Y0, U0, Y1, V0 bytes.
NV12	A 2 plane YUV format of Luma (Y) and interleaved UV data at 4:2:0 sampling.
NV21	A 2 plane YUV format of Luma (Y) and interleaved VU data at 4:2:0 sampling.
IYUV	A 3 plane of 8-bit 4:2:0 sampled Y, U, V planes.
UYVY422	A single plane of 32-bit macro pixel of U0, Y0, V0, Y1 byte.

Definition at line 58 of file CoreTypes.h.

 {
     UNKNOWN,  /**< Unknown image format */
     U8,       /**< 1 channel, 1 U8 per channel */
     S16,      /**< 1 channel, 1 S16 per channel */
     U16,      /**< 1 channel, 1 U16 per channel */
     S32,      /**< 1 channel, 1 S32 per channel */
     U32,      /**< 1 channel, 1 U32 per channel */
     S64,      /**< 1 channel, 1 S64 per channel */
     U64,      /**< 1 channel, 1 U64 per channel */
     BFLOAT16, /**< 16-bit brain floating-point number */
     F16,      /**< 1 channel, 1 F16 per channel */
     F32,      /**< 1 channel, 1 F32 per channel */
     UV88,     /**< 2 channel, 1 U8 per channel */
     RGB888,   /**< 3 channels, 1 U8 per channel */
     RGBA8888, /**< 4 channels, 1 U8 per channel */
     YUV444,   /**< A 3 plane of 8 bit 4:4:4 sampled Y, U, V planes */
     YUYV422,  /**< A single plane of 32-bit macro pixel of Y0, U0, Y1, V0 bytes */
     NV12,     /**< A 2 plane YUV format of Luma (Y) and interleaved UV data at 4:2:0 sampling */
     NV21,     /**< A 2 plane YUV format of Luma (Y) and interleaved VU data at 4:2:0 sampling */
     IYUV,     /**< A 3 plane of 8-bit 4:2:0 sampled Y, U, V planes */
     UYVY422   /**< A single plane of 32-bit macro pixel of U0, Y0, V0, Y1 byte */
 };

◆ FuseBatchNormalizationType

enum FuseBatchNormalizationType

strong

Available FuseBatchNormalizationType.

Enumerator
CONVOLUTION	For Convolution weights.
DEPTHWISECONVOLUTION	For Depthwise Convolution weights.

Definition at line 117 of file Types.h.

 {
     CONVOLUTION,         /**< For Convolution weights */
     DEPTHWISECONVOLUTION /**< For Depthwise Convolution weights*/
 };

◆ GEMMLowpOutputStageType

enum GEMMLowpOutputStageType

strong

GEMMLowp output stage type.

Enumerator
NONE	No quantization.
QUANTIZE_DOWN	Quantize using an integer multiplication.
QUANTIZE_DOWN_FIXEDPOINT	Quantize using a fixed point multiplication.
QUANTIZE_DOWN_FLOAT	Quantize using a floating point multiplication.

Definition at line 36 of file GEMMInfo.h.

 {
     NONE,                     /**< No quantization */
     QUANTIZE_DOWN,            /**< Quantize using an integer multiplication */
     QUANTIZE_DOWN_FIXEDPOINT, /**< Quantize using a fixed point multiplication */
     QUANTIZE_DOWN_FLOAT       /**< Quantize using a floating point multiplication */
 };

◆ GPUTarget

enum GPUTarget

strong

Available GPU Targets.

Enumerator
UNKNOWN
GPU_ARCH_MASK
GPU_GENERATION_MASK
MIDGARD
BIFROST
VALHALL
FIFTHGEN
T600
T700
T800
G71
G72
G51
G51BIG
G51LIT
G31
G76
G52
G52LIT
G77
G57
G78
G68
G78AE
G710
G610
G510
G310
G715
G615
G720
G620

Definition at line 34 of file GPUTarget.h.

 {
     UNKNOWN             = 0x101,
     GPU_ARCH_MASK       = 0xF00,
     GPU_GENERATION_MASK = 0x0F0,
     MIDGARD             = 0x100,
     BIFROST             = 0x200,
     VALHALL             = 0x300,
     FIFTHGEN            = 0X400,
     T600                = 0x110,
     T700                = 0x120,
     T800                = 0x130,
     G71                 = 0x210,
     G72                 = 0x220,
     G51                 = 0x221,
     G51BIG              = 0x222,
     G51LIT              = 0x223,
     G31                 = 0x224,
     G76                 = 0x230,
     G52                 = 0x231,
     G52LIT              = 0x232,
     G77                 = 0x310,
     G57                 = 0x311,
     G78                 = 0x320,
     G68                 = 0x321,
     G78AE               = 0x330,
     G710                = 0x340,
     G610                = 0x341,
     G510                = 0x342,
     G310                = 0x343,
     G715                = 0x350,
     G615                = 0x351,
     G720                = 0x410,
     G620                = 0X411
 };

◆ ImportMemoryType

enum ImportMemoryType

strong

Enumerator
HostPtr

Definition at line 57 of file Types.h.

 {
     HostPtr = AclImportMemoryType::AclHostPtr
 };

◆ InterpolationPolicy

enum InterpolationPolicy

strong

Interpolation method.

Enumerator
NEAREST_NEIGHBOR	Output values are defined to match the source pixel whose center is nearest to the sample position.
BILINEAR	Output values are defined by bilinear interpolation between the pixels.
AREA	Output values are determined by averaging the source pixels whose areas fall under the area of the destination pixel, projected onto the source image.

Definition at line 360 of file Types.h.

 {
     NEAREST_NEIGHBOR, /**< Output values are defined to match the source pixel whose center is nearest to the sample position */
     BILINEAR,         /**< Output values are defined by bilinear interpolation between the pixels */
     AREA, /**< Output values are determined by averaging the source pixels whose areas fall under the area of the destination pixel, projected onto the source image */
 };

◆ LogicalOperation

enum LogicalOperation

strong

List of supported logical operations.

Enumerator
Unknown	Unknown.
And	Logical And &&.
Or	Logical Or \|\|.
Not	Logical Not !

Definition at line 30 of file KernelTypes.h.

 {
     Unknown, /**< Unknown */
     And,     /**< Logical And && */
     Or,      /**< Logical Or || */
     Not,     /**< Logical Not ! */
 };

◆ MappingType

enum MappingType

strong

Mapping type.

Enumerator
BLOBS	Mappings are in blob granularity.
OFFSETS	Mappings are in offset granularity in the same blob.

Definition at line 34 of file Types.h.

 {
     BLOBS,  /**< Mappings are in blob granularity */
     OFFSETS /**< Mappings are in offset granularity in the same blob */
 };

◆ NMSType

enum NMSType

strong

Available non maxima suppression types.

Enumerator
LINEAR	Linear NMS.
GAUSSIAN	Gaussian NMS.
ORIGINAL	Original NMS.

Definition at line 488 of file Types.h.

 {
     LINEAR,   /**< Linear NMS */
     GAUSSIAN, /**< Gaussian NMS */
     ORIGINAL  /**< Original NMS */
 };

◆ NormType

enum NormType

strong

The normalization type used for the normalization layer.

Enumerator
IN_MAP_1D	Normalization applied within the same map in 1D region.
IN_MAP_2D	Normalization applied within the same map in 2D region.
CROSS_MAP	Normalization applied cross maps.

Definition at line 456 of file Types.h.

 {
     IN_MAP_1D, /**< Normalization applied within the same map in 1D region */
     IN_MAP_2D, /**< Normalization applied within the same map in 2D region */
     CROSS_MAP  /**< Normalization applied cross maps */
 };

◆ PaddingMode

enum PaddingMode

strong

Padding mode to use for PadLayer.

Enumerator
CONSTANT
REFLECT
SYMMETRIC

Definition at line 124 of file Types.h.

 {
     CONSTANT,
     REFLECT,
     SYMMETRIC
 };

◆ PoolingType

enum PoolingType

strong

Available pooling types.

Enumerator
MAX	Max Pooling.
AVG	Average Pooling.
L2	L2 Pooling.

Definition at line 480 of file Types.h.

 {
     MAX, /**< Max Pooling */
     AVG, /**< Average Pooling */
     L2   /**< L2 Pooling */
 };

◆ ReductionOperation

enum ReductionOperation

strong

Available reduction operations.

Enumerator
ARG_IDX_MAX	Index of the max value.
ARG_IDX_MIN	Index of the min value.
MEAN_SUM	Mean of sum.
PROD	Product.
SUM_SQUARE	Sum of squares.
SUM	Sum.
MIN	Min.
MAX	Max.

Definition at line 408 of file Types.h.

 {
     ARG_IDX_MAX, /**< Index of the max value */
     ARG_IDX_MIN, /**< Index of the min value */
     MEAN_SUM,    /**< Mean of sum */
     PROD,        /**< Product */
     SUM_SQUARE,  /**< Sum of squares */
     SUM,         /**< Sum */
     MIN,         /**< Min */
     MAX,         /**< Max */
 };

◆ RoundingPolicy

enum RoundingPolicy

strong

Rounding method.

Enumerator
TO_ZERO	Truncates the least significant values that are lost in operations.
TO_NEAREST_UP	Rounds to nearest value; half rounds away from zero.
TO_NEAREST_EVEN	Rounds to nearest value; half rounds to nearest even.

Definition at line 30 of file Rounding.h.

 {
     TO_ZERO,         /**< Truncates the least significant values that are lost in operations. */
     TO_NEAREST_UP,   /**< Rounds to nearest value; half rounds away from zero */
     TO_NEAREST_EVEN, /**< Rounds to nearest value; half rounds to nearest even */
 };

◆ SamplingPolicy

enum SamplingPolicy

strong

Available Sampling Policies.

Enumerator
CENTER	Samples are taken at pixel center.
TOP_LEFT	Samples are taken at pixel top left corner.

Definition at line 84 of file Types.h.

 {
     CENTER,  /**< Samples are taken at pixel center */
     TOP_LEFT /**< Samples are taken at pixel top left corner */
 };

◆ StatusCode

enum StatusCode

strong

Enumerator
Success
RuntimeError
OutOfMemory
Unimplemented
UnsupportedTarget
InvalidTarget
InvalidArgument
UnsupportedConfig
InvalidObjectState

Definition at line 32 of file Types.h.

 {
     Success            = AclSuccess,
     RuntimeError       = AclRuntimeError,
     OutOfMemory        = AclOutOfMemory,
     Unimplemented      = AclUnimplemented,
     UnsupportedTarget  = AclUnsupportedTarget,
     InvalidTarget      = AclInvalidTarget,
     InvalidArgument    = AclInvalidArgument,
     UnsupportedConfig  = AclUnsupportedConfig,
     InvalidObjectState = AclInvalidObjectState,
 };

◆ Target

enum Target

strong

Enumerator
Cpu
GpuOcl

Definition at line 45 of file Types.h.

 {
     Cpu    = AclTarget::AclCpu,
     GpuOcl = AclTarget::AclGpuOcl,
 };

◆ TensorType

enum TensorType : int32_t

Memory type.

Enumerator
ACL_UNKNOWN
ACL_SRC_DST
ACL_SRC
ACL_SRC_0
ACL_SRC_1
ACL_SRC_2
ACL_SRC_3
ACL_SRC_4
ACL_SRC_5
ACL_SRC_6
ACL_SRC_END
ACL_DST
ACL_DST_0
ACL_DST_1
ACL_DST_2
ACL_DST_END
ACL_INT
ACL_INT_0
ACL_INT_1
ACL_INT_2
ACL_INT_3
ACL_INT_4
ACL_SRC_VEC
ACL_DST_VEC
ACL_INT_VEC
ACL_BIAS
ACL_VEC_ROW_SUM
ACL_VEC_COL_SUM
ACL_SHIFTS
ACL_MULTIPLIERS

Definition at line 38 of file Types.h.

                 : int32_t
 {
     ACL_UNKNOWN = -1,
     ACL_SRC_DST = 0,
  
     // Src
     ACL_SRC     = 0,
     ACL_SRC_0   = 0,
     ACL_SRC_1   = 1,
     ACL_SRC_2   = 2,
     ACL_SRC_3   = 3,
     ACL_SRC_4   = 4,
     ACL_SRC_5   = 5,
     ACL_SRC_6   = 6,
     ACL_SRC_END = 6,
  
     // Dst
     ACL_DST     = 30,
     ACL_DST_0   = 30,
     ACL_DST_1   = 31,
     ACL_DST_2   = 32,
     ACL_DST_END = 32,
  
     // Aux
     ACL_INT     = 50,
     ACL_INT_0   = 50,
     ACL_INT_1   = 51,
     ACL_INT_2   = 52,
     ACL_INT_3   = 53,
     ACL_INT_4   = 54,
     ACL_SRC_VEC = 256,
     ACL_DST_VEC = 512,
     ACL_INT_VEC = 1024,
  
     // Aliasing Types
     // Conv etc
     ACL_BIAS = ACL_SRC_2,
  
     // Gemm
     ACL_VEC_ROW_SUM = ACL_SRC_3,
     ACL_VEC_COL_SUM = ACL_SRC_4,
     ACL_SHIFTS      = ACL_SRC_5,
     ACL_MULTIPLIERS = ACL_SRC_6,
 };

◆ WeightFormat

enum WeightFormat

strong

Memory layouts for the weights tensor.

UNSPECIFIED is used to select kernels that do not run in variable weights mode.
ANY is used to query the kernel database to retrieve any of the kernels that runs in variable weights mode. Once a kernel is found, the specific format expected by the kernel can be retrieved by the user for reordering the weights tensor accordingly.

The other values OHWIo{interleave_by}i{block_by} describe the memory layout of a 4D tensor with layout OHWI that has been transformed into a 4D tensor with dimensions O'HWI' where:

O' = first multiple of {interleave_by} s.t. O<=O' I' = first multiple of {block_by} s.t. I<=I'

The total size of the dst tensor is O' x H x W x I'

The access function of the tensor with layout OHWIo{interleave_by}i{block_by} and size O'HWI' is a 6-parameter access function, where the 6 parameters are computed as follows:

x5 = floor(o/{interleave_by}) RANGE [0, O'/{interleave_by} -1] SIZE: O'/{interleave_by}

x4 = h RANGE [0, H-1] SIZE: H x3 = w RANGE [0, W-1] SIZE: W x2 = floor(i/{block_by}) RANGE [0, I'/{block_by} -1] SIZE: I'/{block_by} x1 = o%{interleave_by} RANGE [0, {interleave_by} -1] SIZE: {interleave_by} x0 = i%{block_by} RANGE [0, {block_by} -1] SIZE: {block_by} TOTAL SIZE: O' * H * W * I'

4D 6D

value(o, h, w, i) = x5 * H * W * I' * {interleave_by}

x4 * W * I' * {interleave_by}
x3 * I' * {interleave_by}
x2 * {interleave_by} * {block_by}
x1 * {block_by}
x0

Notice that in arm_gemm the 4D tensor of dimension O'HWI' created for the OHWIo{interleave_by}i{block_by} format is in reality seen as a 2D tensor, where the number of rows is O'/{interleave_by} and the number of columns is {interleave_by} * H * W * I'.

The postfix *_bf16 is for the memory layout needed for the fast-mode kernels, in which the weights are passed in bfloat16 format.

Enumerator
UNSPECIFIED
ANY
OHWI
OHWIo2
OHWIo4
OHWIo8
OHWIo16
OHWIo32
OHWIo64
OHWIo128
OHWIo4i2
OHWIo4i2_bf16
OHWIo8i2
OHWIo8i2_bf16
OHWIo16i2
OHWIo16i2_bf16
OHWIo32i2
OHWIo32i2_bf16
OHWIo64i2
OHWIo64i2_bf16
OHWIo4i4
OHWIo4i4_bf16
OHWIo8i4
OHWIo8i4_bf16
OHWIo16i4
OHWIo16i4_bf16
OHWIo32i4
OHWIo32i4_bf16
OHWIo64i4
OHWIo64i4_bf16
OHWIo2i8
OHWIo4i8
OHWIo8i8
OHWIo16i8
OHWIo32i8
OHWIo64i8

Definition at line 311 of file CoreTypes.h.

 {
     UNSPECIFIED    = 0x1,
     ANY            = 0x2,
     OHWI           = 0x100100,
     OHWIo2         = 0x100200,
     OHWIo4         = 0x100400,
     OHWIo8         = 0x100800,
     OHWIo16        = 0x101000,
     OHWIo32        = 0x102000,
     OHWIo64        = 0x104000,
     OHWIo128       = 0x108000,
     OHWIo4i2       = 0x200400,
     OHWIo4i2_bf16  = 0x200410,
     OHWIo8i2       = 0x200800,
     OHWIo8i2_bf16  = 0x200810,
     OHWIo16i2      = 0x201000,
     OHWIo16i2_bf16 = 0x201010,
     OHWIo32i2      = 0x202000,
     OHWIo32i2_bf16 = 0x202010,
     OHWIo64i2      = 0x204000,
     OHWIo64i2_bf16 = 0x204010,
     OHWIo4i4       = 0x400400,
     OHWIo4i4_bf16  = 0x400410,
     OHWIo8i4       = 0x400800,
     OHWIo8i4_bf16  = 0x400810,
     OHWIo16i4      = 0x401000,
     OHWIo16i4_bf16 = 0x401010,
     OHWIo32i4      = 0x402000,
     OHWIo32i4_bf16 = 0x402010,
     OHWIo64i4      = 0x404000,
     OHWIo64i4_bf16 = 0x404010,
     OHWIo2i8       = 0x800200,
     OHWIo4i8       = 0x800400,
     OHWIo8i8       = 0x800800,
     OHWIo16i8      = 0x801000,
     OHWIo32i8      = 0x802000,
     OHWIo64i8      = 0x804000
 };

Function Documentation

◆ adjust_down()

int arm_compute::adjust_down	(	int	required,
		int	available,
		int	step
	)

inline

Decrease required in steps of step until it's less than available.

Parameters

[in]	required	Number of required bytes.
[in]	available	Number of available bytes.
[in]	step	Step size used to decrease required bytes.

Returns: Largest value smaller than available that is a multiple of step

Definition at line 47 of file IAccessWindow.h.

 {
     ARM_COMPUTE_ERROR_ON(step <= 0);
  
     return required - step * ((required - available + step - 1) / step);
 }

References ARM_COMPUTE_ERROR_ON, GemmTuner::required, and arm_compute::cpu::step.

Referenced by AccessWindowTranspose::update_window_if_needed(), and AccessWindowRectangle::update_window_if_needed().

◆ adjust_odd_shape()

TensorShape arm_compute::adjust_odd_shape	(	const TensorShape &	shape,
		Format	format
	)

inline

Adjust tensor shape size if width or height are odd for a given multi-planar format.

No modification is done for other formats.

Note: Adding here a few links discussing the issue of odd size and sharing the same solution: Android Source WebM libYUV YUVPlayer *

Parameters

[in,out]	shape	Tensor shape of 2D size
[in]	format	Format of the tensor

Returns: The adjusted tensor shape.

Definition at line 338 of file Validate.h.

 {
     TensorShape output{shape};
  
     // Force width to be even for formats which require subsampling of the U and V channels
     if (has_format_horizontal_subsampling(format))
     {
         output.set(0, (output.x() + 1) & ~1U);
     }
  
     // Force height to be even for formats which require subsampling of the U and V channels
     if (has_format_vertical_subsampling(format))
     {
         output.set(1, (output.y() + 1) & ~1U);
     }
  
     return output;
 }

References update_supported_ops::format, has_format_horizontal_subsampling(), has_format_vertical_subsampling(), arm_compute::test::validation::shape, and arm_compute::utils::cast::U.

◆ adjust_up()

int arm_compute::adjust_up	(	int	required,
		int	available,
		int	step
	)

inline

Increase required in steps of step until it's greater than available.

Parameters

[in]	required	Number of required bytes.
[in]	available	Number of available bytes.
[in]	step	Step size used to increase required bytes.

Returns: Largest value smaller than available that is a multiple of step

Definition at line 63 of file IAccessWindow.h.

 {
     ARM_COMPUTE_ERROR_ON(step <= 0);
  
     return required + step * ((available - required + step - 1) / step);
 }

References ARM_COMPUTE_ERROR_ON, GemmTuner::required, and arm_compute::cpu::step.

Referenced by AccessWindowTranspose::update_window_if_needed(), and AccessWindowRectangle::update_window_if_needed().

◆ adjust_vec_size()

unsigned int arm_compute::adjust_vec_size	(	unsigned int	vec_size,
		size_t	dim0
	)

inline

Returns the adjusted vector size in case it is less than the input's first dimension, getting rounded down to its closest valid vector size.

Parameters

[in]	vec_size	vector size to be adjusted
[in]	dim0	size of the first dimension

Returns: the number of element processed along the X axis per thread

Definition at line 38 of file AdjustVecSize.h.

 {
     ARM_COMPUTE_ERROR_ON(vec_size > 16);
  
     if ((vec_size >= dim0) && (dim0 == 3))
     {
         return dim0;
     }
  
     while (vec_size > dim0)
     {
         vec_size >>= 1;
     }
  
     return vec_size;
 }

References ARM_COMPUTE_ERROR_ON.

◆ arm_matrix_multiply_supported()

bool arm_matrix_multiply_supported ( const cl::Device & device )

Helper function to check whether the cl_arm_matrix_multiply extension is supported.

Parameters

[in] device A CL device

Returns: True if the extension is supported

Definition at line 499 of file CLHelpers.cpp.

 {
     return device_supports_extension(device, "cl_arm_matrix_multiply");
 }

References device_supports_extension().

Referenced by ClMatMulLowpNativeMMULKernel::validate(), and ClMatMulNativeMMULKernel::validate().

◆ arm_non_uniform_workgroup_supported()

bool arm_non_uniform_workgroup_supported ( const cl::Device & device )

Helper function to check whether the arm_non_uniform_work_group_size extension is supported.

Parameters

[in] device A CL device

Returns: True if the extension is supported

Definition at line 232 of file CLHelpers.cpp.

 {
     return device_supports_extension(device, "cl_arm_non_uniform_work_group_size");
 }

References device_supports_extension().

◆ auto_init_if_empty() [1/2]

bool arm_compute::auto_init_if_empty	(	ITensorInfo &	info,
		const TensorShape &	shape,
		int	num_channels,
		DataType	data_type,
		QuantizationInfo	quantization_info = `QuantizationInfo()`
	)

inline

Auto initialize the tensor info (shape, number of channels and data type) if the current assignment is empty.

Parameters

[in,out]	info	Tensor info used to check and assign.
[in]	shape	New shape.
[in]	num_channels	New number of channels.
[in]	data_type	New data type
[in]	quantization_info	(Optional) New quantization info

Returns: True if the tensor info has been initialized

Definition at line 43 of file AutoConfiguration.h.

 {
     if (info.tensor_shape().total_size() == 0)
     {
         info.set_data_type(data_type);
         info.set_num_channels(num_channels);
         info.set_tensor_shape(shape);
         info.set_quantization_info(quantization_info);
         return true;
     }
  
     return false;
 }

References arm_compute::test::validation::data_type, arm_compute::test::validation::info, and arm_compute::test::validation::shape.

◆ auto_init_if_empty() [2/2]

bool arm_compute::auto_init_if_empty	(	ITensorInfo &	info_sink,
		const ITensorInfo &	info_source
	)

inline

Auto initialize the tensor info using another tensor info.

(COMPMID-6012) This method should remain in sync with the fields of ITensorInfo that have setters.

Parameters

info_sink	Tensor info used to check and assign
info_source	Tensor info used to assign

Returns: True if the tensor info has been initialized

Definition at line 72 of file AutoConfiguration.h.

 {
     if (info_sink.tensor_shape().total_size() == 0)
     {
         info_sink.set_data_type(info_source.data_type());
         info_sink.set_num_channels(info_source.num_channels());
         info_sink.set_tensor_shape(info_source.tensor_shape());
         info_sink.set_quantization_info(info_source.quantization_info());
         info_sink.set_data_layout(info_source.data_layout());
         info_sink.set_are_values_constant(info_source.are_values_constant());
         return true;
     }
  
     return false;
 }

References ITensorInfo::are_values_constant(), ITensorInfo::data_layout(), ITensorInfo::data_type(), ITensorInfo::num_channels(), ITensorInfo::quantization_info(), ITensorInfo::set_are_values_constant(), ITensorInfo::set_data_layout(), ITensorInfo::set_data_type(), ITensorInfo::set_num_channels(), ITensorInfo::set_quantization_info(), ITensorInfo::set_tensor_shape(), ITensorInfo::tensor_shape(), and TensorShape::total_size().

◆ block_by()

int arm_compute::block_by ( const WeightFormat wf )

inline

Definition at line 1656 of file Types.h.

 {
     return (static_cast<int>(wf) >> 20) & 0xF;
 }

Referenced by CpuGemmConv2d::configure(), CpuGemm::validate(), and CpuGemmConv2d::validate().

◆ build_information()

std::string arm_compute::build_information ( )

Returns the arm_compute library build information.

Contains the version number and the build options used to build the library

Returns: The arm_compute library build information

Referenced by main().

◆ calculate_max_enlarged_window()

Window arm_compute::calculate_max_enlarged_window	(	const ValidRegion &	valid_region,
		const Steps &	steps,
		BorderSize	border_size
	)

Definition at line 147 of file WindowHelpers.cpp.

 {
     const Coordinates &anchor = valid_region.anchor;
     const TensorShape &shape  = valid_region.shape;
  
     Window window;
  
     window.set(0, Window::Dimension(
                       // move the anchor to the start from the border
                       anchor[0] - border_size.left,
                       // move the anchor to include the right end border
                       // Make sure the window width is a multiple of the step size
                       anchor[0] - border_size.left +
                           ceil_to_multiple(shape[0] + border_size.left + border_size.right, steps[0]),
                       steps[0]));
  
     size_t n = 1;
  
     if (anchor.num_dimensions() > 1)
     {
         window.set(1, Window::Dimension(
                           // Include the border above the image
                           anchor[1] - border_size.top,
                           // Include the border below the image
                           anchor[1] - border_size.top +
                               ceil_to_multiple(shape[1] + border_size.top + border_size.bottom, steps[1]),
                           steps[1]));
  
         ++n;
     }
  
     if (anchor.num_dimensions() > 2)
     {
         window.set(2, Window::Dimension(0, std::max<size_t>(1, shape[n]), steps[2]));
  
         ++n;
     }
  
     for (; n < anchor.num_dimensions(); ++n)
     {
         window.set(n, Window::Dimension(anchor[n], std::max<size_t>(1, shape[n])));
     }
  
     for (; n < Coordinates::num_max_dimensions; ++n)
     {
         window.set(n, Window::Dimension(0, 1));
     }
  
     return window;
 }

References ValidRegion::anchor, BorderSize::bottom, ceil_to_multiple(), BorderSize::left, Dimensions< T >::num_dimensions(), Dimensions< int >::num_max_dimensions, BorderSize::right, Window::set(), arm_compute::test::validation::shape, ValidRegion::shape, BorderSize::top, and arm_compute::test::validation::valid_region.

◆ calculate_max_window() [1/2]

Window arm_compute::calculate_max_window	(	const TensorShape &	shape,
		const Steps &	steps,
		bool	skip_border,
		BorderSize	border_size
	)

Definition at line 90 of file WindowHelpers.cpp.

 {
     if (!skip_border)
     {
         border_size = BorderSize(0);
     }
  
     Window window;
  
     window.set(0, Window::Dimension(
                       // Skip the border left of the image
                       border_size.left,
                       // Skip the border right of the image
                       // Make sure the window width is a multiple of the step size
                       border_size.left +
                           ceil_to_multiple(std::max(0, static_cast<int>(shape[0]) - static_cast<int>(border_size.left) -
                                                            static_cast<int>(border_size.right)),
                                            steps[0]),
                       steps[0]));
  
     size_t n = 1;
  
     if (shape.num_dimensions() > 1)
     {
         window.set(1, Window::Dimension(
                           // Skip the border above the image
                           border_size.top,
                           // Skip the border below the image
                           border_size.top + ceil_to_multiple(std::max(0, static_cast<int>(shape[1]) -
                                                                              static_cast<int>(border_size.top) -
                                                                              static_cast<int>(border_size.bottom)),
                                                              steps[1]),
                           steps[1]));
  
         ++n;
     }
  
     if (shape.num_dimensions() > 2)
     {
         window.set(2, Window::Dimension(0, std::max<size_t>(1, shape[2]), steps[2]));
  
         ++n;
     }
  
     for (; n < shape.num_dimensions(); ++n)
     {
         window.set(n, Window::Dimension(0, std::max<size_t>(1, shape[n])));
     }
  
     for (; n < Coordinates::num_max_dimensions; ++n)
     {
         window.set(n, Window::Dimension(0, 1));
     }
  
     return window;
 }

References BorderSize::bottom, ceil_to_multiple(), BorderSize::left, Dimensions< int >::num_max_dimensions, BorderSize::right, Window::set(), arm_compute::test::validation::shape, and BorderSize::top.

◆ calculate_max_window() [2/2]

Window arm_compute::calculate_max_window	(	const ValidRegion &	valid_region,
		const Steps &	steps,
		bool	skip_border,
		BorderSize	border_size
	)

Definition at line 29 of file WindowHelpers.cpp.

 {
     if (!skip_border)
     {
         border_size = BorderSize(0);
     }
  
     const Coordinates &anchor = valid_region.anchor;
     const TensorShape &shape  = valid_region.shape;
  
     Window window;
  
     window.set(0, Window::Dimension(
                       // Skip the border left of the image
                       anchor[0] + border_size.left,
                       // Skip the border right of the image
                       // Make sure the window width is a multiple of the step size
                       anchor[0] + border_size.left +
                           ceil_to_multiple(std::max(0, static_cast<int>(shape[0]) - static_cast<int>(border_size.left) -
                                                            static_cast<int>(border_size.right)),
                                            steps[0]),
                       steps[0]));
  
     size_t n = 1;
  
     if (anchor.num_dimensions() > 1)
     {
         window.set(1,
                    Window::Dimension(
                        // Skip the border above the image
                        anchor[1] + border_size.top,
                        // Skip the border below the image
                        anchor[1] + border_size.top +
                            ceil_to_multiple(std::max(0, static_cast<int>(shape[1]) - static_cast<int>(border_size.top) -
                                                             static_cast<int>(border_size.bottom)),
                                             steps[1]),
                        steps[1]));
  
         ++n;
     }
  
     if (anchor.num_dimensions() > 2)
     {
         window.set(2, Window::Dimension(anchor[2], std::max<size_t>(1, shape[2]), steps[2]));
  
         ++n;
     }
  
     for (; n < anchor.num_dimensions(); ++n)
     {
         window.set(n, Window::Dimension(anchor[n], std::max<size_t>(1, shape[n])));
     }
  
     for (; n < Coordinates::num_max_dimensions; ++n)
     {
         window.set(n, Window::Dimension(0, 1));
     }
  
     return window;
 }

References ValidRegion::anchor, BorderSize::bottom, ceil_to_multiple(), BorderSize::left, Dimensions< T >::num_dimensions(), Dimensions< int >::num_max_dimensions, BorderSize::right, Window::set(), arm_compute::test::validation::shape, ValidRegion::shape, BorderSize::top, and arm_compute::test::validation::valid_region.

◆ calculate_max_window_horizontal()

Window arm_compute::calculate_max_window_horizontal	(	const ValidRegion &	valid_region,
		const Steps &	steps,
		bool	skip_border,
		BorderSize	border_size
	)

Definition at line 198 of file WindowHelpers.cpp.

 {
     if (skip_border)
     {
         border_size.top    = 0;
         border_size.bottom = 0;
     }
     else
     {
         border_size.left  = 0;
         border_size.right = 0;
     }
  
     const Coordinates &anchor = valid_region.anchor;
     const TensorShape &shape  = valid_region.shape;
  
     Window window;
  
     window.set(0, Window::Dimension(
                       // Skip the border left of the image
                       anchor[0] + border_size.left,
                       // Skip the border right of the image
                       // Make sure the window width is a multiple of the step size
                       anchor[0] + border_size.left +
                           ceil_to_multiple(std::max(0, static_cast<int>(shape[0]) - static_cast<int>(border_size.left) -
                                                            static_cast<int>(border_size.right)),
                                            steps[0]),
                       steps[0]));
  
     size_t n = 1;
  
     if (anchor.num_dimensions() > 1)
     {
         window.set(1, Window::Dimension(
                           // Skip the border above the image
                           anchor[1] - border_size.top,
                           // Skip the border below the image
                           anchor[1] + shape[1] + border_size.bottom, 1));
  
         ++n;
     }
  
     for (; n < anchor.num_dimensions(); ++n)
     {
         window.set(n, Window::Dimension(anchor[n], std::max<size_t>(1, shape[n])));
     }
  
     for (; n < Coordinates::num_max_dimensions; ++n)
     {
         window.set(n, Window::Dimension(0, 1));
     }
  
     return window;
 }

References ValidRegion::anchor, BorderSize::bottom, ceil_to_multiple(), BorderSize::left, Dimensions< T >::num_dimensions(), Dimensions< int >::num_max_dimensions, BorderSize::right, Window::set(), arm_compute::test::validation::shape, ValidRegion::shape, BorderSize::top, and arm_compute::test::validation::valid_region.

Referenced by CpuGemmLowpMatrixBReductionKernel::configure().

◆ calculate_same_pad()

PadStrideInfo calculate_same_pad	(	TensorShape	input_shape,
		TensorShape	weights_shape,
		PadStrideInfo	conv_info,
		DataLayout	data_layout = `DataLayout::NCHW`,
		const Size2D &	dilation = `Size2D(1u, 1u)`,
		const DimensionRoundingType &	rounding_type = `DimensionRoundingType::FLOOR`
	)

Calculate padding requirements in case of SAME padding.

Parameters

[in]	input_shape	Input shape
[in]	weights_shape	Weights shape
[in]	conv_info	Convolution information (containing strides)
[in]	data_layout	(Optional) Data layout of the input and weights tensor
[in]	dilation	(Optional) Dilation factor used in the convolution.
[in]	rounding_type	(Optional) Dimension rounding type when down-scaling.

Returns: PadStrideInfo for SAME padding

Definition at line 218 of file Utils.cpp.

 {
     const auto &strides = conv_info.stride();
     ARM_COMPUTE_ERROR_ON_MSG((strides.first < 1 || strides.second < 1),
                              "Stride values should be greater than or equal to 1.");
  
     const unsigned int width_idx     = get_data_layout_dimension_index(data_layout, DataLayoutDimension::WIDTH);
     const unsigned int height_idx    = get_data_layout_dimension_index(data_layout, DataLayoutDimension::HEIGHT);
     const unsigned int in_width      = input_shape[width_idx];
     const unsigned int in_height     = input_shape[height_idx];
     const unsigned int kernel_width  = weights_shape[width_idx];
     const unsigned int kernel_height = weights_shape[height_idx];
  
     // Calculate output dimensions
     const auto         is_ceil    = static_cast<unsigned int>(rounding_type == DimensionRoundingType::CEIL);
     const unsigned int out_width  = ((in_width - is_ceil) + strides.first - 1) / strides.first + is_ceil;
     const unsigned int out_height = ((in_height - is_ceil) + strides.second - 1) / strides.second + is_ceil;
  
     // Calculate effective weights sizes
     const int real_weight_width  = (kernel_width - 1) * dilation.x() + 1;
     const int real_weight_height = (kernel_height - 1) * dilation.y() + 1;
  
     // Calculate total pad
     const int pad_width = std::max(0, static_cast<int>((out_width - 1) * strides.first + real_weight_width - in_width));
     const int pad_height =
         std::max(0, static_cast<int>((out_height - 1) * strides.second + real_weight_height - in_height));
  
     // Calculate individual paddings
     const unsigned int pad_left   = pad_width / 2;
     const unsigned int pad_top    = pad_height / 2;
     const unsigned int pad_right  = pad_width - pad_left;
     const unsigned int pad_bottom = pad_height - pad_top;
  
     PadStrideInfo same_info(strides.first, strides.second, pad_left, pad_right, pad_top, pad_bottom, rounding_type);
  
     // Check for correctness of predicted output shape against the one calculated using the generated info
     const auto out_dims = scaled_dimensions(in_width, in_height, kernel_width, kernel_height, same_info, dilation);
     ARM_COMPUTE_ERROR_ON(out_dims.first != out_width || out_dims.second != out_height);
     ARM_COMPUTE_UNUSED(out_dims);
  
     return same_info;
 }

References ARM_COMPUTE_ERROR_ON, ARM_COMPUTE_ERROR_ON_MSG, ARM_COMPUTE_UNUSED, CEIL, arm_compute::test::validation::conv_info, arm_compute::cpu::data_layout, get_data_layout_dimension_index(), HEIGHT, arm_compute::cpu::height_idx, arm_compute::test::validation::input_shape, scaled_dimensions(), WIDTH, arm_compute::cpu::width_idx, Size2D::x(), and Size2D::y().

Referenced by arm_compute::utils::calculate_convolution_padding().

◆ calculate_squashed_or_max_window() [1/2]

std::pair<Window, size_t> arm_compute::calculate_squashed_or_max_window ( const ITensorInfo & src )

Definition at line 307 of file WindowHelpers.cpp.

 {
     const auto &shape          = src.tensor_shape();
     const auto &strides        = src.strides_in_bytes();
     const auto  num_dimensions = src.num_dimensions();
  
     Window win;
     size_t split_dimension = Window::DimY;
     size_t dim             = 0;
     size_t squashed_bytes  = src.element_size();
  
     // Try to squash the low dimensions together.
     for (; dim < num_dimensions; ++dim)
     {
         if (strides[dim] != squashed_bytes)
         {
             break;
         }
         squashed_bytes *= shape[dim];
     }
     if (dim == num_dimensions)
     {
         const auto squashed_elements = squashed_bytes / src.element_size();
         split_dimension              = Window::DimX;
         // The input tensor can be interpreted as 1D array.
         win.set(0, Window::Dimension(0, squashed_elements, 1));
         for (dim = 1; dim < Coordinates::num_max_dimensions; ++dim)
         {
             win.set(dim, Window::Dimension(0, 1, 1));
         }
     }
     else
     {
         // Generate the max window.
         for (dim = 0; dim < Coordinates::num_max_dimensions; ++dim)
         {
             win.set(dim, Window::Dimension(0, shape[dim], 1));
         }
     }
     return std::make_pair(win, split_dimension);
 }

References Window::DimX, Window::DimY, Dimensions< int >::num_max_dimensions, Window::set(), arm_compute::test::validation::shape, and arm_compute::test::validation::src.

◆ calculate_squashed_or_max_window() [2/2]

std::pair<Window, size_t> arm_compute::calculate_squashed_or_max_window	(	const ITensorInfo &	src0,
		const ITensorInfo &	src1
	)

Definition at line 256 of file WindowHelpers.cpp.

 {
     const auto &shape0         = src0.tensor_shape();
     const auto &shape1         = src1.tensor_shape();
     const auto &strides0       = src0.strides_in_bytes();
     const auto &strides1       = src1.strides_in_bytes();
     const auto  num_dimensions = std::max(src0.num_dimensions(), src1.num_dimensions());
  
     Window win;
     size_t split_dimension = Window::DimY;
     size_t dim             = 0;
  
     size_t squashed_bytes = src0.element_size();
  
     // Try to squash the low dimensions together.
     for (; dim < num_dimensions; ++dim)
     {
         if (shape0[dim] != shape1[dim] || strides0[dim] != squashed_bytes || strides1[dim] != squashed_bytes)
         {
             break;
         }
  
         squashed_bytes *= shape0[dim];
     }
  
     if (dim == num_dimensions)
     {
         auto squashed_elements = squashed_bytes / src0.element_size();
  
         split_dimension = Window::DimX;
  
         // The input tensors can be interpreted as 1D array.
         win.set(0, Window::Dimension(0, squashed_elements, 1));
  
         for (dim = 1; dim < Coordinates::num_max_dimensions; ++dim)
         {
             win.set(dim, Window::Dimension(0, 1, 1));
         }
     }
     else
     {
         // Generates the max window.
         for (dim = 0; dim < Coordinates::num_max_dimensions; ++dim)
         {
             win.set(dim, Window::Dimension(0, std::max(shape0[dim], shape1[dim]), 1));
         }
     }
  
     return std::make_pair(win, split_dimension);
 }

References Window::DimX, Window::DimY, ITensorInfo::element_size(), ITensorInfo::num_dimensions(), Dimensions< int >::num_max_dimensions, Window::set(), ITensorInfo::strides_in_bytes(), and ITensorInfo::tensor_shape().

Referenced by CpuActivationKernel::configure(), CpuSubKernel::configure(), CpuAddKernel::configure(), CpuMulKernel::configure(), and CpuReshapeKernel::prepare().

◆ calculate_subsampled_shape()

TensorShape arm_compute::calculate_subsampled_shape	(	const TensorShape &	shape,
		Format	format,
		Channel	channel = `Channel::UNKNOWN`
	)

inline

Calculate subsampled shape for a given format and channel.

Parameters

[in]	shape	Shape of the tensor to calculate the extracted channel.
[in]	format	Format of the tensor.
[in]	channel	Channel to create tensor shape to be extracted.

Returns: The subsampled tensor shape.

Definition at line 400 of file Validate.h.

 {
     TensorShape output{shape};
  
     // Subsample shape only for U or V channel
     if (Channel::U == channel || Channel::V == channel || Channel::UNKNOWN == channel)
     {
         // Subsample width for the tensor shape when channel is U or V
         if (has_format_horizontal_subsampling(format))
         {
             output.set(0, output.x() / 2U);
         }
  
         // Subsample height for the tensor shape when channel is U or V
         if (has_format_vertical_subsampling(format))
         {
             output.set(1, output.y() / 2U);
         }
     }
  
     return output;
 }

References update_supported_ops::format, has_format_horizontal_subsampling(), has_format_vertical_subsampling(), arm_compute::test::validation::shape, U, arm_compute::utils::cast::U, UNKNOWN, and V.

Referenced by error_on_tensors_not_subsampled().

◆ calculate_valid_region_scale()

ValidRegion calculate_valid_region_scale	(	const ITensorInfo &	src_info,
		const TensorShape &	dst_shape,
		InterpolationPolicy	interpolate_policy,
		SamplingPolicy	sampling_policy,
		bool	border_undefined
	)

Helper function to calculate the Valid Region for Scale.

Parameters

[in]	src_info	Input tensor info used to check.
[in]	dst_shape	Shape of the output.
[in]	interpolate_policy	Interpolation policy.
[in]	sampling_policy	Sampling policy.
[in]	border_undefined	True if the border is undefined.

Returns: The corresponding valid region

Definition at line 28 of file Helpers.cpp.

 {
     const DataLayout data_layout = src_info.data_layout();
     const int        idx_width   = get_data_layout_dimension_index(data_layout, DataLayoutDimension::WIDTH);
     const int        idx_height  = get_data_layout_dimension_index(data_layout, DataLayoutDimension::HEIGHT);
  
     const float scale_x        = static_cast<float>(dst_shape[idx_width]) / src_info.tensor_shape()[idx_width];
     const float scale_y        = static_cast<float>(dst_shape[idx_height]) / src_info.tensor_shape()[idx_height];
     const float sampling_point = (sampling_policy == SamplingPolicy::CENTER) ? 0.5f : 0.0f;
  
     // Get input's valid region start and end points
     const int valid_start_in_x = src_info.valid_region().anchor[idx_width];
     const int valid_start_in_y = src_info.valid_region().anchor[idx_height];
     const int valid_end_in_x   = src_info.valid_region().anchor[idx_width] + src_info.valid_region().shape[idx_width];
     const int valid_end_in_y   = src_info.valid_region().anchor[idx_height] + src_info.valid_region().shape[idx_height];
  
     // Initialize output's valid region start and end points
     auto valid_start_out_x = static_cast<int>(valid_start_in_x * scale_x);
     auto valid_start_out_y = static_cast<int>(valid_start_in_y * scale_y);
     auto valid_end_out_x   = std::min<int>(std::ceil(valid_end_in_x * scale_x), dst_shape[idx_width]);
     auto valid_end_out_y   = std::min<int>(std::ceil(valid_end_in_y * scale_y), dst_shape[idx_height]);
  
     // Handle valid points in case of the bi-linear interpolation
     if (border_undefined)
     {
         switch (interpolate_policy)
         {
             case InterpolationPolicy::NEAREST_NEIGHBOR:
             {
                 // (start_out + sampling_point) >= (start_in * scale)
                 // start_out = ceil((start_in * scale) - sampling_point)
                 valid_start_out_x = std::ceil(valid_start_in_x * scale_x - sampling_point);
                 valid_start_out_y = std::ceil(valid_start_in_y * scale_y - sampling_point);
  
                 // (end_out - 1 + sampling_point) < (end_in * scale)
                 // end_out   = ceil((end_in * scale) - sampling_point); // <-- ceil(x - 1) strictly less
                 valid_end_out_x = std::ceil(valid_end_in_x * scale_x - sampling_point);
                 valid_end_out_y = std::ceil(valid_end_in_y * scale_y - sampling_point);
                 break;
             }
             case InterpolationPolicy::BILINEAR:
             {
                 // (start_out + sampling_point) >= ((start_in + sampling_point) * scale)
                 // start_out = ceil(((start_in + sampling_point) * scale) - sampling_point)
                 valid_start_out_x = std::ceil((valid_start_in_x + sampling_point) * scale_x - sampling_point);
                 valid_start_out_y = std::ceil((valid_start_in_y + sampling_point) * scale_y - sampling_point);
  
                 // (end_out - 1 + sampling_point) <= ((end_in - 1 + sampling_point) * scale)
                 // end_out   = floor(((end_in - 1 + sampling_point) * scale) - sampling_point + 1)
                 valid_end_out_x = std::floor((valid_end_in_x - 1.f + sampling_point) * scale_x - sampling_point + 1.f);
                 valid_end_out_y = std::floor((valid_end_in_y - 1.f + sampling_point) * scale_y - sampling_point + 1.f);
                 break;
             }
             case InterpolationPolicy::AREA:
                 break;
             default:
             {
                 ARM_COMPUTE_ERROR("Invalid InterpolationPolicy");
                 break;
             }
         }
     }
  
     // Setup output valid region
     ValidRegion valid_region{Coordinates(), dst_shape, dst_shape.num_dimensions()};
  
     valid_region.anchor.set(idx_width, std::max(0, valid_start_out_x));
     valid_region.anchor.set(idx_height, std::max(0, valid_start_out_y));
  
     valid_region.shape.set(idx_width, std::min<size_t>(valid_end_out_x - valid_start_out_x, dst_shape[idx_width]));
     valid_region.shape.set(idx_height, std::min<size_t>(valid_end_out_y - valid_start_out_y, dst_shape[idx_height]));
  
     return valid_region;
 }

Referenced by arm_compute::test::validation::FIXTURE_DATA_TEST_CASE().

◆ ceil_to_multiple()

auto arm_compute::ceil_to_multiple	(	S	value,
		T	divisor
	)		-> decltype(((value + divisor - 1) / divisor) * divisor)

inline

Computes the smallest number larger or equal to value that is a multiple of divisor.

Parameters

[in]	value	Lower bound value
[in]	divisor	Value to compute multiple of.

Returns: the result.

Definition at line 50 of file Math.h.

 {
     ARM_COMPUTE_ERROR_ON(value < 0 || divisor <= 0);
     return DIV_CEIL(value, divisor) * divisor;
 }

References ARM_COMPUTE_ERROR_ON, and DIV_CEIL().

◆ channel_idx_from_format()

int arm_compute::channel_idx_from_format	(	Format	format,
		Channel	channel
	)

inline

Return the channel index of a given channel given an input format.

Parameters

[in]	format	Input format
[in]	channel	Input channel

Returns: The channel index of the specific channel of the specific format

Definition at line 143 of file FormatUtils.h.

 {
     switch (format)
     {
         case Format::RGB888:
         {
             switch (channel)
             {
                 case Channel::R:
                     return 0;
                 case Channel::G:
                     return 1;
                 case Channel::B:
                     return 2;
                 default:
                     ARM_COMPUTE_ERROR("Not supported channel");
                     return 0;
             }
         }
         case Format::RGBA8888:
         {
             switch (channel)
             {
                 case Channel::R:
                     return 0;
                 case Channel::G:
                     return 1;
                 case Channel::B:
                     return 2;
                 case Channel::A:
                     return 3;
                 default:
                     ARM_COMPUTE_ERROR("Not supported channel");
                     return 0;
             }
         }
         case Format::YUYV422:
         {
             switch (channel)
             {
                 case Channel::Y:
                     return 0;
                 case Channel::U:
                     return 1;
                 case Channel::V:
                     return 3;
                 default:
                     ARM_COMPUTE_ERROR("Not supported channel");
                     return 0;
             }
         }
         case Format::UYVY422:
         {
             switch (channel)
             {
                 case Channel::Y:
                     return 1;
                 case Channel::U:
                     return 0;
                 case Channel::V:
                     return 2;
                 default:
                     ARM_COMPUTE_ERROR("Not supported channel");
                     return 0;
             }
         }
         case Format::NV12:
         {
             switch (channel)
             {
                 case Channel::Y:
                     return 0;
                 case Channel::U:
                     return 0;
                 case Channel::V:
                     return 1;
                 default:
                     ARM_COMPUTE_ERROR("Not supported channel");
                     return 0;
             }
         }
         case Format::NV21:
         {
             switch (channel)
             {
                 case Channel::Y:
                     return 0;
                 case Channel::U:
                     return 1;
                 case Channel::V:
                     return 0;
                 default:
                     ARM_COMPUTE_ERROR("Not supported channel");
                     return 0;
             }
         }
         case Format::YUV444:
         case Format::IYUV:
         {
             switch (channel)
             {
                 case Channel::Y:
                     return 0;
                 case Channel::U:
                     return 0;
                 case Channel::V:
                     return 0;
                 default:
                     ARM_COMPUTE_ERROR("Not supported channel");
                     return 0;
             }
         }
         default:
             ARM_COMPUTE_ERROR("Not supported format");
             return 0;
     }
 }

References A, ARM_COMPUTE_ERROR, B, update_supported_ops::format, G, IYUV, NV12, NV21, R, RGB888, RGBA8888, U, UYVY422, V, Y, YUV444, and YUYV422.

◆ check_value_range()

bool arm_compute::check_value_range	(	T	val,
		DataType	dt,
		QuantizationInfo	qinfo = `QuantizationInfo()`
	)

Returns true if the value can be represented by the given data type.

Parameters

[in]	val	value to be checked
[in]	dt	data type that is checked
[in]	qinfo	(Optional) quantization info if the data type is QASYMM8

Returns: true if the data type can hold the value.

Definition at line 421 of file DataTypeUtils.h.

 {
     switch (dt)
     {
         case DataType::U8:
         {
             const auto val_u8 = static_cast<uint8_t>(val);
             return ((val_u8 == val) && val >= std::numeric_limits<uint8_t>::lowest() &&
                     val <= std::numeric_limits<uint8_t>::max());
         }
         case DataType::QASYMM8:
         {
             double min = static_cast<double>(dequantize_qasymm8(0, qinfo));
             double max = static_cast<double>(dequantize_qasymm8(std::numeric_limits<uint8_t>::max(), qinfo));
             return ((double)val >= min && (double)val <= max);
         }
         case DataType::S8:
         {
             const auto val_s8 = static_cast<int8_t>(val);
             return ((val_s8 == val) && val >= std::numeric_limits<int8_t>::lowest() &&
                     val <= std::numeric_limits<int8_t>::max());
         }
         case DataType::U16:
         {
             const auto val_u16 = static_cast<uint16_t>(val);
             return ((val_u16 == val) && val >= std::numeric_limits<uint16_t>::lowest() &&
                     val <= std::numeric_limits<uint16_t>::max());
         }
         case DataType::S16:
         {
             const auto val_s16 = static_cast<int16_t>(val);
             return ((val_s16 == val) && val >= std::numeric_limits<int16_t>::lowest() &&
                     val <= std::numeric_limits<int16_t>::max());
         }
         case DataType::U32:
         {
             const auto val_d64 = static_cast<double>(val);
             const auto val_u32 = static_cast<uint32_t>(val);
             return ((val_u32 == val_d64) && val_d64 >= std::numeric_limits<uint32_t>::lowest() &&
                     val_d64 <= std::numeric_limits<uint32_t>::max());
         }
         case DataType::S32:
         {
             const auto val_d64 = static_cast<double>(val);
             const auto val_s32 = static_cast<int32_t>(val);
             return ((val_s32 == val_d64) && val_d64 >= std::numeric_limits<int32_t>::lowest() &&
                     val_d64 <= std::numeric_limits<int32_t>::max());
         }
         case DataType::BFLOAT16:
             return (val >= bfloat16::lowest() && val <= bfloat16::max());
         case DataType::F16:
             return (val >= std::numeric_limits<half>::lowest() && val <= std::numeric_limits<half>::max());
         case DataType::F32:
             return (val >= std::numeric_limits<float>::lowest() && val <= std::numeric_limits<float>::max());
         default:
             ARM_COMPUTE_ERROR("Data type not supported");
             return false;
     }
 }

References ARM_COMPUTE_ERROR, BFLOAT16, dequantize_qasymm8(), dt, F16, F32, bfloat16::lowest(), arm_compute::support::cpp11::lowest(), bfloat16::max(), QASYMM8, arm_compute::test::validation::qinfo, S16, S32, S8, U16, U32, and U8.

◆ cl_winograd_convolution_layer_supported()

bool cl_winograd_convolution_layer_supported	(	const Size2D &	output_tile,
		const Size2D &	kernel_size,
		DataLayout	data_layout
	)

This function checks if the Winograd configuration (defined through the output tile, kernel size and the data layout) is supported on OpenCL.

Parameters

[in]	output_tile	Output tile for the Winograd filtering algorithm
[in]	kernel_size	Kernel size for the Winograd filtering algorithm
[in]	data_layout	Data layout of the input tensor

Returns: True if the configuration is supported

Definition at line 292 of file CLHelpers.cpp.

 {
     ARM_COMPUTE_ERROR_ON(data_layout == DataLayout::UNKNOWN);
  
     using WinogradConfiguration = std::pair<std::pair<int, int>, std::pair<int, int>>;
  
     std::vector<WinogradConfiguration> winograd_configs_nchw = {
         WinogradConfiguration(std::pair<int, int>(1, 2), std::pair<int, int>(1, 3)),
         WinogradConfiguration(std::pair<int, int>(1, 4), std::pair<int, int>(1, 3)),
         WinogradConfiguration(std::pair<int, int>(2, 1), std::pair<int, int>(3, 1)),
         WinogradConfiguration(std::pair<int, int>(4, 1), std::pair<int, int>(3, 1)),
         WinogradConfiguration(std::pair<int, int>(2, 2), std::pair<int, int>(3, 3)),
         WinogradConfiguration(std::pair<int, int>(4, 4), std::pair<int, int>(3, 3)),
         WinogradConfiguration(std::pair<int, int>(4, 4), std::pair<int, int>(5, 5)),
         WinogradConfiguration(std::pair<int, int>(4, 1), std::pair<int, int>(5, 1)),
         WinogradConfiguration(std::pair<int, int>(1, 4), std::pair<int, int>(1, 5))};
  
     std::vector<WinogradConfiguration> winograd_configs_nhwc = {
         WinogradConfiguration(std::pair<int, int>(2, 2), std::pair<int, int>(3, 3)),
         WinogradConfiguration(std::pair<int, int>(1, 4), std::pair<int, int>(1, 3)),
         WinogradConfiguration(std::pair<int, int>(4, 1), std::pair<int, int>(3, 1)),
         WinogradConfiguration(std::pair<int, int>(4, 4), std::pair<int, int>(3, 3)),
         WinogradConfiguration(std::pair<int, int>(4, 4), std::pair<int, int>(5, 5)),
         WinogradConfiguration(std::pair<int, int>(4, 1), std::pair<int, int>(5, 1)),
         WinogradConfiguration(std::pair<int, int>(1, 4), std::pair<int, int>(1, 5)),
         WinogradConfiguration(std::pair<int, int>(1, 2), std::pair<int, int>(1, 7)),
         WinogradConfiguration(std::pair<int, int>(2, 1), std::pair<int, int>(7, 1)),
         WinogradConfiguration(std::pair<int, int>(2, 2), std::pair<int, int>(7, 7)),
     };
  
     auto p = std::make_pair(std::pair<int, int>(output_tile.width, output_tile.height),
                             std::pair<int, int>(kernel_size.width, kernel_size.height));
  
     // Return true if supported
     if (data_layout == DataLayout::NCHW)
     {
         return (std::find(winograd_configs_nchw.begin(), winograd_configs_nchw.end(), p) !=
                 winograd_configs_nchw.end());
     }
     else
     {
         return (std::find(winograd_configs_nhwc.begin(), winograd_configs_nhwc.end(), p) !=
                 winograd_configs_nhwc.end());
     }
 }

References ARM_COMPUTE_ERROR_ON, arm_compute::cpu::data_layout, Size2D::height, NCHW, UNKNOWN, and Size2D::width.

◆ colorconvert_iyuv_to_nv12()

void arm_compute::colorconvert_iyuv_to_nv12	(	const void *__restrict	input,
		void *__restrict	output,
		const Window &	win
	)

Convert IYUV to NV12.

Parameters

[in]	input	Input IYUV data buffer.
[out]	output	Output NV12 buffer.
[in]	win	Window for iterating the buffers.

Definition at line 717 of file NEColorConvertHelper.inl.

 {
     ARM_COMPUTE_ERROR_ON(nullptr == input);
     ARM_COMPUTE_ERROR_ON(nullptr == output);
     win.validate();
  
     const auto input_ptr  = static_cast<const IMultiImage *__restrict>(input);
     const auto output_ptr = static_cast<IMultiImage *__restrict>(output);
  
     // UV's width and height are subsampled
     Window win_uv(win);
     win_uv.set(Window::DimX, Window::Dimension(win_uv.x().start() / 2, win_uv.x().end() / 2, win_uv.x().step() / 2));
     win_uv.set(Window::DimY, Window::Dimension(win_uv.y().start() / 2, win_uv.y().end() / 2, 1));
     win_uv.validate();
  
     Iterator in_y(input_ptr->plane(0), win);
     Iterator in_u(input_ptr->plane(1), win_uv);
     Iterator in_v(input_ptr->plane(2), win_uv);
     Iterator out_y(output_ptr->plane(0), win);
     Iterator out_uv(output_ptr->plane(1), win_uv);
  
     execute_window_loop(
         win,
         [&](const Coordinates &)
         {
             const auto   ta_y_top    = vld2q_u8(in_y.ptr());
             const auto   ta_y_bottom = vld2q_u8(in_y.ptr() + input_ptr->plane(0)->info()->strides_in_bytes().y());
             uint8x16x2_t ta_uv;
             ta_uv.val[0] = vld1q_u8(in_u.ptr());
             ta_uv.val[1] = vld1q_u8(in_v.ptr());
             //ta_y.val[0] = Y0 Y2 Y4 Y6 ...
             //ta_y.val[1] = Y1 Y3 Y5 Y7 ...
             //ta_uv.val[0] = U0 U2 U4 U6 ...
             //ta_uv.val[1] = V0 V2 V4 V6 ...
  
             vst2q_u8(out_y.ptr(), ta_y_top);
             vst2q_u8(out_y.ptr() + output_ptr->plane(0)->info()->strides_in_bytes().y(), ta_y_bottom);
             vst2q_u8(out_uv.ptr(), ta_uv);
         },
         in_y, in_u, in_v, out_y, out_uv);
 }

References ARM_COMPUTE_ERROR_ON, Window::DimX, Window::DimY, Window::Dimension::end(), execute_window_loop(), arm_compute::test::validation::input, Iterator::ptr(), Window::set(), Window::Dimension::start(), Window::Dimension::step(), Window::validate(), Window::x(), and Window::y().

◆ colorconvert_iyuv_to_rgb()

void arm_compute::colorconvert_iyuv_to_rgb	(	const void *__restrict	input,
		void *__restrict	output,
		const Window &	win
	)

Convert IYUV to RGB.

Parameters

[in]	input	Input IYUV data buffer.
[out]	output	Output RGB buffer.
[in]	win	Window for iterating the buffers.

Definition at line 562 of file NEColorConvertHelper.inl.

 {
     ARM_COMPUTE_ERROR_ON(nullptr == input);
     ARM_COMPUTE_ERROR_ON(nullptr == output);
     win.validate();
  
     const auto input_ptr  = static_cast<const IMultiImage *__restrict>(input);
     const auto output_ptr = static_cast<IImage *__restrict>(output);
  
     constexpr auto element_size = alpha ? 32 : 24;
     const auto     out_stride   = output_ptr->info()->strides_in_bytes().y();
  
     // UV's width and height are subsampled
     Window win_uv(win);
     win_uv.set(Window::DimX, Window::Dimension(win_uv.x().start() / 2, win_uv.x().end() / 2, win_uv.x().step() / 2));
     win_uv.set(Window::DimY, Window::Dimension(win_uv.y().start() / 2, win_uv.y().end() / 2, 1));
     win_uv.validate();
  
     Iterator in_y(input_ptr->plane(0), win);
     Iterator in_u(input_ptr->plane(1), win_uv);
     Iterator in_v(input_ptr->plane(2), win_uv);
     Iterator out(output_ptr, win);
  
     execute_window_loop(
         win,
         [&](const Coordinates &)
         {
             const auto *y_top_ptr    = in_y.ptr();
             const auto *y_bottom_ptr = in_y.ptr() + input_ptr->plane(0)->info()->strides_in_bytes().y();
             const auto *u_ptr        = in_u.ptr();
             const auto *v_ptr        = in_v.ptr();
  
         // Work-around issue in gcc 9(>=) where vld2q might cause issues with register allocation
 #if defined(__arch64__)
             const auto ta0_y_top    = vld1q_u8(y_top_ptr);
             const auto ta1_y_top    = vld1q_u8(y_top_ptr + 16);
             const auto ta0_y_bottom = vld1q_u8(y_bottom_ptr);
             const auto ta1_y_bottom = vld1q_u8(y_bottom_ptr + 16);
             const auto ta_u         = vld1q_u8(u_ptr);
             const auto ta_v         = vld1q_u8(v_ptr);
  
             // Convert the uint8x16x4_t to float32x4x4_t
             float32x4x4_t yvec_top  = arm_compute::convert_uint8x16_to_float32x4x4(vuzp1q_u8(ta0_y_top, ta1_y_top));
             float32x4x4_t yyvec_top = arm_compute::convert_uint8x16_to_float32x4x4(vuzp2q_u8(ta0_y_top, ta1_y_top));
             float32x4x4_t yvec_bottom =
                 arm_compute::convert_uint8x16_to_float32x4x4(vuzp1q_u8(ta0_y_bottom, ta1_y_bottom));
             float32x4x4_t yyvec_bottom =
                 arm_compute::convert_uint8x16_to_float32x4x4(vuzp2q_u8(ta0_y_bottom, ta1_y_bottom));
             float32x4x4_t uvec = arm_compute::convert_uint8x16_to_float32x4x4(ta_u);
             float32x4x4_t vvec = arm_compute::convert_uint8x16_to_float32x4x4(ta_v);
 #else  /* defined(__arch64__) */
             const auto ta_y_top    = vld2q_u8(y_top_ptr);
             const auto ta_y_bottom = vld2q_u8(y_bottom_ptr);
             const auto ta_u        = vld1q_u8(u_ptr);
             const auto ta_v        = vld1q_u8(v_ptr);
             //ta_y.val[0] = Y0 Y2 Y4 Y6 ...
             //ta_y.val[1] = Y1 Y3 Y5 Y7 ...
             //ta_u.val[0] = U0 U2 U4 U6 ...
             //ta_v.val[0] = V0 V2 V4 V6 ...
  
             // Convert the uint8x16x4_t to float32x4x4_t
             float32x4x4_t yvec_top     = arm_compute::convert_uint8x16_to_float32x4x4(ta_y_top.val[0]);
             float32x4x4_t yyvec_top    = arm_compute::convert_uint8x16_to_float32x4x4(ta_y_top.val[1]);
             float32x4x4_t yvec_bottom  = arm_compute::convert_uint8x16_to_float32x4x4(ta_y_bottom.val[0]);
             float32x4x4_t yyvec_bottom = arm_compute::convert_uint8x16_to_float32x4x4(ta_y_bottom.val[1]);
             float32x4x4_t uvec         = arm_compute::convert_uint8x16_to_float32x4x4(ta_u);
             float32x4x4_t vvec         = arm_compute::convert_uint8x16_to_float32x4x4(ta_v);
 #endif /* defined(__arch64__) */
  
             yuyv_to_rgb_calculation(yvec_top.val[0], uvec.val[0], yyvec_top.val[0], vvec.val[0],
                                     out.ptr() + 0 * element_size, alpha);
             yuyv_to_rgb_calculation(yvec_top.val[1], uvec.val[1], yyvec_top.val[1], vvec.val[1],
                                     out.ptr() + 1 * element_size, alpha);
             yuyv_to_rgb_calculation(yvec_top.val[2], uvec.val[2], yyvec_top.val[2], vvec.val[2],
                                     out.ptr() + 2 * element_size, alpha);
             yuyv_to_rgb_calculation(yvec_top.val[3], uvec.val[3], yyvec_top.val[3], vvec.val[3],
                                     out.ptr() + 3 * element_size, alpha);
  
             yuyv_to_rgb_calculation(yvec_bottom.val[0], uvec.val[0], yyvec_bottom.val[0], vvec.val[0],
                                     out.ptr() + out_stride + 0 * element_size, alpha);
             yuyv_to_rgb_calculation(yvec_bottom.val[1], uvec.val[1], yyvec_bottom.val[1], vvec.val[1],
                                     out.ptr() + out_stride + 1 * element_size, alpha);
             yuyv_to_rgb_calculation(yvec_bottom.val[2], uvec.val[2], yyvec_bottom.val[2], vvec.val[2],
                                     out.ptr() + out_stride + 2 * element_size, alpha);
             yuyv_to_rgb_calculation(yvec_bottom.val[3], uvec.val[3], yyvec_bottom.val[3], vvec.val[3],
                                     out.ptr() + out_stride + 3 * element_size, alpha);
         },
         in_y, in_u, in_v, out);
 }

References ARM_COMPUTE_ERROR_ON, convert_uint8x16_to_float32x4x4(), Window::DimX, Window::DimY, Window::Dimension::end(), execute_window_loop(), ITensor::info(), arm_compute::test::validation::input, Iterator::ptr(), Window::set(), Window::Dimension::start(), Window::Dimension::step(), ITensorInfo::strides_in_bytes(), Window::validate(), Window::x(), Dimensions< T >::y(), Window::y(), and arm_compute::test::colorconvert_helper::detail::yuyv_to_rgb_calculation().

◆ colorconvert_iyuv_to_yuv4()

void arm_compute::colorconvert_iyuv_to_yuv4	(	const void *__restrict	input,
		void *__restrict	output,
		const Window &	win
	)

Convert IYUV to YUV4.

Parameters

[in]	input	Input IYUV data buffer.
[out]	output	Output YUV4 buffer.
[in]	win	Window for iterating the buffers.

Definition at line 940 of file NEColorConvertHelper.inl.

 {
     ARM_COMPUTE_ERROR_ON(nullptr == input);
     ARM_COMPUTE_ERROR_ON(nullptr == output);
     win.validate();
  
     const auto input_ptr  = static_cast<const IMultiImage *__restrict>(input);
     const auto output_ptr = static_cast<IMultiImage *__restrict>(output);
  
     // UV's width and height are subsampled
     Window win_uv(win);
     win_uv.set(Window::DimX, Window::Dimension(win_uv.x().start() / 2, win_uv.x().end() / 2, win_uv.x().step() / 2));
     win_uv.set(Window::DimY, Window::Dimension(win_uv.y().start() / 2, win_uv.y().end() / 2, 1));
     win_uv.validate();
  
     Iterator in_y(input_ptr->plane(0), win);
     Iterator in_u(input_ptr->plane(1), win_uv);
     Iterator in_v(input_ptr->plane(2), win_uv);
     Iterator out_y(output_ptr->plane(0), win);
     Iterator out_u(output_ptr->plane(1), win);
     Iterator out_v(output_ptr->plane(2), win);
  
     execute_window_loop(
         win,
         [&](const Coordinates &)
         {
             const auto ta_y_top    = vld2q_u8(in_y.ptr());
             const auto ta_y_bottom = vld2q_u8(in_y.ptr() + input_ptr->plane(0)->info()->strides_in_bytes().y());
             const auto ta_u        = vld1q_u8(in_u.ptr());
             const auto ta_v        = vld1q_u8(in_v.ptr());
             //ta_y.val[0] = Y0 Y2 Y4 Y6 ...
             //ta_y.val[1] = Y1 Y3 Y5 Y7 ...
             //ta_u = U0 U2 U4 U6 ...
             //ta_v = V0 V2 V4 V6 ...
  
             vst2q_u8(out_y.ptr(), ta_y_top);
             vst2q_u8(out_y.ptr() + output_ptr->plane(0)->info()->strides_in_bytes().y(), ta_y_bottom);
  
             uint8x16x2_t uvec;
             uvec.val[0] = ta_u;
             uvec.val[1] = ta_u;
             vst2q_u8(out_u.ptr(), uvec);
             vst2q_u8(out_u.ptr() + output_ptr->plane(1)->info()->strides_in_bytes().y(), uvec);
  
             uint8x16x2_t vvec;
             vvec.val[0] = ta_v;
             vvec.val[1] = ta_v;
             vst2q_u8(out_v.ptr(), vvec);
             vst2q_u8(out_v.ptr() + output_ptr->plane(2)->info()->strides_in_bytes().y(), vvec);
         },
         in_y, in_u, in_v, out_y, out_u, out_v);
 }

References ARM_COMPUTE_ERROR_ON, Window::DimX, Window::DimY, Window::Dimension::end(), execute_window_loop(), arm_compute::test::validation::input, Iterator::ptr(), Window::set(), Window::Dimension::start(), Window::Dimension::step(), Window::validate(), Window::x(), and Window::y().

◆ colorconvert_nv12_to_iyuv()

void arm_compute::colorconvert_nv12_to_iyuv	(	const void *__restrict	input,
		void *__restrict	output,
		const Window &	win
	)

Convert NV12 to IYUV.

Parameters

[in]	input	Input NV12 data buffer.
[out]	output	Output IYUV buffer.
[in]	win	Window for iterating the buffers.

Definition at line 767 of file NEColorConvertHelper.inl.

 {
     ARM_COMPUTE_ERROR_ON(nullptr == input);
     ARM_COMPUTE_ERROR_ON(nullptr == output);
     win.validate();
  
     const auto input_ptr  = static_cast<const IMultiImage *__restrict>(input);
     const auto output_ptr = static_cast<IMultiImage *__restrict>(output);
  
     constexpr auto shift = uv ? 0 : 1;
  
     // UV's width and height are subsampled
     Window win_uv(win);
     win_uv.set(Window::DimX, Window::Dimension(win_uv.x().start() / 2, win_uv.x().end() / 2, win_uv.x().step() / 2));
     win_uv.set(Window::DimY, Window::Dimension(win_uv.y().start() / 2, win_uv.y().end() / 2, 1));
     win_uv.validate();
  
     Iterator in_y(input_ptr->plane(0), win);
     Iterator in_uv(input_ptr->plane(1), win_uv);
     Iterator out_y(output_ptr->plane(0), win);
     Iterator out_u(output_ptr->plane(1), win_uv);
     Iterator out_v(output_ptr->plane(2), win_uv);
  
     execute_window_loop(
         win,
         [&](const Coordinates &)
         {
             const auto ta_y_top    = vld2q_u8(in_y.ptr());
             const auto ta_y_bottom = vld2q_u8(in_y.ptr() + input_ptr->plane(0)->info()->strides_in_bytes().y());
             const auto ta_uv       = vld2q_u8(in_uv.ptr());
             //ta_y.val[0] = Y0 Y2 Y4 Y6 ...
             //ta_y.val[1] = Y1 Y3 Y5 Y7 ...
             //ta_uv.val[0] = U0 U2 U4 U6 ...
             //ta_uv.val[1] = V0 V2 V4 V6 ...
  
             vst2q_u8(out_y.ptr(), ta_y_top);
             vst2q_u8(out_y.ptr() + output_ptr->plane(0)->info()->strides_in_bytes().y(), ta_y_bottom);
             vst1q_u8(out_u.ptr(), ta_uv.val[0 + shift]);
             vst1q_u8(out_v.ptr(), ta_uv.val[1 - shift]);
         },
         in_y, in_uv, out_y, out_u, out_v);
 }

References ARM_COMPUTE_ERROR_ON, Window::DimX, Window::DimY, Window::Dimension::end(), execute_window_loop(), arm_compute::test::validation::input, Iterator::ptr(), Window::set(), Window::Dimension::start(), Window::Dimension::step(), Window::validate(), Window::x(), and Window::y().

◆ colorconvert_nv12_to_rgb()

void arm_compute::colorconvert_nv12_to_rgb	(	const void *__restrict	input,
		void *__restrict	output,
		const Window &	win
	)

Convert NV12 to RGB.

Parameters

[in]	input	Input NV12 data buffer.
[out]	output	Output RGB buffer.
[in]	win	Window for iterating the buffers.

Definition at line 490 of file NEColorConvertHelper.inl.

 {
     ARM_COMPUTE_ERROR_ON(nullptr == input);
     ARM_COMPUTE_ERROR_ON(nullptr == output);
     win.validate();
  
     const auto input_ptr  = static_cast<const IMultiImage *__restrict>(input);
     const auto output_ptr = static_cast<IImage *__restrict>(output);
  
     constexpr auto element_size = alpha ? 32 : 24;
     const auto     out_stride   = output_ptr->info()->strides_in_bytes().y();
     constexpr auto shift        = uv ? 0 : 1;
  
     // UV's width and height are subsampled
     Window win_uv(win);
     win_uv.set(Window::DimX, Window::Dimension(win_uv.x().start() / 2, win_uv.x().end() / 2, win.x().step() / 2));
     win_uv.set(Window::DimY, Window::Dimension(win_uv.y().start() / 2, win_uv.y().end() / 2, 1));
     win_uv.validate();
  
     Iterator in_y(input_ptr->plane(0), win);
     Iterator in_uv(input_ptr->plane(1), win_uv);
     Iterator out(output_ptr, win);
  
     execute_window_loop(
         win,
         [&](const Coordinates &)
         {
             const auto ta_y_top    = vld2q_u8(in_y.ptr());
             const auto ta_y_bottom = vld2q_u8(in_y.ptr() + input_ptr->plane(0)->info()->strides_in_bytes().y());
             const auto ta_uv       = vld2q_u8(in_uv.ptr());
             //ta_y.val[0] = Y0 Y2 Y4 Y6 ...
             //ta_y.val[1] = Y1 Y3 Y5 Y7 ...
             //ta_uv.val[0] = U0 U2 U4 U6 ...
             //ta_uv.val[1] = V0 V2 V4 V6 ...
  
             // Convert the uint8x16x4_t to float32x4x4_t
             float32x4x4_t yvec_top     = arm_compute::convert_uint8x16_to_float32x4x4(ta_y_top.val[0]);
             float32x4x4_t yyvec_top    = arm_compute::convert_uint8x16_to_float32x4x4(ta_y_top.val[1]);
             float32x4x4_t yvec_bottom  = arm_compute::convert_uint8x16_to_float32x4x4(ta_y_bottom.val[0]);
             float32x4x4_t yyvec_bottom = arm_compute::convert_uint8x16_to_float32x4x4(ta_y_bottom.val[1]);
             float32x4x4_t uvec         = arm_compute::convert_uint8x16_to_float32x4x4(ta_uv.val[0 + shift]);
             float32x4x4_t vvec         = arm_compute::convert_uint8x16_to_float32x4x4(ta_uv.val[1 - shift]);
  
             yuyv_to_rgb_calculation(yvec_top.val[0], uvec.val[0], yyvec_top.val[0], vvec.val[0],
                                     out.ptr() + 0 * element_size, alpha);
             yuyv_to_rgb_calculation(yvec_top.val[1], uvec.val[1], yyvec_top.val[1], vvec.val[1],
                                     out.ptr() + 1 * element_size, alpha);
             yuyv_to_rgb_calculation(yvec_top.val[2], uvec.val[2], yyvec_top.val[2], vvec.val[2],
                                     out.ptr() + 2 * element_size, alpha);
             yuyv_to_rgb_calculation(yvec_top.val[3], uvec.val[3], yyvec_top.val[3], vvec.val[3],
                                     out.ptr() + 3 * element_size, alpha);
  
             yuyv_to_rgb_calculation(yvec_bottom.val[0], uvec.val[0], yyvec_bottom.val[0], vvec.val[0],
                                     out.ptr() + out_stride + 0 * element_size, alpha);
             yuyv_to_rgb_calculation(yvec_bottom.val[1], uvec.val[1], yyvec_bottom.val[1], vvec.val[1],
                                     out.ptr() + out_stride + 1 * element_size, alpha);
             yuyv_to_rgb_calculation(yvec_bottom.val[2], uvec.val[2], yyvec_bottom.val[2], vvec.val[2],
                                     out.ptr() + out_stride + 2 * element_size, alpha);
             yuyv_to_rgb_calculation(yvec_bottom.val[3], uvec.val[3], yyvec_bottom.val[3], vvec.val[3],
                                     out.ptr() + out_stride + 3 * element_size, alpha);
         },
         in_y, in_uv, out);
 }

References ARM_COMPUTE_ERROR_ON, convert_uint8x16_to_float32x4x4(), Window::DimX, Window::DimY, Window::Dimension::end(), execute_window_loop(), ITensor::info(), arm_compute::test::validation::input, Iterator::ptr(), Window::set(), Window::Dimension::start(), Window::Dimension::step(), ITensorInfo::strides_in_bytes(), Window::validate(), Window::x(), Dimensions< T >::y(), Window::y(), and arm_compute::test::colorconvert_helper::detail::yuyv_to_rgb_calculation().

◆ colorconvert_nv12_to_yuv4()

void arm_compute::colorconvert_nv12_to_yuv4	(	const void *__restrict	input,
		void *__restrict	output,
		const Window &	win
	)

Convert NV12 to YUV4.

Parameters

[in]	input	Input NV12 data buffer.
[out]	output	Output YUV4 buffer.
[in]	win	Window for iterating the buffers.

Definition at line 880 of file NEColorConvertHelper.inl.

 {
     ARM_COMPUTE_ERROR_ON(nullptr == input);
     ARM_COMPUTE_ERROR_ON(nullptr == output);
     win.validate();
  
     const auto input_ptr  = static_cast<const IMultiImage *__restrict>(input);
     const auto output_ptr = static_cast<IMultiImage *__restrict>(output);
  
     constexpr auto shift = uv ? 0 : 1;
  
     // UV's width and height are subsampled
     Window win_uv(win);
     win_uv.set(Window::DimX, Window::Dimension(win_uv.x().start() / 2, win_uv.x().end() / 2, win_uv.x().step() / 2));
     win_uv.set(Window::DimY, Window::Dimension(win_uv.y().start() / 2, win_uv.y().end() / 2, 1));
     win_uv.validate();
  
     Iterator in_y(input_ptr->plane(0), win);
     Iterator in_uv(input_ptr->plane(1), win_uv);
     Iterator out_y(output_ptr->plane(0), win);
     Iterator out_u(output_ptr->plane(1), win);
     Iterator out_v(output_ptr->plane(2), win);
  
     execute_window_loop(
         win,
         [&](const Coordinates &)
         {
             const auto ta_y_top    = vld2q_u8(in_y.ptr());
             const auto ta_y_bottom = vld2q_u8(in_y.ptr() + input_ptr->plane(0)->info()->strides_in_bytes().y());
             const auto ta_uv       = vld2q_u8(in_uv.ptr());
             //ta_y.val[0] = Y0 Y2 Y4 Y6 ...
             //ta_y.val[1] = Y1 Y3 Y5 Y7 ...
             //ta_uv.val[0] = U0 U2 U4 U6 ...
             //ta_uv.val[1] = V0 V2 V4 V6 ...
  
             vst2q_u8(out_y.ptr(), ta_y_top);
             vst2q_u8(out_y.ptr() + output_ptr->plane(0)->info()->strides_in_bytes().y(), ta_y_bottom);
  
             uint8x16x2_t uvec;
             uvec.val[0] = ta_uv.val[0 + shift];
             uvec.val[1] = ta_uv.val[0 + shift];
             vst2q_u8(out_u.ptr(), uvec);
             vst2q_u8(out_u.ptr() + output_ptr->plane(1)->info()->strides_in_bytes().y(), uvec);
  
             uint8x16x2_t vvec;
             vvec.val[0] = ta_uv.val[1 - shift];
             vvec.val[1] = ta_uv.val[1 - shift];
             vst2q_u8(out_v.ptr(), vvec);
             vst2q_u8(out_v.ptr() + output_ptr->plane(2)->info()->strides_in_bytes().y(), vvec);
         },
         in_y, in_uv, out_y, out_u, out_v);
 }

References ARM_COMPUTE_ERROR_ON, Window::DimX, Window::DimY, Window::Dimension::end(), execute_window_loop(), arm_compute::test::validation::input, Iterator::ptr(), Window::set(), Window::Dimension::start(), Window::Dimension::step(), Window::validate(), Window::x(), and Window::y().

◆ colorconvert_rgb_to_iyuv()

void arm_compute::colorconvert_rgb_to_iyuv	(	const void *__restrict	input,
		void *__restrict	output,
		const Window &	win
	)

Convert RGB to IYUV.

Parameters

[in]	input	Input RGB data buffer.
[out]	output	Output IYUV buffer.
[in]	win	Window for iterating the buffers.

Definition at line 1045 of file NEColorConvertHelper.inl.

 {
     ARM_COMPUTE_ERROR_ON(nullptr == input);
     ARM_COMPUTE_ERROR_ON(nullptr == output);
     win.validate();
  
     const auto input_ptr  = static_cast<const IImage *__restrict>(input);
     const auto output_ptr = static_cast<IMultiImage *__restrict>(output);
  
     // UV's width and height are subsampled
     Window win_uv(win);
     win_uv.set(Window::DimX, Window::Dimension(win_uv.x().start() / 2, win_uv.x().end() / 2, win_uv.x().step() / 2));
     win_uv.set(Window::DimY, Window::Dimension(win_uv.y().start() / 2, win_uv.y().end() / 2, 1));
     win_uv.validate();
  
     Iterator in(input_ptr, win);
     Iterator out_y(output_ptr->plane(0), win);
     Iterator out_u(output_ptr->plane(1), win_uv);
     Iterator out_v(output_ptr->plane(2), win_uv);
  
     execute_window_loop(
         win,
         [&](const Coordinates &)
         {
             const auto ta_rgb_top    = load_rgb(in.ptr(), alpha);
             const auto ta_rgb_bottom = load_rgb(in.ptr() + input_ptr->info()->strides_in_bytes().y(), alpha);
             //ta_rgb.val[0] = R0 R1 R2 R3 ...
             //ta_rgb.val[1] = G0 G1 G2 G3 ...
             //ta_rgb.val[2] = B0 B1 B2 B3 ...
  
             store_rgb_to_iyuv(ta_rgb_top.val[0], ta_rgb_top.val[1], ta_rgb_top.val[2], ta_rgb_bottom.val[0],
                               ta_rgb_bottom.val[1], ta_rgb_bottom.val[2], out_y.ptr(),
                               out_y.ptr() + output_ptr->plane(0)->info()->strides_in_bytes().y(), out_u.ptr(),
                               out_v.ptr());
         },
         in, out_y, out_u, out_v);
 }

References ARM_COMPUTE_ERROR_ON, Window::DimX, Window::DimY, Window::Dimension::end(), execute_window_loop(), arm_compute::test::validation::input, Iterator::ptr(), Window::set(), Window::Dimension::start(), Window::Dimension::step(), Window::validate(), Window::x(), and Window::y().

◆ colorconvert_rgb_to_nv12()

void arm_compute::colorconvert_rgb_to_nv12	(	const void *__restrict	input,
		void *__restrict	output,
		const Window &	win
	)

Convert RGB to NV12.

Parameters

[in]	input	Input RGB data buffer.
[out]	output	Output NV12 buffer.
[in]	win	Window for iterating the buffers.

Definition at line 1001 of file NEColorConvertHelper.inl.

 {
     ARM_COMPUTE_ERROR_ON(nullptr == input);
     ARM_COMPUTE_ERROR_ON(nullptr == output);
     win.validate();
  
     const auto input_ptr  = static_cast<const IImage *__restrict>(input);
     const auto output_ptr = static_cast<IMultiImage *__restrict>(output);
  
     // UV's width and height are subsampled
     Window win_uv(win);
     win_uv.set(Window::DimX, Window::Dimension(win_uv.x().start() / 2, win_uv.x().end() / 2, win_uv.x().step() / 2));
     win_uv.set(Window::DimY, Window::Dimension(win_uv.y().start() / 2, win_uv.y().end() / 2, 1));
     win_uv.validate();
  
     Iterator in(input_ptr, win);
     Iterator out_y(output_ptr->plane(0), win);
     Iterator out_uv(output_ptr->plane(1), win_uv);
  
     execute_window_loop(
         win,
         [&](const Coordinates &)
         {
             const auto ta_rgb_top    = load_rgb(in.ptr(), alpha);
             const auto ta_rgb_bottom = load_rgb(in.ptr() + input_ptr->info()->strides_in_bytes().y(), alpha);
             //ta_rgb.val[0] = R0 R1 R2 R3 ...
             //ta_rgb.val[1] = G0 G1 G2 G3 ...
             //ta_rgb.val[2] = B0 B1 B2 B3 ...
  
             store_rgb_to_nv12(ta_rgb_top.val[0], ta_rgb_top.val[1], ta_rgb_top.val[2], ta_rgb_bottom.val[0],
                               ta_rgb_bottom.val[1], ta_rgb_bottom.val[2], out_y.ptr(),
                               out_y.ptr() + output_ptr->plane(0)->info()->strides_in_bytes().y(), out_uv.ptr());
         },
         in, out_y, out_uv);
 }

References ARM_COMPUTE_ERROR_ON, Window::DimX, Window::DimY, Window::Dimension::end(), execute_window_loop(), arm_compute::test::validation::input, Iterator::ptr(), Window::set(), Window::Dimension::start(), Window::Dimension::step(), Window::validate(), Window::x(), and Window::y().

◆ colorconvert_rgb_to_rgbx()

void arm_compute::colorconvert_rgb_to_rgbx	(	const void *__restrict	input,
		void *__restrict	output,
		const Window &	win
	)

Convert RGB to RGBX.

Parameters

[in]	input	Input RGB data buffer.
[out]	output	Output RGBX buffer.
[in]	win	Window for iterating the buffers.

Definition at line 344 of file NEColorConvertHelper.inl.

 {
     ARM_COMPUTE_ERROR_ON(nullptr == input);
     ARM_COMPUTE_ERROR_ON(nullptr == output);
  
     const auto input_ptr  = static_cast<const IImage *__restrict>(input);
     const auto output_ptr = static_cast<IImage *__restrict>(output);
  
     Iterator in(input_ptr, win);
     Iterator out(output_ptr, win);
  
     execute_window_loop(
         win,
         [&](const Coordinates &)
         {
             const auto   ta1 = vld3q_u8(in.ptr());
             uint8x16x4_t ta2;
             ta2.val[0] = ta1.val[0];
             ta2.val[1] = ta1.val[1];
             ta2.val[2] = ta1.val[2];
             ta2.val[3] = vdupq_n_u8(255);
             vst4q_u8(out.ptr(), ta2);
         },
         in, out);
 }

References ARM_COMPUTE_ERROR_ON, execute_window_loop(), arm_compute::test::validation::input, and Iterator::ptr().

◆ colorconvert_rgb_to_u8()

void arm_compute::colorconvert_rgb_to_u8	(	const void *__restrict	input,
		void *__restrict	output,
		const Window &	win
	)

Convert RGB to U8.

Parameters

[in]	input	Input RGB data buffer.
[out]	output	Output U8 buffer.
[in]	win	Window for iterating the buffers.

Definition at line 377 of file NEColorConvertHelper.inl.

 {
     ARM_COMPUTE_ERROR_ON(nullptr == input);
     ARM_COMPUTE_ERROR_ON(nullptr == output);
  
     const auto input_ptr  = static_cast<const IImage *__restrict>(input);
     const auto output_ptr = static_cast<IImage *__restrict>(output);
  
     Iterator in(input_ptr, win);
     Iterator out(output_ptr, win);
  
     execute_window_loop(
         win,
         [&](const Coordinates &)
         {
             const auto ta1 = vld3q_u8(in.ptr());
             uint8x16_t ta2;
             rgb_to_u8_conversion(ta1, ta2);
             vst1q_u8(out.ptr(), ta2);
         },
         in, out);
 }

References ARM_COMPUTE_ERROR_ON, execute_window_loop(), arm_compute::test::validation::input, and Iterator::ptr().

◆ colorconvert_rgb_to_yuv4()

void arm_compute::colorconvert_rgb_to_yuv4	(	const void *__restrict	input,
		void *__restrict	output,
		const Window &	win
	)

Convert RGB to YUV4.

Parameters

[in]	input	Input RGB data buffer.
[out]	output	Output YUV4 buffer.
[in]	win	Window for iterating the buffers.

Definition at line 1091 of file NEColorConvertHelper.inl.

 {
     ARM_COMPUTE_ERROR_ON(nullptr == input);
     ARM_COMPUTE_ERROR_ON(nullptr == output);
     win.validate();
  
     const auto input_ptr  = static_cast<const IImage *__restrict>(input);
     const auto output_ptr = static_cast<IMultiImage *__restrict>(output);
  
     Iterator in(input_ptr, win);
     Iterator out_y(output_ptr->plane(0), win);
     Iterator out_u(output_ptr->plane(1), win);
     Iterator out_v(output_ptr->plane(2), win);
  
     execute_window_loop(
         win,
         [&](const Coordinates &)
         {
             const auto ta_rgb = load_rgb(in.ptr(), alpha);
             //ta_rgb.val[0] = R0 R1 R2 R3 ...
             //ta_rgb.val[1] = G0 G1 G2 G3 ...
             //ta_rgb.val[2] = B0 B1 B2 B3 ...
  
             store_rgb_to_yuv4(ta_rgb.val[0], ta_rgb.val[1], ta_rgb.val[2], out_y.ptr(), out_u.ptr(), out_v.ptr());
         },
         in, out_y, out_u, out_v);
 }

References ARM_COMPUTE_ERROR_ON, execute_window_loop(), arm_compute::test::validation::input, Iterator::ptr(), and Window::validate().

◆ colorconvert_rgbx_to_rgb()

void arm_compute::colorconvert_rgbx_to_rgb	(	const void *	input,
		void *	output,
		const Window &	win
	)

Convert RGBX to RGB.

Parameters

[in]	input	Input RGBX data buffer.
[out]	output	Output RGB buffer.
[in]	win	Window for iterating the buffers.

Definition at line 407 of file NEColorConvertHelper.inl.

 {
     ARM_COMPUTE_ERROR_ON(nullptr == input);
     ARM_COMPUTE_ERROR_ON(nullptr == output);
  
     const auto input_ptr  = static_cast<const IImage *__restrict>(input);
     const auto output_ptr = static_cast<IImage *__restrict>(output);
  
     Iterator in(input_ptr, win);
     Iterator out(output_ptr, win);
  
     execute_window_loop(
         win,
         [&](const Coordinates &)
         {
             const auto   ta1 = vld4q_u8(in.ptr());
             uint8x16x3_t ta2;
             ta2.val[0] = ta1.val[0];
             ta2.val[1] = ta1.val[1];
             ta2.val[2] = ta1.val[2];
             vst3q_u8(out.ptr(), ta2);
         },
         in, out);
 }

References ARM_COMPUTE_ERROR_ON, execute_window_loop(), arm_compute::test::validation::input, and Iterator::ptr().

◆ colorconvert_yuyv_to_iyuv()

void arm_compute::colorconvert_yuyv_to_iyuv	(	const void *__restrict	input,
		void *__restrict	output,
		const Window &	win
	)

Convert YUYV to IYUV.

Parameters

[in]	input	Input YUYV data buffer.
[out]	output	Output IYUV buffer.
[in]	win	Window for iterating the buffers.

Definition at line 818 of file NEColorConvertHelper.inl.

 {
     ARM_COMPUTE_ERROR_ON(nullptr == input);
     ARM_COMPUTE_ERROR_ON(nullptr == output);
     win.validate();
  
     const auto input_ptr  = static_cast<const IImage *__restrict>(input);
     const auto output_ptr = static_cast<IMultiImage *__restrict>(output);
  
     constexpr auto shift = yuyv ? 0 : 1;
  
     // Destination's UV's width and height are subsampled
     Window win_uv(win);
     win_uv.set(Window::DimX, Window::Dimension(win_uv.x().start() / 2, win_uv.x().end() / 2, win_uv.x().step() / 2));
     win_uv.set(Window::DimY, Window::Dimension(win_uv.y().start() / 2, win_uv.y().end() / 2, 1));
     win_uv.validate();
  
     Iterator in(input_ptr, win);
     Iterator out_y(output_ptr->plane(0), win);
     Iterator out_u(output_ptr->plane(1), win_uv);
     Iterator out_v(output_ptr->plane(2), win_uv);
  
     execute_window_loop(
         win,
         [&](const Coordinates &)
         {
             const auto ta_top    = vld4q_u8(in.ptr());
             const auto ta_bottom = vld4q_u8(in.ptr() + input_ptr->info()->strides_in_bytes().y());
             //ta.val[0] = Y0 Y2 Y4 Y6 ...
             //ta.val[1] = U0 U2 U4 U6 ...
             //ta.val[2] = Y1 Y3 Y5 Y7 ...
             //ta.val[3] = V0 V2 V4 V7 ...
  
             uint8x16x2_t yvec;
             yvec.val[0] = ta_top.val[0 + shift];
             yvec.val[1] = ta_top.val[2 + shift];
             vst2q_u8(out_y.ptr(), yvec);
  
             uint8x16x2_t yyvec;
             yyvec.val[0] = ta_bottom.val[0 + shift];
             yyvec.val[1] = ta_bottom.val[2 + shift];
             vst2q_u8(out_y.ptr() + output_ptr->plane(0)->info()->strides_in_bytes().y(), yyvec);
  
             uint8x16_t uvec;
             uvec = vhaddq_u8(ta_top.val[1 - shift], ta_bottom.val[1 - shift]);
             vst1q_u8(out_u.ptr(), uvec);
  
             uint8x16_t vvec;
             vvec = vhaddq_u8(ta_top.val[3 - shift], ta_bottom.val[3 - shift]);
             vst1q_u8(out_v.ptr(), vvec);
         },
         in, out_y, out_u, out_v);
 }

References ARM_COMPUTE_ERROR_ON, Window::DimX, Window::DimY, Window::Dimension::end(), execute_window_loop(), arm_compute::test::validation::input, Iterator::ptr(), Window::set(), Window::Dimension::start(), Window::Dimension::step(), Window::validate(), Window::x(), and Window::y().

◆ colorconvert_yuyv_to_nv12()

void arm_compute::colorconvert_yuyv_to_nv12	(	const void *__restrict	input,
		void *__restrict	output,
		const Window &	win
	)

Convert YUYV to NV12.

Parameters

[in]	input	Input YUYV data buffer.
[out]	output	Output NV12 buffer.
[in]	win	Window for iterating the buffers.

Definition at line 660 of file NEColorConvertHelper.inl.

 {
     ARM_COMPUTE_ERROR_ON(nullptr == input);
     ARM_COMPUTE_ERROR_ON(nullptr == output);
     win.validate();
  
     const auto input_ptr  = static_cast<const IImage *__restrict>(input);
     const auto output_ptr = static_cast<IMultiImage *__restrict>(output);
  
     constexpr auto shift = yuyv ? 0 : 1;
  
     // NV12's UV's width and height are subsampled
     Window win_uv(win);
     win_uv.set(Window::DimX, Window::Dimension(win_uv.x().start() / 2, win_uv.x().end() / 2, win_uv.x().step() / 2));
     win_uv.set(Window::DimY, Window::Dimension(win_uv.y().start() / 2, win_uv.y().end() / 2, 1));
     win_uv.validate();
  
     Iterator in(input_ptr, win);
     Iterator out_y(output_ptr->plane(0), win);
     Iterator out_uv(output_ptr->plane(1), win_uv);
  
     execute_window_loop(
         win,
         [&](const Coordinates &)
         {
             const auto ta_top    = vld4q_u8(in.ptr());
             const auto ta_bottom = vld4q_u8(in.ptr() + input_ptr->info()->strides_in_bytes().y());
             //ta.val[0] = Y0 Y2 Y4 Y6 ...
             //ta.val[1] = U0 U2 U4 U6 ...
             //ta.val[2] = Y1 Y3 Y5 Y7 ...
             //ta.val[3] = V0 V2 V4 V7 ...
  
             uint8x16x2_t yvec;
             yvec.val[0] = ta_top.val[0 + shift];
             yvec.val[1] = ta_top.val[2 + shift];
             vst2q_u8(out_y.ptr(), yvec);
  
             uint8x16x2_t yyvec;
             yyvec.val[0] = ta_bottom.val[0 + shift];
             yyvec.val[1] = ta_bottom.val[2 + shift];
             vst2q_u8(out_y.ptr() + output_ptr->plane(0)->info()->strides_in_bytes().y(), yyvec);
  
             uint8x16x2_t uvvec;
             uvvec.val[0] = vhaddq_u8(ta_top.val[1 - shift], ta_bottom.val[1 - shift]);
             uvvec.val[1] = vhaddq_u8(ta_top.val[3 - shift], ta_bottom.val[3 - shift]);
             vst2q_u8(out_uv.ptr(), uvvec);
         },
         in, out_y, out_uv);
 }

References ARM_COMPUTE_ERROR_ON, Window::DimX, Window::DimY, Window::Dimension::end(), execute_window_loop(), arm_compute::test::validation::input, Iterator::ptr(), Window::set(), Window::Dimension::start(), Window::Dimension::step(), Window::validate(), Window::x(), and Window::y().

◆ colorconvert_yuyv_to_rgb()

void arm_compute::colorconvert_yuyv_to_rgb	(	const void *__restrict	input,
		void *__restrict	output,
		const Window &	win
	)

Convert YUYV to RGB.

Parameters

[in]	input	Input YUYV data buffer.
[out]	output	Output RGB buffer.
[in]	win	Window for iterating the buffers.

Definition at line 440 of file NEColorConvertHelper.inl.

 {
     ARM_COMPUTE_ERROR_ON(nullptr == input);
     ARM_COMPUTE_ERROR_ON(nullptr == output);
  
     const auto input_ptr  = static_cast<const IImage *__restrict>(input);
     const auto output_ptr = static_cast<IImage *__restrict>(output);
  
     constexpr auto element_size = alpha ? 32 : 24;
     constexpr auto shift        = yuyv ? 0 : 1;
  
     Iterator in(input_ptr, win);
     Iterator out(output_ptr, win);
  
     execute_window_loop(
         win,
         [&](const Coordinates &)
         {
             const auto ta = vld4q_u8(in.ptr());
             //ta.val[0] = Y0 Y2 Y4 Y6 ...
             //ta.val[1] = U0 U2 U4 U6 ...
             //ta.val[2] = Y1 Y3 Y5 Y7 ...
             //ta.val[3] = V0 V2 V4 V7 ...
  
             // Convert the uint8x16x4_t to float32x4x4_t
             const float32x4x4_t yvec  = arm_compute::convert_uint8x16_to_float32x4x4(ta.val[0 + shift]);
             const float32x4x4_t uvec  = arm_compute::convert_uint8x16_to_float32x4x4(ta.val[1 - shift]);
             const float32x4x4_t yyvec = arm_compute::convert_uint8x16_to_float32x4x4(ta.val[2 + shift]);
             const float32x4x4_t vvec  = arm_compute::convert_uint8x16_to_float32x4x4(ta.val[3 - shift]);
  
             yuyv_to_rgb_calculation(yvec.val[0], uvec.val[0], yyvec.val[0], vvec.val[0], out.ptr() + 0 * element_size,
                                     alpha);
             yuyv_to_rgb_calculation(yvec.val[1], uvec.val[1], yyvec.val[1], vvec.val[1], out.ptr() + 1 * element_size,
                                     alpha);
             yuyv_to_rgb_calculation(yvec.val[2], uvec.val[2], yyvec.val[2], vvec.val[2], out.ptr() + 2 * element_size,
                                     alpha);
             yuyv_to_rgb_calculation(yvec.val[3], uvec.val[3], yyvec.val[3], vvec.val[3], out.ptr() + 3 * element_size,
                                     alpha);
         },
         in, out);
 }

References ARM_COMPUTE_ERROR_ON, convert_uint8x16_to_float32x4x4(), execute_window_loop(), arm_compute::test::validation::input, Iterator::ptr(), and arm_compute::test::colorconvert_helper::detail::yuyv_to_rgb_calculation().

◆ command_buffer_mutable_dispatch_supported()

bool command_buffer_mutable_dispatch_supported ( const cl::Device & device )

Check whether cl_khr_command_buffer_mutable_dispatch extension is supported by the specified CL device.

Parameters

[in] device The CL device

Returns: True if the extension is supported by the CL device.

Definition at line 509 of file CLHelpers.cpp.

 {
     return device_supports_extension(device, "cl_khr_command_buffer_mutable_dispatch");
 }

References device_supports_extension().

Referenced by CLCommandBuffer::create().

◆ command_buffer_supported()

bool command_buffer_supported ( const cl::Device & device )

Check whether cl_khr_command_buffer extension is supported by the specified CL device.

Parameters

[in] device The CL device

Returns: True if the extension is supported by the CL device.

Definition at line 504 of file CLHelpers.cpp.

 {
     return device_supports_extension(device, "cl_khr_command_buffer");
 }

References device_supports_extension().

◆ compute_requantization_scale_offset()

UniformQuantizationInfo arm_compute::compute_requantization_scale_offset	(	const UniformQuantizationInfo &	uqinfo_in,
		const UniformQuantizationInfo &	uqinfo_out
	)

inline

Definition at line 569 of file QuantizationInfo.h.

 {
     float   scale_to_apply  = uqinfo_out.scale;
     int32_t offset_to_apply = uqinfo_out.offset;
  
     scale_to_apply /= uqinfo_in.scale;
     // In order to minimize flooring we convert the offset to a float,
     // then compute the new offset in the float domain,
     // finally we convert it back as int32_t
     offset_to_apply -= static_cast<int32_t>(static_cast<float>(uqinfo_in.offset) * uqinfo_in.scale / uqinfo_out.scale);
     return UniformQuantizationInfo(scale_to_apply, offset_to_apply);
 }

References UniformQuantizationInfo::offset, and UniformQuantizationInfo::scale.

Referenced by CLROIPoolingLayerKernel::configure().

◆ compute_strides() [1/2]

Strides arm_compute::compute_strides ( const ITensorInfo & info )

inline

Create a strides object based on the tensor dimensions.

Parameters

[in] info Tensor info object used to compute the strides.

Returns: Strides object based on element size and tensor shape.

Definition at line 63 of file Utils.h.

 {
     return compute_strides(info, info.element_size());
 }

References compute_strides(), and arm_compute::test::validation::info.

◆ compute_strides() [2/2]

Strides arm_compute::compute_strides	(	const ITensorInfo &	info,
		T	stride_x,
		Ts &&...	fixed_strides
	)

inline

Create a strides object based on the provided strides and the tensor dimensions.

Parameters

[in]	info	Tensor info object providing the shape of the tensor for unspecified strides.
[in]	stride_x	Stride to be used in X dimension (in bytes).
[in]	fixed_strides	Strides to be used in higher dimensions starting at Y (in bytes).

Returns: Strides object based on the specified strides. Missing strides are calculated based on the tensor shape and the strides of lower dimensions.

Definition at line 41 of file Utils.h.

 {
     const TensorShape &shape = info.tensor_shape();
  
     // Create strides object
     Strides strides(stride_x, fixed_strides...);
  
     for (size_t i = 1 + sizeof...(Ts); i < info.num_dimensions(); ++i)
     {
         strides.set(i, shape[i - 1] * strides[i - 1]);
     }
  
     return strides;
 }

References arm_compute::test::validation::info, Dimensions< T >::set(), and arm_compute::test::validation::shape.

Referenced by compute_strides(), and TensorInfo::set_tensor_shape().

◆ compute_winograd_convolution_tiles()

Size2D arm_compute::compute_winograd_convolution_tiles	(	const Size2D &	in_dims,
		const Size2D &	kernel_size,
		const Size2D &	output_tile_size,
		const PadStrideInfo &	conv_info
	)

inline

Calculate the number of output tiles required by Winograd Convolution layer.

This utility function can be used by the Winograd input transform to know the number of tiles on the x and y direction

Parameters

[in]	in_dims	Spatial dimensions of the input tensor of convolution layer
[in]	kernel_size	Kernel size
[in]	output_tile_size	Size of a single output tile
[in]	conv_info	Convolution info (i.e. pad, stride,...)

Returns: the number of output tiles along the x and y directions of size "output_tile_size"

Definition at line 250 of file Helpers.h.

 {
     int num_tiles_x =
         std::ceil((in_dims.width - (kernel_size.width - 1) + conv_info.pad_left() + conv_info.pad_right()) /
                   static_cast<float>(output_tile_size.width));
     int num_tiles_y =
         std::ceil((in_dims.height - (kernel_size.height - 1) + conv_info.pad_top() + conv_info.pad_bottom()) /
                   static_cast<float>(output_tile_size.height));
  
     // Clamp in case we provide paddings but we have 1D convolution
     num_tiles_x = std::min(num_tiles_x, static_cast<int>(in_dims.width));
     num_tiles_y = std::min(num_tiles_y, static_cast<int>(in_dims.height));
  
     return Size2D(num_tiles_x, num_tiles_y);
 }

References arm_compute::test::validation::conv_info, Size2D::height, and Size2D::width.

Referenced by arm_compute::misc::shape_calculator::compute_winograd_input_transform_shape(), ClWinogradInputTransformKernel::configure(), ClWinogradOutputTransformKernel::configure(), arm_compute::test::validation::reference::winograd_input_transform(), and arm_compute::test::validation::reference::winograd_output_transform().

◆ convert_float32x4x3_to_uint8x8x3()

void convert_float32x4x3_to_uint8x8x3	(	const float32x4x3_t &	in1,
		const float32x4x3_t &	in2,
		uint8x8x3_t &	out
	)

inline

Converts from two float32x4x3_t to just one uint8x8x3_t.

Parameters

[in]	in1	First input vector of float to be converted
[in]	in2	Second input vector of float to be converted
[out]	out	Converted output vector uint8 to store the result

Definition at line 488 of file NEMath.inl.

 {
     out.val[0] = vqmovn_u16(vcombine_u16(vqmovn_u32(vcvtq_u32_f32(in1.val[0])), vqmovn_u32(vcvtq_u32_f32(in2.val[0]))));
     out.val[1] = vqmovn_u16(vcombine_u16(vqmovn_u32(vcvtq_u32_f32(in1.val[1])), vqmovn_u32(vcvtq_u32_f32(in2.val[1]))));
     out.val[2] = vqmovn_u16(vcombine_u16(vqmovn_u32(vcvtq_u32_f32(in1.val[2])), vqmovn_u32(vcvtq_u32_f32(in2.val[2]))));
 }

◆ convert_float32x4x4_to_int8x16()

void convert_float32x4x4_to_int8x16	(	const float32x4x4_t &	in,
		int8x16_t &	out
	)

inline

Converts from float32x4x4_t to just one int8x16_t.

Parameters

[in]	in	Vector of float to be converted
[out]	out	Converted vector of uint8 to store the result

Definition at line 502 of file NEMath.inl.

 {
     const auto low  = vcombine_s16(vqmovn_s32(vcvtq_s32_f32(in.val[0])), vqmovn_s32(vcvtq_s32_f32(in.val[1])));
     const auto high = vcombine_s16(vqmovn_s32(vcvtq_s32_f32(in.val[2])), vqmovn_s32(vcvtq_s32_f32(in.val[3])));
     out             = vcombine_s8(vqmovn_s16(low), vqmovn_s16(high));
 }

Referenced by convert_float_to_int< float32x4x4_t, int8x16_t >().

◆ convert_float32x4x4_to_uint8x16()

void convert_float32x4x4_to_uint8x16	(	const float32x4x4_t &	in,
		uint8x16_t &	out
	)

inline

Converts from two float32x4x4_t to just one uint8x16_t.

Parameters

[in]	in	Vector of float to be converted
[out]	out	Converted vector of uint8 to store the result

Definition at line 495 of file NEMath.inl.

 {
     const auto low  = vcombine_u16(vqmovn_u32(vcvtq_u32_f32(in.val[0])), vqmovn_u32(vcvtq_u32_f32(in.val[1])));
     const auto high = vcombine_u16(vqmovn_u32(vcvtq_u32_f32(in.val[2])), vqmovn_u32(vcvtq_u32_f32(in.val[3])));
     out             = vcombine_u8(vqmovn_u16(low), vqmovn_u16(high));
 }

Referenced by convert_float_to_int< float32x4x4_t, uint8x16_t >().

◆ convert_float_to_int()

int_vec_type arm_compute::convert_float_to_int ( const float_vec_type & in )

Converts from float vector to integer vector.

Parameters

[in] in Float vector to converted

Returns: The converted integer vector

◆ convert_float_to_int< float32x4x4_t, int8x16_t >()

int8x16_t arm_compute::convert_float_to_int< float32x4x4_t, int8x16_t > ( const float32x4x4_t & in )

inline

Definition at line 524 of file NEMath.inl.

 {
     int8x16_t out;
     convert_float32x4x4_to_int8x16(in, out);
     return out;
 }

References convert_float32x4x4_to_int8x16().

◆ convert_float_to_int< float32x4x4_t, uint8x16_t >()

uint8x16_t arm_compute::convert_float_to_int< float32x4x4_t, uint8x16_t > ( const float32x4x4_t & in )

inline

Definition at line 510 of file NEMath.inl.

 {
     uint8x16_t out;
     convert_float32x4x4_to_uint8x16(in, out);
     return out;
 }

References convert_float32x4x4_to_uint8x16().

◆ convert_int8x16_to_float32x4x4()

float32x4x4_t convert_int8x16_to_float32x4x4 ( const int8x16_t & in )

inline

Converts from int8x16 to float32x4x4_t.

Parameters

[in] in Vector of int8 to be converted

Returns: Converted vector of float

Definition at line 462 of file NEMath.inl.

 {
     float32x4x4_t out;
  
     const auto tmp1 = vmovl_s8(vget_low_s8(in));
     out.val[0]      = vcvtq_f32_s32(vmovl_s16(vget_low_s16(tmp1)));
     out.val[1]      = vcvtq_f32_s32(vmovl_s16(vget_high_s16(tmp1)));
  
     const auto tmp2 = vmovl_s8(vget_high_s8(in));
     out.val[2]      = vcvtq_f32_s32(vmovl_s16(vget_low_s16(tmp2)));
     out.val[3]      = vcvtq_f32_s32(vmovl_s16(vget_high_s16(tmp2)));
     return out;
 }

Referenced by convert_int_to_float< float32x4x4_t, int8x16_t >(), and convert_to_float32x4x4().

◆ convert_int_to_float()

float_vec_type arm_compute::convert_int_to_float ( const int_vec_type & in )

Converts from integer vector to float vector.

Parameters

[in] in Integer vector to converted

Returns: The converted float vector

◆ convert_int_to_float< float32x4x4_t, int8x16_t >()

float32x4x4_t arm_compute::convert_int_to_float< float32x4x4_t, int8x16_t > ( const int8x16_t & in )

inline

Definition at line 532 of file NEMath.inl.

 {
     return convert_int8x16_to_float32x4x4(in);
 }

References convert_int8x16_to_float32x4x4().

◆ convert_int_to_float< float32x4x4_t, uint8x16_t >()

float32x4x4_t arm_compute::convert_int_to_float< float32x4x4_t, uint8x16_t > ( const uint8x16_t & in )

inline

Definition at line 518 of file NEMath.inl.

 {
     return convert_uint8x16_to_float32x4x4(in);
 }

References convert_uint8x16_to_float32x4x4().

◆ convert_negative_axis()

Coordinates& arm_compute::convert_negative_axis	(	Coordinates &	coords,
		int	max_value
	)

inline

Convert negative coordinates to positive in the range [0, num_dims_input].

Parameters

[out]	coords	Array of coordinates to be converted.
[in]	max_value	Maximum value to be used when wrapping the negative values in coords

Definition at line 287 of file Helpers.h.

 {
     for (unsigned int i = 0; i < coords.num_dimensions(); ++i)
     {
         coords[i] = wrap_around(coords[i], max_value);
     }
     return coords;
 }

References Dimensions< T >::num_dimensions(), and wrap_around().

Referenced by arm_compute::misc::shape_calculator::calculate_reduce_mean_shape(), NEReduceMean::configure(), and CLReduceMean::configure().

◆ convert_to_float32x4x4() [1/3]

float32x4x4_t arm_compute::convert_to_float32x4x4 ( const int8x16_t & in )

inline

Definition at line 483 of file NEMath.inl.

 {
     return convert_int8x16_to_float32x4x4(in);
 }

References convert_int8x16_to_float32x4x4().

◆ convert_to_float32x4x4() [2/3]

float32x4x4_t arm_compute::convert_to_float32x4x4 ( const T & in )

Converts to float32x4x4_t from the specified templated 16 elements vectors.

Parameters

[in] in Vector of float to be converted

Returns: Converted vector of float

◆ convert_to_float32x4x4() [3/3]

float32x4x4_t arm_compute::convert_to_float32x4x4 ( const uint8x16_t & in )

inline

Definition at line 477 of file NEMath.inl.

 {
     return convert_uint8x16_to_float32x4x4(in);
 }

References convert_uint8x16_to_float32x4x4().

◆ convert_uint8x16_to_float32x4x4()

float32x4x4_t convert_uint8x16_to_float32x4x4 ( const uint8x16_t & in )

inline

Converts from uint8x16 to float32x4x4_t.

Parameters

[in] in Vector of uint8 to be converted

Returns: Converted vector of float

Definition at line 448 of file NEMath.inl.

 {
     float32x4x4_t out;
  
     const auto tmp1 = vmovl_u8(vget_low_u8(in));
     out.val[0]      = vcvtq_f32_u32(vmovl_u16(vget_low_u16(tmp1)));
     out.val[1]      = vcvtq_f32_u32(vmovl_u16(vget_high_u16(tmp1)));
  
     const auto tmp2 = vmovl_u8(vget_high_u8(in));
     out.val[2]      = vcvtq_f32_u32(vmovl_u16(vget_low_u16(tmp2)));
     out.val[3]      = vcvtq_f32_u32(vmovl_u16(vget_high_u16(tmp2)));
     return out;
 }

Referenced by colorconvert_iyuv_to_rgb(), colorconvert_nv12_to_rgb(), colorconvert_yuyv_to_rgb(), convert_int_to_float< float32x4x4_t, uint8x16_t >(), and convert_to_float32x4x4().

◆ convert_window_coord_to_position()

Coordinates arm_compute::convert_window_coord_to_position	(	const Window &	w,
		const Coordinates &	offset
	)

inline

Convert an offset in window steps into absolute coordinates.

Parameters

[in]	w	Window `offset` is related to.
[in]	offset	Offset inside the window expressed in number of window steps.

Returns: Absolute coordinates.

Definition at line 40 of file WindowIterator.h.

 {
     Coordinates position;
     for (unsigned int i = 0; i < Coordinates::num_max_dimensions; ++i)
     {
         position.set(i, w[i].start() + offset[i] * w[i].step());
     }
     return position;
 }

References Dimensions< int >::num_max_dimensions, offset(), Dimensions< T >::set(), arm_compute::cpu::step, and arm_compute::test::validation::w.

◆ coords2index()

int coords2index	(	const TensorShape &	shape,
		const Coordinates &	coord
	)

inline

Convert n-dimensional coordinates into a linear index.

Parameters

[in]	shape	Shape of the n-dimensional tensor.
[in]	coord	N-dimensional coordinates.

Returns: linead index

Definition at line 183 of file Helpers.inl.

 {
     int num_elements = shape.total_size();
     ARM_COMPUTE_UNUSED(num_elements);
     ARM_COMPUTE_ERROR_ON_MSG(num_elements == 0, "Cannot create linear index from empty shape!");
  
     int index  = 0;
     int stride = 1;
  
     for (unsigned int d = 0; d < coord.num_dimensions(); ++d)
     {
         index += coord[d] * stride;
         stride *= shape[d];
     }
  
     return index;
 }

References ARM_COMPUTE_ERROR_ON_MSG, ARM_COMPUTE_UNUSED, Dimensions< T >::num_dimensions(), and arm_compute::test::validation::shape.

Referenced by arm_compute::test::validation::reference::convert_fully_connected_weights(), arm_compute::test::validation::reference::gather(), and arm_compute::test::validation::reference::winograd_input_transform().

◆ cpu_impl_dt()

std::string arm_compute::cpu_impl_dt ( const DataType & data_type )

inline

Returns the suffix string of CPU kernel implementation names based on the given data type.

Parameters

[in] data_type The data type the CPU kernel implemetation uses

Returns: the suffix string of CPU kernel implementations

Definition at line 487 of file DataTypeUtils.h.

 {
     std::string ret = "";
  
     switch (data_type)
     {
         case DataType::F32:
             ret = "fp32";
             break;
         case DataType::F16:
             ret = "fp16";
             break;
         case DataType::U8:
             ret = "u8";
             break;
         case DataType::S16:
             ret = "s16";
             break;
         case DataType::S32:
             ret = "s32";
             break;
         case DataType::QASYMM8:
             ret = "qu8";
             break;
         case DataType::QASYMM8_SIGNED:
             ret = "qs8";
             break;
         case DataType::QSYMM16:
             ret = "qs16";
             break;
         case DataType::QSYMM8_PER_CHANNEL:
             ret = "qp8";
             break;
         case DataType::BFLOAT16:
             ret = "bf16";
             break;
         default:
             ARM_COMPUTE_ERROR("Unsupported.");
     }
  
     return ret;
 }

References ARM_COMPUTE_ERROR, BFLOAT16, arm_compute::test::validation::data_type, F16, F32, QASYMM8, QASYMM8_SIGNED, QSYMM16, QSYMM8_PER_CHANNEL, S16, S32, and U8.

◆ create_error()

Status create_error	(	ErrorCode	error_code,
		std::string	msg
	)

Creates an error containing the error message.

Parameters

[in]	error_code	Error code
[in]	msg	Message to display before abandoning.

Returns: status containing the error

Definition at line 34 of file Error.cpp.

 {
     return Status(error_code, msg);
 }

◆ create_error_msg()

Status create_error_msg	(	ErrorCode	error_code,
		const char *	func,
		const char *	file,
		int	line,
		const char *	msg
	)

Creates an error and the error message.

Parameters

[in]	error_code	Error code
[in]	func	Function in which the error occurred.
[in]	file	File in which the error occurred.
[in]	line	Line in which the error occurred.
[in]	msg	Message to display before abandoning.

Returns: status containing the error

Definition at line 40 of file Error.cpp.

 {
     std::array<char, 512> out{0};
     snprintf(out.data(), out.size(), "in %s %s:%d: %s", func, file, line, msg);
     return Status(error_code, std::string(out.data()));
 }

References arm_compute::support::cpp11::snprintf().

◆ create_image2d_from_buffer()

cl::Image2D create_image2d_from_buffer	(	const cl::Context &	ctx,
		const cl::Buffer &	buffer,
		const TensorShape &	shape2d,
		DataType	data_type,
		size_t	image_row_pitch,
		CLImage2DType	image_type
	)

Create a cl::Image2D object from an OpenCL buffer.

Note

The following conditions are required to create a OpenCL image object from OpenCL buffer,

The platform should support the OpenCL cl_khr_image2d_from_buffer extension
The stride Y for the input1 should satisfy the OpenCL pitch alignment requirement
input width should be less or equal to (CL_DEVICE_IMAGE2D_MAX_WIDTH * 4)
input height should be less or equal to CL_DEVICE_IMAGE2D_MAX_HEIGHT

It is user responsibility to ensure the above conditions are satisfied since no checks are performed within this function

Parameters

[in]	ctx	cl::Context object
[in]	buffer	cl::Buffer object from which the OpenCL image2d object is created
[in]	shape2d	2D tensor shape
[in]	data_type	DataType to use. Only supported: F32,F16
[in]	image_row_pitch	Image row pitch (a.k.a. stride Y) to be used in the image2d object
[in]	image_type	Image 2D type (CLImage2DType)

Returns: cl::Image2D object

Definition at line 62 of file CLUtils.cpp.

 {
     ARM_COMPUTE_ERROR_ON_MSG(!image2d_from_buffer_supported(CLKernelLibrary::get().get_device()),
                              "The extension cl_khr_image2d_from_buffer is not supported on the target platform");
     ARM_COMPUTE_ERROR_ON_MSG(get_cl_image_pitch_alignment(CLKernelLibrary::get().get_device()) == 0,
                              "Impossible to retrieve the cl_image pitch alignment");
     ARM_COMPUTE_ERROR_ON_MSG(buffer.get() == nullptr, "Cannot create cl_image from empty cl_buffer");
  
     cl_channel_type cl_data_type;
  
     switch (data_type)
     {
         case DataType::F32:
             cl_data_type = CL_FLOAT;
             break;
         case DataType::F16:
             cl_data_type = CL_HALF_FLOAT;
             break;
         default:
             ARM_COMPUTE_ERROR("Data type not support with OpenCL image2d");
     }
  
     cl_mem cl_image;
     cl_int err = CL_SUCCESS;
  
     const cl_image_format format = {CL_RGBA, cl_data_type};
  
     cl_image_desc desc;
     memset(&desc, 0, sizeof(desc));
     desc.image_type      = CL_MEM_OBJECT_IMAGE2D;
     desc.mem_object      = buffer();
     desc.image_row_pitch = image_row_pitch;
     desc.image_width     = shape2d[0];
     desc.image_height    = shape2d[1];
  
     switch (image_type)
     {
         case CLImage2DType::ReadOnly:
             cl_image = clCreateImage(ctx(), CL_MEM_READ_ONLY, &format, &desc, nullptr, &err);
             break;
         case CLImage2DType::WriteOnly:
             cl_image = clCreateImage(ctx(), CL_MEM_WRITE_ONLY, &format, &desc, nullptr, &err);
             break;
         default:
             ARM_COMPUTE_ERROR("Unsupported CLImage2DType");
     }
  
     ARM_COMPUTE_UNUSED(err);
     ARM_COMPUTE_ERROR_ON_MSG(err != CL_SUCCESS, "Error during the creation of CL image from buffer");
  
     return cl::Image2D(cl_image);
 }

References ARM_COMPUTE_ERROR, ARM_COMPUTE_ERROR_ON_MSG, ARM_COMPUTE_UNUSED, clCreateImage(), arm_compute::test::validation::data_type, F16, F32, update_supported_ops::format, CLKernelLibrary::get(), get_cl_image_pitch_alignment(), image2d_from_buffer_supported(), ReadOnly, and WriteOnly.

Referenced by create_image2d_from_tensor(), CLDepthwiseConvolutionLayerNativeKernel::run(), ClMatMulNativeKernel::run_op(), ClIndirectConv2dKernel::run_op(), ClGemmMatrixMultiplyReshapedOnlyRhsMMULKernel::run_op(), ClGemmMatrixMultiplyReshapedOnlyRhsKernel::run_op(), and ClGemmMatrixMultiplyReshapedKernel::run_op().

◆ create_image2d_from_tensor()

cl::Image2D create_image2d_from_tensor	(	const ICLTensor *	tensor,
		CLImage2DType	image_type
	)

Create a cl::Image2D object from a tensor.

Parameters

[in]	tensor	Tensor from which to construct Image 2D object
[in]	image_type	Image 2D type (CLImage2DType)

Returns: cl::Image2D object

Definition at line 37 of file CLUtils.cpp.

 {
     ARM_COMPUTE_ERROR_ON_NULLPTR(tensor);
  
     const cl::Context &ctx    = CLKernelLibrary::get().context();
     const cl::Buffer  &buffer = tensor->cl_buffer();
     const ITensorInfo *info   = tensor->info();
     ARM_COMPUTE_ERROR_ON_MSG(info->lock_paddings(), "Tensor paddings must not be locked to allow extending paddings to "
                                                     "satisfy cl_image pitch alignment requirement");
  
     const size_t image_w{info->dimension(0) / 4};
     const size_t image_h{info->tensor_shape().total_size() / info->dimension(0)};
     const size_t max_image_w{CLKernelLibrary::get().get_device().getInfo<CL_DEVICE_IMAGE2D_MAX_WIDTH>()};
     const size_t max_image_h{CLKernelLibrary::get().get_device().getInfo<CL_DEVICE_IMAGE2D_MAX_HEIGHT>()};
  
     ARM_COMPUTE_UNUSED(max_image_w, max_image_h);
     ARM_COMPUTE_ERROR_ON_MSG(image_w > max_image_w, "Image width exceeds maximum width for exporting to cl_image");
     ARM_COMPUTE_ERROR_ON_MSG(image_h > max_image_h, "Image height exceeds maximum height for exporting to cl_image");
  
     const TensorShape shape2d(image_w, image_h);
     const size_t      image_row_pitch = info->strides_in_bytes()[1];
  
     return create_image2d_from_buffer(ctx, buffer, shape2d, info->data_type(), image_row_pitch, image_type);
 }

References ARM_COMPUTE_ERROR_ON_MSG, ARM_COMPUTE_ERROR_ON_NULLPTR, ARM_COMPUTE_UNUSED, CLTensor::cl_buffer(), CLKernelLibrary::context(), create_image2d_from_buffer(), CLKernelLibrary::get(), CLKernelLibrary::get_device(), CLTensor::info(), arm_compute::test::validation::info, and tensor.

Referenced by ClDirectConv2dKernel::run_op().

◆ create_kernel()

cl::Kernel create_kernel	(	const CLCompileContext &	ctx,
		const std::string &	kernel_name,
		const std::set< std::string > &	build_opts = `std::set<std::string>()`
	)

Creates an opencl kernel using a compile context.

Parameters

[in]	ctx	A compile context to be used to create the opencl kernel.
[in]	kernel_name	The kernel name.
[in]	build_opts	The build options to be used for the opencl kernel compilation.

Returns: An opencl kernel

Definition at line 409 of file CLHelpers.cpp.

 {
     opencl::ClKernelLibrary &klib = opencl::ClKernelLibrary::get();
  
     const std::string program_name = klib.program_name(kernel_name);
     auto              kernel_src   = klib.program(program_name);
     const std::string kernel_path  = klib.kernel_path();
  
     return static_cast<cl::Kernel>(ctx.create_kernel(kernel_name, program_name, kernel_src.program, kernel_path,
                                                      build_opts, kernel_src.is_binary));
 }

References CLCompileContext::create_kernel(), ClKernelLibrary::get(), kernel_name, ClKernelLibrary::kernel_path(), ClKernelLibrary::program(), and ClKernelLibrary::program_name().

◆ create_lws_hint_parallel_implementations()

cl::NDRange create_lws_hint_parallel_implementations	(	unsigned int	input_dimension,
		unsigned int	vector_size
	)

Creates a suitable LWS hint object for parallel implementations.

Sets the number of WG based on the input size. If input width is smaller than 128 we can use fewer threads than 8.

Parameters

[in]	input_dimension	number of elements along the dimension to apply the parallellization
[in]	vector_size	size of the vector in OpenCL

Returns: An LWS hint object

Definition at line 421 of file CLHelpers.cpp.

 {
     const unsigned int width_leftover = input_dimension % vector_size;
     const unsigned int border_width   = (width_leftover != 0) ? vector_size - width_leftover : 0;
     const unsigned int num_of_threads = ((input_dimension + border_width) / 16);
     return cl::NDRange(std::min(8U, num_of_threads));
 }

References arm_compute::utils::cast::U, and arm_compute::cpu::vector_size.

◆ create_opencl_context_and_device()

std::tuple< cl::Context, cl::Device, cl_int > create_opencl_context_and_device ( CLBackendType cl_backend_type )

This function creates an OpenCL context and a device.

Note: In debug builds, the function will automatically enable cl_arm_printf if the driver/device supports it.

Parameters

[in] cl_backend_type The OpenCL backend type to use.

Returns: A std::tuple where the first element is the opencl context, the second element is the opencl device and the third one an error code. The error code will be CL_SUCCESS upon successful creation, otherwise a value telling why the function failed.

Definition at line 121 of file CLHelpers.cpp.

 {
     ARM_COMPUTE_ERROR_ON(!opencl_is_available());
     cl::Platform            p = select_preferable_platform(cl_backend_type);
     cl::Device              device;
     std::vector<cl::Device> platform_devices;
     p.getDevices(CL_DEVICE_TYPE_DEFAULT, &platform_devices);
     ARM_COMPUTE_ERROR_ON_MSG(platform_devices.size() == 0, "Couldn't find any OpenCL device");
     device                                          = platform_devices[0];
     cl_int                               err        = CL_SUCCESS;
     std::array<cl_context_properties, 7> properties = {0, 0, 0, 0, 0, 0, 0};
     initialise_context_properties(p, device, properties);
     cl::Context cl_context = cl::Context(device, properties.data(), nullptr, nullptr, &err);
     ARM_COMPUTE_ERROR_ON_MSG(err != CL_SUCCESS, "Failed to create OpenCL context");
     return std::make_tuple(cl_context, device, err);
 }

References ARM_COMPUTE_ERROR_ON, ARM_COMPUTE_ERROR_ON_MSG, opencl_is_available(), and select_preferable_platform().

Referenced by CLRuntimeContext::CLRuntimeContext(), CLScheduler::default_init(), and main().

◆ create_window_iterator()

WindowIterator<L> arm_compute::create_window_iterator	(	const Window &	w,
		const Coordinates &	start,
		const Coordinates &	end,
		L &&	lambda_function
	)

Create a WindowIterator object.

Parameters

[in]	w	Window to use for the iteration
[in]	start	Where to start iterating from (In Window coordinates)
[in]	end	Where to stop iterating (In Window coordinates).
[in]	lambda_function	Lambda function to call for every iteration between start and end. (It will be called last for end - 1)

Returns: A WindowIterator object.

Definition at line 313 of file WindowIterator.h.

 {
     return WindowIterator<L>(w, start, end, std::move(lambda_function));
 }

References arm_compute::mlgo::parser::end(), and arm_compute::test::validation::w.

◆ data_layout_from_name()

arm_compute::DataLayout data_layout_from_name ( const std::string & name )

Converts a string to a strong types enumeration DataLayout.

Parameters

[in] name String to convert

Returns: Converted DataLayout enumeration

Definition at line 32 of file TypeLoader.cpp.

 {
     static const std::map<std::string, arm_compute::DataLayout> data_layouts = {
         {"nhwc", DataLayout::NHWC},
         {"nchw", DataLayout::NCHW},
     };
  
 #ifndef ARM_COMPUTE_EXCEPTIONS_DISABLED
     try
     {
 #endif /* ARM_COMPUTE_EXCEPTIONS_DISABLED */
         return data_layouts.at(arm_compute::utility::tolower(name));
  
 #ifndef ARM_COMPUTE_EXCEPTIONS_DISABLED
     }
     catch (const std::out_of_range &)
     {
         throw std::invalid_argument(name);
     }
 #endif /* ARM_COMPUTE_EXCEPTIONS_DISABLED */
 }

References name, NCHW, NHWC, and arm_compute::utility::tolower().

Referenced by operator>>().

◆ data_size_from_type()

size_t arm_compute::data_size_from_type ( DataType data_type )

inline

The size in bytes of the data type.

Parameters

[in] data_type Input data type

Returns: The size in bytes of the data type

Definition at line 38 of file DataTypeUtils.h.

 {
     switch (data_type)
     {
         case DataType::U8:
         case DataType::S8:
         case DataType::QSYMM8:
         case DataType::QASYMM8:
         case DataType::QASYMM8_SIGNED:
         case DataType::QSYMM8_PER_CHANNEL:
             return 1;
         case DataType::U16:
         case DataType::S16:
         case DataType::QSYMM16:
         case DataType::QASYMM16:
         case DataType::BFLOAT16:
         case DataType::F16:
             return 2;
         case DataType::F32:
         case DataType::U32:
         case DataType::S32:
             return 4;
         case DataType::F64:
         case DataType::U64:
         case DataType::S64:
             return 8;
         case DataType::SIZET:
             return sizeof(size_t);
         default:
             ARM_COMPUTE_ERROR("Invalid data type");
             return 0;
     }
 }

References ARM_COMPUTE_ERROR, BFLOAT16, arm_compute::test::validation::data_type, F16, F32, F64, QASYMM16, QASYMM8, QASYMM8_SIGNED, QSYMM16, QSYMM8, QSYMM8_PER_CHANNEL, S16, S32, S64, S8, SIZET, U16, U32, U64, and U8.

◆ data_type_from_format()

DataType arm_compute::data_type_from_format ( Format format )

inline

Return the data type used by a given single-planar pixel format.

Parameters

[in] format Input format

Returns: The size in bytes of the pixel format

Definition at line 115 of file DataTypeUtils.h.

 {
     switch (format)
     {
         case Format::U8:
         case Format::UV88:
         case Format::RGB888:
         case Format::RGBA8888:
         case Format::YUYV422:
         case Format::UYVY422:
             return DataType::U8;
         case Format::U16:
             return DataType::U16;
         case Format::S16:
             return DataType::S16;
         case Format::U32:
             return DataType::U32;
         case Format::S32:
             return DataType::S32;
         case Format::BFLOAT16:
             return DataType::BFLOAT16;
         case Format::F16:
             return DataType::F16;
         case Format::F32:
             return DataType::F32;
         //Doesn't make sense for planar formats:
         case Format::NV12:
         case Format::NV21:
         case Format::IYUV:
         case Format::YUV444:
         default:
             ARM_COMPUTE_ERROR("Not supported data_type for given format");
             return DataType::UNKNOWN;
     }
 }

References ARM_COMPUTE_ERROR, BFLOAT16, F16, F32, update_supported_ops::format, IYUV, NV12, NV21, RGB888, RGBA8888, S16, S32, U16, U32, U8, UNKNOWN, UV88, UYVY422, YUV444, and YUYV422.

Referenced by SimpleTensor< uint8_t >::data_type(), TensorInfo::init(), TensorInfo::init_auto_padding(), and TensorInfo::set_format().

◆ data_type_from_name()

DataType data_type_from_name ( const std::string & name )

Convert a string to DataType.

Parameters

[in] name The name of the data type

Returns: DataType

Definition at line 58 of file DataTypeUtils.cpp.

 {
     static const std::map<std::string, DataType> data_types = {
         {"f16", DataType::F16},
         {"f32", DataType::F32},
         {"qasymm8", DataType::QASYMM8},
         {"qasymm8_signed", DataType::QASYMM8_SIGNED},
     };
  
 #ifndef ARM_COMPUTE_EXCEPTIONS_DISABLED
     try
     {
 #endif /* ARM_COMPUTE_EXCEPTIONS_DISABLED */
         return data_types.at(utility::tolower(name));
  
 #ifndef ARM_COMPUTE_EXCEPTIONS_DISABLED
     }
     catch (const std::out_of_range &)
     {
         ARM_COMPUTE_ERROR_VAR("Invalid data type name: %s", name.c_str());
     }
 #endif /* ARM_COMPUTE_EXCEPTIONS_DISABLED */
 }

References ARM_COMPUTE_ERROR_VAR, F16, F32, name, QASYMM8, QASYMM8_SIGNED, and arm_compute::utility::tolower().

Referenced by operator>>().

◆ deconvolution_output_dimensions()

std::pair< unsigned int, unsigned int > deconvolution_output_dimensions	(	unsigned int	in_width,
		unsigned int	in_height,
		unsigned int	kernel_width,
		unsigned int	kernel_height,
		const PadStrideInfo &	pad_stride_info
	)

Returns expected width and height of the deconvolution's output tensor.

Parameters

[in]	in_width	Width of input tensor (Number of columns)
[in]	in_height	Height of input tensor (Number of rows)
[in]	kernel_width	Kernel width.
[in]	kernel_height	Kernel height.
[in]	pad_stride_info	Pad and stride information.

Returns: A pair with the new width in the first position and the new height in the second.

Definition at line 266 of file Utils.cpp.

 {
     const unsigned int pad_left   = pad_stride_info.pad_left();
     const unsigned int pad_top    = pad_stride_info.pad_top();
     const unsigned int pad_right  = pad_stride_info.pad_right();
     const unsigned int pad_bottom = pad_stride_info.pad_bottom();
     const unsigned int stride_x   = pad_stride_info.stride().first;
     const unsigned int stride_y   = pad_stride_info.stride().second;
  
     ARM_COMPUTE_ERROR_ON(in_width < 1 || in_height < 1);
     ARM_COMPUTE_ERROR_ON(((in_width - 1) * stride_x + kernel_width) < (pad_left + pad_right));
     ARM_COMPUTE_ERROR_ON(((in_height - 1) * stride_y + kernel_height) < (pad_top + pad_bottom));
     const int w = stride_x * (in_width - 1) + kernel_width - (pad_left + pad_right);
     const int h = stride_y * (in_height - 1) + kernel_height - (pad_top + pad_bottom);
  
     return std::make_pair<unsigned int, unsigned int>(w, h);
 }

References ARM_COMPUTE_ERROR_ON, PadStrideInfo::pad_bottom(), PadStrideInfo::pad_left(), PadStrideInfo::pad_right(), PadStrideInfo::pad_top(), PadStrideInfo::stride(), and arm_compute::test::validation::w.

Referenced by DeconvolutionLayerNode::compute_output_descriptor(), ClTransposedConvolutionKernel::configure(), NEDeconvolutionLayer::configure(), CLDirectDeconvolutionLayer::configure(), NEDeconvolutionLayer::validate(), CLGEMMDeconvolutionLayer::validate(), and CLDirectDeconvolutionLayer::validate().

◆ dequantize() [1/4]

float arm_compute::dequantize	(	int16_t	value,
		float	scale
	)

inline

Dequantize a value given a 16-bit symmetric quantization scheme.

Parameters

[in]	value	Value to dequantize
[in]	scale	Scale to use for dequantization

Returns: Dequantized value

Definition at line 415 of file QuantizationInfo.h.

 {
     return value * scale;
 }

References arm_compute::test::validation::scale.

◆ dequantize() [2/4]

float arm_compute::dequantize	(	int8_t	value,
		float	scale
	)

inline

Dequantize a value given a 8-bit symmetric quantization scheme.

Parameters

[in]	value	Value to dequantize
[in]	scale	Scale to use for dequantization

Returns: Dequantized value

Definition at line 403 of file QuantizationInfo.h.

 {
     return value * scale;
 }

References arm_compute::test::validation::scale.

◆ dequantize() [3/4]

float arm_compute::dequantize	(	uint16_t	value,
		float	scale,
		int32_t	offset
	)

inline

Dequantize a value given a 16-bit asymmetric quantization scheme.

Parameters

[in]	value	Value to dequantize
[in]	scale	Scale to use for dequantization
[in]	offset	Zero-offset to use for dequantization

Returns: Dequantized value

Definition at line 428 of file QuantizationInfo.h.

 {
     return (static_cast<int>(value) - offset) * scale;
 }

References offset(), and arm_compute::test::validation::scale.

◆ dequantize() [4/4]

float arm_compute::dequantize	(	uint8_t	value,
		float	scale,
		int32_t	offset
	)

inline

Dequantize a value given an 8-bit asymmetric quantization scheme.

Parameters

[in]	value	Value to dequantize
[in]	scale	Scale to use for dequantization
[in]	offset	Zero-offset to use for dequantization

Returns: Dequantized value

Definition at line 379 of file QuantizationInfo.h.

 {
     return (static_cast<int>(value) - offset) * scale;
 }

References offset(), and arm_compute::test::validation::scale.

◆ dequantize_qasymm16() [1/2]

float arm_compute::dequantize_qasymm16	(	uint16_t	value,
		const QuantizationInfo &	qinfo
	)

inline

Dequantize a value given a 16-bit asymmetric quantization scheme.

Parameters

[in]	value	Value to dequantize
[in]	qinfo	Quantization information to use for dequantizing

Returns: Dequantized value

Definition at line 534 of file QuantizationInfo.h.

 {
     return dequantize_qasymm16(value, qinfo.uniform());
 }

References dequantize_qasymm16(), arm_compute::test::validation::qinfo, and QuantizationInfo::uniform().

◆ dequantize_qasymm16() [2/2]

float arm_compute::dequantize_qasymm16	(	uint16_t	value,
		const UniformQuantizationInfo &	qinfo
	)

inline

Dequantize a value given a 16-bit asymmetric quantization scheme.

Parameters

[in]	value	Value to dequantize
[in]	qinfo	Quantization information to use for dequantizing

Returns: Dequantized value

Definition at line 510 of file QuantizationInfo.h.

 {
     return (static_cast<int>(value) - qinfo.offset) * qinfo.scale;
 }

References QuantizationInfo::offset(), arm_compute::test::validation::qinfo, and QuantizationInfo::scale().

Referenced by arm_compute::cpu::bounding_box_transform_qsymm16(), arm_compute::test::validation::convert_from_asymmetric(), dequantize_qasymm16(), and arm_compute::cpu::roi_align().

◆ dequantize_qasymm8()

float arm_compute::dequantize_qasymm8	(	uint8_t	value,
		const INFO_TYPE &	qinfo
	)

inline

Dequantize a value given an unsigned 8-bit asymmetric quantization scheme.

Parameters

[in]	value	Value to dequantize
[in]	qinfo	Quantization information to use for dequantizing

Returns: Dequantized value

Definition at line 353 of file QuantizationInfo.h.

 {
     return Qasymm8QuantizationHelper<uint8_t>::dequantize(value, qinfo);
 }

References Qasymm8QuantizationHelper< QUANTIZED_TYPE >::dequantize(), and arm_compute::test::validation::qinfo.

Referenced by arm_compute::cpu::bounding_box_transform_qsymm16(), check_value_range(), arm_compute::test::validation::convert_from_asymmetric(), arm_compute::scale_helpers::delta_bilinear_c1_quantized(), arm_compute::test::validation::reference::depthconcatenate_layer(), arm_compute::cpu::elementwise_op< uint8_t >(), arm_compute::cpu::elementwise_op_quantized(), arm_compute::cpu::neon_qasymm8_activation(), arm_compute::cpu::roi_align_1x1_qasymm8(), CPPDetectionPostProcessLayer::run(), CpuConcatenateHeightKernel::run_op(), CpuConcatenateWidthKernel::run_op(), and arm_compute::test::validation::reference::scale().

◆ dequantize_qasymm8_signed()

float arm_compute::dequantize_qasymm8_signed	(	int8_t	value,
		const INFO_TYPE &	qinfo
	)

inline

Dequantize a value given a signed 8-bit asymmetric quantization scheme.

Parameters

[in]	value	Value to dequantize
[in]	qinfo	Quantization information to use for dequantizing

Returns: Dequantized value

Definition at line 366 of file QuantizationInfo.h.

 {
     return Qasymm8QuantizationHelper<int8_t>::dequantize(value, qinfo);
 }

References Qasymm8QuantizationHelper< QUANTIZED_TYPE >::dequantize(), and arm_compute::test::validation::qinfo.

Referenced by arm_compute::test::validation::convert_from_asymmetric(), arm_compute::scale_helpers::delta_bilinear_c1_quantized(), arm_compute::cpu::elementwise_comp_quantized_signed(), arm_compute::cpu::elementwise_op< int8_t >(), arm_compute::cpu::elementwise_op_quantized_signed(), arm_compute::cpu::neon_qasymm8_signed_activation(), arm_compute::cpu::roi_align_1x1_qasymm8(), CPPDetectionPostProcessLayer::run(), CpuConcatenateWidthKernel::run_op(), CpuConcatenateHeightKernel::run_op(), and arm_compute::test::validation::reference::scale().

◆ dequantize_qsymm16() [1/2]

float arm_compute::dequantize_qsymm16	(	int16_t	value,
		const QuantizationInfo &	qinfo
	)

inline

Dequantize a value given a 16-bit symmetric quantization scheme.

Parameters

[in]	value	Value to dequantize
[in]	qinfo	Quantization information to use for dequantizing

Returns: Dequantized value

Definition at line 481 of file QuantizationInfo.h.

 {
     return dequantize_qsymm16(value, qinfo.uniform());
 }

References dequantize_qsymm16(), arm_compute::test::validation::qinfo, and QuantizationInfo::uniform().

◆ dequantize_qsymm16() [2/2]

float arm_compute::dequantize_qsymm16	(	int16_t	value,
		const UniformQuantizationInfo &	qinfo
	)

inline

Dequantize a value given a 16-bit symmetric quantization scheme.

Parameters

[in]	value	Value to dequantize
[in]	qinfo	Quantization information to use for dequantizing

Returns: Dequantized value

Definition at line 457 of file QuantizationInfo.h.

 {
     return value * qinfo.scale;
 }

References arm_compute::test::validation::qinfo, and QuantizationInfo::scale().

Referenced by arm_compute::cpu::compute_all_anchors_qasymm16(), arm_compute::test::validation::convert_from_symmetric(), dequantize_qsymm16(), and arm_compute::cpu::neon_qsymm16_activation().

◆ dequantize_qsymm8()

float arm_compute::dequantize_qsymm8	(	int8_t	value,
		const UniformQuantizationInfo &	qinfo
	)

inline

Dequantize a value given a 8-bit symmetric quantization scheme.

Parameters

[in]	value	Value to dequantize
[in]	qinfo	Quantization information to use for dequantizing

Returns: Dequantized value

Definition at line 391 of file QuantizationInfo.h.

 {
     return value * qinfo.scale;
 }

References arm_compute::test::validation::qinfo, and QuantizationInfo::scale().

◆ device_supports_extension()

bool device_supports_extension	(	const cl::Device &	device,
		const char *	extension_name
	)

Helper function to check whether a given extension is supported.

Parameters

[in]	device	A CL device
[in]	extension_name	Name of the extension to be checked

Returns: True if the extension is supported

Definition at line 285 of file CLHelpers.cpp.

 {
     std::string extensions = device.getInfo<CL_DEVICE_EXTENSIONS>();
     auto        pos        = extensions.find(extension_name);
     return (pos != std::string::npos);
 }

References check_header_guards::extensions.

Referenced by arm_matrix_multiply_supported(), arm_non_uniform_workgroup_supported(), command_buffer_mutable_dispatch_supported(), command_buffer_supported(), dot8_acc_supported(), dot8_supported(), fp16_supported(), image2d_from_buffer_supported(), and arm_compute::test::validation::TEST_CASE().

◆ DIV_CEIL()

constexpr auto arm_compute::DIV_CEIL	(	S	val,
		T	m
	)		-> decltype((val + m - 1) / m)

constexpr

Calculate the rounded up quotient of val / m.

Parameters

[in]	val	Value to divide and round up.
[in]	m	Value to divide by.

Returns: the result.

Definition at line 37 of file Math.h.

 {
     return (val + m - 1) / m;
 }

Referenced by ceil_to_multiple(), and arm_compute::misc::shape_calculator::compute_indirect_buffer_shape().

◆ dot8_acc_supported()

bool dot8_acc_supported ( const cl::Device & device )

Helper function to check whether the cl_arm_integer_dot_product_accumulate_int8 extension is supported.

Parameters

[in] device A CL device

Returns: True if the extension is supported

Definition at line 253 of file CLHelpers.cpp.

 {
     return device_supports_extension(device, "cl_arm_integer_dot_product_accumulate_int8");
 }

References device_supports_extension().

◆ dot8_supported()

bool dot8_supported ( const cl::Device & device )

Helper function to check whether the cl_arm_integer_dot_product_int8 extension is supported.

Parameters

[in] device A CL device

Returns: True if the extension is supported

Definition at line 242 of file CLHelpers.cpp.

 {
     std::string     device_name = device.getInfo<CL_DEVICE_NAME>();
     const GPUTarget gpu_target  = get_target_from_name(device_name);
  
     // SW_WORKAROUND: Workaround for DDK revision r14p0.to enable cl_arm_integer_dot_product_int8
     std::set<GPUTarget> sw_workaround_issue = {GPUTarget::G76};
     return (device_supports_extension(device, "cl_arm_integer_dot_product_int8") ||
             sw_workaround_issue.count(gpu_target) != 0);
 }

References device_supports_extension(), G76, and get_target_from_name().

Referenced by ClGemmLowpMatrixMultiplyNativeKernel::configure(), ClGemmLowpMatrixMultiplyReshapedKernel::configure(), ClGemmLowpMatrixMultiplyReshapedOnlyRhsKernel::configure(), and ClGemmLowpMatrixAReductionKernel::configure().

◆ element_size_from_data_type()

size_t arm_compute::element_size_from_data_type ( DataType dt )

inline

The size in bytes of the data type.

Parameters

[in] dt Input data type

Returns: The size in bytes of the data type

Definition at line 78 of file DataTypeUtils.h.

 {
     switch (dt)
     {
         case DataType::S8:
         case DataType::U8:
         case DataType::QSYMM8:
         case DataType::QASYMM8:
         case DataType::QASYMM8_SIGNED:
         case DataType::QSYMM8_PER_CHANNEL:
             return 1;
         case DataType::U16:
         case DataType::S16:
         case DataType::QSYMM16:
         case DataType::QASYMM16:
         case DataType::BFLOAT16:
         case DataType::F16:
             return 2;
         case DataType::U32:
         case DataType::S32:
         case DataType::F32:
             return 4;
         case DataType::U64:
         case DataType::S64:
             return 8;
         default:
             ARM_COMPUTE_ERROR("Undefined element size for given data type");
             return 0;
     }
 }

References ARM_COMPUTE_ERROR, BFLOAT16, dt, F16, F32, QASYMM16, QASYMM8, QASYMM8_SIGNED, QSYMM16, QSYMM8, QSYMM8_PER_CHANNEL, S16, S32, S64, S8, U16, U32, U64, and U8.

Referenced by CLPadLayerKernel::configure(), arm_compute::test::validation::reference::depth_convert(), SimpleTensor< uint8_t >::element_size(), and arm_compute::test::validation::validate().

◆ enqueue()

void enqueue	(	cl::CommandQueue &	queue,
		ICLKernel &	kernel,
		const Window &	window,
		const cl::NDRange &	lws_hint = `CLKernelLibrary::get().default_ndrange()`,
		bool	use_dummy_work_items = `false`
	)

Add the kernel to the command queue with the given window.

Note: Depending on the size of the window, this might translate into several jobs being enqueued.; If kernel->kernel() is empty then the function will return without adding anything to the queue.

Parameters

[in,out]	queue	OpenCL command queue.
[in]	kernel	Kernel to enqueue
[in]	window	Window the kernel has to process.
[in]	lws_hint	(Optional) Local workgroup size requested. Default is based on the device target.
[in]	use_dummy_work_items	(Optional) Use dummy work items in order to have two dimensional power of two NDRange. Default is false Note: it is kernel responsibility to check if the work-item is out-of-range

Note: If any dimension of the lws is greater than the global workgroup size then no lws will be passed.

Definition at line 33 of file ICLKernel.cpp.

 {
     if (kernel.kernel()() == nullptr)
     {
         return;
     }
  
     for (unsigned int i = 0; i < Coordinates::num_max_dimensions; ++i)
     {
         ARM_COMPUTE_ERROR_ON(window[i].step() == 0);
         // Make sure that dimensions > Z are 1
         ARM_COMPUTE_ERROR_ON((i >= 3) && ((window[i].end() - window[i].start()) != 1));
     }
  
     cl::NDRange gws = ICLKernel::gws_from_window(window, use_dummy_work_items);
  
     // Check for empty NDRange
     if (gws.dimensions() == 0)
     {
         return;
     }
  
     kernel.cache_gws(gws);
  
     cl::NDRange valid_lws;
     if (lws_hint[0] * lws_hint[1] * lws_hint[2] > kernel.get_max_workgroup_size())
     {
         valid_lws = cl::NullRange;
     }
     else
     {
         valid_lws = lws_hint;
     }
  
     cl::NDRange lws = cl::NullRange;
  
     if ((valid_lws[0] <= gws[0]) && (valid_lws[1] <= gws[1]) && (valid_lws[2] <= gws[2]))
     {
         lws = valid_lws;
     }
  
     if (CLKernelLibrary::get().is_wbsm_supported())
     {
         set_wbsm(kernel.kernel(), kernel.wbsm_hint());
     }
     queue.enqueueNDRangeKernel(kernel.kernel(), cl::NullRange, gws, lws);
 }

References ARM_COMPUTE_ERROR_ON, ICLKernel::cache_gws(), arm_compute::mlgo::parser::end(), ICLKernel::get_max_workgroup_size(), ICLKernel::kernel(), set_wbsm(), arm_compute::cpu::step, and ICLKernel::wbsm_hint().

◆ error_on_channel_not_in()

arm_compute::Status arm_compute::error_on_channel_not_in	(	const char *	function,
		const char *	file,
		const int	line,
		T	cn,
		T &&	channel,
		Ts &&...	channels
	)

inline

Return an error if the channel is not in channels.

Parameters

[in]	function	Function in which the error occurred.
[in]	file	Name of the file where the error occurred.
[in]	line	Line on which the error occurred.
[in]	cn	Input channel
[in]	channel	First channel allowed.
[in]	channels	(Optional) Further allowed channels.

Returns: Status

Definition at line 1036 of file Validate.h.

 {
     ARM_COMPUTE_RETURN_ERROR_ON_LOC(cn == Channel::UNKNOWN, function, file, line);
  
     const std::array<T, sizeof...(Ts)> channels_array{{std::forward<Ts>(channels)...}};
     ARM_COMPUTE_RETURN_ERROR_ON_LOC(channel != cn && std::none_of(channels_array.begin(), channels_array.end(),
                                                                   [&](const T &f) { return f == cn; }),
                                     function, file, line);
     return arm_compute::Status{};
 }

References ARM_COMPUTE_RETURN_ERROR_ON_LOC, and UNKNOWN.

Referenced by error_on_channel_not_in_known_format().

◆ error_on_channel_not_in_known_format()

arm_compute::Status error_on_channel_not_in_known_format	(	const char *	function,
		const char *	file,
		const int	line,
		arm_compute::Format	fmt,
		arm_compute::Channel	cn
	)

Return an error if the channel is not in format.

Parameters

[in]	function	Function in which the error occurred.
[in]	file	Name of the file where the error occurred.
[in]	line	Line on which the error occurred.
[in]	fmt	Input channel
[in]	cn	First channel allowed.

Returns: Status

Definition at line 127 of file Validate.cpp.

 {
     ARM_COMPUTE_RETURN_ERROR_ON_LOC(fmt == arm_compute::Format::UNKNOWN, function, file, line);
     ARM_COMPUTE_RETURN_ERROR_ON_LOC(cn == arm_compute::Channel::UNKNOWN, function, file, line);
  
     switch (fmt)
     {
         case arm_compute::Format::RGB888:
             arm_compute::error_on_channel_not_in(function, file, line, cn, arm_compute::Channel::R,
                                                  arm_compute::Channel::G, arm_compute::Channel::B);
             break;
         case arm_compute::Format::RGBA8888:
             arm_compute::error_on_channel_not_in(function, file, line, cn, arm_compute::Channel::R,
                                                  arm_compute::Channel::G, arm_compute::Channel::B,
                                                  arm_compute::Channel::A);
             break;
         case arm_compute::Format::UV88:
             arm_compute::error_on_channel_not_in(function, file, line, cn, arm_compute::Channel::U,
                                                  arm_compute::Channel::V);
             break;
         case arm_compute::Format::IYUV:
         case arm_compute::Format::UYVY422:
         case arm_compute::Format::YUYV422:
         case arm_compute::Format::NV12:
         case arm_compute::Format::NV21:
         case arm_compute::Format::YUV444:
             arm_compute::error_on_channel_not_in(function, file, line, cn, arm_compute::Channel::Y,
                                                  arm_compute::Channel::U, arm_compute::Channel::V);
             break;
         default:
             ARM_COMPUTE_ERROR_LOC(function, file, line, "Not supported format.");
     }
     return arm_compute::Status{};
 }

References A, ARM_COMPUTE_ERROR_LOC, ARM_COMPUTE_RETURN_ERROR_ON_LOC, B, error_on_channel_not_in(), G, IYUV, NV12, NV21, R, RGB888, RGBA8888, U, UNKNOWN, UV88, UYVY422, V, Y, YUV444, and YUYV422.

◆ error_on_coordinates_dimensions_gte()

arm_compute::Status error_on_coordinates_dimensions_gte	(	const char *	function,
		const char *	file,
		const int	line,
		const Coordinates &	pos,
		unsigned int	max_dim
	)

Return an error if the passed coordinates have too many dimensions.

The coordinates have too many dimensions if any of the dimensions greater or equal to max_dim is different from 0.

Parameters

[in]	function	Function in which the error occurred.
[in]	file	Name of the file where the error occurred.
[in]	line	Line on which the error occurred.
[in]	pos	Coordinates to validate
[in]	max_dim	Maximum number of dimensions allowed.

Returns: Status

Definition at line 80 of file Validate.cpp.

 {
     for (unsigned int i = max_dim; i < arm_compute::Coordinates::num_max_dimensions; ++i)
     {
         ARM_COMPUTE_RETURN_ERROR_ON_LOC(pos[i] != 0, function, file, line);
     }
     return arm_compute::Status{};
 }

References ARM_COMPUTE_RETURN_ERROR_ON_LOC, and Dimensions< int >::num_max_dimensions.

◆ error_on_data_layout_not_in() [1/2]

arm_compute::Status arm_compute::error_on_data_layout_not_in	(	const char *	function,
		const char *	file,
		const int	line,
		const ITensor *	tensor,
		T &&	dl,
		Ts &&...	dls
	)

inline

Return an error if the data layout of the passed tensor does not match any of the data layout provided.

Parameters

[in]	function	Function in which the error occurred.
[in]	file	Name of the file where the error occurred.
[in]	line	Line on which the error occurred.
[in]	tensor	Tensor to validate.
[in]	dl	First data layout allowed.
[in]	dls	(Optional) Further allowed data layouts.

Returns: Status

Definition at line 880 of file Validate.h.

 {
     ARM_COMPUTE_RETURN_ERROR_ON_LOC(tensor == nullptr, function, file, line);
     ARM_COMPUTE_RETURN_ON_ERROR(::arm_compute::error_on_data_layout_not_in(
         function, file, line, tensor->info(), std::forward<T>(dl), std::forward<Ts>(dls)...));
     return arm_compute::Status{};
 }

References ARM_COMPUTE_RETURN_ERROR_ON_LOC, ARM_COMPUTE_RETURN_ON_ERROR, dl, error_on_data_layout_not_in(), CLTensor::info(), and tensor.

◆ error_on_data_layout_not_in() [2/2]

arm_compute::Status arm_compute::error_on_data_layout_not_in	(	const char *	function,
		const char *	file,
		const int	line,
		const ITensorInfo *	tensor_info,
		T &&	dl,
		Ts &&...	dls
	)

inline

Return an error if the data layout of the passed tensor info does not match any of the data layouts provided.

Parameters

[in]	function	Function in which the error occurred.
[in]	file	Name of the file where the error occurred.
[in]	line	Line on which the error occurred.
[in]	tensor_info	Tensor info to validate.
[in]	dl	First data layout allowed.
[in]	dls	(Optional) Further allowed data layouts.

Returns: Status

Definition at line 853 of file Validate.h.

 {
     ARM_COMPUTE_RETURN_ERROR_ON_LOC(tensor_info == nullptr, function, file, line);
  
     const DataLayout &tensor_dl = tensor_info->data_layout(); //NOLINT
     ARM_COMPUTE_RETURN_ERROR_ON_LOC(tensor_dl == DataLayout::UNKNOWN, function, file, line);
  
     const std::array<T, sizeof...(Ts)> dls_array{{std::forward<Ts>(dls)...}};
     ARM_COMPUTE_RETURN_ERROR_ON_LOC_MSG_VAR(
         tensor_dl != dl && std::none_of(dls_array.begin(), dls_array.end(), [&](const T &l) { return l == tensor_dl; }),
         function, file, line, "ITensor data layout %s not supported by this kernel",
         string_from_data_layout(tensor_dl).c_str());
     return arm_compute::Status{};
 }

References ARM_COMPUTE_RETURN_ERROR_ON_LOC, ARM_COMPUTE_RETURN_ERROR_ON_LOC_MSG_VAR, TensorInfo::data_layout(), dl, string_from_data_layout(), tensor_info, and UNKNOWN.

Referenced by error_on_data_layout_not_in().

◆ error_on_data_type_channel_not_in() [1/2]

arm_compute::Status arm_compute::error_on_data_type_channel_not_in	(	const char *	function,
		const char *	file,
		const int	line,
		const ITensor *	tensor,
		size_t	num_channels,
		T &&	dt,
		Ts &&...	dts
	)

inline

Return an error if the data type or the number of channels of the passed tensor does not match any of the data types and number of channels provided.

Parameters

[in]	function	Function in which the error occurred.
[in]	file	Name of the file where the error occurred.
[in]	line	Line on which the error occurred.
[in]	tensor	Tensor to validate.
[in]	num_channels	Number of channels to check
[in]	dt	First data type allowed.
[in]	dts	(Optional) Further allowed data types.

Returns: Status

Definition at line 936 of file Validate.h.

 {
     ARM_COMPUTE_RETURN_ERROR_ON_LOC(tensor == nullptr, function, file, line);
     ARM_COMPUTE_RETURN_ON_ERROR(error_on_data_type_channel_not_in(function, file, line, tensor->info(), num_channels,
                                                                   std::forward<T>(dt), std::forward<Ts>(dts)...));
     return arm_compute::Status{};
 }

References ARM_COMPUTE_RETURN_ERROR_ON_LOC, ARM_COMPUTE_RETURN_ON_ERROR, dt, error_on_data_type_channel_not_in(), CLTensor::info(), and tensor.

◆ error_on_data_type_channel_not_in() [2/2]

arm_compute::Status arm_compute::error_on_data_type_channel_not_in	(	const char *	function,
		const char *	file,
		const int	line,
		const ITensorInfo *	tensor_info,
		size_t	num_channels,
		T &&	dt,
		Ts &&...	dts
	)

inline

Return an error if the data type or the number of channels of the passed tensor info does not match any of the data types and number of channels provided.

Parameters

[in]	function	Function in which the error occurred.
[in]	file	Name of the file where the error occurred.
[in]	line	Line on which the error occurred.
[in]	tensor_info	Tensor info to validate.
[in]	num_channels	Number of channels to check
[in]	dt	First data type allowed.
[in]	dts	(Optional) Further allowed data types.

Returns: Status

Definition at line 907 of file Validate.h.

 {
     ARM_COMPUTE_RETURN_ON_ERROR(::arm_compute::error_on_data_type_not_in(
         function, file, line, tensor_info, std::forward<T>(dt), std::forward<Ts>(dts)...));
     const size_t tensor_nc = tensor_info->num_channels();
     ARM_COMPUTE_RETURN_ERROR_ON_LOC_MSG_VAR(tensor_nc != num_channels, function, file, line,
                                             "Number of channels %zu. Required number of channels %zu", tensor_nc,
                                             num_channels);
     return arm_compute::Status{};
 }

References ARM_COMPUTE_RETURN_ERROR_ON_LOC_MSG_VAR, ARM_COMPUTE_RETURN_ON_ERROR, dt, error_on_data_type_not_in(), TensorInfo::num_channels(), and tensor_info.

Referenced by error_on_data_type_channel_not_in().

◆ error_on_data_type_not_in() [1/2]

arm_compute::Status arm_compute::error_on_data_type_not_in	(	const char *	function,
		const char *	file,
		const int	line,
		const ITensor *	tensor,
		T &&	dt,
		Ts &&...	dts
	)

inline

Return an error if the data type of the passed tensor does not match any of the data types provided.

Parameters

[in]	function	Function in which the error occurred.
[in]	file	Name of the file where the error occurred.
[in]	line	Line on which the error occurred.
[in]	tensor	Tensor to validate.
[in]	dt	First data type allowed.
[in]	dts	(Optional) Further allowed data types.

Returns: Status

Definition at line 828 of file Validate.h.

 {
     ARM_COMPUTE_RETURN_ERROR_ON_LOC(tensor == nullptr, function, file, line);
     ARM_COMPUTE_RETURN_ON_ERROR(::arm_compute::error_on_data_type_not_in(
         function, file, line, tensor->info(), std::forward<T>(dt), std::forward<Ts>(dts)...));
     return arm_compute::Status{};
 }

References ARM_COMPUTE_RETURN_ERROR_ON_LOC, ARM_COMPUTE_RETURN_ON_ERROR, dt, error_on_data_type_not_in(), CLTensor::info(), and tensor.

◆ error_on_data_type_not_in() [2/2]

arm_compute::Status arm_compute::error_on_data_type_not_in	(	const char *	function,
		const char *	file,
		const int	line,
		const ITensorInfo *	tensor_info,
		T &&	dt,
		Ts &&...	dts
	)

inline

Return an error if the data type of the passed tensor info does not match any of the data types provided.

Parameters

[in]	function	Function in which the error occurred.
[in]	file	Name of the file where the error occurred.
[in]	line	Line on which the error occurred.
[in]	tensor_info	Tensor info to validate.
[in]	dt	First data type allowed.
[in]	dts	(Optional) Further allowed data types.

Returns: Status

Definition at line 801 of file Validate.h.

 {
     ARM_COMPUTE_RETURN_ERROR_ON_LOC(tensor_info == nullptr, function, file, line);
  
     const DataType &tensor_dt = tensor_info->data_type(); //NOLINT
     ARM_COMPUTE_RETURN_ERROR_ON_LOC(tensor_dt == DataType::UNKNOWN, function, file, line);
  
     const std::array<T, sizeof...(Ts)> dts_array{{std::forward<Ts>(dts)...}};
     ARM_COMPUTE_RETURN_ERROR_ON_LOC_MSG_VAR(
         tensor_dt != dt && std::none_of(dts_array.begin(), dts_array.end(), [&](const T &d) { return d == tensor_dt; }),
         function, file, line, "ITensor data type %s not supported by this kernel",
         string_from_data_type(tensor_dt).c_str());
     return arm_compute::Status{};
 }

References ARM_COMPUTE_RETURN_ERROR_ON_LOC, ARM_COMPUTE_RETURN_ERROR_ON_LOC_MSG_VAR, TensorInfo::data_type(), dt, string_from_data_type(), tensor_info, and UNKNOWN.

Referenced by error_on_data_type_channel_not_in(), and error_on_data_type_not_in().

◆ error_on_format_not_in()

void arm_compute::error_on_format_not_in	(	const char *	function,
		const char *	file,
		const int	line,
		const T *	object,
		F &&	format,
		Fs &&...	formats
	)

Throw an error if the format of the passed tensor/multi-image does not match any of the formats provided.

Parameters

[in]	function	Function in which the error occurred.
[in]	file	Name of the file where the error occurred.
[in]	line	Line on which the error occurred.
[in]	object	Tensor/multi-image to validate.
[in]	format	First format allowed.
[in]	formats	(Optional) Further allowed formats.

Definition at line 767 of file Validate.h.

 {
     ARM_COMPUTE_ERROR_ON_LOC(object == nullptr, function, file, line);
  
     Format &&object_format = object->info()->format();
     ARM_COMPUTE_UNUSED(object_format);
  
     ARM_COMPUTE_ERROR_ON_LOC(object_format == Format::UNKNOWN, function, file, line);
  
     const std::array<F, sizeof...(Fs)> formats_array{{std::forward<Fs>(formats)...}};
     ARM_COMPUTE_UNUSED(formats_array);
  
     ARM_COMPUTE_ERROR_ON_LOC_MSG(
         object_format != format &&
             std::none_of(formats_array.begin(), formats_array.end(), [&](const F &f) { return f == object_format; }),
         function, file, line, "Format %s not supported by this kernel", string_from_format(object_format).c_str());
     ARM_COMPUTE_UNUSED(function, format, file, line);
 }

References ARM_COMPUTE_ERROR_ON_LOC, ARM_COMPUTE_ERROR_ON_LOC_MSG, ARM_COMPUTE_UNUSED, update_supported_ops::format, string_from_format(), and UNKNOWN.

◆ error_on_invalid_subtensor()

arm_compute::Status error_on_invalid_subtensor	(	const char *	function,
		const char *	file,
		const int	line,
		const TensorShape &	parent_shape,
		const Coordinates &	coords,
		const TensorShape &	shape
	)

Return an error if if the coordinates and shape of the subtensor are within the parent tensor.

Parameters

[in]	function	Function in which the error occurred.
[in]	file	Name of the file where the error occurred.
[in]	line	Line on which the error occurred.
[in]	parent_shape	Parent tensor shape
[in]	coords	Coordinates inside the parent tensor where the first element of the subtensor is
[in]	shape	Shape of the subtensor

Returns: Status

Definition at line 174 of file Validate.cpp.

 {
     // Check dimensions
     for (unsigned int i = 0; i < TensorShape::num_max_dimensions; ++i)
     {
         const bool invalid_idx        = coords[i] >= static_cast<int>(parent_shape[i]);
         const bool out_of_bounds_size = coords[i] + static_cast<int>(shape[i]) > static_cast<int>(parent_shape[i]);
         ARM_COMPUTE_RETURN_ERROR_ON_LOC(invalid_idx || out_of_bounds_size, function, file, line);
     }
     return arm_compute::Status{};
 }

References ARM_COMPUTE_RETURN_ERROR_ON_LOC, and arm_compute::test::validation::shape.

◆ error_on_invalid_subtensor_valid_region()

arm_compute::Status error_on_invalid_subtensor_valid_region	(	const char *	function,
		const char *	file,
		const int	line,
		const ValidRegion &	parent_valid_region,
		const ValidRegion &	valid_region
	)

Return an error if the valid region of a subtensor is not inside the valid region of the parent tensor.

Parameters

[in]	function	Function in which the error occurred.
[in]	file	Name of the file where the error occurred.
[in]	line	Line on which the error occurred.
[in]	parent_valid_region	Parent valid region.
[in]	valid_region	Valid region of subtensor.

Returns: Status

Definition at line 191 of file Validate.cpp.

 {
     // Check valid regions
     for (unsigned int d = 0; d < TensorShape::num_max_dimensions; ++d)
     {
         ARM_COMPUTE_RETURN_ERROR_ON_LOC((parent_valid_region.anchor[d] > valid_region.anchor[d]), function, file, line);
         ARM_COMPUTE_RETURN_ERROR_ON_LOC(
             (parent_valid_region.anchor[d] + static_cast<int>(parent_valid_region.shape[d])) <
                 (valid_region.anchor[d] + static_cast<int>(valid_region.shape[d])),
             function, file, line);
     }
  
     return arm_compute::Status{};
 }

References ValidRegion::anchor, ARM_COMPUTE_RETURN_ERROR_ON_LOC, ValidRegion::shape, and arm_compute::test::validation::valid_region.

◆ error_on_invalid_subwindow()

arm_compute::Status error_on_invalid_subwindow	(	const char *	function,
		const char *	file,
		const int	line,
		const Window &	full,
		const Window &	sub
	)

Return an error if the passed subwindow is invalid.

The subwindow is invalid if:

It is not a valid window.
It is not fully contained inside the full window
The step for each of its dimension is not identical to the corresponding one of the full window.

Parameters

[in]	function	Function in which the error occurred.
[in]	file	Name of the file where the error occurred.
[in]	line	Line on which the error occurred.
[in]	full	Full size window
[in]	sub	Sub-window to validate.

Returns: Status

Definition at line 44 of file Validate.cpp.

 {
     full.validate();
     sub.validate();
  
     for (size_t i = 0; i < arm_compute::Coordinates::num_max_dimensions; ++i)
     {
         ARM_COMPUTE_RETURN_ERROR_ON_LOC(full[i].start() > sub[i].start(), function, file, line);
         ARM_COMPUTE_RETURN_ERROR_ON_LOC(full[i].end() < sub[i].end(), function, file, line);
         ARM_COMPUTE_RETURN_ERROR_ON_LOC(full[i].step() != sub[i].step(), function, file, line);
         ARM_COMPUTE_RETURN_ERROR_ON_LOC((sub[i].start() - full[i].start()) % sub[i].step(), function, file, line);
     }
     return arm_compute::Status{};
 }

References ARM_COMPUTE_RETURN_ERROR_ON_LOC, arm_compute::mlgo::parser::end(), Dimensions< int >::num_max_dimensions, arm_compute::cpu::step, and Window::validate().

◆ error_on_mismatching_data_layouts() [1/2]

arm_compute::Status arm_compute::error_on_mismatching_data_layouts	(	const char *	function,
		const char *	file,
		const int	line,
		const ITensor *	tensor,
		Ts...	tensors
	)

inline

Return an error if the passed tensors have different data layouts.

Parameters

[in]	function	Function in which the error occurred.
[in]	file	Name of the file where the error occurred.
[in]	line	Line on which the error occurred.
[in]	tensor	The first tensor to be compared.
[in]	tensors	(Optional) Further allowed tensors.

Returns: Status

Definition at line 613 of file Validate.h.

 {
     ARM_COMPUTE_RETURN_ERROR_ON_LOC(tensor == nullptr, function, file, line);
     ARM_COMPUTE_RETURN_ON_ERROR(::arm_compute::error_on_nullptr(function, file, line, std::forward<Ts>(tensors)...));
     ARM_COMPUTE_RETURN_ON_ERROR(::arm_compute::error_on_mismatching_data_layouts(
         function, file, line, tensor->info(), detail::get_tensor_info_t<ITensorInfo *>()(tensors)...));
     return arm_compute::Status{};
 }

References ARM_COMPUTE_RETURN_ERROR_ON_LOC, ARM_COMPUTE_RETURN_ON_ERROR, error_on_mismatching_data_layouts(), error_on_nullptr(), CLTensor::info(), and tensor.

◆ error_on_mismatching_data_layouts() [2/2]

arm_compute::Status arm_compute::error_on_mismatching_data_layouts	(	const char *	function,
		const char *	file,
		const int	line,
		const ITensorInfo *	tensor_info,
		Ts...	tensor_infos
	)

inline

Return an error if the passed tensor infos have different data layouts.

Parameters

[in]	function	Function in which the error occurred.
[in]	file	Name of the file where the error occurred.
[in]	line	Line on which the error occurred.
[in]	tensor_info	The first tensor info to be compared.
[in]	tensor_infos	(Optional) Further allowed tensor infos.

Returns: Status

Definition at line 588 of file Validate.h.

 {
     ARM_COMPUTE_RETURN_ERROR_ON_LOC(tensor_info == nullptr, function, file, line);
     ARM_COMPUTE_RETURN_ON_ERROR(::arm_compute::error_on_nullptr(function, file, line, tensor_infos...));
  
     DataLayout                                         &&tensor_data_layout = tensor_info->data_layout();
     const std::array<const ITensorInfo *, sizeof...(Ts)> tensors_infos_array{{tensor_infos...}};
     ARM_COMPUTE_RETURN_ERROR_ON_LOC_MSG(std::any_of(tensors_infos_array.begin(), tensors_infos_array.end(),
                                                     [&](const ITensorInfo *tensor_info_obj)
                                                     { return tensor_info_obj->data_layout() != tensor_data_layout; }),
                                         function, file, line, "Tensors have different data layouts");
     return arm_compute::Status{};
 }

References ARM_COMPUTE_RETURN_ERROR_ON_LOC, ARM_COMPUTE_RETURN_ERROR_ON_LOC_MSG, ARM_COMPUTE_RETURN_ON_ERROR, TensorInfo::data_layout(), error_on_nullptr(), and tensor_info.

Referenced by error_on_mismatching_data_layouts().

◆ error_on_mismatching_data_types() [1/2]

arm_compute::Status arm_compute::error_on_mismatching_data_types	(	const char *	function,
		const char *	file,
		const int	line,
		const ITensor *	tensor,
		Ts...	tensors
	)

inline

Return an error if the passed two tensors have different data types.

Parameters

[in]	function	Function in which the error occurred.
[in]	file	Name of the file where the error occurred.
[in]	line	Line on which the error occurred.
[in]	tensor	The first tensor to be compared.
[in]	tensors	(Optional) Further allowed tensors.

Returns: Status

Definition at line 665 of file Validate.h.

 {
     ARM_COMPUTE_RETURN_ERROR_ON_LOC(tensor == nullptr, function, file, line);
     ARM_COMPUTE_RETURN_ON_ERROR(::arm_compute::error_on_nullptr(function, file, line, tensors...));
     ARM_COMPUTE_RETURN_ON_ERROR(::arm_compute::error_on_mismatching_data_types(
         function, file, line, tensor->info(), detail::get_tensor_info_t<ITensorInfo *>()(tensors)...));
     return arm_compute::Status{};
 }

References ARM_COMPUTE_RETURN_ERROR_ON_LOC, ARM_COMPUTE_RETURN_ON_ERROR, error_on_mismatching_data_types(), error_on_nullptr(), CLTensor::info(), and tensor.

◆ error_on_mismatching_data_types() [2/2]

arm_compute::Status arm_compute::error_on_mismatching_data_types	(	const char *	function,
		const char *	file,
		const int	line,
		const ITensorInfo *	tensor_info,
		Ts...	tensor_infos
	)

inline

Return an error if the passed two tensor infos have different data types.

Parameters

[in]	function	Function in which the error occurred.
[in]	file	Name of the file where the error occurred.
[in]	line	Line on which the error occurred.
[in]	tensor_info	The first tensor info to be compared.
[in]	tensor_infos	(Optional) Further allowed tensor infos.

Returns: Status

Definition at line 640 of file Validate.h.

 {
     ARM_COMPUTE_RETURN_ERROR_ON_LOC(tensor_info == nullptr, function, file, line);
     ARM_COMPUTE_RETURN_ON_ERROR(::arm_compute::error_on_nullptr(function, file, line, tensor_infos...));
  
     DataType                                           &&tensor_data_type = tensor_info->data_type();
     const std::array<const ITensorInfo *, sizeof...(Ts)> tensors_infos_array{{tensor_infos...}};
     ARM_COMPUTE_RETURN_ERROR_ON_LOC_MSG(std::any_of(tensors_infos_array.begin(), tensors_infos_array.end(),
                                                     [&](const ITensorInfo *tensor_info_obj)
                                                     { return tensor_info_obj->data_type() != tensor_data_type; }),
                                         function, file, line, "Tensors have different data types");
     return arm_compute::Status{};
 }

References ARM_COMPUTE_RETURN_ERROR_ON_LOC, ARM_COMPUTE_RETURN_ERROR_ON_LOC_MSG, ARM_COMPUTE_RETURN_ON_ERROR, TensorInfo::data_type(), error_on_nullptr(), and tensor_info.

Referenced by error_on_mismatching_data_types().

◆ error_on_mismatching_dimensions()

arm_compute::Status arm_compute::error_on_mismatching_dimensions	(	const char *	function,
		const char *	file,
		int	line,
		const Dimensions< T > &	dim1,
		const Dimensions< T > &	dim2,
		Ts &&...	dims
	)

Return an error if the passed dimension objects differ.

Parameters

[in]	function	Function in which the error occurred.
[in]	file	Name of the file where the error occurred.
[in]	line	Line on which the error occurred.
[in]	dim1	The first object to be compared.
[in]	dim2	The second object to be compared.
[in]	dims	(Optional) Further allowed objects.

Returns: Status

Definition at line 280 of file Validate.h.

 {
     ARM_COMPUTE_RETURN_ON_ERROR(detail::for_each_error(detail::compare_dimension<T>(dim1, function, file, line), dim2,
                                                        std::forward<Ts>(dims)...));
     return arm_compute::Status{};
 }

References ARM_COMPUTE_RETURN_ON_ERROR, and arm_compute::detail::for_each_error().

◆ error_on_mismatching_quantization_info() [1/2]

arm_compute::Status arm_compute::error_on_mismatching_quantization_info	(	const char *	function,
		const char *	file,
		const int	line,
		const ITensor *	tensor_1,
		const ITensor *	tensor_2,
		Ts...	tensors
	)

inline

Return an error if the passed tensor have different asymmetric quantized data types or different quantization info.

Note: : If the first tensor doesn't have asymmetric quantized data type, the function returns without throwing an error

Parameters

[in]	function	Function in which the error occurred.
[in]	file	Name of the file where the error occurred.
[in]	line	Line on which the error occurred.
[in]	tensor_1	The first tensor to be compared.
[in]	tensor_2	The second tensor to be compared.
[in]	tensors	(Optional) Further allowed tensors.

Returns: Status

Definition at line 738 of file Validate.h.

 {
     ARM_COMPUTE_RETURN_ON_ERROR(
         ::arm_compute::error_on_mismatching_quantization_info(function, file, line, tensor_1->info(), tensor_2->info(),
                                                               detail::get_tensor_info_t<ITensorInfo *>()(tensors)...));
     return arm_compute::Status{};
 }

References ARM_COMPUTE_RETURN_ON_ERROR, error_on_mismatching_quantization_info(), and ITensor::info().

◆ error_on_mismatching_quantization_info() [2/2]

arm_compute::Status arm_compute::error_on_mismatching_quantization_info	(	const char *	function,
		const char *	file,
		const int	line,
		const ITensorInfo *	tensor_info_1,
		const ITensorInfo *	tensor_info_2,
		Ts...	tensor_infos
	)

inline

Return an error if the passed tensor infos have different asymmetric quantized data types or different quantization info.

Note: : If the first tensor info doesn't have asymmetric quantized data type, the function returns without throwing an error

Parameters

[in]	function	Function in which the error occurred.
[in]	file	Name of the file where the error occurred.
[in]	line	Line on which the error occurred.
[in]	tensor_info_1	The first tensor info to be compared.
[in]	tensor_info_2	The second tensor info to be compared.
[in]	tensor_infos	(Optional) Further allowed tensor infos.

Returns: Status

Definition at line 695 of file Validate.h.

 {
     DataType             &&first_data_type         = tensor_info_1->data_type();
     const QuantizationInfo first_quantization_info = tensor_info_1->quantization_info();
  
     if (!is_data_type_quantized(first_data_type))
     {
         return arm_compute::Status{};
     }
  
     const std::array<const ITensorInfo *, 1 + sizeof...(Ts)> tensor_infos_array{
         {tensor_info_2, std::forward<Ts>(tensor_infos)...}};
     ARM_COMPUTE_RETURN_ERROR_ON_LOC_MSG(std::any_of(tensor_infos_array.begin(), tensor_infos_array.end(),
                                                     [&](const ITensorInfo *tensor_info)
                                                     { return tensor_info->data_type() != first_data_type; }),
                                         function, file, line, "Tensors have different asymmetric quantized data types");
     ARM_COMPUTE_RETURN_ERROR_ON_LOC_MSG(
         std::any_of(tensor_infos_array.begin(), tensor_infos_array.end(),
                     [&](const ITensorInfo *tensor_info)
                     { return tensor_info->quantization_info() != first_quantization_info; }),
         function, file, line, "Tensors have different quantization information");
  
     return arm_compute::Status{};
 }

References ARM_COMPUTE_RETURN_ERROR_ON_LOC_MSG, ITensorInfo::data_type(), is_data_type_quantized(), ITensorInfo::quantization_info(), and tensor_info.

Referenced by error_on_mismatching_quantization_info().

◆ error_on_mismatching_shapes() [1/4]

arm_compute::Status arm_compute::error_on_mismatching_shapes	(	const char *	function,
		const char *	file,
		const int	line,
		const ITensor *	tensor_1,
		const ITensor *	tensor_2,
		Ts...	tensors
	)

inline

Return an error if the passed two tensors have different shapes from the given dimension.

Parameters

[in]	function	Function in which the error occurred.
[in]	file	Name of the file where the error occurred.
[in]	line	Line on which the error occurred.
[in]	tensor_1	The first tensor to be compared.
[in]	tensor_2	The second tensor to be compared.
[in]	tensors	(Optional) Further allowed tensors.

Returns: Status

Definition at line 497 of file Validate.h.

 {
     return error_on_mismatching_shapes(function, file, line, 0U, tensor_1, tensor_2, std::forward<Ts>(tensors)...);
 }

References error_on_mismatching_shapes(), and arm_compute::utils::cast::U.

◆ error_on_mismatching_shapes() [2/4]

arm_compute::Status arm_compute::error_on_mismatching_shapes	(	const char *	function,
		const char *	file,
		const int	line,
		const ITensorInfo *	tensor_info_1,
		const ITensorInfo *	tensor_info_2,
		Ts...	tensor_infos
	)

inline

Return an error if the passed two tensor infos have different shapes from the given dimension.

Parameters

[in]	function	Function in which the error occurred.
[in]	file	Name of the file where the error occurred.
[in]	line	Line on which the error occurred.
[in]	tensor_info_1	The first tensor info to be compared.
[in]	tensor_info_2	The second tensor info to be compared.
[in]	tensor_infos	(Optional) Further allowed tensor infos.

Returns: Status

Definition at line 475 of file Validate.h.

 {
     return error_on_mismatching_shapes(function, file, line, 0U, tensor_info_1, tensor_info_2,
                                        std::forward<Ts>(tensor_infos)...);
 }

References arm_compute::utils::cast::U.

Referenced by error_on_mismatching_shapes().

◆ error_on_mismatching_shapes() [3/4]

arm_compute::Status arm_compute::error_on_mismatching_shapes	(	const char *	function,
		const char *	file,
		const int	line,
		unsigned int	upper_dim,
		const ITensor *	tensor_1,
		const ITensor *	tensor_2,
		Ts...	tensors
	)

inline

Return an error if the passed two tensors have different shapes from the given dimension.

Parameters

[in]	function	Function in which the error occurred.
[in]	file	Name of the file where the error occurred.
[in]	line	Line on which the error occurred.
[in]	upper_dim	The dimension from which to check.
[in]	tensor_1	The first tensor to be compared.
[in]	tensor_2	The second tensor to be compared.
[in]	tensors	(Optional) Further allowed tensors.

Returns: Status

Definition at line 556 of file Validate.h.

 {
     ARM_COMPUTE_RETURN_ERROR_ON_LOC(tensor_1 == nullptr, function, file, line);
     ARM_COMPUTE_RETURN_ERROR_ON_LOC(tensor_2 == nullptr, function, file, line);
     ARM_COMPUTE_RETURN_ON_ERROR(::arm_compute::error_on_nullptr(function, file, line, tensors...));
     ARM_COMPUTE_RETURN_ON_ERROR(
         ::arm_compute::error_on_mismatching_shapes(function, file, line, upper_dim, tensor_1->info(), tensor_2->info(),
                                                    detail::get_tensor_info_t<ITensorInfo *>()(tensors)...));
     return arm_compute::Status{};
 }

References ARM_COMPUTE_RETURN_ERROR_ON_LOC, ARM_COMPUTE_RETURN_ON_ERROR, error_on_mismatching_shapes(), error_on_nullptr(), and ITensor::info().

◆ error_on_mismatching_shapes() [4/4]

arm_compute::Status arm_compute::error_on_mismatching_shapes	(	const char *	function,
		const char *	file,
		const int	line,
		unsigned int	upper_dim,
		const ITensorInfo *	tensor_info_1,
		const ITensorInfo *	tensor_info_2,
		Ts...	tensor_infos
	)

inline

Return an error if the passed two tensors have different shapes from the given dimension.

Parameters

[in]	function	Function in which the error occurred.
[in]	file	Name of the file where the error occurred.
[in]	line	Line on which the error occurred.
[in]	upper_dim	The dimension from which to check.
[in]	tensor_info_1	The first tensor info to be compared.
[in]	tensor_info_2	The second tensor info to be compared.
[in]	tensor_infos	(Optional) Further allowed tensor infos.

Returns: Status

Definition at line 519 of file Validate.h.

 {
     ARM_COMPUTE_RETURN_ERROR_ON_LOC(tensor_info_1 == nullptr, function, file, line);
     ARM_COMPUTE_RETURN_ERROR_ON_LOC(tensor_info_2 == nullptr, function, file, line);
     ARM_COMPUTE_RETURN_ON_ERROR(::arm_compute::error_on_nullptr(function, file, line, tensor_infos...));
  
     const std::array<const ITensorInfo *, 2 + sizeof...(Ts)> tensors_info_array{
         {tensor_info_1, tensor_info_2, tensor_infos...}};
     ARM_COMPUTE_RETURN_ERROR_ON_LOC_MSG(std::any_of(std::next(tensors_info_array.cbegin()), tensors_info_array.cend(),
                                                     [&](const ITensorInfo *tensor_info)
                                                     {
                                                         return detail::have_different_dimensions(
                                                             (*tensors_info_array.cbegin())->tensor_shape(),
                                                             tensor_info->tensor_shape(), upper_dim);
                                                     }),
                                         function, file, line, "Tensors have different shapes");
     return arm_compute::Status{};
 }

References ARM_COMPUTE_RETURN_ERROR_ON_LOC, ARM_COMPUTE_RETURN_ERROR_ON_LOC_MSG, ARM_COMPUTE_RETURN_ON_ERROR, error_on_nullptr(), and tensor_info.

◆ error_on_mismatching_windows()

arm_compute::Status error_on_mismatching_windows	(	const char *	function,
		const char *	file,
		const int	line,
		const Window &	full,
		const Window &	win
	)

Return an error if the passed window is invalid.

The subwindow is invalid if:

It is not a valid window.
Its dimensions don't match the full window's ones
The step for each of its dimension is not identical to the corresponding one of the full window.

Parameters

[in]	function	Function in which the error occurred.
[in]	file	Name of the file where the error occurred.
[in]	line	Line on which the error occurred.
[in]	full	Full size window
[in]	win	Window to validate.

Returns: Status

Definition at line 26 of file Validate.cpp.

 {
     full.validate();
     win.validate();
  
     for (size_t i = 0; i < arm_compute::Coordinates::num_max_dimensions; ++i)
     {
         ARM_COMPUTE_RETURN_ERROR_ON_LOC(full[i].start() != win[i].start(), function, file, line);
         ARM_COMPUTE_RETURN_ERROR_ON_LOC(full[i].end() != win[i].end(), function, file, line);
         ARM_COMPUTE_RETURN_ERROR_ON_LOC(full[i].step() != win[i].step(), function, file, line);
     }
     return arm_compute::Status{};
 }

References ARM_COMPUTE_RETURN_ERROR_ON_LOC, arm_compute::mlgo::parser::end(), Dimensions< int >::num_max_dimensions, arm_compute::cpu::step, and Window::validate().

◆ error_on_nullptr()

arm_compute::Status arm_compute::error_on_nullptr	(	const char *	function,
		const char *	file,
		const int	line,
		Ts &&...	pointers
	)

inline

Create an error if one of the pointers is a nullptr.

Parameters

[in]	function	Function in which the error occurred.
[in]	file	Name of the file where the error occurred.
[in]	line	Line on which the error occurred.
[in]	pointers	Pointers to check against nullptr.

Returns: Status

Definition at line 151 of file Validate.h.

 {
     const std::array<const void *, sizeof...(Ts)> pointers_array{{std::forward<Ts>(pointers)...}};
     bool                                          has_nullptr =
         std::any_of(pointers_array.begin(), pointers_array.end(), [&](const void *ptr) { return (ptr == nullptr); });
     ARM_COMPUTE_RETURN_ERROR_ON_LOC_MSG(has_nullptr, function, file, line, "Nullptr object!");
     return arm_compute::Status{};
 }

References ARM_COMPUTE_RETURN_ERROR_ON_LOC_MSG.

Referenced by error_on_mismatching_data_layouts(), error_on_mismatching_data_types(), error_on_mismatching_shapes(), error_on_tensors_not_even(), and error_on_tensors_not_subsampled().

◆ error_on_tensor_not_2d() [1/2]

arm_compute::Status error_on_tensor_not_2d	(	const char *	function,
		const char *	file,
		const int	line,
		const ITensor *	tensor
	)

Return an error if the tensor is not 2D.

Parameters

[in]	function	Function in which the error occurred.
[in]	file	Name of the file where the error occurred.
[in]	line	Line on which the error occurred.
[in]	tensor	Tensor to validate.

Returns: Status

Definition at line 102 of file Validate.cpp.

 {
     ARM_COMPUTE_RETURN_ERROR_ON_LOC(tensor == nullptr, function, file, line);
     ARM_COMPUTE_RETURN_ERROR_ON_LOC(tensor->info() == nullptr, function, file, line);
     ARM_COMPUTE_RETURN_ERROR_ON_LOC_MSG_VAR(tensor->info()->num_dimensions() != 2, function, file, line,
                                             "Only 2D Tensors are supported by this kernel (%zu passed)",
                                             tensor->info()->num_dimensions());
     return arm_compute::Status{};
 }

References ARM_COMPUTE_RETURN_ERROR_ON_LOC, ARM_COMPUTE_RETURN_ERROR_ON_LOC_MSG_VAR, and tensor.

◆ error_on_tensor_not_2d() [2/2]

arm_compute::Status error_on_tensor_not_2d	(	const char *	function,
		const char *	file,
		const int	line,
		const ITensorInfo *	tensor
	)

Return an error if the tensor info is not 2D.

Parameters

[in]	function	Function in which the error occurred.
[in]	file	Name of the file where the error occurred.
[in]	line	Line on which the error occurred.
[in]	tensor	Tensor info to validate.

Returns: Status

Definition at line 115 of file Validate.cpp.

 {
     ARM_COMPUTE_RETURN_ERROR_ON_LOC(tensor == nullptr, function, file, line);
     ARM_COMPUTE_RETURN_ERROR_ON_LOC_MSG_VAR(tensor->num_dimensions() != 2, function, file, line,
                                             "Only 2D Tensors are supported by this kernel (%zu passed)",
                                             tensor->num_dimensions());
     return arm_compute::Status{};
 }

References ARM_COMPUTE_RETURN_ERROR_ON_LOC, ARM_COMPUTE_RETURN_ERROR_ON_LOC_MSG_VAR, and tensor.

◆ error_on_tensors_not_even()

arm_compute::Status arm_compute::error_on_tensors_not_even	(	const char *	function,
		const char *	file,
		int	line,
		const Format &	format,
		const ITensor *	tensor1,
		Ts...	tensors
	)

Return an error if the passed tensor objects are not even.

Parameters

[in]	function	Function in which the error occurred.
[in]	file	Name of the file where the error occurred.
[in]	line	Line on which the error occurred.
[in]	format	Format to check if odd shape is allowed
[in]	tensor1	The first object to be compared for odd shape.
[in]	tensors	(Optional) Further allowed objects.

Returns: Status

Definition at line 369 of file Validate.h.

 {
     ARM_COMPUTE_RETURN_ERROR_ON_LOC(tensor1 == nullptr, function, file, line);
     ARM_COMPUTE_RETURN_ON_ERROR(::arm_compute::error_on_nullptr(function, file, line, std::forward<Ts>(tensors)...));
     const std::array<const ITensor *, 1 + sizeof...(Ts)> tensors_info_array{{tensor1, std::forward<Ts>(tensors)...}};
     ARM_COMPUTE_RETURN_ERROR_ON_LOC_MSG(
         std::any_of(tensors_info_array.cbegin(), tensors_info_array.cend(),
                     [&](const ITensor *tensor)
                     {
                         const TensorShape correct_shape = adjust_odd_shape(tensor->info()->tensor_shape(), format);
                         return detail::have_different_dimensions(tensor->info()->tensor_shape(), correct_shape, 2);
                     }),
         function, file, line, "Tensor shape has odd dimensions");
     return arm_compute::Status{};
 }

References ARM_COMPUTE_RETURN_ERROR_ON_LOC, ARM_COMPUTE_RETURN_ERROR_ON_LOC_MSG, ARM_COMPUTE_RETURN_ON_ERROR, error_on_nullptr(), and tensor.

◆ error_on_tensors_not_subsampled()

arm_compute::Status arm_compute::error_on_tensors_not_subsampled	(	const char *	function,
		const char *	file,
		int	line,
		const Format &	format,
		const TensorShape &	shape,
		const ITensor *	tensor1,
		Ts...	tensors
	)

Return an error if the passed tensor objects are not sub-sampled.

Parameters

[in]	function	Function in which the error occurred.
[in]	file	Name of the file where the error occurred.
[in]	line	Line on which the error occurred.
[in]	format	Format to check if sub-sampling allowed.
[in]	shape	The tensor shape to calculate sub-sampling from.
[in]	tensor1	The first object to be compared.
[in]	tensors	(Optional) Further allowed objects.

Returns: Status

Definition at line 436 of file Validate.h.

 {
     ARM_COMPUTE_RETURN_ERROR_ON_LOC(tensor1 == nullptr, function, file, line);
     ARM_COMPUTE_RETURN_ON_ERROR(::arm_compute::error_on_nullptr(function, file, line, std::forward<Ts>(tensors)...));
     const TensorShape                                    sub2_shape = calculate_subsampled_shape(shape, format);
     const std::array<const ITensor *, 1 + sizeof...(Ts)> tensors_info_array{{tensor1, std::forward<Ts>(tensors)...}};
     ARM_COMPUTE_RETURN_ERROR_ON_LOC_MSG(
         std::any_of(tensors_info_array.cbegin(), tensors_info_array.cend(),
                     [&](const ITensor *tensor)
                     { return detail::have_different_dimensions(tensor->info()->tensor_shape(), sub2_shape, 2); }),
         function, file, line, "Tensor shape has mismatch dimensions for sub-sampling");
     return arm_compute::Status{};
 }

References ARM_COMPUTE_RETURN_ERROR_ON_LOC, ARM_COMPUTE_RETURN_ERROR_ON_LOC_MSG, ARM_COMPUTE_RETURN_ON_ERROR, calculate_subsampled_shape(), error_on_nullptr(), update_supported_ops::format, arm_compute::test::validation::shape, and tensor.

◆ error_on_unconfigured_kernel()

arm_compute::Status error_on_unconfigured_kernel	(	const char *	function,
		const char *	file,
		const int	line,
		const IKernel *	kernel
	)

Return an error if the kernel is not configured.

Parameters

[in]	function	Function in which the error occurred.
[in]	file	Name of the file where the error occurred.
[in]	line	Line on which the error occurred.
[in]	kernel	Kernel to validate.

Returns: Status

Definition at line 163 of file Validate.cpp.

 {
     ARM_COMPUTE_RETURN_ERROR_ON_LOC(kernel == nullptr, function, file, line);
     ARM_COMPUTE_RETURN_ERROR_ON_LOC_MSG(!kernel->is_window_configured(), function, file, line,
                                         "This kernel hasn't been configured.");
     return arm_compute::Status{};
 }

References ARM_COMPUTE_RETURN_ERROR_ON_LOC, ARM_COMPUTE_RETURN_ERROR_ON_LOC_MSG, and IKernel::is_window_configured().

◆ error_on_unsupported_cpu_bf16() [1/2]

Status arm_compute::error_on_unsupported_cpu_bf16	(	const char *	function,
		const char *	file,
		const int	line,
		const ITensor *	tensor
	)

inline

Return an error if the data type of the passed tensor is BFLOAT16 and BFLOAT16 support is not compiled in.

Parameters

[in]	function	Function in which the error occurred.
[in]	file	Name of the file where the error occurred.
[in]	line	Line on which the error occurred.
[in]	tensor	Tensor to validate.

Returns: Status

Definition at line 107 of file Validate.h.

 {
     ARM_COMPUTE_RETURN_ERROR_ON_LOC(tensor == nullptr, function, file, line);
     ARM_COMPUTE_RETURN_ON_ERROR(::arm_compute::error_on_unsupported_cpu_bf16(function, file, line, tensor->info()));
     return Status{};
 }

References ARM_COMPUTE_RETURN_ERROR_ON_LOC, ARM_COMPUTE_RETURN_ON_ERROR, error_on_unsupported_cpu_bf16(), CLTensor::info(), and tensor.

◆ error_on_unsupported_cpu_bf16() [2/2]

Status arm_compute::error_on_unsupported_cpu_bf16	(	const char *	function,
		const char *	file,
		const int	line,
		const ITensorInfo *	tensor_info
	)

inline

Return an error if the data type of the passed tensor info is BFLOAT16 and BFLOAT16 support is not compiled in.

Parameters

[in]	function	Function in which the error occurred.
[in]	file	Name of the file where the error occurred.
[in]	line	Line on which the error occurred.
[in]	tensor_info	Tensor info to validate.

Returns: Status

Definition at line 66 of file Validate.h.

 {
     bool bf16_kernels_enabled = false;
 #if defined(ARM_COMPUTE_ENABLE_BF16)
     bf16_kernels_enabled = true;
 #endif /* defined(ARM_COMPUTE_ENABLE_BF16) */
  
     ARM_COMPUTE_RETURN_ERROR_ON_LOC(tensor_info == nullptr, function, file, line);
     ARM_COMPUTE_RETURN_ERROR_ON_LOC_MSG(
         (tensor_info->data_type() == DataType::BFLOAT16) && (!CPUInfo::get().has_bf16() || !bf16_kernels_enabled),
         function, file, line, "This CPU architecture does not support BFloat16 data type, you need v8.6 or above");
     return Status{};
 }

References ARM_COMPUTE_RETURN_ERROR_ON_LOC, ARM_COMPUTE_RETURN_ERROR_ON_LOC_MSG, BFLOAT16, TensorInfo::data_type(), CPUInfo::get(), CPUInfo::has_bf16(), and tensor_info.

Referenced by error_on_unsupported_cpu_bf16().

◆ error_on_unsupported_cpu_fp16() [1/2]

Status arm_compute::error_on_unsupported_cpu_fp16	(	const char *	function,
		const char *	file,
		const int	line,
		const ITensor *	tensor
	)

inline

Return an error if the data type of the passed tensor is FP16 and FP16 support is not compiled in.

Parameters

[in]	function	Function in which the error occurred.
[in]	file	Name of the file where the error occurred.
[in]	line	Line on which the error occurred.
[in]	tensor	Tensor to validate.

Returns: Status

Definition at line 90 of file Validate.h.

 {
     ARM_COMPUTE_RETURN_ERROR_ON_LOC(tensor == nullptr, function, file, line);
     ARM_COMPUTE_RETURN_ON_ERROR(::arm_compute::error_on_unsupported_cpu_fp16(function, file, line, tensor->info()));
     return Status{};
 }

References ARM_COMPUTE_RETURN_ERROR_ON_LOC, ARM_COMPUTE_RETURN_ON_ERROR, error_on_unsupported_cpu_fp16(), CLTensor::info(), and tensor.

◆ error_on_unsupported_cpu_fp16() [2/2]

Status arm_compute::error_on_unsupported_cpu_fp16	(	const char *	function,
		const char *	file,
		const int	line,
		const ITensorInfo *	tensor_info
	)

inline

Return an error if the data type of the passed tensor info is FP16 and FP16 support is not compiled in.

Parameters

[in]	function	Function in which the error occurred.
[in]	file	Name of the file where the error occurred.
[in]	line	Line on which the error occurred.
[in]	tensor_info	Tensor info to validate.

Returns: Status

Definition at line 42 of file Validate.h.

 {
     bool fp16_kernels_enabled = false;
 #if defined(ARM_COMPUTE_ENABLE_FP16) && defined(ENABLE_FP16_KERNELS)
     fp16_kernels_enabled = true;
 #endif /* defined(ARM_COMPUTE_ENABLE_FP16) && defined(ENABLE_FP16_KERNELS) */
  
     ARM_COMPUTE_RETURN_ERROR_ON_LOC(tensor_info == nullptr, function, file, line);
     ARM_COMPUTE_RETURN_ERROR_ON_LOC_MSG(
         (tensor_info->data_type() == DataType::F16) && (!CPUInfo::get().has_fp16() || !fp16_kernels_enabled), function,
         file, line, "This CPU architecture does not support F16 data type, you need v8.2 or above");
     return Status{};
 }

References ARM_COMPUTE_RETURN_ERROR_ON_LOC, ARM_COMPUTE_RETURN_ERROR_ON_LOC_MSG, TensorInfo::data_type(), F16, CPUInfo::get(), CPUInfo::has_fp16(), and tensor_info.

Referenced by error_on_unsupported_cpu_fp16().

◆ error_on_unsupported_fp16() [1/2]

arm_compute::Status arm_compute::error_on_unsupported_fp16	(	const char *	function,
		const char *	file,
		const int	line,
		const ITensor *	tensor,
		bool	is_fp16_supported
	)

inline

Return an error if the data type of the passed tensor is FP16 and FP16 extension is not supported by the device.

Parameters

[in]	function	Function in which the error occurred.
[in]	file	Name of the file where the error occurred.
[in]	line	Line on which the error occurred.
[in]	tensor	Tensor to validate.
[in]	is_fp16_supported	Is fp16 supported by the device.

Returns: Status

Definition at line 985 of file Validate.h.

 {
     ARM_COMPUTE_RETURN_ERROR_ON_LOC(tensor == nullptr, function, file, line);
     ARM_COMPUTE_RETURN_ON_ERROR(
         ::arm_compute::error_on_unsupported_fp16(function, file, line, tensor->info(), is_fp16_supported));
     return arm_compute::Status{};
 }

References ARM_COMPUTE_RETURN_ERROR_ON_LOC, ARM_COMPUTE_RETURN_ON_ERROR, error_on_unsupported_fp16(), CLTensor::info(), and tensor.

◆ error_on_unsupported_fp16() [2/2]

arm_compute::Status arm_compute::error_on_unsupported_fp16	(	const char *	function,
		const char *	file,
		const int	line,
		const ITensorInfo *	tensor_info,
		bool	is_fp16_supported
	)

inline

Return an error if the data type of the passed tensor info is FP16 and FP16 extension is not supported by the device.

Parameters

[in]	function	Function in which the error occurred.
[in]	file	Name of the file where the error occurred.
[in]	line	Line on which the error occurred.
[in]	tensor_info	Tensor info to validate.
[in]	is_fp16_supported	Is fp16 supported by the device.

Returns: Status

Definition at line 966 of file Validate.h.

 {
     ARM_COMPUTE_RETURN_ERROR_ON_LOC(tensor_info == nullptr, function, file, line);
     ARM_COMPUTE_RETURN_ERROR_ON_LOC_MSG((tensor_info->data_type() == DataType::F16 && !is_fp16_supported), function,
                                         file, line, "FP16 not supported by the device");
     return arm_compute::Status{};
 }

References ARM_COMPUTE_RETURN_ERROR_ON_LOC, ARM_COMPUTE_RETURN_ERROR_ON_LOC_MSG, TensorInfo::data_type(), F16, and tensor_info.

Referenced by error_on_unsupported_fp16().

◆ error_on_unsupported_int64_base_atomics()

arm_compute::Status arm_compute::error_on_unsupported_int64_base_atomics	(	const char *	function,
		const char *	file,
		const int	line
	)

inline

Return an error if int64_base_atomics extension is not supported by the device.

Parameters

[in]	function	Function in which the error occurred.
[in]	file	Name of the file where the error occurred.
[in]	line	Line on which the error occurred.

Returns: Status

Definition at line 49 of file CLValidate.h.

 {
     if (!CLKernelLibrary::get().int64_base_atomics_supported())
     {
         return ARM_COMPUTE_CREATE_ERROR_LOC(arm_compute::ErrorCode::UNSUPPORTED_EXTENSION_USE, function, file, line,
                                             "Atomic functions are not supported");
     }
     return arm_compute::Status{};
 }

References ARM_COMPUTE_CREATE_ERROR_LOC, CLKernelLibrary::get(), and UNSUPPORTED_EXTENSION_USE.

◆ error_on_window_dimensions_gte()

arm_compute::Status error_on_window_dimensions_gte	(	const char *	function,
		const char *	file,
		const int	line,
		const Window &	win,
		unsigned int	max_dim
	)

Return an error if the passed window has too many dimensions.

The window has too many dimensions if any of the dimension greater or equal to max_dim is different from 0.

Parameters

[in]	function	Function in which the error occurred.
[in]	file	Name of the file where the error occurred.
[in]	line	Line on which the error occurred.
[in]	win	Window to validate
[in]	max_dim	Maximum number of dimensions allowed.

Returns: Status

Definition at line 90 of file Validate.cpp.

 {
     for (unsigned int i = max_dim; i < arm_compute::Coordinates::num_max_dimensions; ++i)
     {
         ARM_COMPUTE_RETURN_ERROR_ON_LOC_MSG_VAR(
             (win[i].start() != 0) || (win[i].end() != win[i].step()), function, file, line,
             "Maximum number of dimensions expected %u but dimension %u is not empty", max_dim, i);
     }
     return arm_compute::Status{};
 }

References ARM_COMPUTE_RETURN_ERROR_ON_LOC_MSG_VAR, arm_compute::mlgo::parser::end(), Dimensions< int >::num_max_dimensions, and arm_compute::cpu::step.

◆ error_on_window_not_collapsable_at_dimension()

arm_compute::Status error_on_window_not_collapsable_at_dimension	(	const char *	function,
		const char *	file,
		const int	line,
		const Window &	full,
		const Window &	window,
		const int	dim
	)

Return an error if the window can't be collapsed at the given dimension.

The window cannot be collapsed if the given dimension not equal to the full window's dimension or not start from 0.

Parameters

[in]	function	Function in which the error occurred.
[in]	file	Name of the file where the error occurred.
[in]	line	Line on which the error occurred.
[in]	full	Full size window
[in]	window	Window to be collapsed.
[in]	dim	Dimension need to be checked.

Returns: Status

Definition at line 63 of file Validate.cpp.

 {
     full.validate();
     window.validate();
  
     ARM_COMPUTE_RETURN_ERROR_ON_LOC(window[dim].start() != 0, function, file, line);
     ARM_COMPUTE_RETURN_ERROR_ON_LOC(window[dim].start() != full[dim].start(), function, file, line);
     ARM_COMPUTE_RETURN_ERROR_ON_LOC(full[dim].end() != window[dim].end(), function, file, line);
  
     return arm_compute::Status{};
 }

References ARM_COMPUTE_RETURN_ERROR_ON_LOC, arm_compute::mlgo::parser::end(), and Window::validate().

◆ execute_window_loop()

void execute_window_loop	(	const Window &	w,
		L &&	lambda_function,
		Ts &&...	iterators
	)

inline

Iterate through the passed window, automatically adjusting the iterators and calling the lambda_functino for each element.

It passes the x and y positions to the lambda_function for each iteration

Parameters

[in]	w	Window to iterate through.
[in]	lambda_function	The function of type void(function)( const Coordinates & id ) to call at each iteration. Where id represents the absolute coordinates of the item to process.
[in,out]	iterators	Tensor iterators which will be updated by this function before calling lambda_function.

Definition at line 74 of file Helpers.inl.

 {
     w.validate();
  
     for (unsigned int i = 0; i < Coordinates::num_max_dimensions; ++i)
     {
         ARM_COMPUTE_ERROR_ON(w[i].step() == 0);
     }
  
     Coordinates id;
     ForEachDimension<Coordinates::num_max_dimensions>::unroll(w, id, std::forward<L>(lambda_function),
                                                               std::forward<Ts>(iterators)...);
 }

References ARM_COMPUTE_ERROR_ON, Dimensions< int >::num_max_dimensions, arm_compute::cpu::step, and arm_compute::test::validation::w.

◆ export_to_cl_image()

bool export_to_cl_image ( const ITensorInfo * tensor )

Definition at line 449 of file CLHelpers.cpp.

 {
     if (tensor->tensor_shape()[0] % 4 != 0)
     {
         return false;
     }
  
     // If not floating point
     if (!is_data_type_float(tensor->data_type()))
     {
         return false;
     }
  
     // Check if the cl_khr_image2d_from_buffer extension is supported on the target platform
     if (!image2d_from_buffer_supported(CLKernelLibrary::get().get_device()))
     {
         return false;
     }
  
     // Check cl image pitch alignment
     if (get_cl_image_pitch_alignment(CLKernelLibrary::get().get_device()) == 0)
     {
         return false;
     }
  
     const size_t image_w     = tensor->tensor_shape()[0] / 4;
     const size_t image_h     = tensor->tensor_shape().total_size() / tensor->tensor_shape()[0];
     const size_t max_image_w = CLKernelLibrary::get().get_device().getInfo<CL_DEVICE_IMAGE2D_MAX_WIDTH>();
     const size_t max_image_h = CLKernelLibrary::get().get_device().getInfo<CL_DEVICE_IMAGE2D_MAX_HEIGHT>();
  
     if (image_w > max_image_w || image_h > max_image_h)
     {
         return false;
     }
  
     return true;
 }

References CLKernelLibrary::get(), get_cl_image_pitch_alignment(), CLKernelLibrary::get_device(), image2d_from_buffer_supported(), is_data_type_float(), and tensor.

Referenced by arm_compute::opencl::kernels::gemm::configure_lhs_rhs_info(), arm_compute::cl_dwc::use_cl_image_for_weights(), and ClComponentDepthwiseConv2d::validate().

◆ finalize_quantization() [1/4]

int8_t arm_compute::finalize_quantization	(	int32_t	in_value,
		int	result_fixedpoint_multiplier,
		int32_t	result_shift,
		int32_t	result_offset_after_shift_s32,
		int8_t	min_s8,
		int8_t	max_s8,
		bool	is_bounded_relu
	)

inline

Performs final quantization step on single element.

Parameters

[in]	in_value	Input to be quantized.
[in]	result_fixedpoint_multiplier	Result multiplier parameter
[in]	result_shift	Result shift parameter
[in]	result_offset_after_shift_s32	Result offset parameter
[in]	min_s8	Relu lower bound
[in]	max_s8	Relu upper bound
[in]	is_bounded_relu	Specified if a fused bounded relu should be applied

Returns: Quantized value

Definition at line 366 of file NEAsymm.h.

 {
     int32x4_t in_s32 = vdupq_n_s32(in_value);
  
     if (result_shift < 0)
     {
         in_value = vgetq_lane_s32(
             vqrdmulhq_n_s32(vmulq_n_s32(in_s32, (1 << (-result_shift))), result_fixedpoint_multiplier), 0);
     }
     else
     {
         // Fixed point multiplication with vector saturating rounding doubling multiply high with scalar
         in_value = vgetq_lane_s32(vqrdmulhq_n_s32(in_s32, result_fixedpoint_multiplier), 0);
  
         // Shift value by result_shift_s32
         in_value = rounding_divide_by_pow2(in_value, result_shift);
     }
  
     // Add the offset term
     in_value += result_offset_after_shift_s32;
  
     // Bound the result
     int8_t out_s8 = static_cast<int8_t>(std::max<int32_t>(-128, std::min<int32_t>(127, in_value)));
     if (is_bounded_relu)
     {
         out_s8 = static_cast<int8_t>(std::max(min_s8, std::min(max_s8, out_s8)));
     }
  
     return out_s8;
 }

References rounding_divide_by_pow2().

◆ finalize_quantization() [2/4]

uint8_t arm_compute::finalize_quantization	(	int32_t	in_value,
		int	result_fixedpoint_multiplier,
		int32_t	result_shift,
		int32_t	result_offset_after_shift_s32,
		uint8_t	min_u8,
		uint8_t	max_u8,
		bool	is_bounded_relu
	)

inline

Performs final quantization step on single element.

Parameters

[in]	in_value	Input to be quantized.
[in]	result_fixedpoint_multiplier	Result multiplier parameter
[in]	result_shift	Result shift parameter
[in]	result_offset_after_shift_s32	Result offset parameter
[in]	min_u8	Relu lower bound
[in]	max_u8	Relu upper bound
[in]	is_bounded_relu	Specified if a fused bounded relu should be applied

Returns: Quantized value

Definition at line 318 of file NEAsymm.h.

 {
     int32x4_t in_s32 = vdupq_n_s32(in_value);
  
     if (result_shift < 0)
     {
         in_value = vgetq_lane_s32(
             vqrdmulhq_n_s32(vmulq_n_s32(in_s32, (1 << (-result_shift))), result_fixedpoint_multiplier), 0);
     }
     else
     {
         // Fixed point multiplication with vector saturating rounding doubling multiply high with scalar
         in_value = vgetq_lane_s32(vqrdmulhq_n_s32(in_s32, result_fixedpoint_multiplier), 0);
         // Shift value by result_shift_s32
         in_value = rounding_divide_by_pow2(in_value, result_shift);
     }
  
     // Add the offset term
     in_value += result_offset_after_shift_s32;
  
     // Bound the result
     uint8_t out_u8 = static_cast<uint8_t>(std::max<int32_t>(0, std::min<int32_t>(255, in_value)));
     if (is_bounded_relu)
     {
         out_u8 = static_cast<uint8_t>(std::max(min_u8, std::min(max_u8, out_u8)));
     }
  
     return out_u8;
 }

References rounding_divide_by_pow2().

◆ finalize_quantization() [3/4]

int8x16_t arm_compute::finalize_quantization	(	int32x4x4_t &	in_s32,
		int	result_fixedpoint_multiplier,
		int32_t	result_shift,
		int32x4_t	result_offset_after_shift_s32,
		int8x16_t	min_s8,
		int8x16_t	max_s8,
		bool	is_bounded_relu
	)

inline

Performs final quantization step on 16 elements.

Parameters

[in]	in_s32	Input to be quantized.
[in]	result_fixedpoint_multiplier	Result multiplier parameter
[in]	result_shift	Result shift parameter
[in]	result_offset_after_shift_s32	Result offset parameter
[in]	min_s8	Relu lower bound
[in]	max_s8	Relu upper bound
[in]	is_bounded_relu	Specified if a fused bounded relu should be applied

Returns: Quantized values

Definition at line 161 of file NEAsymm.h.

 {
     if (result_shift < 0)
     {
         in_s32.val[0] = vmulq_n_s32(in_s32.val[0], (1 << (-result_shift)));
         in_s32.val[1] = vmulq_n_s32(in_s32.val[1], (1 << (-result_shift)));
         in_s32.val[2] = vmulq_n_s32(in_s32.val[2], (1 << (-result_shift)));
         in_s32.val[3] = vmulq_n_s32(in_s32.val[3], (1 << (-result_shift)));
  
         in_s32.val[0] = vqrdmulhq_n_s32(in_s32.val[0], result_fixedpoint_multiplier);
         in_s32.val[1] = vqrdmulhq_n_s32(in_s32.val[1], result_fixedpoint_multiplier);
         in_s32.val[2] = vqrdmulhq_n_s32(in_s32.val[2], result_fixedpoint_multiplier);
         in_s32.val[3] = vqrdmulhq_n_s32(in_s32.val[3], result_fixedpoint_multiplier);
     }
     else
     {
         // Fixed point multiplication with vector saturating rounding doubling multiply high with scalar
         in_s32.val[0] = vqrdmulhq_n_s32(in_s32.val[0], result_fixedpoint_multiplier);
         in_s32.val[1] = vqrdmulhq_n_s32(in_s32.val[1], result_fixedpoint_multiplier);
         in_s32.val[2] = vqrdmulhq_n_s32(in_s32.val[2], result_fixedpoint_multiplier);
         in_s32.val[3] = vqrdmulhq_n_s32(in_s32.val[3], result_fixedpoint_multiplier);
  
         // Round to the nearest division by a power-of-two using result_shift_s32
         in_s32.val[0] = rounding_divide_by_pow2(in_s32.val[0], result_shift);
         in_s32.val[1] = rounding_divide_by_pow2(in_s32.val[1], result_shift);
         in_s32.val[2] = rounding_divide_by_pow2(in_s32.val[2], result_shift);
         in_s32.val[3] = rounding_divide_by_pow2(in_s32.val[3], result_shift);
     }
  
     // Add the offset terms
     in_s32.val[0] = vaddq_s32(in_s32.val[0], result_offset_after_shift_s32);
     in_s32.val[1] = vaddq_s32(in_s32.val[1], result_offset_after_shift_s32);
     in_s32.val[2] = vaddq_s32(in_s32.val[2], result_offset_after_shift_s32);
     in_s32.val[3] = vaddq_s32(in_s32.val[3], result_offset_after_shift_s32);
  
     // Convert S32 to S16
     const int16x8x2_t in_s16 = {{vcombine_s16(vqmovn_s32(in_s32.val[0]), vqmovn_s32(in_s32.val[1])),
                                  vcombine_s16(vqmovn_s32(in_s32.val[2]), vqmovn_s32(in_s32.val[3]))}};
  
     // Convert S16 to S8
     int8x16_t out_s8 = vcombine_s8(vqmovn_s16(in_s16.val[0]), vqmovn_s16(in_s16.val[1]));
  
     if (is_bounded_relu)
     {
         out_s8 = vmaxq_s8(out_s8, min_s8);
         out_s8 = vminq_s8(out_s8, max_s8);
     }
  
     return out_s8;
 }

References rounding_divide_by_pow2().

◆ finalize_quantization() [4/4]

uint8x16_t arm_compute::finalize_quantization	(	int32x4x4_t &	in_s32,
		int	result_fixedpoint_multiplier,
		int32_t	result_shift,
		int32x4_t	result_offset_after_shift_s32,
		uint8x16_t	min_u8,
		uint8x16_t	max_u8,
		bool	is_bounded_relu
	)

inline

Performs final quantization step on 16 elements.

Parameters

[in]	in_s32	Input to be quantized.
[in]	result_fixedpoint_multiplier	Result multiplier parameter
[in]	result_shift	Result shift parameter
[in]	result_offset_after_shift_s32	Result offset parameter
[in]	min_u8	Relu lower bound
[in]	max_u8	Relu upper bound
[in]	is_bounded_relu	Specified if a fused bounded relu should be applied

Returns: Quantized values

Definition at line 84 of file NEAsymm.h.

 {
     const static int32x4_t zero_s32 = vdupq_n_s32(0);
  
     if (result_shift < 0)
     {
         in_s32.val[0] = vmulq_n_s32(in_s32.val[0], (1 << (-result_shift)));
         in_s32.val[1] = vmulq_n_s32(in_s32.val[1], (1 << (-result_shift)));
         in_s32.val[2] = vmulq_n_s32(in_s32.val[2], (1 << (-result_shift)));
         in_s32.val[3] = vmulq_n_s32(in_s32.val[3], (1 << (-result_shift)));
  
         in_s32.val[0] = vqrdmulhq_n_s32(in_s32.val[0], result_fixedpoint_multiplier);
         in_s32.val[1] = vqrdmulhq_n_s32(in_s32.val[1], result_fixedpoint_multiplier);
         in_s32.val[2] = vqrdmulhq_n_s32(in_s32.val[2], result_fixedpoint_multiplier);
         in_s32.val[3] = vqrdmulhq_n_s32(in_s32.val[3], result_fixedpoint_multiplier);
     }
     else
     {
         // Fixed point multiplication with vector saturating rounding doubling multiply high with scalar
         in_s32.val[0] = vqrdmulhq_n_s32(in_s32.val[0], result_fixedpoint_multiplier);
         in_s32.val[1] = vqrdmulhq_n_s32(in_s32.val[1], result_fixedpoint_multiplier);
         in_s32.val[2] = vqrdmulhq_n_s32(in_s32.val[2], result_fixedpoint_multiplier);
         in_s32.val[3] = vqrdmulhq_n_s32(in_s32.val[3], result_fixedpoint_multiplier);
  
         // Round to the nearest division by a power-of-two using result_shift_s32
         in_s32.val[0] = rounding_divide_by_pow2(in_s32.val[0], result_shift);
         in_s32.val[1] = rounding_divide_by_pow2(in_s32.val[1], result_shift);
         in_s32.val[2] = rounding_divide_by_pow2(in_s32.val[2], result_shift);
         in_s32.val[3] = rounding_divide_by_pow2(in_s32.val[3], result_shift);
     }
  
     // Add the offset terms
     in_s32.val[0] = vaddq_s32(in_s32.val[0], result_offset_after_shift_s32);
     in_s32.val[1] = vaddq_s32(in_s32.val[1], result_offset_after_shift_s32);
     in_s32.val[2] = vaddq_s32(in_s32.val[2], result_offset_after_shift_s32);
     in_s32.val[3] = vaddq_s32(in_s32.val[3], result_offset_after_shift_s32);
  
     // Saturate negative values
     in_s32.val[0] = vmaxq_s32(in_s32.val[0], zero_s32);
     in_s32.val[1] = vmaxq_s32(in_s32.val[1], zero_s32);
     in_s32.val[2] = vmaxq_s32(in_s32.val[2], zero_s32);
     in_s32.val[3] = vmaxq_s32(in_s32.val[3], zero_s32);
  
     // Convert S32 to S16
     const int16x8x2_t in_s16 = {{vcombine_s16(vqmovn_s32(in_s32.val[0]), vqmovn_s32(in_s32.val[1])),
                                  vcombine_s16(vqmovn_s32(in_s32.val[2]), vqmovn_s32(in_s32.val[3]))}};
  
     // Convert S16 to U8
     uint8x16_t out_u8 = vcombine_u8(vqmovun_s16(in_s16.val[0]), vqmovun_s16(in_s16.val[1]));
  
     if (is_bounded_relu)
     {
         out_u8 = vmaxq_u8(out_u8, min_u8);
         out_u8 = vminq_u8(out_u8, max_u8);
     }
  
     return out_u8;
 }

References rounding_divide_by_pow2().

Referenced by arm_compute::cpu::directconv3d_quantized_neon_ndhwc().

◆ finalize_quantization_int16() [1/2]

int16_t arm_compute::finalize_quantization_int16	(	int32_t	in_value,
		int	result_fixedpoint_multiplier,
		int32_t	result_shift,
		int16_t	min_s16,
		int16_t	max_s16
	)

inline

Performs final quantization step on single signed 16-bit element.

Template Parameters

is_bounded_relu Specified if a fused bounded relu should be applied

Parameters

[in]	in_value	Input to be quantized.
[in]	result_fixedpoint_multiplier	Result multiplier parameter
[in]	result_shift	Result shift parameter
[in]	min_s16	Relu lower bound
[in]	max_s16	Relu upper bound

Returns: Quantized values

Definition at line 100 of file NESymm.h.

 {
     if (result_shift < 0)
     {
         const int64_t in_64 = static_cast<int64_t>(in_value) * (1 << (-result_shift)) *
                               static_cast<int64_t>(result_fixedpoint_multiplier);
         in_value = static_cast<int32_t>((in_64 + (1 << 30)) >> 31);
     }
     else
     {
         // Fixed point multiplication with vector saturating rounding doubling multiply high with scalar
         const int64_t in_64 = static_cast<int64_t>(in_value) * static_cast<int64_t>(result_fixedpoint_multiplier);
         // Shift value by result_shift_s32
         in_value = rounding_divide_by_pow2(static_cast<int32_t>((in_64 + (1 << 30)) >> 31), result_shift);
     }
  
     // Bound the result
     int16_t out_s16 = static_cast<int16_t>(std::max<int32_t>(-32768, std::min<int32_t>(32767, in_value)));
  
     if (is_bounded_relu)
     {
         out_s16 = static_cast<int16_t>(std::max(min_s16, std::min(max_s16, out_s16)));
     }
  
     return out_s16;
 }

References rounding_divide_by_pow2().

◆ finalize_quantization_int16() [2/2]

int16x8_t arm_compute::finalize_quantization_int16	(	int32x4x2_t &	in_s32,
		int	result_fixedpoint_multiplier,
		int32_t	result_shift,
		int16x8_t	min_s16,
		int16x8_t	max_s16
	)

Performs final quantization step on 8 signed 16-bit elements.

Template Parameters

is_bounded_relu Specified if a fused bounded relu should be applied

Parameters

[in]	in_s32	Input to be quantized.
[in]	result_fixedpoint_multiplier	Result multiplier parameter
[in]	result_shift	Result shift parameter
[in]	min_s16	Relu lower bound
[in]	max_s16	Relu upper bound

Returns: Quantized values

Definition at line 54 of file NESymm.h.

 {
     if (result_shift < 0)
     {
         in_s32.val[0] = vmulq_n_s32(in_s32.val[0], (1 << -result_shift));
         in_s32.val[1] = vmulq_n_s32(in_s32.val[1], (1 << -result_shift));
  
         in_s32.val[0] = vqrdmulhq_n_s32(in_s32.val[0], result_fixedpoint_multiplier);
         in_s32.val[1] = vqrdmulhq_n_s32(in_s32.val[1], result_fixedpoint_multiplier);
     }
     else
     {
         // Fixed point multiplication with vector saturating rounding doubling multiply high with scalar
         in_s32.val[0] = vqrdmulhq_n_s32(in_s32.val[0], result_fixedpoint_multiplier);
         in_s32.val[1] = vqrdmulhq_n_s32(in_s32.val[1], result_fixedpoint_multiplier);
         // Round to the nearest division by a power-of-two using result_shift_s32
         in_s32.val[0] = rounding_divide_by_pow2(in_s32.val[0], result_shift);
         in_s32.val[1] = rounding_divide_by_pow2(in_s32.val[1], result_shift);
     }
  
     // Convert S32 to S16
     int16x8_t out_s16 = vcombine_s16(vqmovn_s32(in_s32.val[0]), vqmovn_s32(in_s32.val[1]));
  
     if (is_bounded_relu)
     {
         out_s16 = vmaxq_s16(out_s16, min_s16);
         out_s16 = vminq_s16(out_s16, max_s16);
     }
  
     return out_s16;
 }

References rounding_divide_by_pow2().

◆ finalize_quantization_symm()

int8x16_t arm_compute::finalize_quantization_symm	(	int32x4x4_t &	in_s32,
		const int32x4x4_t &	result_fixedpoint_multiplier,
		const int32x4x4_t &	result_shift,
		const int32x4_t &	result_offset_after_shift_s32,
		const int8x16_t &	min_s8,
		const int8x16_t &	max_s8,
		const bool	is_bounded_relu
	)

inline

Performs final quantization step on 16 elements for symmetric quantization.

Parameters

[in]	in_s32	Input to be quantized.
[in]	result_fixedpoint_multiplier	Result multiplier parameter
[in]	result_shift	Result shift parameter
[in]	result_offset_after_shift_s32	Result offset parameter
[in]	min_s8	Relu lower bound
[in]	max_s8	Relu upper bound
[in]	is_bounded_relu	Specified if a fused bounded relu should be applied

Returns: Quantized values

Definition at line 230 of file NEAsymm.h.

 {
     const static int32x4_t one_s32 = vdupq_n_s32(1);
  
     // Fixed point multiplication with vector saturating rounding doubling multiply high with scalar
     int32x4x4_t res_shift_gt0 = {
         vqrdmulhq_s32(in_s32.val[0], result_fixedpoint_multiplier.val[0]),
         vqrdmulhq_s32(in_s32.val[1], result_fixedpoint_multiplier.val[1]),
         vqrdmulhq_s32(in_s32.val[2], result_fixedpoint_multiplier.val[2]),
         vqrdmulhq_s32(in_s32.val[3], result_fixedpoint_multiplier.val[3]),
     };
     // Round to the nearest division by a power-of-two using result_shift_s32
     res_shift_gt0.val[0] = rounding_divide_by_pow2(res_shift_gt0.val[0], result_shift.val[0]);
     res_shift_gt0.val[1] = rounding_divide_by_pow2(res_shift_gt0.val[1], result_shift.val[1]);
     res_shift_gt0.val[2] = rounding_divide_by_pow2(res_shift_gt0.val[2], result_shift.val[2]);
     res_shift_gt0.val[3] = rounding_divide_by_pow2(res_shift_gt0.val[3], result_shift.val[3]);
  
     int32x4x4_t res_shift_lt0 = {
         vmulq_s32(in_s32.val[0], vshlq_s32(one_s32, vnegq_s32(result_shift.val[0]))),
         vmulq_s32(in_s32.val[1], vshlq_s32(one_s32, vnegq_s32(result_shift.val[1]))),
         vmulq_s32(in_s32.val[2], vshlq_s32(one_s32, vnegq_s32(result_shift.val[2]))),
         vmulq_s32(in_s32.val[3], vshlq_s32(one_s32, vnegq_s32(result_shift.val[3]))),
     };
     res_shift_lt0.val[0] = vqrdmulhq_s32(res_shift_lt0.val[0], result_fixedpoint_multiplier.val[0]);
     res_shift_lt0.val[1] = vqrdmulhq_s32(res_shift_lt0.val[1], result_fixedpoint_multiplier.val[1]);
     res_shift_lt0.val[2] = vqrdmulhq_s32(res_shift_lt0.val[2], result_fixedpoint_multiplier.val[2]);
     res_shift_lt0.val[3] = vqrdmulhq_s32(res_shift_lt0.val[3], result_fixedpoint_multiplier.val[3]);
  
     // Select result depending on shift value
     const uint32x4x4_t mask_lt0 = {
 #ifdef __aarch64__
         vcltzq_s32(result_shift.val[0]),
         vcltzq_s32(result_shift.val[1]),
         vcltzq_s32(result_shift.val[2]),
         vcltzq_s32(result_shift.val[3]),
 #else  //__aarch64__
         vcltq_s32(result_shift.val[0], vdupq_n_s32(0)),
         vcltq_s32(result_shift.val[1], vdupq_n_s32(0)),
         vcltq_s32(result_shift.val[2], vdupq_n_s32(0)),
         vcltq_s32(result_shift.val[3], vdupq_n_s32(0)),
 #endif //__aarch64__
     };
  
     in_s32.val[0] = vbslq_s32(mask_lt0.val[0], res_shift_lt0.val[0], res_shift_gt0.val[0]);
     in_s32.val[1] = vbslq_s32(mask_lt0.val[1], res_shift_lt0.val[1], res_shift_gt0.val[1]);
     in_s32.val[2] = vbslq_s32(mask_lt0.val[2], res_shift_lt0.val[2], res_shift_gt0.val[2]);
     in_s32.val[3] = vbslq_s32(mask_lt0.val[3], res_shift_lt0.val[3], res_shift_gt0.val[3]);
  
     // Add the offset terms
     in_s32.val[0] = vaddq_s32(in_s32.val[0], result_offset_after_shift_s32);
     in_s32.val[1] = vaddq_s32(in_s32.val[1], result_offset_after_shift_s32);
     in_s32.val[2] = vaddq_s32(in_s32.val[2], result_offset_after_shift_s32);
     in_s32.val[3] = vaddq_s32(in_s32.val[3], result_offset_after_shift_s32);
  
     // Convert S32 to S16
     const int16x8x2_t in_s16 = {{vcombine_s16(vqmovn_s32(in_s32.val[0]), vqmovn_s32(in_s32.val[1])),
                                  vcombine_s16(vqmovn_s32(in_s32.val[2]), vqmovn_s32(in_s32.val[3]))}};
  
     // Convert S16 to S8
     int8x16_t out_s8 = vcombine_s8(vqmovn_s16(in_s16.val[0]), vqmovn_s16(in_s16.val[1]));
  
     if (is_bounded_relu)
     {
         out_s8 = vmaxq_s8(out_s8, min_s8);
         out_s8 = vminq_s8(out_s8, max_s8);
     }
  
     return out_s8;
 }

References rounding_divide_by_pow2().

◆ float_to_string_with_full_precision()

std::string float_to_string_with_full_precision ( float val )

Create a string with the float in full precision.

Parameters

val	Floating point value

Returns: String with the floating point value.

Definition at line 52 of file StringUtils.cpp.

 {
     std::stringstream ss;
     ss.precision(std::numeric_limits<float>::max_digits10);
     ss << val;
  
     if (val != static_cast<int>(val))
     {
         ss << "f";
     }
  
     return ss.str();
 }

References arm_compute::test::validation::ss().

◆ floor_to_multiple()

auto arm_compute::floor_to_multiple	(	S	value,
		T	divisor
	)		-> decltype((value / divisor) * divisor)

inline

Computes the largest number smaller or equal to value that is a multiple of divisor.

Parameters

[in]	value	Upper bound value
[in]	divisor	Value to compute multiple of.

Returns: the result.

Definition at line 64 of file Math.h.

 {
     ARM_COMPUTE_ERROR_ON(value < 0 || divisor <= 0);
     return (value / divisor) * divisor;
 }

References ARM_COMPUTE_ERROR_ON.

◆ fp16_supported()

bool fp16_supported ( const cl::Device & device )

Helper function to check whether the cl_khr_fp16 extension is supported.

Parameters

[in] device A CL device

Returns: True if the extension is supported

Definition at line 237 of file CLHelpers.cpp.

 {
     return device_supports_extension(device, "cl_khr_fp16");
 }

References device_supports_extension().

◆ get_arch_from_target()

GPUTarget get_arch_from_target ( GPUTarget target )

Helper function to get the GPU arch.

Parameters

[in] target GPU target

Returns: the GPU target which shows the arch

Definition at line 247 of file GPUTarget.cpp.

 {
     return (target & GPUTarget::GPU_ARCH_MASK);
 }

References GPU_ARCH_MASK.

◆ get_cl_dot8_acc_type_from_data_type()

std::string get_cl_dot8_acc_type_from_data_type ( const DataType & dt )

Translates a tensor data type to the appropriate OpenCL dot8 accumulator type.

Parameters

[in] dt DataType to be translated to OpenCL dot8 accumulator type.

Returns: The string specifying the OpenCL dot8 accumulator type to be used.

Definition at line 176 of file CLHelpers.cpp.

 {
     switch (dt)
     {
         case DataType::U8:
         case DataType::QASYMM8:
             return "uint";
         case DataType::S8:
         case DataType::QASYMM8_SIGNED:
         case DataType::QSYMM8:
         case DataType::QSYMM8_PER_CHANNEL:
             return "int";
         default:
             ARM_COMPUTE_ERROR("Unsupported data type.");
             return "";
     }
 }

References ARM_COMPUTE_ERROR, dt, QASYMM8, QASYMM8_SIGNED, QSYMM8, QSYMM8_PER_CHANNEL, S8, and U8.

Referenced by ClGemmLowpMatrixMultiplyNativeKernel::configure(), ClGemmLowpMatrixMultiplyReshapedKernel::configure(), ClGemmLowpMatrixMultiplyReshapedOnlyRhsKernel::configure(), ClGemmLowpMatrixAReductionKernel::configure(), and ClGemmLowpMatrixBReductionKernel::configure().

◆ get_cl_image_pitch_alignment()

size_t get_cl_image_pitch_alignment ( const cl::Device & device )

Helper function to get the cl_image pitch alignment in pixels.

Parameters

[in] device A CL device

Returns: the cl_image pitch alignment in pixels. If an error occurs, the function will return 0

Definition at line 382 of file CLHelpers.cpp.

 {
     cl_uint pixel_aligment = 0;
  
     cl_int err = clGetDeviceInfo(device(), CL_DEVICE_IMAGE_PITCH_ALIGNMENT, sizeof(cl_uint), &pixel_aligment, nullptr);
  
     if (err == CL_SUCCESS)
     {
         return pixel_aligment;
     }
     else
     {
         return 0;
     }
 }

References clGetDeviceInfo().

Referenced by create_image2d_from_buffer(), export_to_cl_image(), examples::gemm_tuner_helpers::update_padding_for_cl_image(), arm_compute::opencl::kernels::gemm::update_padding_for_cl_image(), and arm_compute::opencl::kernels::gemm::validate_image2d_support_on_rhs().

◆ get_cl_non_uniform_work_group_supported()

bool get_cl_non_uniform_work_group_supported ( const cl::Device & device )

Helper function to check whether non-uniform work group is supported.

Parameters

[in] device A CL device

Returns: True if the feature is supported

Definition at line 398 of file CLHelpers.cpp.

 {
     cl_bool supported = CL_FALSE;
  
     cl_int err =
         clGetDeviceInfo(device(), CL_DEVICE_NON_UNIFORM_WORK_GROUP_SUPPORT, sizeof(cl_bool), &supported, nullptr);
  
     return (err == CL_SUCCESS && supported == CL_TRUE);
 }

References clGetDeviceInfo().

Referenced by CLDevice::is_non_uniform_workgroup_supported().

◆ get_cl_promoted_type_from_data_type()

std::string get_cl_promoted_type_from_data_type ( const DataType & dt )

Translates a tensor data type to the appropriate OpenCL promoted type.

Parameters

[in] dt DataType to be used to get the promoted OpenCL type.

Returns: The string specifying the OpenCL type to be used.

Definition at line 77 of file CLHelpers.cpp.

 {
     switch (dt)
     {
         case DataType::U8:
         case DataType::QASYMM8:
             return "ushort";
         case DataType::S8:
         case DataType::QASYMM8_SIGNED:
         case DataType::QSYMM8:
         case DataType::QSYMM8_PER_CHANNEL:
             return "short";
         case DataType::U16:
         case DataType::QASYMM16:
             return "uint";
         case DataType::S16:
         case DataType::QSYMM16:
             return "int";
         case DataType::U32:
             return "ulong";
         case DataType::S32:
             return "long";
         case DataType::F16:
             return "float";
         default:
             ARM_COMPUTE_ERROR("Cannot get promoted OpenCL type for the input data type.");
             return "";
     }
 }

References ARM_COMPUTE_ERROR, dt, F16, QASYMM16, QASYMM8, QASYMM8_SIGNED, QSYMM16, QSYMM8, QSYMM8_PER_CHANNEL, S16, S32, S8, U16, U32, and U8.

◆ get_cl_select_type_from_data_type()

std::string get_cl_select_type_from_data_type ( const DataType & dt )

Translates a tensor data type to the appropriate OpenCL select type.

Parameters

[in] dt DataType to be translated to OpenCL select type.

Returns: The string specifying the OpenCL select type to be used.

Definition at line 143 of file CLHelpers.cpp.

 {
     switch (dt)
     {
         case DataType::U8:
         case DataType::QASYMM8:
         case DataType::S8:
         case DataType::QASYMM8_SIGNED:
         case DataType::QSYMM8:
         case DataType::QSYMM8_PER_CHANNEL:
             return "char";
         case DataType::U16:
         case DataType::QASYMM16:
             return "ushort";
         case DataType::F16:
         case DataType::S16:
         case DataType::QSYMM16:
             return "short";
         case DataType::U32:
             return "uint";
         case DataType::F32:
         case DataType::S32:
             return "int";
         case DataType::U64:
             return "ulong";
         case DataType::S64:
             return "long";
         default:
             ARM_COMPUTE_ERROR("Unsupported input data type.");
             return "";
     }
 }

References ARM_COMPUTE_ERROR, dt, F16, F32, QASYMM16, QASYMM8, QASYMM8_SIGNED, QSYMM16, QSYMM8, QSYMM8_PER_CHANNEL, S16, S32, S64, S8, U16, U32, U64, and U8.

Referenced by CLArgMinMaxLayerKernel::configure().

◆ get_cl_signed_type_from_element_size()

std::string get_cl_signed_type_from_element_size ( size_t element_size )

Translates the element size to an signed integer data type.

Parameters

[in] element_size Size in bytes of an element.

Returns: The string specifying the OpenCL type to be used.

Definition at line 125 of file CLHelpers.cpp.

 {
     switch (element_size)
     {
         case 1:
             return "char";
         case 2:
             return "short";
         case 4:
             return "int";
         case 8:
             return "long";
         default:
             ARM_COMPUTE_ERROR("Data type not supported");
             return "";
     }
 }

References ARM_COMPUTE_ERROR.

◆ get_cl_type_from_data_type()

std::string get_cl_type_from_data_type ( const DataType & dt )

Translates a tensor data type to the appropriate OpenCL type.

Parameters

[in] dt DataType to be translated to OpenCL type.

Returns: The string specifying the OpenCL type to be used.

Definition at line 41 of file CLHelpers.cpp.

 {
     switch (dt)
     {
         case DataType::U8:
         case DataType::QASYMM8:
             return "uchar";
         case DataType::S8:
         case DataType::QASYMM8_SIGNED:
         case DataType::QSYMM8:
         case DataType::QSYMM8_PER_CHANNEL:
             return "char";
         case DataType::U16:
         case DataType::QASYMM16:
             return "ushort";
         case DataType::S16:
         case DataType::QSYMM16:
             return "short";
         case DataType::U32:
             return "uint";
         case DataType::S32:
             return "int";
         case DataType::U64:
             return "ulong";
         case DataType::S64:
             return "long";
         case DataType::F16:
             return "half";
         case DataType::F32:
             return "float";
         default:
             ARM_COMPUTE_ERROR("Unsupported input data type.");
             return "";
     }
 }

References ARM_COMPUTE_ERROR, dt, F16, F32, QASYMM16, QASYMM8, QASYMM8_SIGNED, QSYMM16, QSYMM8, QSYMM8_PER_CHANNEL, S16, S32, S64, S8, U16, U32, U64, and U8.

◆ get_cl_unsigned_type_from_element_size()

std::string get_cl_unsigned_type_from_element_size ( size_t element_size )

Translates the element size to an unsigned integer data type.

Parameters

[in] element_size Size in bytes of an element.

Returns: The string specifying the OpenCL type to be used.

Definition at line 107 of file CLHelpers.cpp.

 {
     switch (element_size)
     {
         case 1:
             return "uchar";
         case 2:
             return "ushort";
         case 4:
             return "uint";
         case 8:
             return "ulong";
         default:
             ARM_COMPUTE_ERROR("Data type not supported");
             return "";
     }
 }

References ARM_COMPUTE_ERROR.

◆ get_cl_version()

CLVersion get_cl_version ( const cl::Device & device )

Helper function to get the highest OpenCL version supported.

Parameters

[in] device A CL device

Returns: the highest OpenCL version supported

Definition at line 258 of file CLHelpers.cpp.

 {
     std::string version_str = device.getInfo<CL_DEVICE_VERSION>();
     if (version_str.find("OpenCL 3") != std::string::npos)
     {
         return CLVersion::CL30;
     }
     else if (version_str.find("OpenCL 2") != std::string::npos)
     {
         return CLVersion::CL20;
     }
     else if (version_str.find("OpenCL 1.2") != std::string::npos)
     {
         return CLVersion::CL12;
     }
     else if (version_str.find("OpenCL 1.1") != std::string::npos)
     {
         return CLVersion::CL11;
     }
     else if (version_str.find("OpenCL 1.0") != std::string::npos)
     {
         return CLVersion::CL10;
     }
  
     return CLVersion::UNKNOWN;
 }

References CL10, CL11, CL12, CL20, CL30, and UNKNOWN.

Referenced by CLDevice::CLDevice().

◆ get_data_layout_dimension_index()

size_t get_data_layout_dimension_index	(	const DataLayout &	data_layout,
		const DataLayoutDimension &	data_layout_dimension
	)

inline

Get the index of the given dimension.

Parameters

[in]	data_layout	The data layout.
[in]	data_layout_dimension	The dimension which this index is requested for.

Returns: The int conversion of the requested data layout index.

Definition at line 201 of file Helpers.inl.

 {
     ARM_COMPUTE_ERROR_ON_MSG(data_layout == DataLayout::UNKNOWN,
                              "Cannot retrieve the dimension index for an unknown layout!");
     const auto &dims = get_layout_map().at(data_layout);
     const auto &it   = std::find(dims.cbegin(), dims.cend(), data_layout_dimension);
     ARM_COMPUTE_ERROR_ON_MSG(it == dims.cend(), "Invalid dimension for the given layout.");
     return it - dims.cbegin();
 }

References ARM_COMPUTE_ERROR_ON_MSG, arm_compute::cpu::data_layout, get_layout_map(), and UNKNOWN.

◆ get_data_size_from_data_type()

std::string get_data_size_from_data_type ( const DataType & dt )

Get the size of a data type in number of bits.

Parameters

[in] dt DataType.

Returns: Number of bits in the data type specified.

Definition at line 194 of file CLHelpers.cpp.

 {
     switch (dt)
     {
         case DataType::U8:
         case DataType::S8:
         case DataType::QSYMM8:
         case DataType::QASYMM8:
         case DataType::QASYMM8_SIGNED:
         case DataType::QSYMM8_PER_CHANNEL:
             return "8";
         case DataType::U16:
         case DataType::S16:
         case DataType::QSYMM16:
         case DataType::QASYMM16:
         case DataType::F16:
             return "16";
         case DataType::U32:
         case DataType::S32:
         case DataType::F32:
             return "32";
         case DataType::U64:
         case DataType::S64:
             return "64";
         default:
             ARM_COMPUTE_ERROR("Unsupported input data type.");
             return "0";
     }
 }

References ARM_COMPUTE_ERROR, dt, F16, F32, QASYMM16, QASYMM8, QASYMM8_SIGNED, QSYMM16, QSYMM8, QSYMM8_PER_CHANNEL, S16, S32, S64, S8, U16, U32, U64, and U8.

Referenced by ClDirectConv2dKernel::configure(), CLROIPoolingLayerKernel::configure(), and CLROIAlignLayerKernel::configure().

◆ get_default_lws_for_type()

cl::NDRange get_default_lws_for_type	(	CLKernelType	kernel_type,
		cl::NDRange	gws
	)

Definition at line 97 of file DefaultLWSHeuristics.cpp.

 {
     const size_t gws_x = gws[0];
     const size_t gws_y = gws[1];
     const size_t gws_z = gws[2];
  
     switch (kernel_type)
     {
         case CLKernelType::GEMM:
         {
             return get_gemm_lws(gws_x, gws_y, gws_z);
         }
         case CLKernelType::DIRECT:
         {
             return get_direct_lws(gws_x, gws_y, gws_z);
         }
         case CLKernelType::WINOGRAD:
         {
             return get_winograd_lws(gws_x, gws_y, gws_z);
         }
         case CLKernelType::DEPTHWISE:
         {
             return get_dwc_lws(gws_x, gws_y, gws_z);
         }
         default:
         {
             return CLKernelLibrary::get().default_ndrange();
         }
     }
 }

References CLKernelLibrary::default_ndrange(), DEPTHWISE, DIRECT, GEMM, CLKernelLibrary::get(), and WINOGRAD.

◆ get_index_data_layout_dimension()

DataLayoutDimension get_index_data_layout_dimension	(	const DataLayout &	data_layout,
		const size_t	index
	)

inline

Get the DataLayoutDimension of a given index and layout.

Parameters

[in]	data_layout	The data layout.
[in]	index	The data layout index.

Returns: The dimension which this index is requested for.

Definition at line 212 of file Helpers.inl.

 {
     ARM_COMPUTE_ERROR_ON_MSG(data_layout == DataLayout::UNKNOWN,
                              "Cannot retrieve the layout dimension for an unknown layout!");
     const auto &dims = get_layout_map().at(data_layout);
     ARM_COMPUTE_ERROR_ON_MSG(index >= dims.size(), "Invalid index for the given layout.");
     return dims[index];
 }

References ARM_COMPUTE_ERROR_ON_MSG, arm_compute::cpu::data_layout, get_layout_map(), and UNKNOWN.

◆ get_internal() [1/5]

IContext* arm_compute::get_internal ( AclContext ctx )

inline

Extract internal representation of a Context.

Parameters

[in] ctx Opaque context pointer

Returns: The internal representation as an IContext

Definition at line 127 of file IContext.h.

 {
     return static_cast<IContext *>(ctx);
 }

Referenced by AclActivation(), AclCreateQueue(), AclCreateTensor(), AclCreateTensorPack(), AclDestroyContext(), AclDestroyOperator(), AclDestroyQueue(), AclDestroyTensor(), AclDestroyTensorPack(), AclGetClContext(), AclGetClDevice(), AclGetClMem(), AclGetClQueue(), AclMapTensor(), AclPackTensor(), AclPackTensors(), AclQueueFinish(), AclRunOperator(), AclSetClContext(), AclSetClQueue(), AclTensorImport(), and AclUnmapTensor().

◆ get_internal() [2/5]

IOperator* arm_compute::get_internal ( AclOperator op )

inline

Extract internal representation of an Operator.

Parameters

[in] op Opaque operator pointer

Returns: The internal representation as an IOperator

Definition at line 113 of file IOperator.h.

 {
     return static_cast<IOperator *>(op);
 }

◆ get_internal() [3/5]

IQueue* arm_compute::get_internal ( AclQueue queue )

inline

Extract internal representation of a Queue.

Parameters

[in] queue Opaque queue pointer

Returns: The internal representation as an IQueue

Definition at line 76 of file IQueue.h.

 {
     return static_cast<IQueue *>(queue);
 }

◆ get_internal() [4/5]

ITensorV2* arm_compute::get_internal ( AclTensor tensor )

inline

Extract internal representation of a Tensor.

Parameters

[in] tensor Opaque tensor pointer

Returns: The internal representation as an ITensor

Definition at line 115 of file ITensorV2.h.

 {
     return static_cast<ITensorV2 *>(tensor);
 }

References tensor.

◆ get_internal() [5/5]

TensorPack* arm_compute::get_internal ( AclTensorPack pack )

inline

Extract internal representation of a TensoPack.

Parameters

[in] pack Opaque tensor pack pointer

Returns: The internal representation as an TensorPack

Definition at line 107 of file TensorPack.h.

 {
     return static_cast<TensorPack *>(pack);
 }

References arm_compute::test::validation::pack.

◆ get_layout_map()

const std::map< DataLayout, std::vector< DataLayoutDimension > > & get_layout_map ( )

Returns a static map used to find an index or dimension based on a data layout.

*** Layouts ***

*** 4D *** [N C H W] [3 2 1 0] [N H W C]

*** 5D *** [N C D H W] [4 3 2 1 0] [N D H W C]

Definition at line 107 of file Helpers.cpp.

 {
     constexpr DataLayoutDimension W = DataLayoutDimension::WIDTH;
     constexpr DataLayoutDimension H = DataLayoutDimension::HEIGHT;
     constexpr DataLayoutDimension C = DataLayoutDimension::CHANNEL;
     constexpr DataLayoutDimension D = DataLayoutDimension::DEPTH;
     constexpr DataLayoutDimension N = DataLayoutDimension::BATCHES;
  
     static const std::map<DataLayout, std::vector<DataLayoutDimension>> layout_map = {
         {DataLayout::NDHWC, {C, W, H, D, N}},
         {DataLayout::NCDHW, {W, H, D, C, N}},
         {DataLayout::NHWC, {C, W, H, N}},
         {DataLayout::NCHW, {W, H, C, N}}};
  
     return layout_map;
 }

References BATCHES, CHANNEL, DEPTH, HEIGHT, N, NCDHW, NCHW, NDHWC, NHWC, and WIDTH.

Referenced by get_data_layout_dimension_index(), and get_index_data_layout_dimension().

◆ get_min_max()

std::tuple<PixelValue, PixelValue> arm_compute::get_min_max ( DataType dt )

inline

Compute the mininum and maximum values a data type can take.

Parameters

[in] dt Data type to get the min/max bounds of

Returns: A tuple (min,max) with the minimum and maximum values respectively wrapped in PixelValue.

Definition at line 195 of file DataTypeUtils.h.

 {
     PixelValue min{};
     PixelValue max{};
     switch (dt)
     {
         case DataType::U8:
         case DataType::QASYMM8:
         {
             min = PixelValue(static_cast<int32_t>(std::numeric_limits<uint8_t>::lowest()));
             max = PixelValue(static_cast<int32_t>(std::numeric_limits<uint8_t>::max()));
             break;
         }
         case DataType::S8:
         case DataType::QSYMM8:
         case DataType::QASYMM8_SIGNED:
         case DataType::QSYMM8_PER_CHANNEL:
         {
             min = PixelValue(static_cast<int32_t>(std::numeric_limits<int8_t>::lowest()));
             max = PixelValue(static_cast<int32_t>(std::numeric_limits<int8_t>::max()));
             break;
         }
         case DataType::U16:
         case DataType::QASYMM16:
         {
             min = PixelValue(static_cast<int32_t>(std::numeric_limits<uint16_t>::lowest()));
             max = PixelValue(static_cast<int32_t>(std::numeric_limits<uint16_t>::max()));
             break;
         }
         case DataType::S16:
         case DataType::QSYMM16:
         {
             min = PixelValue(static_cast<int32_t>(std::numeric_limits<int16_t>::lowest()));
             max = PixelValue(static_cast<int32_t>(std::numeric_limits<int16_t>::max()));
             break;
         }
         case DataType::U32:
         {
             min = PixelValue(std::numeric_limits<uint32_t>::lowest());
             max = PixelValue(std::numeric_limits<uint32_t>::max());
             break;
         }
         case DataType::S32:
         {
             min = PixelValue(std::numeric_limits<int32_t>::lowest());
             max = PixelValue(std::numeric_limits<int32_t>::max());
             break;
         }
         case DataType::BFLOAT16:
         {
             min = PixelValue(bfloat16::lowest());
             max = PixelValue(bfloat16::max());
             break;
         }
         case DataType::F16:
         {
             min = PixelValue(std::numeric_limits<half>::lowest());
             max = PixelValue(std::numeric_limits<half>::max());
             break;
         }
         case DataType::F32:
         {
             min = PixelValue(std::numeric_limits<float>::lowest());
             max = PixelValue(std::numeric_limits<float>::max());
             break;
         }
         default:
             ARM_COMPUTE_ERROR("Undefined data type!");
     }
     return std::make_tuple(min, max);
 }

References ARM_COMPUTE_ERROR, BFLOAT16, dt, F16, F32, bfloat16::lowest(), arm_compute::support::cpp11::lowest(), bfloat16::max(), QASYMM16, QASYMM8, QASYMM8_SIGNED, QSYMM16, QSYMM8, QSYMM8_PER_CHANNEL, S16, S32, S8, U16, U32, and U8.

Referenced by ClPool2dKernel::configure(), ClPool3dKernel::configure(), ClGemmLowpOffsetContributionOutputStageKernel::configure(), ClGemmLowpMatrixMultiplyReshapedOnlyRhsKernel::configure(), ClGemmConv2d::configure(), get_quantized_activation_min_max(), arm_compute::quantization::get_quantized_asymmetric_output_min_max(), and CpuGemmLowpOffsetContributionOutputStageKernel::run_op().

◆ get_next_power_two()

unsigned int arm_compute::get_next_power_two ( unsigned int x )

inline

Given an integer value, this function returns the next power of two.

Parameters

[in] x Input value

Returns: the next power of two

Definition at line 74 of file Utils.h.

 {
     // Decrement by 1
     x--;
  
     // Shift right by 1
     x |= x >> 1u;
     // Shift right by 2
     x |= x >> 2u;
     // Shift right by 4
     x |= x >> 4u;
     // Shift right by 8
     x |= x >> 8u;
     // Shift right by 16
     x |= x >> 16u;
  
     // Increment by 1
     x++;
  
     return x;
 }

Referenced by ICLKernel::gws_from_window().

◆ get_normalization_dimension_index()

unsigned int arm_compute::get_normalization_dimension_index	(	DataLayout	layout,
		const NormalizationLayerInfo &	info
	)

inline

Calculate the normalization dimension index for a given normalization type.

Parameters

[in]	layout	Data layout of the input and output tensor
[in]	info	Normalization info

Returns: Normalization dimension index

Definition at line 39 of file NormalizationHelpers.h.

 {
     const unsigned int width_idx   = get_data_layout_dimension_index(layout, DataLayoutDimension::WIDTH);
     const unsigned int channel_idx = get_data_layout_dimension_index(layout, DataLayoutDimension::CHANNEL);
  
     return info.is_in_map() ? width_idx : channel_idx;
 }

References CHANNEL, arm_compute::cpu::channel_idx, get_data_layout_dimension_index(), arm_compute::test::validation::info, WIDTH, and arm_compute::cpu::width_idx.

Referenced by NENormalizationLayerKernel::configure(), and CLNormalizationLayerKernel::configure().

◆ get_padding_info() [1/2]

std::unordered_map< const ITensorInfo *, PaddingSize > get_padding_info ( std::initializer_list< const ITensor * > tensors )

Stores padding information before configuring a kernel.

Parameters

[in] tensors list of tensors to store the padding info for

Returns: An unordered map where each tensor info pointer is paired with its original padding info

Definition at line 462 of file Utils.cpp.

 {
     std::unordered_map<const ITensorInfo *, PaddingSize> res;
  
     for (const ITensor *tensor : tensors)
     {
         if (tensor)
         {
             res.insert({tensor->info(), tensor->info()->padding()});
         }
     }
  
     return res;
 }

References CLTensor::info(), TensorInfo::padding(), and tensor.

◆ get_padding_info() [2/2]

std::unordered_map< const ITensorInfo *, PaddingSize > get_padding_info ( std::initializer_list< const ITensorInfo * > infos )

Stores padding information before configuring a kernel.

Parameters

[in] infos list of tensor infos to store the padding info for

Returns: An unordered map where each tensor info pointer is paired with its original padding info

Definition at line 477 of file Utils.cpp.

 {
     std::unordered_map<const ITensorInfo *, PaddingSize> res;
  
     for (const ITensorInfo *info : infos)
     {
         if (info)
         {
             res.insert({info, info->padding()});
         }
     }
  
     return res;
 }

References arm_compute::test::validation::info.

◆ get_promoted_data_type()

DataType arm_compute::get_promoted_data_type ( DataType dt )

inline

Return the promoted data type of a given data type.

Note: If promoted data type is not supported an error will be thrown

Parameters

[in] dt Data type to get the promoted type of.

Returns: Promoted data type

Definition at line 159 of file DataTypeUtils.h.

 {
     switch (dt)
     {
         case DataType::U8:
             return DataType::U16;
         case DataType::S8:
             return DataType::S16;
         case DataType::U16:
             return DataType::U32;
         case DataType::S16:
             return DataType::S32;
         case DataType::QSYMM8:
         case DataType::QASYMM8:
         case DataType::QASYMM8_SIGNED:
         case DataType::QSYMM8_PER_CHANNEL:
         case DataType::QSYMM16:
         case DataType::QASYMM16:
         case DataType::BFLOAT16:
         case DataType::F16:
         case DataType::U32:
         case DataType::S32:
         case DataType::F32:
             ARM_COMPUTE_ERROR("Unsupported data type promotions!");
         default:
             ARM_COMPUTE_ERROR("Undefined data type!");
     }
     return DataType::UNKNOWN;
 }

References ARM_COMPUTE_ERROR, BFLOAT16, dt, F16, F32, QASYMM16, QASYMM8, QASYMM8_SIGNED, QSYMM16, QSYMM8, QSYMM8_PER_CHANNEL, S16, S32, S8, U16, U32, U8, and UNKNOWN.

◆ get_quantized_activation_min_max()

std::pair< int32_t, int32_t > get_quantized_activation_min_max	(	const ActivationLayerInfo &	act_info,
		DataType	data_type,
		UniformQuantizationInfo	oq_info
	)

Returns a pair of minimum and maximum values for a quantized activation.

Parameters

[in]	act_info	The information for activation
[in]	data_type	The used data type
[in]	oq_info	The output quantization information

Returns: The pair with minimum and maximum values

Definition at line 442 of file Utils.cpp.

 {
     const bool is_qasymm8_signed = is_data_type_quantized_asymmetric_signed(data_type);
     const auto a                 = act_info.a();
     const auto b                 = act_info.b();
     const int  a_int          = is_qasymm8_signed ? quantize_qasymm8_signed(a, oq_info) : quantize_qasymm8(a, oq_info);
     const int  b_int          = is_qasymm8_signed ? quantize_qasymm8_signed(b, oq_info) : quantize_qasymm8(b, oq_info);
     const auto type_max_value = std::get<1>(get_min_max(data_type)).get<int32_t>();
  
     const int32_t min_activation = act_info.activation() != ActivationLayerInfo::ActivationFunction::LU_BOUNDED_RELU
                                        ? std::min(oq_info.offset, type_max_value)
                                        : b_int;
     const int32_t max_activation =
         act_info.activation() == ActivationLayerInfo::ActivationFunction::RELU ? type_max_value : a_int;
  
     return std::make_pair(min_activation, max_activation);
 }

References arm_compute::test::validation::act_info, arm_compute::test::validation::b, arm_compute::test::validation::data_type, get_min_max(), is_data_type_quantized_asymmetric_signed(), UniformQuantizationInfo::offset, quantize_qasymm8(), and quantize_qasymm8_signed().

Referenced by ClMatMulLowpNativeKernel::configure(), CLDepthwiseConvolutionLayerNativeKernel::configure(), ClGemmConv2d::configure(), and ClGemmConv2d::validate().

◆ get_softmax_output_quantization_info()

QuantizationInfo get_softmax_output_quantization_info	(	DataType	input_type,
		bool	is_log
	)

Returns output quantization information for softmax layer.

Parameters

[in]	input_type	The data type of the input tensor
[in]	is_log	True for log softmax

Returns: Quantization information for the output tensor

Definition at line 421 of file Utils.cpp.

 {
     // Note: Output quantization info for softmax should always have
     // * Softmax with QASYMM8: scale = 1/256, offset = 0
     // * Softmax with QASYMM8_SIGNED: scale = 1/256, offset = -128
     // * LogSoftmax with QASYMM8: scale = 1/256, offset = 0
     // * LogSoftmax with QASYMM8_SIGNED: scale = 16/256, offset = 127
     if (is_data_type_quantized_asymmetric_signed(input_type))
     {
         if (is_log)
         {
             return QuantizationInfo(16.f / 256, 127);
         }
         else
         {
             return QuantizationInfo(1.f / 256, -128);
         }
     }
     return QuantizationInfo(1.f / 256, 0);
 }

References is_data_type_quantized_asymmetric_signed().

Referenced by CpuSoftmaxKernel::configure(), SoftmaxLayerNode::configure_output(), and ClSoftmaxKernel::validate().

◆ get_target_from_device()

GPUTarget get_target_from_device ( const cl::Device & device )

Helper function to get the GPU target from CL device.

Parameters

[in] device A CL device

Returns: the GPU target

Definition at line 224 of file CLHelpers.cpp.

 {
     // Query device name size
     std::string device_name = device.getInfo<CL_DEVICE_NAME>();
  
     return get_target_from_name(device_name);
 }

References get_target_from_name().

Referenced by CLCompileContext::default_ndrange(), CLScheduler::init(), and ICLKernel::set_target().

◆ get_target_from_name()

GPUTarget get_target_from_name ( const std::string & device_name )

Helper function to get the GPU target from a device name.

Parameters

[in] device_name A device name

Returns: the GPU target

Definition at line 188 of file GPUTarget.cpp.

 {
     std::regex  mali_regex(R"(Mali-(.*))");
     std::smatch name_parts;
     const bool  found_mali = std::regex_search(device_name, name_parts, mali_regex);
  
     if (!found_mali)
     {
         ARM_COMPUTE_LOG_INFO_MSG_CORE("Can't find valid Arm® Mali™ GPU. Target is set to default.");
         return GPUTarget::MIDGARD;
     }
  
     const char         target  = name_parts.str(1)[0];
     const std::string &version = name_parts.str(1);
  
     std::regex future_regex(R"(.*X)");
     const bool is_future_gpu = std::regex_search(version, future_regex);
  
     // Work-out gpu target
     GPUTarget gpu_target;
     if (target == 'G' || is_future_gpu)
     {
         // Check for Valhall, Bifrost or 5-th Gen
         gpu_target = get_fifth_gen_target(version);
         if (gpu_target == GPUTarget::UNKNOWN)
         {
             gpu_target = get_valhall_target(version);
         }
  
         if (gpu_target == GPUTarget::UNKNOWN)
         {
             gpu_target = get_bifrost_target(version);
         }
  
         // Default GPUTarget
         if (gpu_target == GPUTarget::UNKNOWN)
         {
             gpu_target = GPUTarget::VALHALL;
         }
     }
     else if (target == 'T')
     {
         gpu_target = get_midgard_target(version);
     }
     else
     {
         gpu_target = GPUTarget::UNKNOWN;
     }
  
     // Report in case of unknown target
     if (gpu_target == GPUTarget::UNKNOWN)
     {
         ARM_COMPUTE_LOG_INFO_MSG_CORE("Arm® Mali™ Mali GPU unknown. Target is set to the default one. (BIFROST)");
         return GPUTarget::BIFROST;
     }
  
     return gpu_target;
 }

References ARM_COMPUTE_LOG_INFO_MSG_CORE, BIFROST, MIDGARD, UNKNOWN, and VALHALL.

Referenced by CLDevice::CLDevice(), dot8_supported(), get_target_from_device(), and arm_compute::test::validation::TEST_CASE().

◆ get_wbsm_support_info()

bool get_wbsm_support_info ( const cl::Device & device )

Definition at line 429 of file CLHelpers.cpp.

 {
     cl_bitfield capabilities = 0;
     cl_int      err = clGetDeviceInfo(device.get(), CL_DEVICE_SCHEDULING_CONTROLS_CAPABILITIES_ARM, sizeof(cl_bitfield),
                                       &capabilities, nullptr);
     if ((err == CL_SUCCESS) && (capabilities & CL_KERNEL_EXEC_INFO_WORKGROUP_BATCH_SIZE_MODIFIER_ARM))
     {
         return true;
     }
     return false;
 }

References clGetDeviceInfo().

Referenced by CLCompileContext::CLCompileContext(), and CLCompileContext::set_device().

◆ gpu_target_is_in() [1/2]

bool arm_compute::gpu_target_is_in	(	GPUTarget	target_to_check,
		GPUTarget	target
	)

inline

Variant of gpu_target_is_in for comparing two targets.

Definition at line 115 of file GPUTarget.h.

 {
     return target_to_check == target;
 }

◆ gpu_target_is_in() [2/2]

bool arm_compute::gpu_target_is_in	(	GPUTarget	target_to_check,
		GPUTarget	target,
		Args...	targets
	)

Helper function to check whether a gpu target is equal to the provided targets.

Parameters

[in]	target_to_check	gpu target to check
[in]	target	First target to compare against
[in]	targets	(Optional) Additional targets to compare with

Returns: True if the target is equal with at least one of the targets.

Definition at line 109 of file GPUTarget.h.

 {
     return (target_to_check == target) | gpu_target_is_in(target_to_check, targets...);
 }

Referenced by arm_compute::test::validation::TEST_CASE().

◆ handle_cl_error()

void handle_cl_error	(	const std::string &	function_name,
		cl_int	error_code
	)

Check for CL error code and throw exception accordingly.

Parameters

[in]	function_name	The name of the CL function being called.
[in]	error_code	The error returned by the CL function.

Definition at line 120 of file CLUtils.cpp.

 {
     if (error_code != CL_SUCCESS)
     {
         std::string error_message = function_name + " - Error code: " + std::to_string(error_code);
         ARM_COMPUTE_ERROR(error_message.c_str());
     }
 }

References ARM_COMPUTE_ERROR, and to_string().

Referenced by CLMutableCommandBuffer::add_kernel(), CLMutableCommandBuffer::CLMutableCommandBuffer(), CLCompatCommandBuffer::enqueue(), CLMutableCommandBuffer::enqueue(), CLMutableCommandBuffer::finalize(), and CLMutableCommandBuffer::update().

◆ has_format_horizontal_subsampling()

bool arm_compute::has_format_horizontal_subsampling ( Format format )

inline

Return true if the given format has horizontal subsampling.

Parameters

[in] format Format to determine subsampling.

Returns: True if the format can be subsampled horizontaly.

Definition at line 304 of file Validate.h.

 {
     return (format == Format::YUYV422 || format == Format::UYVY422 || format == Format::NV12 ||
             format == Format::NV21 || format == Format::IYUV || format == Format::UV88)
                ? true
                : false;
 }

References update_supported_ops::format, IYUV, NV12, NV21, UV88, UYVY422, and YUYV422.

Referenced by adjust_odd_shape(), and calculate_subsampled_shape().

◆ has_format_vertical_subsampling()

bool arm_compute::has_format_vertical_subsampling ( Format format )

inline

Return true if the given format has vertical subsampling.

Parameters

[in] format Format to determine subsampling.

Returns: True if the format can be subsampled verticaly.

Definition at line 318 of file Validate.h.

 {
     return (format == Format::NV12 || format == Format::NV21 || format == Format::IYUV || format == Format::UV88)
                ? true
                : false;
 }

References update_supported_ops::format, IYUV, NV12, NV21, and UV88.

Referenced by adjust_odd_shape(), and calculate_subsampled_shape().

◆ has_holes() [1/2]

bool has_holes ( const ITensorInfo & info )

Check if the tensor has any holes.

A hole is defined as any gap in the tensor between two consecutive values. This can be a result of extending the paddings or manipulating the strides of the tensor

Parameters

[in] info Tensor info object defining the shape of the input tensor.

Note: This function checks for holes in all dimensions.

Definition at line 28 of file Utils.cpp.

 {
     return has_holes(info, info.num_dimensions() - 1);
 }

References arm_compute::test::validation::info.

Referenced by CpuSoftmaxKernel::configure(), CpuReshapeKernel::prepare(), and NEStackLayerKernel::prepare().

◆ has_holes() [2/2]

bool has_holes	(	const ITensorInfo &	info,
		size_t	dimension
	)

Check if the tensor has any holes.

Parameters

[in]	info	Tensor info object defining the shape of the input tensor.
[in]	dimension	Highest dimension to check.

Note: This function checks for holes in all the dimensions upto and including the highest dimension.

Definition at line 33 of file Utils.cpp.

 {
     const auto &shape          = info.tensor_shape();
     const auto &strides        = info.strides_in_bytes();
     size_t      squashed_bytes = info.element_size();
  
     for (size_t dim = 0; dim <= dimension; ++dim)
     {
         if (strides[dim] != squashed_bytes)
         {
             return true;
         }
         squashed_bytes *= shape[dim];
     }
     return false;
 }

References arm_compute::test::validation::info, and arm_compute::test::validation::shape.

◆ has_padding_changed()

bool has_padding_changed ( const std::unordered_map< const ITensorInfo *, PaddingSize > & padding_map )

Check if the previously stored padding info has changed after configuring a kernel.

Parameters

[in] padding_map an unordered map where each tensor info pointer is paired with its original padding info

Returns: true if any of the tensor infos has changed its paddings

Definition at line 492 of file Utils.cpp.

 {
     return std::find_if(padding_map.begin(), padding_map.end(),
                         [](const std::pair<const ITensorInfo *, PaddingSize> &padding_info)
                         { return (padding_info.first->padding() != padding_info.second); }) != padding_map.end();
 }

◆ ignore_unused()

void arm_compute::ignore_unused ( T && ... )

inline

Ignores unused arguments.

Template Parameters

T	Argument types

Parameters

[in] ... Ignored arguments

Definition at line 39 of file Error.h.

40 {

41 }

◆ image2d_from_buffer_supported()

bool image2d_from_buffer_supported ( const cl::Device & device )

Helper function to check whether the cl_khr_image2d_from_buffer extension is supported.

Parameters

[in] device A CL device

Returns: True if the extension is supported

Definition at line 377 of file CLHelpers.cpp.

 {
     return device_supports_extension(device, "cl_khr_image2d_from_buffer");
 }

References device_supports_extension().

Referenced by create_image2d_from_buffer(), export_to_cl_image(), arm_compute::test::validation::TEST_CASE(), and arm_compute::opencl::kernels::gemm::validate_image2d_support_on_rhs().

◆ index2coords()

Coordinates index2coords	(	const TensorShape &	shape,
		int	index
	)

inline

Convert a linear index into n-dimensional coordinates.

Parameters

[in]	shape	Shape of the n-dimensional tensor.
[in]	index	Linear index specifying the i-th element.

Returns: n-dimensional coordinates.

Definition at line 164 of file Helpers.inl.

 {
     int num_elements = shape.total_size();
  
     ARM_COMPUTE_ERROR_ON_MSG(index < 0 || index >= num_elements, "Index has to be in [0, num_elements]!");
     ARM_COMPUTE_ERROR_ON_MSG(num_elements == 0, "Cannot create coordinate from empty shape!");
  
     Coordinates coord{0};
  
     for (int d = shape.num_dimensions() - 1; d >= 0; --d)
     {
         num_elements /= shape[d];
         coord.set(d, index / num_elements);
         index %= num_elements;
     }
  
     return coord;
 }

References ARM_COMPUTE_ERROR_ON_MSG, and arm_compute::test::validation::shape.

Referenced by arm_compute::test::validation::reference::convert_fully_connected_weights().

◆ interleave_by()

int arm_compute::interleave_by ( const WeightFormat wf )

inline

Definition at line 1652 of file Types.h.

 {
     return (static_cast<int>(wf) >> 8) & 0xFFF;
 }

◆ intersect_valid_regions()

ValidRegion arm_compute::intersect_valid_regions ( const Ts &... regions )

Intersect multiple valid regions.

Parameters

[in] regions Valid regions.

Returns: Intersection of all regions.

Definition at line 64 of file WindowHelpers.h.

 {
     auto intersect = [](const ValidRegion &r1, const ValidRegion &r2) -> ValidRegion
     {
         ValidRegion region;
  
         for (size_t d = 0; d < std::min(r1.anchor.num_dimensions(), r2.anchor.num_dimensions()); ++d)
         {
             region.anchor.set(d, std::max(r1.anchor[d], r2.anchor[d]));
         }
  
         for (size_t d = 0; d < std::min(r1.shape.num_dimensions(), r2.shape.num_dimensions()); ++d)
         {
             region.shape.set(d, std::min(r1.shape[d], r2.shape[d]));
         }
  
         return region;
     };
  
     return utility::foldl(intersect, regions...);
 }

References ValidRegion::anchor, arm_compute::utility::foldl(), Dimensions< T >::num_dimensions(), Dimensions< T >::set(), TensorShape::set(), and ValidRegion::shape.

◆ is_data_type_float()

bool arm_compute::is_data_type_float ( DataType dt )

inline

Check if a given data type is of floating point type.

Parameters

[in] dt Input data type.

Returns: True if data type is of floating point type, else false.

Definition at line 304 of file DataTypeUtils.h.

 {
     switch (dt)
     {
         case DataType::F16:
         case DataType::F32:
             return true;
         default:
             return false;
     }
 }

References dt, F16, and F32.

Referenced by ClPool2dKernel::configure(), ClQuantizeKernel::configure(), ClScaleKernel::configure(), ClDirectConv2d::configure(), ClCastKernel::configure(), CLReductionOperationKernel::configure(), ClMulKernel::configure(), CLArgMinMaxLayerKernel::configure(), export_to_cl_image(), arm_compute::graph::detail::fuse_node_with_activation(), ClTemplateCast::get_component_code(), ClTemplateResize::get_component_code(), ClConv2d::get_convolution_method(), ClPool2dKernel::run_op(), ClComponentLogits1DNorm::validate(), ClSaturatedArithmeticKernel::validate(), and ClArithmeticKernel::validate().

◆ is_data_type_quantized()

bool arm_compute::is_data_type_quantized ( DataType dt )

inline

Check if a given data type is of quantized type.

Note: Quantized is considered a super-set of fixed-point and asymmetric data types.

Parameters

[in] dt Input data type.

Returns: True if data type is of quantized type, else false.

Definition at line 324 of file DataTypeUtils.h.

 {
     switch (dt)
     {
         case DataType::QSYMM8:
         case DataType::QASYMM8:
         case DataType::QASYMM8_SIGNED:
         case DataType::QSYMM8_PER_CHANNEL:
         case DataType::QSYMM16:
         case DataType::QASYMM16:
             return true;
         default:
             return false;
     }
 }

References dt, QASYMM16, QASYMM8, QASYMM8_SIGNED, QSYMM16, QSYMM8, and QSYMM8_PER_CHANNEL.

◆ is_data_type_quantized_asymmetric()

bool arm_compute::is_data_type_quantized_asymmetric ( DataType dt )

inline

Check if a given data type is of asymmetric quantized type.

Parameters

[in] dt Input data type.

Returns: True if data type is of asymmetric quantized type, else false.

Definition at line 346 of file DataTypeUtils.h.

 {
     switch (dt)
     {
         case DataType::QASYMM8:
         case DataType::QASYMM8_SIGNED:
         case DataType::QASYMM16:
             return true;
         default:
             return false;
     }
 }

References dt, QASYMM16, QASYMM8, and QASYMM8_SIGNED.

◆ is_data_type_quantized_asymmetric_signed()

bool arm_compute::is_data_type_quantized_asymmetric_signed ( DataType dt )

inline

Check if a given data type is of asymmetric quantized signed type.

Parameters

[in] dt Input data type.

Returns: True if data type is of asymmetric quantized signed type, else false.

Definition at line 365 of file DataTypeUtils.h.

 {
     switch (dt)
     {
         case DataType::QASYMM8_SIGNED:
             return true;
         default:
             return false;
     }
 }

References dt, and QASYMM8_SIGNED.

Referenced by CpuDirectConv2dOutputStageKernel::configure(), get_quantized_activation_min_max(), get_softmax_output_quantization_info(), and arm_compute::cpu::roi_align_1x1_qasymm8().

◆ is_data_type_quantized_per_channel()

bool arm_compute::is_data_type_quantized_per_channel ( DataType dt )

inline

Check if a given data type is of per channel type.

Parameters

[in] dt Input data type.

Returns: True if data type is of per channel type, else false.

Definition at line 401 of file DataTypeUtils.h.

 {
     switch (dt)
     {
         case DataType::QSYMM8_PER_CHANNEL:
             return true;
         default:
             return false;
     }
 }

References dt, and QSYMM8_PER_CHANNEL.

Referenced by CLTensorAllocator::allocate(), ClDequantizeKernel::configure(), CpuDepthwiseConv2dAssemblyWrapperKernel::configure(), CLDepthwiseConvolutionLayer::configure(), ClGemmLowpMatrixMultiplyCore::configure(), ClGemmConv2d::configure(), CpuGemmLowpMatrixMultiplyCore::configure(), arm_compute::test::validation::reference::dequantization_layer(), CLDeconvolutionLayer::get_deconvolution_method(), arm_compute::cpu::run_depthwise_quanitized8bit(), ClDequantizeKernel::run_op(), CpuDepthwiseConv2dAssemblyWrapperKernel::validate(), ClGemmLowpMatrixMultiplyCore::validate(), ClGemmConv2d::validate(), CLDepthwiseConvolutionLayer::validate(), CpuGemmLowpMatrixMultiplyCore::validate(), NEDeconvolutionLayer::validate(), and CpuGemmAssemblyDispatch::validate().

◆ is_data_type_quantized_symmetric()

bool arm_compute::is_data_type_quantized_symmetric ( DataType dt )

inline

Check if a given data type is of symmetric quantized type.

Parameters

[in] dt Input data type.

Returns: True if data type is of symmetric quantized type, else false.

Definition at line 382 of file DataTypeUtils.h.

 {
     switch (dt)
     {
         case DataType::QSYMM8:
         case DataType::QSYMM8_PER_CHANNEL:
         case DataType::QSYMM16:
             return true;
         default:
             return false;
     }
 }

References dt, QSYMM16, QSYMM8, and QSYMM8_PER_CHANNEL.

Referenced by ClGemmLowpMatrixMultiplyCore::configure(), and ClGemmLowpMatrixMultiplyCore::validate().

◆ is_fixed_format()

bool arm_compute::is_fixed_format ( const WeightFormat & wf )

inline

Definition at line 1660 of file Types.h.

 {
     return wf != WeightFormat::UNSPECIFIED && wf != WeightFormat::ANY;
 }

References ANY, and UNSPECIFIED.

Referenced by CpuGemmDirectConv2d::validate(), and CpuGemmConv2d::validate().

◆ is_fixed_format_fast_math()

bool arm_compute::is_fixed_format_fast_math ( const WeightFormat & wf )

inline

Definition at line 1664 of file Types.h.

 {
     return (static_cast<int>(wf) >> 4) & 0x1;
 }

Referenced by CpuFullyConnected::validate(), CpuGemm::validate(), and CpuGemmAssemblyDispatch::validate().

◆ is_pool_3d_region_entirely_outside_input()

bool is_pool_3d_region_entirely_outside_input ( const Pooling3dLayerInfo & info )

Check if the 3d pool region is entirely outside the input tensor.

Parameters

[in] info Pooling3dLayerInfo to be checked.

Returns: True if the pool region is entirely outside the input tensor, False otherwise.

Definition at line 141 of file Utils.cpp.

 {
     if (info.is_global_pooling || info.pool_size.x() == 0 || info.pool_size.y() == 0 || info.pool_size.z() == 0)
     {
         return false;
     }
     const auto ps                = info.padding;
     const auto pool_le_padding_x = info.pool_size.x() <= std::max({ps.left, ps.right});
     const auto pool_le_padding_y = info.pool_size.y() <= std::max({ps.top, ps.bottom});
     const auto pool_le_padding_z = info.pool_size.z() <= std::max({ps.front, ps.back});
     return pool_le_padding_x || pool_le_padding_y || pool_le_padding_z;
 }

References arm_compute::test::validation::info.

◆ is_pool_region_entirely_outside_input()

bool is_pool_region_entirely_outside_input ( const PoolingLayerInfo & info )

Check if the pool region is entirely outside the input tensor.

Parameters

[in] info PoolingLayerInfo to be checked.

Returns: True if the pool region is entirely outside the input tensor, False otherwise.

Definition at line 129 of file Utils.cpp.

 {
     if (info.is_global_pooling || info.exclude_padding || info.pool_size.x() == 0 || info.pool_size.y() == 0)
     {
         return false;
     }
     const auto ps                = info.pad_stride_info;
     const auto pool_le_padding_x = info.pool_size.x() <= std::max({ps.pad_left(), ps.pad_right()});
     const auto pool_le_padding_y = info.pool_size.y() <= std::max({ps.pad_top(), ps.pad_bottom()});
     return pool_le_padding_x || pool_le_padding_y;
 }

References arm_compute::test::validation::info.

Referenced by CpuPool2dAssemblyWrapperKernel::validate(), and ClComponentPool2d::validate().

◆ is_symmetric()

bool arm_compute::is_symmetric ( const Padding3D & info )

inline

Check if the 3D padding is symmetric i.e.

padding in each opposite sides are euqal (left=right, top=bottom and front=back)

Parameters

[in] info Padding3D input 3D padding object to check if it is symmetric

Returns: True if padding is symmetric

Definition at line 245 of file Utils.h.

 {
     return ((info.left == info.right) && (info.top == info.bottom) && (info.front == info.back));
 }

References arm_compute::test::validation::info.

◆ join()

std::string join	(	const std::vector< std::string >	strings,
		const std::string &	sep
	)

Join a sequence of strings with separator sep.

Parameters

[in]	strings	Strings to join
[in]	sep	Separator to join consecutive strings in the sequence

Returns: std::string

Definition at line 66 of file StringUtils.cpp.

 {
     if (strings.empty())
     {
         return "";
     }
     return std::accumulate(std::next(strings.begin()), strings.end(), strings.at(0),
                            [&sep](const std::string &a, const std::string &b) { return a + sep + b; });
 }

References arm_compute::test::validation::reference::accumulate(), and arm_compute::test::validation::b.

Referenced by GEMMParam::__str__(), NativeGEMMConfig::__str__(), ReshapedOnlyRHSGEMMConfig::__str__(), ReshapedGEMMConfig::__str__(), Measurement::__str__(), format_code::check_copyright(), format_code::check_license(), check_header_guards::find_expected_header_guard(), GpuCkwElementwiseBinary::get_name(), format_doxygen::process_comment(), format_code::run_fix_code_formatting(), and GEMMBenchmarkResultRecorder::summary().

◆ lower_string()

std::string lower_string ( const std::string & val )

Lower a given string.

Parameters

[in] val Given string to lower.

Returns: The lowered string

Definition at line 38 of file StringUtils.cpp.

 {
     std::string res = val;
     std::transform(res.begin(), res.end(), res.begin(), ::tolower);
     return res;
 }

References arm_compute::utility::tolower().

◆ manage_workspace() [1/2]

WorkspaceData<TensorType> arm_compute::manage_workspace	(	const experimental::MemoryRequirements &	mem_reqs,
		MemoryGroup &	mgroup,
		ITensorPack &	run_pack
	)

Definition at line 56 of file MemoryHelpers.h.

 {
     ITensorPack dummy_pack = ITensorPack();
     return manage_workspace<TensorType>(mem_reqs, mgroup, run_pack, dummy_pack);
 }

◆ manage_workspace() [2/2]

WorkspaceData<TensorType> arm_compute::manage_workspace	(	const experimental::MemoryRequirements &	mem_reqs,
		MemoryGroup &	mgroup,
		ITensorPack &	run_pack,
		ITensorPack &	prep_pack
	)

Definition at line 63 of file MemoryHelpers.h.

 {
     WorkspaceData<TensorType> workspace_memory;
     for (const auto &req : mem_reqs)
     {
         if (req.size == 0)
         {
             continue;
         }
  
         const auto aux_info = TensorInfo{TensorShape(req.size), 1, DataType::U8};
         workspace_memory.emplace_back(
             WorkspaceDataElement<TensorType>{req.slot, req.lifetime, std::make_unique<TensorType>()});
  
         auto aux_tensor = workspace_memory.back().tensor.get();
         ARM_COMPUTE_ERROR_ON_NULLPTR(aux_tensor);
         aux_tensor->allocator()->init(aux_info, req.alignment);
  
         if (req.lifetime == experimental::MemoryLifetime::Temporary)
         {
             mgroup.manage(aux_tensor);
         }
         else
         {
             prep_pack.add_tensor(req.slot, aux_tensor);
         }
         run_pack.add_tensor(req.slot, aux_tensor);
     }
  
     for (auto &mem : workspace_memory)
     {
         auto tensor = mem.tensor.get();
         tensor->allocator()->allocate();
     }
  
     return workspace_memory;
 }

References ITensorPack::add_tensor(), CLTensorAllocator::allocate(), CLTensor::allocator(), ARM_COMPUTE_ERROR_ON_NULLPTR, MemoryGroup::manage(), WorkspaceDataElement< TensorType >::slot, arm_compute::experimental::Temporary, tensor, and U8.

◆ multiply_by_quantized_multiplier_2row()

int32x4x2_t arm_compute::multiply_by_quantized_multiplier_2row	(	int32x4x2_t	input,
		int32_t	qmul,
		int32_t	shift
	)

inline

Multiply a neon vector using quantized multiplier and shift.

Parameters

[in]	input	Input vector to mutiply values to be quantized.
[in]	qmul	Quantized multipler
[in]	shift	Left bit shift

Returns: A neon vector holding the multiplied value

Definition at line 225 of file NESymm.h.

 {
     const auto left_shift  = shift > 0 ? shift : 0;
     const auto right_shift = shift > 0 ? 0 : -shift;
     const auto one_shifted = 1 << left_shift;
  
     int32x4x2_t result;
     result.val[0] = rounding_divide_by_pow2(vqrdmulhq_n_s32(vmulq_n_s32(input.val[0], one_shifted), qmul), right_shift);
     result.val[1] = rounding_divide_by_pow2(vqrdmulhq_n_s32(vmulq_n_s32(input.val[1], one_shifted), qmul), right_shift);
  
     return result;
 }

References arm_compute::test::validation::input, and rounding_divide_by_pow2().

◆ needs_serialized_reduction()

bool needs_serialized_reduction	(	ReductionOperation	op,
		DataType	dt,
		unsigned int	axis
	)

Check if the given reduction operation should be handled in a serial way.

Parameters

[in]	op	Reduction operation to perform
[in]	dt	Data type
[in]	axis	Axis along which to reduce

Returns: True if the given reduction operation should be handled in a serial way.

Definition at line 412 of file Utils.cpp.

 {
     const bool is_min_max        = (op == ReductionOperation::MAX || op == ReductionOperation::MIN);
     const bool is_quantized_type = is_data_type_quantized(dt);
     const bool is_first_dim      = (axis == 0);
  
     return !is_first_dim || (is_quantized_type && !is_min_max);
 }

References dt, is_data_type_quantized(), MAX, and MIN.

Referenced by CLReductionOperationKernel::configure(), and CLReductionOperationKernel::run().

◆ normalize_float()

void arm_compute::normalize_float	(	const Window &	window,
		const ITensor *	in,
		const ITensor *	in_squared,
		ITensor *	out,
		NormalizationLayerInfo	ninfo
	)

Function to perform normalization depending on the given template dimension.

The second template parameter specifies whether the normalization has to be 1D or 2D.

Note

Only supported normalizations are:

1D over X or Z
2D over X and Y

Parameters

[in]	window	Region on which to execute the kernel.
[in]	in	Source tensor. 3 lower dims represent a single input with dimensions [width, height, IFM], and an optional 4th dimension for batch of inputs. Data types supported: FP16/F32. Data layouts supported: NCHW/NHWC.
[in]	in_squared	Source with each element has been squared. 3 lower dims represent a single input with dimensions [width, height, IFM], Data type and layout supported: same as `input`.
[in]	out	Destination tensor. Output will have the same number of dimensions as input. Data type and layout supported: same as `input`.
[in]	ninfo	Normalization layer information like the normalization type, normalization size and other parameters.

SIMD vector tag type.

Definition at line 58 of file impl.h.

 {
     /** SIMD vector tag type. */
     using ExactTagType = typename wrapper::traits::neon_vector<T, S>::tag_type;
  
     Window win(window);
     win.set(Window::DimX, Window::Dimension(0, 1, 1));
  
     const auto window_start_x = static_cast<int>(window.x().start());
     const auto window_end_x   = static_cast<int>(window.x().end());
     const int  window_step_x  = S;
  
     Iterator input(in, win);
     Iterator input_squared(in_squared, win);
     Iterator output(out, win);
  
     const int dim_y                      = in->info()->data_layout() == DataLayout::NCHW ? 1 : 2;
     const int radius                     = ninfo.norm_size() / 2;
     const int input_squared_stride_x     = in_squared->info()->strides_in_bytes()[0];
     const int input_squared_stride_slice = in_squared->info()->strides_in_bytes()[dim];
     const int input_squared_stride_row   = in_squared->info()->strides_in_bytes()[dim_y];
  
     const int max_right  = in->info()->dimension(dim) - 1;
     const int max_bottom = in->info()->dimension(dim_y) - 1;
  
     const auto coeff_vec = wrapper::vdup_n(static_cast<T>(ninfo.scale_coeff()), ExactTagType{});
     const auto beta_vec  = wrapper::vdup_n(static_cast<T>(ninfo.beta()), ExactTagType{});
     const auto kappa_vec = wrapper::vdup_n(static_cast<T>(ninfo.kappa()), ExactTagType{});
  
     auto sequential_normalization = [&](const int x, const Coordinates &id, const int current_row, const int first_row,
                                         const int last_row, const T *input_ptr, const uint8_t *input_squared_start_ptr,
                                         T *output_ptr)
     {
         const int current_slice = dim == 0 ? x : id[dim];
         const int first_slice   = std::max(current_slice - radius, 0);
         const int last_slice    = std::min(current_slice + radius, max_right);
  
         const uint8_t *const input_squared_x_ptr = input_squared_start_ptr + x * input_squared_stride_x;
         // Accumulate 2D In-Map values
         auto accu = static_cast<T>(0.f);
         for (int j = first_row; j <= last_row; ++j)
         {
             // Compute row displacement
             const uint8_t *const input_squared_ptr = input_squared_x_ptr + (j - current_row) * input_squared_stride_row;
             for (int i = first_slice; i <= last_slice; ++i)
             {
                 accu +=
                     *reinterpret_cast<const T *>(input_squared_ptr + (i - current_slice) * input_squared_stride_slice);
             }
         }
  
         // Normalize
         const auto normalized =
             std::pow(accu * static_cast<T>(ninfo.scale_coeff()) + static_cast<T>(ninfo.kappa()), ninfo.beta());
         const auto normalized_pixel = (*(input_ptr + x)) / normalized;
         *(output_ptr + x)           = normalized_pixel;
     };
  
     execute_window_loop(
         win,
         [&](const Coordinates &id)
         {
             const auto input_ptr  = reinterpret_cast<const T *>(input.ptr());
             auto       output_ptr = reinterpret_cast<T *>(output.ptr());
  
             // Get range to normalize
             const int current_row = do_2D_norm ? id[dim_y] : 0;
             const int first_row   = do_2D_norm ? std::max(current_row - radius, 0) : 0;
             const int last_row    = do_2D_norm ? std::min(current_row + radius, max_bottom) : 0;
  
             int x = window_start_x;
             // Compute serially starting elements for the case x dimension is width
             for (; x < radius && x < window_end_x && dim == 0; ++x)
             {
                 sequential_normalization(x, id, current_row, first_row, last_row, input_ptr, input_squared.ptr(),
                                          output_ptr);
             }
  
             // Compute vectorized
             for (; x <= window_end_x - window_step_x - radius; x += window_step_x)
             {
                 const int current_slice = dim == 0 ? x : id[dim];
                 const int first_slice   = std::max(current_slice - radius, 0);
                 const int last_slice    = std::min(current_slice + radius, max_right);
  
                 const uint8_t *const input_squared_x_ptr = input_squared.ptr() + x * input_squared_stride_x;
                 // Accumulate 2D In-Map values
                 auto accu = wrapper::vdup_n(static_cast<T>(0.f), ExactTagType{});
                 for (int j = first_row; j <= last_row; ++j)
                 {
                     // Compute row displacement
                     const uint8_t *const input_squared_ptr =
                         input_squared_x_ptr + (j - current_row) * input_squared_stride_row;
                     for (int i = first_slice; i <= last_slice; ++i)
                     {
                         accu = wrapper::vadd(
                             accu, wrapper::vloadq(reinterpret_cast<const T *>(
                                       input_squared_ptr + (i - current_slice) * input_squared_stride_slice)));
                     }
                 }
  
                 // Normalize
                 const auto normalized       = wrapper::vpow(wrapper::vmla(kappa_vec, coeff_vec, accu), beta_vec);
                 const auto normalized_pixel = wrapper::vmul(wrapper::vloadq(input_ptr + x), wrapper::vinv(normalized));
                 wrapper::vstore(reinterpret_cast<T *>(output_ptr + x), normalized_pixel);
             }
  
             // Compute left-over elements
             for (; x < window_end_x; ++x)
             {
                 sequential_normalization(x, id, current_row, first_row, last_row, input_ptr, input_squared.ptr(),
                                          output_ptr);
             }
         },
         input, input_squared, output);
 }

◆ num_channels_from_format()

size_t arm_compute::num_channels_from_format ( Format format )

inline

Return the number of channels for a given single-planar pixel format.

Parameters

[in] format Input format

Returns: The number of channels for a given image format.

Definition at line 302 of file FormatUtils.h.

 {
     switch (format)
     {
         case Format::U8:
         case Format::U16:
         case Format::S16:
         case Format::U32:
         case Format::S32:
         case Format::BFLOAT16:
         case Format::F16:
         case Format::F32:
             return 1;
         // Because the U and V channels are subsampled
         // these formats appear like having only 2 channels:
         case Format::YUYV422:
         case Format::UYVY422:
             return 2;
         case Format::UV88:
             return 2;
         case Format::RGB888:
             return 3;
         case Format::RGBA8888:
             return 4;
         //Doesn't make sense for planar formats:
         case Format::NV12:
         case Format::NV21:
         case Format::IYUV:
         case Format::YUV444:
         default:
             return 0;
     }
 }

References BFLOAT16, F16, F32, update_supported_ops::format, IYUV, NV12, NV21, RGB888, RGBA8888, S16, S32, U16, U32, U8, UV88, UYVY422, YUV444, and YUYV422.

Referenced by TensorInfo::init(), TensorInfo::init_auto_padding(), and TensorInfo::set_format().

◆ num_of_elements_in_range()

size_t arm_compute::num_of_elements_in_range	(	const float	start,
		const float	end,
		const float	step
	)

inline

Returns the number of elements required to go from start to end with the wanted step.

Parameters

[in]	start	start value
[in]	end	end value
[in]	step	step value between each number in the wanted sequence

Returns: number of elements to go from start value to end value using the wanted step

Definition at line 295 of file Utils.h.

 {
     ARM_COMPUTE_ERROR_ON_MSG(step == 0, "Range Step cannot be 0");
     return size_t(std::ceil((end - start) / step));
 }

References ARM_COMPUTE_ERROR_ON_MSG, arm_compute::mlgo::parser::end(), and arm_compute::cpu::step.

Referenced by NERangeKernel::configure().

◆ num_planes_from_format()

size_t arm_compute::num_planes_from_format ( Format format )

inline

Return the number of planes for a given format.

Parameters

[in] format Input format

Returns: The number of planes for a given image format.

Definition at line 267 of file FormatUtils.h.

 {
     switch (format)
     {
         case Format::U8:
         case Format::S16:
         case Format::U16:
         case Format::S32:
         case Format::U32:
         case Format::BFLOAT16:
         case Format::F16:
         case Format::F32:
         case Format::RGB888:
         case Format::RGBA8888:
         case Format::YUYV422:
         case Format::UYVY422:
             return 1;
         case Format::NV12:
         case Format::NV21:
             return 2;
         case Format::IYUV:
         case Format::YUV444:
             return 3;
         default:
             ARM_COMPUTE_ERROR("Not supported format");
             return 0;
     }
 }

References ARM_COMPUTE_ERROR, BFLOAT16, F16, F32, update_supported_ops::format, IYUV, NV12, NV21, RGB888, RGBA8888, S16, S32, U16, U32, U8, UYVY422, YUV444, and YUYV422.

◆ offset_int_vec()

int arm_compute::offset_int_vec ( int offset )

inline

Definition at line 38 of file MemoryHelpers.h.

 {
     return ACL_INT_VEC + offset;
 }

References ACL_INT_VEC, and offset().

Referenced by CpuAddMulAdd::configure(), ClSoftmax::configure(), CpuSoftmaxGeneric::configure(), CpuMatMul::configure(), CpuGemmDirectConv2d::configure(), CpuWinogradConv2d::configure(), CpuFullyConnected::configure(), ClFullyConnected::configure(), ClGemmLowpMatrixMultiplyCore::configure(), ClWinogradConv2d::configure(), CpuGemmConv2d::configure(), ClGemmConv2d::configure(), CpuGemmLowpMatrixMultiplyCore::configure(), CpuGemmDirectConv2d::prepare(), CpuWinogradConv2d::prepare(), ClFullyConnected::prepare(), ClGemmLowpMatrixMultiplyCore::prepare(), ClGemm::prepare(), ClWinogradConv2d::prepare(), CpuFullyConnected::prepare(), ClGemmConv2d::prepare(), CpuGemmLowpMatrixMultiplyCore::prepare(), CpuGemmConv2d::prepare(), ClSoftmax::run(), CpuAddMulAdd::run(), CpuSoftmaxGeneric::run(), CpuMatMul::run(), CpuWinogradConv2d::run(), ClIndirectConv2d::run(), ClFullyConnected::run(), ClGemmLowpMatrixMultiplyCore::run(), ClGemm::run(), ClWinogradConv2d::run(), CpuFullyConnected::run(), ClGemmConv2d::run(), CpuGemmLowpMatrixMultiplyCore::run(), CpuGemm::run(), and CpuGemmConv2d::run().

◆ opencl_is_available()

bool opencl_is_available ( )

Check if OpenCL is available.

Returns: True if OpenCL is available.

Definition at line 236 of file OpenCL.cpp.

 {
     CLSymbols::get().load_default();
  
     // Using static objects that rely on OpenCL in their constructor or
     // destructor is implementation defined according to the OpenCL API
     // Specification. These objects include CLScheduler.
     //
     // For compatibility with OpenCL runtimes that also use static objects to
     // hold their state, we call a harmless OpenCL function (clGetPlatformIDs
     // with invalid parameters must result in CL_INVALID_VALUE) to ensure the
     // runtimes have a chance to initialize their static objects first. Thanks
     // to C++11 rules about normal program completion (cf [basic.start]), this
     // ensures their static objects are destroyed last, i.e. after the
     // singleton CLScheduler is destroyed.
     //
     // When OpenCL is not available, this call results in CL_OUT_OF_RESOURCES,
     // which is equally harmless.
     (void)clGetPlatformIDs(0, nullptr, nullptr);
  
     return CLSymbols::get().clBuildProgram_ptr != nullptr;
 }

References CLSymbols::clBuildProgram_ptr, clGetPlatformIDs(), CLSymbols::get(), and CLSymbols::load_default().

Referenced by create_opencl_context_and_device(), CLScheduler::get(), CLDeviceBackend::is_backend_supported(), main(), Framework::run(), and arm_compute::test::sync_if_necessary().

◆ operator!=() [1/3]

bool arm_compute::operator!=	(	const Dimensions< T > &	lhs,
		const Dimensions< T > &	rhs
	)

inline

Check that given dimensions are not equal.

Parameters

[in]	lhs	Left-hand side Dimensions.
[in]	rhs	Right-hand side Dimensions.

Returns: True if the given dimensions are not equal.

Definition at line 287 of file Dimensions.h.

 {
     return !(lhs == rhs);
 }

◆ operator!=() [2/3]

bool arm_compute::operator!=	(	const QuantizationInfo &	lhs,
		const QuantizationInfo &	rhs
	)

inline

Check whether two quantization info are not equal.

Parameters

[in]	lhs	RHS quantization info.
[in]	rhs	LHS quantization info.

Returns: True if the given quantization info is the same.

Definition at line 173 of file QuantizationInfo.h.

 {
     return !(operator==(lhs, rhs));
 }

References operator==().

◆ operator!=() [3/3]

bool arm_compute::operator!=	(	const UniformQuantizationInfo &	lhs,
		const UniformQuantizationInfo &	rhs
	)

inline

Check whether two quantization info are not equal.

Parameters

[in]	lhs	RHS quantization info.
[in]	rhs	LHS quantization info.

Returns: True if the given quantization info is the same.

Definition at line 197 of file QuantizationInfo.h.

 {
     return !(operator==(lhs, rhs));
 }

References operator==().

◆ operator<<() [1/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const ActivationLayerInfo *	info
	)

Formatted output of the activation function info.

Parameters

[out]	os	Output stream.
[in]	info	ActivationLayerInfo to output.

Returns: Formatted string.

Definition at line 543 of file TypePrinter.h.

 {
     if (info != nullptr)
     {
         if (info->enabled())
         {
             os << info->activation();
             os << "(";
             os << "VAL_A=" << info->a() << ",";
             os << "VAL_B=" << info->b();
             os << ")";
         }
         else
         {
             os << "disabled";
         }
     }
     else
     {
         os << "nullptr";
     }
     return os;
 }

References arm_compute::test::validation::info.

◆ operator<<() [2/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const ActivationLayerInfo::ActivationFunction &	act_function
	)

Formatted output of the activation function type.

Parameters

[out]	os	Output stream.
[in]	act_function	Type to output.

Returns: Modified output stream.

Definition at line 460 of file TypePrinter.h.

 {
     switch (act_function)
     {
         case ActivationLayerInfo::ActivationFunction::ABS:
             os << "ABS";
             break;
         case ActivationLayerInfo::ActivationFunction::LINEAR:
             os << "LINEAR";
             break;
         case ActivationLayerInfo::ActivationFunction::LOGISTIC:
             os << "LOGISTIC";
             break;
         case ActivationLayerInfo::ActivationFunction::RELU:
             os << "RELU";
             break;
         case ActivationLayerInfo::ActivationFunction::BOUNDED_RELU:
             os << "BOUNDED_RELU";
             break;
         case ActivationLayerInfo::ActivationFunction::LEAKY_RELU:
             os << "LEAKY_RELU";
             break;
         case ActivationLayerInfo::ActivationFunction::SOFT_RELU:
             os << "SOFT_RELU";
             break;
         case ActivationLayerInfo::ActivationFunction::SQRT:
             os << "SQRT";
             break;
         case ActivationLayerInfo::ActivationFunction::LU_BOUNDED_RELU:
             os << "LU_BOUNDED_RELU";
             break;
         case ActivationLayerInfo::ActivationFunction::ELU:
             os << "ELU";
             break;
         case ActivationLayerInfo::ActivationFunction::SQUARE:
             os << "SQUARE";
             break;
         case ActivationLayerInfo::ActivationFunction::TANH:
             os << "TANH";
             break;
         case ActivationLayerInfo::ActivationFunction::IDENTITY:
             os << "IDENTITY";
             break;
         case ActivationLayerInfo::ActivationFunction::HARD_SWISH:
             os << "HARD_SWISH";
             break;
         case ActivationLayerInfo::ActivationFunction::SWISH:
             os << "SWISH";
             break;
         case ActivationLayerInfo::ActivationFunction::GELU:
             os << "GELU";
             break;
  
         default:
             ARM_COMPUTE_ERROR("NOT_SUPPORTED!");
     }
  
     return os;
 }

References ARM_COMPUTE_ERROR.

◆ operator<<() [3/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const ArithmeticOperation &	op
	)

Formatted output of the ArithmeticOperation type.

Parameters

[out]	os	Output stream.
[in]	op	Operation to output.

Returns: Modified output stream.

Definition at line 1746 of file TypePrinter.h.

 {
     switch (op)
     {
         case ArithmeticOperation::ADD:
             os << "ADD";
             break;
         case ArithmeticOperation::SUB:
             os << "SUB";
             break;
         case ArithmeticOperation::DIV:
             os << "DIV";
             break;
         case ArithmeticOperation::MAX:
             os << "MAX";
             break;
         case ArithmeticOperation::MIN:
             os << "MIN";
             break;
         case ArithmeticOperation::SQUARED_DIFF:
             os << "SQUARED_DIFF";
             break;
         case ArithmeticOperation::POWER:
             os << "POWER";
             break;
         case ArithmeticOperation::PRELU:
             os << "PRELU";
             break;
         default:
             ARM_COMPUTE_ERROR("NOT_SUPPORTED!");
     }
  
     return os;
 }

References ADD, ARM_COMPUTE_ERROR, DIV, MAX, MIN, POWER, PRELU, SQUARED_DIFF, and SUB.

◆ operator<<() [4/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const arm_compute::CpuMatMulSettings &	settings
	)

Formatted output of the arm_compute::CpuMatMulSettings type.

Parameters

[out]	os	Output stream.
[in]	settings	arm_compute::CpuMatMulSettings type to output.

Returns: Modified output stream.

Definition at line 3600 of file TypePrinter.h.

 {
     os << "CpuMatMulSettings="
        << "["
        << "fast_math=" << settings.fast_math() << "]";
  
     return os;
 }

References CpuMatMulSettings::fast_math().

◆ operator<<() [5/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const arm_compute::MatMulInfo &	matmul_info
	)

Formatted output of the arm_compute::MatMulInfo type.

Parameters

[out]	os	Output stream.
[in]	matmul_info	arm_compute::MatMulInfo type to output.

Returns: Modified output stream.

Definition at line 3539 of file TypePrinter.h.

 {
     os << "MatMulKernelInfo="
        << "["
        << "adj_lhs=" << matmul_info.adj_lhs() << ", "
        << "adj_rhs=" << matmul_info.adj_rhs() << "] ";
     return os;
 }

References MatMulInfo::adj_lhs(), and MatMulInfo::adj_rhs().

◆ operator<<() [6/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const arm_compute::MatMulKernelInfo &	matmul_info
	)

Formatted output of the arm_compute::MatMulKernelInfo type.

Parameters

[out]	os	Output stream.
[in]	matmul_info	arm_compute::MatMulKernelInfo type to output.

Returns: Modified output stream.

Definition at line 3567 of file TypePrinter.h.

 {
     os << "MatMulKernelInfo="
        << "["
        << "adj_lhs=" << matmul_info.adj_lhs << ", "
        << "adj_rhs=" << matmul_info.adj_rhs << ", "
        << "M0=" << matmul_info.m0 << ", "
        << "N0=" << matmul_info.n0 << ", "
        << "K0=" << matmul_info.k0 << ", "
        << "export_rhs_to_cl_image=" << matmul_info.export_rhs_to_cl_image << "]";
  
     return os;
 }

References MatMulKernelInfo::adj_lhs, MatMulKernelInfo::adj_rhs, MatMulKernelInfo::export_rhs_to_cl_image, MatMulKernelInfo::k0, MatMulKernelInfo::m0, and MatMulKernelInfo::n0.

◆ operator<<() [7/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const arm_compute::WeightFormat &	wf
	)

Formatted output of the arm_compute::WeightFormat type.

Parameters

[out]	os	Output stream.
[in]	wf	WeightFormat to output.

Returns: Modified output stream.

Definition at line 3241 of file TypePrinter.h.

 {
     os << to_string(wf);
     return os;
 }

References to_string().

◆ operator<<() [8/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const BorderMode &	mode
	)

Formatted output of the BorderMode type.

Parameters

[out]	os	Output stream.
[in]	mode	Type to output.

Returns: Modified output stream.

Definition at line 1014 of file TypePrinter.h.

 {
     switch (mode)
     {
         case BorderMode::UNDEFINED:
             os << "UNDEFINED";
             break;
         case BorderMode::CONSTANT:
             os << "CONSTANT";
             break;
         case BorderMode::REPLICATE:
             os << "REPLICATE";
             break;
         default:
             ARM_COMPUTE_ERROR("NOT_SUPPORTED!");
     }
  
     return os;
 }

References ARM_COMPUTE_ERROR, CONSTANT, clang_tidy_rules::mode, REPLICATE, and UNDEFINED.

◆ operator<<() [9/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const BorderSize &	border
	)

Formatted output of the BorderSize type.

Parameters

[out]	os	Output stream.
[in]	border	Type to output.

Returns: Modified output stream.

Definition at line 1041 of file TypePrinter.h.

 {
     os << border.top << "," << border.right << "," << border.bottom << "," << border.left;
  
     return os;
 }

References BorderSize::bottom, BorderSize::left, BorderSize::right, and BorderSize::top.

◆ operator<<() [10/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const BoundingBoxTransformInfo &	bbox_info
	)

Formatted output of the BoundingBoxTransformInfo type.

Parameters

[out]	os	Output stream.
[in]	bbox_info	Type to output.

Returns: Modified output stream.

Definition at line 345 of file TypePrinter.h.

 {
     auto weights = bbox_info.weights();
     os << "(" << bbox_info.img_width() << "x" << bbox_info.img_height() << ")~" << bbox_info.scale() << "(weights={"
        << weights[0] << ", " << weights[1] << ", " << weights[2] << ", " << weights[3] << "})";
     return os;
 }

References BoundingBoxTransformInfo::img_height(), BoundingBoxTransformInfo::img_width(), BoundingBoxTransformInfo::scale(), and BoundingBoxTransformInfo::weights().

◆ operator<<() [11/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const BoxNMSLimitInfo &	info
	)

Formatted output of the BoxNMSLimitInfo type.

Parameters

[out]	os	Output stream.
[in]	info	BoxNMSLimitInfo to output.

Returns: Modified output stream.

Definition at line 3102 of file TypePrinter.h.

 {
     os << "{score_thresh=" << info.score_thresh() << ", "
        << "nms=" << info.nms() << ", "
        << "detections_per_im=" << info.detections_per_im() << ", "
        << "soft_nms_enabled=" << info.soft_nms_enabled() << ", "
        << "soft_nms_min_score_thres=" << info.soft_nms_min_score_thres() << ", "
        << "suppress_size=" << info.suppress_size() << ", "
        << "min_size=" << info.min_size() << ", "
        << "im_width=" << info.im_width() << ", "
        << "im_height=" << info.im_height() << "}";
     return os;
 }

References arm_compute::test::validation::info.

◆ operator<<() [12/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const Channel &	channel
	)

Formatted output of the Channel type.

Parameters

[out]	os	Output stream.
[in]	channel	Type to output.

Returns: Modified output stream.

Definition at line 947 of file TypePrinter.h.

 {
     switch (channel)
     {
         case Channel::UNKNOWN:
             os << "UNKNOWN";
             break;
         case Channel::C0:
             os << "C0";
             break;
         case Channel::C1:
             os << "C1";
             break;
         case Channel::C2:
             os << "C2";
             break;
         case Channel::C3:
             os << "C3";
             break;
         case Channel::R:
             os << "R";
             break;
         case Channel::G:
             os << "G";
             break;
         case Channel::B:
             os << "B";
             break;
         case Channel::A:
             os << "A";
             break;
         case Channel::Y:
             os << "Y";
             break;
         case Channel::U:
             os << "U";
             break;
         case Channel::V:
             os << "V";
             break;
         default:
             ARM_COMPUTE_ERROR("NOT_SUPPORTED!");
     }
  
     return os;
 }

References A, ARM_COMPUTE_ERROR, B, C0, C1, C2, C3, G, R, U, UNKNOWN, V, and Y.

◆ operator<<() [13/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const ClComponentElementwiseBinary::Attributes::ElementwiseOp &	op
	)

Formatted output of the pute::experimental::dynamic_fusion::ClComponentElementwiseBinary::Attributes::ElementwiseOp type.

Parameters

[out]	os	Output stream.
[in]	op	arm_compute::experimental::dynamic_fusion::ClComponentElementwiseBinary::Attributes::ElementwiseOp type to output.

Returns: Modified output stream.

Definition at line 47 of file ElementwiseBinary.h.

 {
     const std::map<ClComponentElementwiseBinary::Attributes::ElementwiseOp, std::string> op_name = {
         {ClComponentElementwiseBinary::Attributes::ElementwiseOp::Add, "add"},
         {ClComponentElementwiseBinary::Attributes::ElementwiseOp::Div, "div"},
         {ClComponentElementwiseBinary::Attributes::ElementwiseOp::Max, "max"},
         {ClComponentElementwiseBinary::Attributes::ElementwiseOp::Min, "min"},
         {ClComponentElementwiseBinary::Attributes::ElementwiseOp::Mul, "mul"},
         {ClComponentElementwiseBinary::Attributes::ElementwiseOp::Power, "power"},
         {ClComponentElementwiseBinary::Attributes::ElementwiseOp::Prelu, "prelu"},
         {ClComponentElementwiseBinary::Attributes::ElementwiseOp::SquaredDiff, "squareddiff"},
         {ClComponentElementwiseBinary::Attributes::ElementwiseOp::Sub, "sub"}};
     os << op_name.at(op);
     return os;
 }

References ElementwiseBinaryCommonAttributes::Add, ElementwiseBinaryCommonAttributes::Div, ElementwiseBinaryCommonAttributes::Max, ElementwiseBinaryCommonAttributes::Min, ElementwiseBinaryCommonAttributes::Mul, ElementwiseBinaryCommonAttributes::Power, ElementwiseBinaryCommonAttributes::Prelu, ElementwiseBinaryCommonAttributes::SquaredDiff, and ElementwiseBinaryCommonAttributes::Sub.

◆ operator<<() [14/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const CLTunerMode &	val
	)

[Print CLTunerMode type]

Formatted output of the CLTunerMode type.

Parameters

[out]	os	Output stream.
[in]	val	CLTunerMode to output.

Returns: Modified output stream.

Definition at line 2577 of file TypePrinter.h.

 {
     os << to_string(val);
     return os;
 }

References to_string().

◆ operator<<() [15/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const ComparisonOperation &	op
	)

Formatted output of the Comparison Operations.

Parameters

[out]	os	Output stream.
[in]	op	Type to output.

Returns: Modified output stream.

Definition at line 1856 of file TypePrinter.h.

 {
     switch (op)
     {
         case ComparisonOperation::Equal:
             os << "Equal";
             break;
         case ComparisonOperation::NotEqual:
             os << "NotEqual";
             break;
         case ComparisonOperation::Greater:
             os << "Greater";
             break;
         case ComparisonOperation::GreaterEqual:
             os << "GreaterEqual";
             break;
         case ComparisonOperation::Less:
             os << "Less";
             break;
         case ComparisonOperation::LessEqual:
             os << "LessEqual";
             break;
         default:
             ARM_COMPUTE_ERROR("NOT_SUPPORTED!");
     }
  
     return os;
 }

References ARM_COMPUTE_ERROR, Equal, Greater, GreaterEqual, Less, LessEqual, and NotEqual.

◆ operator<<() [16/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const ComputeAnchorsInfo &	anchors_info
	)

Formatted output of the ComputeAnchorsInfo type.

Parameters

[out]	os	Output stream.
[in]	anchors_info	Type to output.

Returns: Modified output stream.

Definition at line 380 of file TypePrinter.h.

 {
     os << "(" << anchors_info.feat_width() << "x" << anchors_info.feat_height() << ")~" << anchors_info.spatial_scale();
     return os;
 }

References ComputeAnchorsInfo::feat_height(), ComputeAnchorsInfo::feat_width(), and ComputeAnchorsInfo::spatial_scale().

◆ operator<<() [17/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const Conv2dInfo &	conv_info
	)

Formatted output of the Conv2dInfo type.

Parameters

[out]	os	Output stream.
[in]	conv_info	Conv2dInfo to output.

Returns: Modified output stream.

Definition at line 2728 of file TypePrinter.h.

 {
     os << "{conv_info=" << conv_info.conv_info << ", "
        << "dilation=" << conv_info.dilation << ", "
        << "act_info=" << to_string(conv_info.act_info) << ", "
        << "enable_fast_math=" << conv_info.enable_fast_math << ", "
        << "num_groups=" << conv_info.num_groups << ","
        << "}";
     return os;
 }

References arm_compute::test::validation::conv_info, and to_string().

◆ operator<<() [18/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const Conv3dInfo &	conv3d_info
	)

Formatted output of the Conv3dInfo type.

Parameters

[out]	os	Output stream.
[in]	conv3d_info	Type to output.

Returns: Modified output stream.

Definition at line 3149 of file TypePrinter.h.

 {
     os << conv3d_info.stride;
     os << ";";
     os << conv3d_info.padding;
     os << ";";
     os << to_string(conv3d_info.act_info);
     os << ";";
     os << conv3d_info.dilation;
     os << ";";
     os << conv3d_info.round_type;
     os << ";";
     os << conv3d_info.enable_fast_math;
  
     return os;
 }

References Conv3dInfo::act_info, Conv3dInfo::dilation, Conv3dInfo::enable_fast_math, Conv3dInfo::padding, Conv3dInfo::round_type, Conv3dInfo::stride, and to_string().

◆ operator<<() [19/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const ConvertPolicy &	policy
	)

Formatted output of the ConvertPolicy type.

Parameters

[out]	os	Output stream.
[in]	policy	Type to output.

Returns: Modified output stream.

Definition at line 1715 of file TypePrinter.h.

 {
     switch (policy)
     {
         case ConvertPolicy::WRAP:
             os << "WRAP";
             break;
         case ConvertPolicy::SATURATE:
             os << "SATURATE";
             break;
         default:
             ARM_COMPUTE_ERROR("NOT_SUPPORTED!");
     }
  
     return os;
 }

References ARM_COMPUTE_ERROR, SATURATE, and WRAP.

◆ operator<<() [20/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const ConvolutionInfo &	conv_info
	)

Formatted output of the ConvolutionInfo type.

Parameters

[out]	os	Output stream.
[in]	conv_info	ConvolutionInfo to output.

Returns: Modified output stream.

Definition at line 2590 of file TypePrinter.h.

 {
     os << "{PadStrideInfo=" << conv_info.pad_stride_info << ", "
        << "depth_multiplier=" << conv_info.depth_multiplier << ", "
        << "act_info=" << to_string(conv_info.act_info) << ", "
        << "dilation=" << conv_info.dilation << "}";
     return os;
 }

References arm_compute::test::validation::conv_info, and to_string().

◆ operator<<() [21/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const ConvolutionMethod &	conv_method
	)

Formatted output of the ConvolutionMethod type.

Parameters

[out]	os	Output stream
[in]	conv_method	Type to output

Returns: Modified output stream.

Definition at line 2168 of file TypePrinter.h.

 {
     switch (conv_method)
     {
         case ConvolutionMethod::GEMM:
             os << "GEMM";
             break;
         case ConvolutionMethod::DIRECT:
             os << "DIRECT";
             break;
         case ConvolutionMethod::WINOGRAD:
             os << "WINOGRAD";
             break;
         case ConvolutionMethod::FFT:
             os << "FFT";
             break;
         case ConvolutionMethod::GEMM_CONV2D:
             os << "GEMM_CONV2D";
             break;
         default:
             ARM_COMPUTE_ERROR("NOT_SUPPORTED!");
     }
  
     return os;
 }

References ARM_COMPUTE_ERROR, DIRECT, FFT, GEMM, GEMM_CONV2D, and WINOGRAD.

◆ operator<<() [22/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const Coordinates2D &	coord_2d
	)

Formatted output of the Coordinates2D type.

Parameters

[out]	os	Output stream.
[in]	coord_2d	Coordinates2D to output.

Returns: Modified output stream.

Definition at line 2908 of file TypePrinter.h.

 {
     os << "{x=" << coord_2d.x << ", "
        << "y=" << coord_2d.y << "}";
     return os;
 }

References Coordinates2D::x, and Coordinates2D::y.

◆ operator<<() [23/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const DataLayout &	data_layout
	)

[Print DataLayout type]

Formatted output of the DataLayout type.

Parameters

[out]	os	Output stream.
[in]	data_layout	Type to output.

Returns: Modified output stream.

Definition at line 695 of file TypePrinter.h.

 {
     switch (data_layout)
     {
         case DataLayout::UNKNOWN:
             os << "UNKNOWN";
             break;
         case DataLayout::NHWC:
             os << "NHWC";
             break;
         case DataLayout::NCHW:
             os << "NCHW";
             break;
         case DataLayout::NDHWC:
             os << "NDHWC";
             break;
         case DataLayout::NCDHW:
             os << "NCDHW";
             break;
         default:
             ARM_COMPUTE_ERROR("NOT_SUPPORTED!");
     }
  
     return os;
 }

References ARM_COMPUTE_ERROR, arm_compute::cpu::data_layout, NCDHW, NCHW, NDHWC, NHWC, and UNKNOWN.

◆ operator<<() [24/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const DataLayoutDimension &	data_layout_dim
	)

[Print DataLayout type]

Formatted output of the DataLayoutDimension type.

Parameters

[out]	os	Output stream.
[in]	data_layout_dim	Data layout dimension to print.

Returns: Modified output stream.

Definition at line 742 of file TypePrinter.h.

 {
     switch (data_layout_dim)
     {
         case DataLayoutDimension::WIDTH:
             os << "WIDTH";
             break;
         case DataLayoutDimension::HEIGHT:
             os << "HEIGHT";
             break;
         case DataLayoutDimension::CHANNEL:
             os << "CHANNEL";
             break;
         case DataLayoutDimension::DEPTH:
             os << "DEPTH";
             break;
         case DataLayoutDimension::BATCHES:
             os << "BATCHES";
             break;
         default:
             ARM_COMPUTE_ERROR("NOT_SUPPORTED!");
     }
     return os;
 }

References ARM_COMPUTE_ERROR, BATCHES, CHANNEL, DEPTH, HEIGHT, and WIDTH.

◆ operator<<() [25/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const DataType &	data_type
	)

Formatted output of the DataType type.

Parameters

[out]	os	Output stream.
[in]	data_type	Type to output.

Returns: Modified output stream.

Definition at line 774 of file TypePrinter.h.

 {
     switch (data_type)
     {
         case DataType::UNKNOWN:
             os << "UNKNOWN";
             break;
         case DataType::U8:
             os << "U8";
             break;
         case DataType::QSYMM8:
             os << "QSYMM8";
             break;
         case DataType::QASYMM8:
             os << "QASYMM8";
             break;
         case DataType::QASYMM8_SIGNED:
             os << "QASYMM8_SIGNED";
             break;
         case DataType::QSYMM8_PER_CHANNEL:
             os << "QSYMM8_PER_CHANNEL";
             break;
         case DataType::S8:
             os << "S8";
             break;
         case DataType::U16:
             os << "U16";
             break;
         case DataType::S16:
             os << "S16";
             break;
         case DataType::QSYMM16:
             os << "QSYMM16";
             break;
         case DataType::QASYMM16:
             os << "QASYMM16";
             break;
         case DataType::U32:
             os << "U32";
             break;
         case DataType::S32:
             os << "S32";
             break;
         case DataType::U64:
             os << "U64";
             break;
         case DataType::S64:
             os << "S64";
             break;
         case DataType::BFLOAT16:
             os << "BFLOAT16";
             break;
         case DataType::F16:
             os << "F16";
             break;
         case DataType::F32:
             os << "F32";
             break;
         case DataType::F64:
             os << "F64";
             break;
         case DataType::SIZET:
             os << "SIZET";
             break;
         default:
             ARM_COMPUTE_ERROR("NOT_SUPPORTED!");
     }
  
     return os;
 }

References ARM_COMPUTE_ERROR, BFLOAT16, arm_compute::test::validation::data_type, F16, F32, F64, QASYMM16, QASYMM8, QASYMM8_SIGNED, QSYMM16, QSYMM8, QSYMM8_PER_CHANNEL, S16, S32, S64, S8, SIZET, U16, U32, U64, U8, and UNKNOWN.

◆ operator<<() [26/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const DetectionOutputLayerCodeType &	detection_code
	)

Formatted output of the DetectionOutputLayerCodeType type.

Parameters

[out]	os	Output stream
[in]	detection_code	Type to output

Returns: Modified output stream.

Definition at line 2357 of file TypePrinter.h.

 {
     switch (detection_code)
     {
         case DetectionOutputLayerCodeType::CENTER_SIZE:
             os << "CENTER_SIZE";
             break;
         case DetectionOutputLayerCodeType::CORNER:
             os << "CORNER";
             break;
         case DetectionOutputLayerCodeType::CORNER_SIZE:
             os << "CORNER_SIZE";
             break;
         case DetectionOutputLayerCodeType::TF_CENTER:
             os << "TF_CENTER";
             break;
         default:
             ARM_COMPUTE_ERROR("NOT_SUPPORTED!");
     }
  
     return os;
 }

References ARM_COMPUTE_ERROR, CENTER_SIZE, CORNER, CORNER_SIZE, and TF_CENTER.

◆ operator<<() [27/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const DetectionOutputLayerInfo &	detection_info
	)

Formatted output of the DetectionOutputLayerInfo type.

Parameters

[out]	os	Output stream
[in]	detection_info	Type to output

Returns: Modified output stream.

Definition at line 2399 of file TypePrinter.h.

 {
     os << "{Classes=" << detection_info.num_classes() << ","
        << "ShareLocation=" << detection_info.share_location() << ","
        << "CodeType=" << detection_info.code_type() << ","
        << "VarianceEncodedInTarget=" << detection_info.variance_encoded_in_target() << ","
        << "KeepTopK=" << detection_info.keep_top_k() << ","
        << "NMSThreshold=" << detection_info.nms_threshold() << ","
        << "Eta=" << detection_info.eta() << ","
        << "BackgroundLabelId=" << detection_info.background_label_id() << ","
        << "ConfidenceThreshold=" << detection_info.confidence_threshold() << ","
        << "TopK=" << detection_info.top_k() << ","
        << "NumLocClasses=" << detection_info.num_loc_classes() << "}";
  
     return os;
 }

References DetectionOutputLayerInfo::background_label_id(), DetectionOutputLayerInfo::code_type(), DetectionOutputLayerInfo::confidence_threshold(), DetectionOutputLayerInfo::eta(), DetectionOutputLayerInfo::keep_top_k(), DetectionOutputLayerInfo::nms_threshold(), DetectionOutputLayerInfo::num_classes(), DetectionOutputLayerInfo::num_loc_classes(), DetectionOutputLayerInfo::share_location(), DetectionOutputLayerInfo::top_k(), and DetectionOutputLayerInfo::variance_encoded_in_target().

◆ operator<<() [28/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const DetectionPostProcessLayerInfo &	detection_info
	)

Formatted output of the DetectionPostProcessLayerInfo type.

Parameters

[out]	os	Output stream
[in]	detection_info	Type to output

Returns: Modified output stream.

Definition at line 2435 of file TypePrinter.h.

 {
     os << "{MaxDetections=" << detection_info.max_detections() << ","
        << "MaxClassesPerDetection=" << detection_info.max_classes_per_detection() << ","
        << "NmsScoreThreshold=" << detection_info.nms_score_threshold() << ","
        << "NmsIouThreshold=" << detection_info.iou_threshold() << ","
        << "NumClasses=" << detection_info.num_classes() << ","
        << "ScaleValue_y=" << detection_info.scale_value_y() << ","
        << "ScaleValue_x=" << detection_info.scale_value_x() << ","
        << "ScaleValue_h=" << detection_info.scale_value_h() << ","
        << "ScaleValue_w=" << detection_info.scale_value_w() << ","
        << "UseRegularNms=" << detection_info.use_regular_nms() << ","
        << "DetectionPerClass=" << detection_info.detection_per_class() << "}";
  
     return os;
 }

◆ operator<<() [29/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const DetectionWindow &	detection_window
	)

Formatted output of the DetectionWindow type.

Parameters

[out]	os	Output stream
[in]	detection_window	Type to output

Returns: Modified output stream.

Definition at line 2338 of file TypePrinter.h.

 {
     os << "{x=" << detection_window.x << ","
        << "y=" << detection_window.y << ","
        << "width=" << detection_window.width << ","
        << "height=" << detection_window.height << ","
        << "idx_class=" << detection_window.idx_class << ","
        << "score=" << detection_window.score << "}";
  
     return os;
 }

References DetectionWindow::height, DetectionWindow::idx_class, DetectionWindow::score, DetectionWindow::width, DetectionWindow::x, and DetectionWindow::y.

◆ operator<<() [30/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const DimensionRoundingType &	rounding_type
	)

Formatted output of the DimensionRoundingType type.

Parameters

[out]	os	Output stream.
[in]	rounding_type	DimensionRoundingType Dimension rounding type when down-scaling, or compute output shape of pooling(2D or 3D).

Returns: Modified output stream.

Definition at line 2062 of file TypePrinter.h.

 {
     switch (rounding_type)
     {
         case DimensionRoundingType::CEIL:
             os << "CEIL";
             break;
         case DimensionRoundingType::FLOOR:
             os << "FLOOR";
             break;
         default:
             ARM_COMPUTE_ERROR("NOT_SUPPORTED!");
     }
     return os;
 }

References ARM_COMPUTE_ERROR, CEIL, and FLOOR.

◆ operator<<() [31/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const Dimensions< T > &	dimensions
	)

Formatted output of the Dimensions type.

Parameters

[out]	os	Output stream.
[in]	dimensions	Type to output.

Returns: Modified output stream.

Definition at line 162 of file TypePrinter.h.

 {
     if (dimensions.num_dimensions() > 0)
     {
         os << dimensions[0];
  
         for (unsigned int d = 1; d < dimensions.num_dimensions(); ++d)
         {
             os << "," << dimensions[d];
         }
     }
  
     return os;
 }

References Dimensions< T >::num_dimensions().

◆ operator<<() [32/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const ElementWiseUnary &	op
	)

Formatted output of the Elementwise unary Operations.

Parameters

[out]	os	Output stream.
[in]	op	Type to output.

Returns: Modified output stream.

Definition at line 1892 of file TypePrinter.h.

 {
     switch (op)
     {
         case ElementWiseUnary::RSQRT:
             os << "RSQRT";
             break;
         case ElementWiseUnary::EXP:
             os << "EXP";
             break;
         case ElementWiseUnary::NEG:
             os << "NEG";
             break;
         case ElementWiseUnary::LOG:
             os << "LOG";
             break;
         case ElementWiseUnary::SIN:
             os << "SIN";
             break;
         case ElementWiseUnary::ABS:
             os << "ABS";
             break;
         case ElementWiseUnary::ROUND:
             os << "ROUND";
             break;
         case ElementWiseUnary::LOGICAL_NOT:
             os << "LOGICAL_NOT";
             break;
         default:
             ARM_COMPUTE_ERROR("NOT_SUPPORTED!");
     }
  
     return os;
 }

References ABS, ARM_COMPUTE_ERROR, EXP, LOG, LOGICAL_NOT, NEG, ROUND, RSQRT, and SIN.

◆ operator<<() [33/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const experimental::dynamic_fusion::CastAttributes &	cast_attr
	)

Formatted output of the arm_compute::experimental::dynamic_fusion::CastAttributes type.

Parameters

[out]	os	Output stream.
[in]	cast_attr	arm_compute::experimental::dynamic_fusion::CastAttributes type to output.

Returns: Modified output stream.

Definition at line 3388 of file TypePrinter.h.

 {
     os << "CastAttributes="
        << "["
        << "Data Type=" << cast_attr.data_type() << ", "
        << "Convert Policy=" << cast_attr.convert_policy() << "]";
  
     return os;
 }

References CastAttributes::convert_policy(), and CastAttributes::data_type().

◆ operator<<() [34/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const experimental::dynamic_fusion::ClampAttributes &	clamp_attr
	)

Formatted output of the arm_compute::experimental::dynamic_fusion::ClampAttributes type.

Parameters

[out]	os	Output stream.
[in]	clamp_attr	arm_compute::experimental::dynamic_fusion::ClampAttributes type to output.

Returns: Modified output stream.

Definition at line 3450 of file TypePrinter.h.

 {
     os << "ClampAttributes="
        << "["
        << "Min value=" << clamp_attr.min_val() << ", "
        << "Max value=" << clamp_attr.max_val() << "]";
     return os;
 }

References ClampAttributes::max_val(), and ClampAttributes::min_val().

◆ operator<<() [35/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const experimental::dynamic_fusion::Conv2dAttributes &	conv2d_attr
	)

Formatted output of the arm_compute::experimental::dynamic_fusion::Conv2dAttributes type.

Parameters

[out]	os	Output stream.
[in]	conv2d_attr	arm_compute::experimental::dynamic_fusion::Conv2dAttributes type to output.

Returns: Modified output stream.

Definition at line 3357 of file TypePrinter.h.

 {
     os << "Conv2dAttributes="
        << "["
        << "Padding=" << conv2d_attr.pad() << ", "
        << "Size2D=" << conv2d_attr.stride() << ", "
        << "Dialation=" << conv2d_attr.dilation() << "]";
  
     return os;
 }

References Conv2dAttributes::dilation(), Conv2dAttributes::pad(), and Conv2dAttributes::stride().

◆ operator<<() [36/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const experimental::dynamic_fusion::DepthwiseConv2dAttributes &	dw_conv2d_attr
	)

Formatted output of the arm_compute::experimental::dynamic_fusion::DepthwiseConv2dAttributes type.

Parameters

[out]	os	Output stream.
[in]	dw_conv2d_attr	arm_compute::experimental::dynamic_fusion::DepthwiseConv2dAttributes type to output.

Returns: Modified output stream.

Definition at line 3417 of file TypePrinter.h.

 {
     os << "DepthwiseConv2dAttributes="
        << "["
        << "Padding=" << dw_conv2d_attr.pad() << ", "
        << "Size2D=" << dw_conv2d_attr.stride() << ", "
        << "Depth Multiplier=" << dw_conv2d_attr.depth_multiplier() << ", "
        << "Dilation=" << dw_conv2d_attr.dilation() << ","
        << "DimensionRoundingType: " << dw_conv2d_attr.dimension_rounding_type() << "]";
  
     return os;
 }

References DepthwiseConv2dAttributes::depth_multiplier(), DepthwiseConv2dAttributes::dilation(), DepthwiseConv2dAttributes::dimension_rounding_type(), DepthwiseConv2dAttributes::pad(), and DepthwiseConv2dAttributes::stride().

◆ operator<<() [37/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const experimental::dynamic_fusion::GpuPool2dSettings &	settings
	)

Formatted output of the arm_compute::experimental::dynamic_fusion::GpuPool2dSettings type.

Parameters

[out]	os	Output stream
[in]	settings	arm_compute::dynamic_fusion::GpuPool2dSettings type to output

Definition at line 3329 of file TypePrinter.h.

 {
     os << "Settings="
        << "["
        << "UseInfAsLimit=" << settings.use_inf_as_limit() << "]";
     return os;
 }

References GpuPool2dSettings::use_inf_as_limit().

◆ operator<<() [38/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const experimental::dynamic_fusion::Pool2dAttributes &	pool2d_attr
	)

Formatted output of the arm_compute::experimental::dynamic_fusion::Pool2dAttributes type.

Parameters

[out]	os	Output stream.
[in]	pool2d_attr	arm_compute::experimental::dynamic_fusion::Pool2dAttributes type to output.

Returns: Modified output stream.

Definition at line 3298 of file TypePrinter.h.

 {
     os << "Pool2dAttributes="
        << "["
        << "PoolingType=" << pool2d_attr.pool_type() << ","
        << "PoolSize=" << pool2d_attr.pool_size() << ","
        << "Padding=" << pool2d_attr.pad() << ","
        << "Stride=" << pool2d_attr.stride() << ","
        << "ExcludePadding" << pool2d_attr.exclude_padding() << "]";
  
     return os;
 }

References Pool2dAttributes::exclude_padding(), Pool2dAttributes::pad(), Pool2dAttributes::pool_size(), Pool2dAttributes::pool_type(), and Pool2dAttributes::stride().

◆ operator<<() [39/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const experimental::dynamic_fusion::ResizeAttributes &	resize_attr
	)

Formatted output of the arm_compute::experimental::dynamic_fusion::ResizeAttributes type.

Parameters

[out]	os	Output stream.
[in]	resize_attr	arm_compute::experimental::dynamic_fusion::ResizeAttributes type to output.

Returns: Modified output stream.

Definition at line 3478 of file TypePrinter.h.

 {
     os << "ResizeAttributes="
        << "["
        << "AlignCorners=" << resize_attr.align_corners() << ", "
        << "InterpolationPolicy=" << resize_attr.interpolation_policy() << ", "
        << "OutputHeight=" << resize_attr.output_height() << ", "
        << "OutputWidth=" << resize_attr.output_width() << ", "
        << "SamplingPolicy=" << resize_attr.sampling_policy() << "]";
     return os;
 }

References ResizeAttributes::align_corners(), ResizeAttributes::interpolation_policy(), ResizeAttributes::output_height(), ResizeAttributes::output_width(), and ResizeAttributes::sampling_policy().

◆ operator<<() [40/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const experimental::dynamic_fusion::SoftmaxAttributes &	softmax_attr
	)

Formatted output of the arm_compute::experimental::dynamic_fusion::SoftmaxAttributes type.

Parameters

[out]	os	Output stream.
[in]	softmax_attr	arm_compute::experimental::dynamic_fusion::SoftmaxAttributes type to output.

Returns: Modified output stream.

Definition at line 3510 of file TypePrinter.h.

 {
     os << "SoftmaxAttributes="
        << "["
        << "Beta=" << softmax_attr.beta() << ", "
        << "Is Log Softmax=" << softmax_attr.is_log_softmax() << ", "
        << "Axis=" << softmax_attr.axis() << "]";
     return os;
 }

References SoftmaxAttributes::axis(), SoftmaxAttributes::beta(), and SoftmaxAttributes::is_log_softmax().

◆ operator<<() [41/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const FFT1DInfo &	fft1d_info
	)

Formatted output of the FFT1DInfo type.

Parameters

[out]	os	Output stream.
[in]	fft1d_info	FFT1DInfo to output.

Returns: Modified output stream.

Definition at line 2853 of file TypePrinter.h.

 {
     os << "{axis=" << fft1d_info.axis << ", "
        << "direction=" << fft1d_info.direction << "}";
     return os;
 }

References FFT1DInfo::axis, and FFT1DInfo::direction.

◆ operator<<() [42/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const FFT2DInfo &	fft2d_info
	)

Formatted output of the FFT2DInfo type.

Parameters

[out]	os	Output stream.
[in]	fft2d_info	FFT2DInfo to output.

Returns: Modified output stream.

Definition at line 2880 of file TypePrinter.h.

 {
     os << "{axis=" << fft2d_info.axis0 << ", "
        << "axis=" << fft2d_info.axis1 << ", "
        << "direction=" << fft2d_info.direction << "}";
     return os;
 }

References FFT2DInfo::axis0, FFT2DInfo::axis1, and FFT2DInfo::direction.

◆ operator<<() [43/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const FFTDirection &	fft_dir
	)

Formatted output of the FFTDirection type.

Parameters

[out]	os	Output stream.
[in]	fft_dir	FFTDirection to output.

Returns: Modified output stream.

Definition at line 2817 of file TypePrinter.h.

 {
     switch (fft_dir)
     {
         case FFTDirection::Forward:
             os << "Forward";
             break;
         case FFTDirection::Inverse:
             os << "Inverse";
             break;
         default:
             ARM_COMPUTE_ERROR("NOT_SUPPORTED!");
     }
     return os;
 }

References ARM_COMPUTE_ERROR, Forward, and Inverse.

◆ operator<<() [44/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const Format &	format
	)

Formatted output of the Format type.

Parameters

[out]	os	Output stream.
[in]	format	Type to output.

Returns: Modified output stream.

Definition at line 865 of file TypePrinter.h.

 {
     switch (format)
     {
         case Format::UNKNOWN:
             os << "UNKNOWN";
             break;
         case Format::U8:
             os << "U8";
             break;
         case Format::S16:
             os << "S16";
             break;
         case Format::U16:
             os << "U16";
             break;
         case Format::S32:
             os << "S32";
             break;
         case Format::U32:
             os << "U32";
             break;
         case Format::F16:
             os << "F16";
             break;
         case Format::F32:
             os << "F32";
             break;
         case Format::UV88:
             os << "UV88";
             break;
         case Format::RGB888:
             os << "RGB888";
             break;
         case Format::RGBA8888:
             os << "RGBA8888";
             break;
         case Format::YUV444:
             os << "YUV444";
             break;
         case Format::YUYV422:
             os << "YUYV422";
             break;
         case Format::NV12:
             os << "NV12";
             break;
         case Format::NV21:
             os << "NV21";
             break;
         case Format::IYUV:
             os << "IYUV";
             break;
         case Format::UYVY422:
             os << "UYVY422";
             break;
         default:
             ARM_COMPUTE_ERROR("NOT_SUPPORTED!");
     }
  
     return os;
 }

References ARM_COMPUTE_ERROR, F16, F32, update_supported_ops::format, IYUV, NV12, NV21, RGB888, RGBA8888, S16, S32, U16, U32, U8, UNKNOWN, UV88, UYVY422, YUV444, and YUYV422.

◆ operator<<() [45/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const FullyConnectedLayerInfo &	layer_info
	)

Formatted output of the FullyConnectedLayerInfo type.

Parameters

[out]	os	Output stream.
[in]	layer_info	FullyConnectedLayerInfo to output.

Returns: Modified output stream.

Definition at line 2619 of file TypePrinter.h.

 {
     os << "{activation_info=" << to_string(layer_info.activation_info) << ", "
        << "weights_trained_layout=" << layer_info.weights_trained_layout << ", "
        << "transpose_weights=" << layer_info.transpose_weights << ", "
        << "are_weights_reshaped=" << layer_info.are_weights_reshaped << ", "
        << "retain_internal_weights=" << layer_info.retain_internal_weights << ", "
        << "fp_mixed_precision=" << layer_info.fp_mixed_precision << "}";
     return os;
 }

References FullyConnectedLayerInfo::activation_info, FullyConnectedLayerInfo::are_weights_reshaped, FullyConnectedLayerInfo::fp_mixed_precision, FullyConnectedLayerInfo::retain_internal_weights, to_string(), FullyConnectedLayerInfo::transpose_weights, and FullyConnectedLayerInfo::weights_trained_layout.

◆ operator<<() [46/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const FuseBatchNormalizationType &	fuse_type
	)

Formatted output of the FuseBatchNormalizationType type.

Parameters

[out]	os	Output stream.
[in]	fuse_type	FuseBatchNormalizationType to output.

Returns: Modified output stream.

Definition at line 2935 of file TypePrinter.h.

 {
     switch (fuse_type)
     {
         case FuseBatchNormalizationType::CONVOLUTION:
             os << "CONVOLUTION";
             break;
         case FuseBatchNormalizationType::DEPTHWISECONVOLUTION:
             os << "DEPTHWISECONVOLUTION";
             break;
         default:
             ARM_COMPUTE_ERROR("NOT_SUPPORTED!");
     }
     return os;
 }

References ARM_COMPUTE_ERROR, CONVOLUTION, and DEPTHWISECONVOLUTION.

◆ operator<<() [47/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const GEMMInfo &	info
	)

Formatted output of the GEMMInfo type.

Parameters

[out]	os	Output stream.
[in]	info	Type to output.

Returns: Modified output stream.

Definition at line 1411 of file TypePrinter.h.

 {
     os << "{is_a_reshaped=" << info.is_a_reshaped() << ",";
     os << "is_b_reshaped=" << info.is_b_reshaped() << ",";
     os << "reshape_b_only_on_first_run=" << info.reshape_b_only_on_first_run() << ",";
     os << "depth_output_gemm3d=" << info.depth_output_gemm3d() << ",";
     os << "reinterpret_input_as_3d=" << info.reinterpret_input_as_3d() << ",";
     os << "retain_internal_weights=" << info.retain_internal_weights() << ",";
     os << "fp_mixed_precision=" << info.fp_mixed_precision() << ",";
     os << "broadcast_bias=" << info.broadcast_bias() << ",";
     os << "pretranspose_B=" << info.pretranspose_B() << ",";
     os << "}";
  
     return os;
 }

References arm_compute::test::validation::info.

◆ operator<<() [48/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const GEMMKernelInfo &	gemm_info
	)

Formatted output of the GEMMKernelInfo type.

Parameters

[out]	os	Output stream.
[in]	gemm_info	Type to output.

Returns: Modified output stream.

Definition at line 253 of file TypePrinter.h.

 {
     os << "( m=" << gemm_info.m;
     os << " n=" << gemm_info.n;
     os << " k=" << gemm_info.k;
     os << " depth_output_gemm3d=" << gemm_info.depth_output_gemm3d;
     os << " reinterpret_input_as_3d=" << gemm_info.reinterpret_input_as_3d;
     os << " broadcast_bias=" << gemm_info.broadcast_bias;
     os << " fp_mixed_precision=" << gemm_info.fp_mixed_precision;
     os << " mult_transpose1xW_width=" << gemm_info.mult_transpose1xW_width;
     os << " mult_interleave4x4_height=" << gemm_info.mult_interleave4x4_height;
     os << " a_offset=" << gemm_info.a_offset;
     os << " b_offset=" << gemm_info.b_offset;
     os << ")";
     return os;
 }

References GEMMKernelInfo::a_offset, GEMMKernelInfo::b_offset, GEMMKernelInfo::broadcast_bias, GEMMKernelInfo::depth_output_gemm3d, GEMMKernelInfo::fp_mixed_precision, GEMMKernelInfo::k, GEMMKernelInfo::m, GEMMKernelInfo::mult_interleave4x4_height, GEMMKernelInfo::mult_transpose1xW_width, GEMMKernelInfo::n, and GEMMKernelInfo::reinterpret_input_as_3d.

◆ operator<<() [49/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const GEMMLHSMatrixInfo &	gemm_info
	)

Formatted output of the GEMMLHSMatrixInfo type.

Parameters

[out]	os	Output stream.
[in]	gemm_info	Type to output.

Returns: Modified output stream.

Definition at line 277 of file TypePrinter.h.

 {
     os << "( m0=" << (unsigned int)gemm_info.m0 << " k0=" << gemm_info.k0 << "  v0=" << gemm_info.v0
        << "  trans=" << gemm_info.transpose << "  inter=" << gemm_info.interleave << "})";
     return os;
 }

References GEMMLHSMatrixInfo::interleave, GEMMLHSMatrixInfo::k0, GEMMLHSMatrixInfo::m0, GEMMLHSMatrixInfo::transpose, and GEMMLHSMatrixInfo::v0.

◆ operator<<() [50/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const GEMMLowpOutputStageInfo &	gemm_info
	)

Formatted output of the GEMMLowpOutputStageInfo type.

Parameters

[out]	os	Output stream.
[in]	gemm_info	GEMMLowpOutputStageInfo to output.

Returns: Modified output stream.

Definition at line 2692 of file TypePrinter.h.

 {
     os << "{type=" << gemm_info.type << ", "
        << "gemlowp_offset=" << gemm_info.gemmlowp_offset << ", "
        << "gemmlowp_multiplier=" << gemm_info.gemmlowp_multiplier << ", "
        << "gemmlowp_shift=" << gemm_info.gemmlowp_shift << ", "
        << "gemmlowp_min_bound=" << gemm_info.gemmlowp_min_bound << ", "
        << "gemmlowp_max_bound=" << gemm_info.gemmlowp_max_bound << ", "
        << "gemmlowp_multipliers=" << gemm_info.gemmlowp_multiplier << ", "
        << "gemmlowp_shifts=" << gemm_info.gemmlowp_shift << ", "
        << "gemmlowp_real_multiplier=" << gemm_info.gemmlowp_real_multiplier << ", "
        << "is_quantized_per_channel=" << gemm_info.is_quantized_per_channel << ", "
        << "output_data_type=" << gemm_info.output_data_type << "}";
     return os;
 }

References GEMMLowpOutputStageInfo::gemmlowp_max_bound, GEMMLowpOutputStageInfo::gemmlowp_min_bound, GEMMLowpOutputStageInfo::gemmlowp_multiplier, GEMMLowpOutputStageInfo::gemmlowp_offset, GEMMLowpOutputStageInfo::gemmlowp_real_multiplier, GEMMLowpOutputStageInfo::gemmlowp_shift, GEMMLowpOutputStageInfo::is_quantized_per_channel, GEMMLowpOutputStageInfo::output_data_type, and GEMMLowpOutputStageInfo::type.

◆ operator<<() [51/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const GEMMLowpOutputStageType &	gemm_type
	)

Formatted output of the GEMMLowpOutputStageType type.

Parameters

[out]	os	Output stream.
[in]	gemm_type	GEMMLowpOutputStageType to output.

Returns: Modified output stream.

Definition at line 2650 of file TypePrinter.h.

 {
     switch (gemm_type)
     {
         case GEMMLowpOutputStageType::NONE:
             os << "NONE";
             break;
         case GEMMLowpOutputStageType::QUANTIZE_DOWN:
             os << "QUANTIZE_DOWN";
             break;
         case GEMMLowpOutputStageType::QUANTIZE_DOWN_FIXEDPOINT:
             os << "QUANTIZE_DOWN_FIXEDPOINT";
             break;
         case GEMMLowpOutputStageType::QUANTIZE_DOWN_FLOAT:
             os << "QUANTIZE_DOWN_FLOAT";
             break;
         default:
             ARM_COMPUTE_ERROR("NOT_SUPPORTED!");
     }
     return os;
 }

References ARM_COMPUTE_ERROR, arm_compute::mlgo::parser::gemm_type(), NONE, QUANTIZE_DOWN, QUANTIZE_DOWN_FIXEDPOINT, and QUANTIZE_DOWN_FLOAT.

◆ operator<<() [52/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const GEMMReshapeInfo &	info
	)

Formatted output of the GEMMReshapeInfo type.

Parameters

[out]	os	Output stream.
[in]	info	Type to output.

Returns: Modified output stream.

Definition at line 1392 of file TypePrinter.h.

 {
     os << "{m=" << info.m() << ",";
     os << "n=" << info.n() << ",";
     os << "k=" << info.k() << ",";
     os << "mult_transpose1xW_width=" << info.mult_transpose1xW_width() << ",";
     os << "mult_interleave4x4_height=" << info.mult_interleave4x4_height();
     os << "}";
  
     return os;
 }

References arm_compute::test::validation::info.

◆ operator<<() [53/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const GEMMRHSMatrixInfo &	gemm_info
	)

Formatted output of the GEMMRHSMatrixInfo type.

Parameters

[out]	os	Output stream.
[in]	gemm_info	Type to output.

Returns: Modified output stream.

Definition at line 291 of file TypePrinter.h.

 {
     os << "( n0=" << (unsigned int)gemm_info.n0 << " k0=" << gemm_info.k0 << "  h0=" << gemm_info.h0
        << "  trans=" << gemm_info.transpose << "  inter=" << gemm_info.interleave
        << " exp_img=" << gemm_info.export_to_cl_image << "})";
     return os;
 }

References GEMMRHSMatrixInfo::export_to_cl_image, GEMMRHSMatrixInfo::h0, GEMMRHSMatrixInfo::interleave, GEMMRHSMatrixInfo::k0, GEMMRHSMatrixInfo::n0, and GEMMRHSMatrixInfo::transpose.

◆ operator<<() [54/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const GenerateProposalsInfo &	proposals_info
	)

Formatted output of the GenerateProposalsInfo type.

Parameters

[out]	os	Output stream.
[in]	proposals_info	Type to output.

Returns: Modified output stream.

Definition at line 406 of file TypePrinter.h.

 {
     os << "(" << proposals_info.im_width() << "x" << proposals_info.im_height() << ")~" << proposals_info.im_scale();
     return os;
 }

References GenerateProposalsInfo::im_height(), GenerateProposalsInfo::im_scale(), and GenerateProposalsInfo::im_width().

◆ operator<<() [55/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const GPUTarget &	gpu_target
	)

Formatted output of the GPUTarget type.

Parameters

[out]	os	Output stream
[in]	gpu_target	Type to output

Returns: Modified output stream.

Definition at line 2214 of file TypePrinter.h.

 {
     switch (gpu_target)
     {
         case GPUTarget::GPU_ARCH_MASK:
             os << "GPU_ARCH_MASK";
             break;
         case GPUTarget::GPU_GENERATION_MASK:
             os << "GPU_GENERATION_MASK";
             break;
         case GPUTarget::MIDGARD:
             os << "MIDGARD";
             break;
         case GPUTarget::BIFROST:
             os << "BIFROST";
             break;
         case GPUTarget::VALHALL:
             os << "VALHALL";
             break;
         case GPUTarget::FIFTHGEN:
             os << "FIFTHGEN";
             break;
         case GPUTarget::T600:
             os << "T600";
             break;
         case GPUTarget::T700:
             os << "T700";
             break;
         case GPUTarget::T800:
             os << "T800";
             break;
         case GPUTarget::G71:
             os << "G71";
             break;
         case GPUTarget::G72:
             os << "G72";
             break;
         case GPUTarget::G51:
             os << "G51";
             break;
         case GPUTarget::G51BIG:
             os << "G51BIG";
             break;
         case GPUTarget::G51LIT:
             os << "G51LIT";
             break;
         case GPUTarget::G31:
             os << "G31";
             break;
         case GPUTarget::G76:
             os << "G76";
             break;
         case GPUTarget::G52:
             os << "G52";
             break;
         case GPUTarget::G52LIT:
             os << "G52LIT";
             break;
         case GPUTarget::G77:
             os << "G77";
             break;
         case GPUTarget::G57:
             os << "G57";
             break;
         case GPUTarget::G78:
             os << "G78";
             break;
         case GPUTarget::G68:
             os << "G68";
             break;
         case GPUTarget::G78AE:
             os << "G78AE";
             break;
         case GPUTarget::G710:
             os << "G710";
             break;
         case GPUTarget::G610:
             os << "G610";
             break;
         case GPUTarget::G510:
             os << "G510";
             break;
         case GPUTarget::G310:
             os << "G310";
             break;
         case GPUTarget::G715:
             os << "G715";
             break;
         case GPUTarget::G615:
             os << "G615";
             break;
         case GPUTarget::G720:
             os << "G720";
             break;
         case GPUTarget::G620:
             os << "G620";
             break;
         default:
             ARM_COMPUTE_ERROR("NOT_SUPPORTED!");
     }
  
     return os;
 }

References ARM_COMPUTE_ERROR, BIFROST, FIFTHGEN, G31, G310, G51, G510, G51BIG, G51LIT, G52, G52LIT, G57, G610, G615, G620, G68, G71, G710, G715, G72, G720, G76, G77, G78, G78AE, GPU_ARCH_MASK, GPU_GENERATION_MASK, MIDGARD, T600, T700, T800, and VALHALL.

◆ operator<<() [56/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const InterpolationPolicy &	policy
	)

Formatted output of the InterpolationPolicy type.

Parameters

[out]	os	Output stream.
[in]	policy	Type to output.

Returns: Modified output stream.

Definition at line 1091 of file TypePrinter.h.

 {
     switch (policy)
     {
         case InterpolationPolicy::NEAREST_NEIGHBOR:
             os << "NEAREST_NEIGHBOR";
             break;
         case InterpolationPolicy::BILINEAR:
             os << "BILINEAR";
             break;
         case InterpolationPolicy::AREA:
             os << "AREA";
             break;
         default:
             ARM_COMPUTE_ERROR("NOT_SUPPORTED!");
     }
  
     return os;
 }

References AREA, ARM_COMPUTE_ERROR, BILINEAR, and NEAREST_NEIGHBOR.

◆ operator<<() [57/88]

::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const LSTMParams< T > &	lstm_params
	)

Formatted output of the ScaleKernelInfo type.

Parameters

[out]	os	Output stream.
[in]	lstm_params	LSTMParams to output.

Returns: Modified output stream.

Definition at line 3001 of file TypePrinter.h.

 {
     os << "{input_to_input_weights=" << to_string(lstm_params.input_to_input_weights()) << ", "
        << "recurrent_to_input_weights=" << to_string(lstm_params.recurrent_to_input_weights()) << ", "
        << "cell_to_input_weights=" << to_string(lstm_params.cell_to_input_weights()) << ", "
        << "input_gate_bias=" << to_string(lstm_params.input_gate_bias()) << ", "
        << "cell_to_forget_weights=" << to_string(lstm_params.cell_to_forget_weights()) << ", "
        << "cell_to_output_weights=" << to_string(lstm_params.cell_to_output_weights()) << ", "
        << "projection_weights=" << to_string(lstm_params.projection_weights()) << ", "
        << "projection_bias=" << to_string(lstm_params.projection_bias()) << ", "
        << "input_layer_norm_weights=" << to_string(lstm_params.input_layer_norm_weights()) << ", "
        << "forget_layer_norm_weights=" << to_string(lstm_params.forget_layer_norm_weights()) << ", "
        << "cell_layer_norm_weights=" << to_string(lstm_params.cell_layer_norm_weights()) << ", "
        << "output_layer_norm_weights=" << to_string(lstm_params.output_layer_norm_weights()) << ", "
        << "cell_clip=" << lstm_params.cell_clip() << ", "
        << "projection_clip=" << lstm_params.projection_clip() << ", "
        << "input_intermediate_scale=" << lstm_params.input_intermediate_scale() << ", "
        << "forget_intermediate_scale=" << lstm_params.forget_intermediate_scale() << ", "
        << "cell_intermediate_scale=" << lstm_params.cell_intermediate_scale() << ", "
        << "hidden_state_zero=" << lstm_params.hidden_state_zero() << ", "
        << "hidden_state_scale=" << lstm_params.hidden_state_scale() << ", "
        << "has_peephole_opt=" << lstm_params.has_peephole_opt() << ", "
        << "has_projection=" << lstm_params.has_projection() << ", "
        << "has_cifg_opt=" << lstm_params.has_cifg_opt() << ", "
        << "use_layer_norm=" << lstm_params.use_layer_norm() << "}";
     return os;
 }

◆ operator<<() [58/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const Multiples &	multiples
	)

Formatted output of the Multiples type.

Parameters

[out]	os	Output stream.
[in]	multiples	Type to output.

Returns: Modified output stream.

Definition at line 1073 of file TypePrinter.h.

 {
     os << "(";
     for (size_t i = 0; i < multiples.size() - 1; i++)
     {
         os << multiples[i] << ", ";
     }
     os << multiples.back() << ")";
     return os;
 }

◆ operator<<() [59/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const NMSType &	nms_type
	)

Available non maxima suppression types.

Formatted output of the NMSType type.

Parameters

[out]	os	Output stream.
[in]	nms_type	NMSType to output.

Returns: Modified output stream.

Definition at line 3063 of file TypePrinter.h.

 {
     switch (nms_type)
     {
         case NMSType::LINEAR:
             os << "LINEAR";
             break;
         case NMSType::GAUSSIAN:
             os << "GAUSSIAN";
             break;
         case NMSType::ORIGINAL:
             os << "ORIGINAL";
             break;
         default:
             ARM_COMPUTE_ERROR("NOT_SUPPORTED!");
     }
     return os;
 }

References ARM_COMPUTE_ERROR, GAUSSIAN, LINEAR, and ORIGINAL.

◆ operator<<() [60/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const NormalizationLayerInfo &	info
	)

Formatted output of NormalizationLayerInfo.

Parameters

[out]	os	Output stream.
[in]	info	Type to output.

Returns: Modified output stream.

Definition at line 627 of file TypePrinter.h.

 {
     os << info.type() << ":NormSize=" << info.norm_size();
     return os;
 }

References arm_compute::test::validation::info.

◆ operator<<() [61/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const NormType &	norm_type
	)

Formatted output of the NormType type.

Parameters

[out]	os	Output stream.
[in]	norm_type	Type to output.

Returns: Modified output stream.

Definition at line 587 of file TypePrinter.h.

 {
     switch (norm_type)
     {
         case NormType::CROSS_MAP:
             os << "CROSS_MAP";
             break;
         case NormType::IN_MAP_1D:
             os << "IN_MAP_1D";
             break;
         case NormType::IN_MAP_2D:
             os << "IN_MAP_2D";
             break;
         default:
             ARM_COMPUTE_ERROR("NOT_SUPPORTED!");
     }
  
     return os;
 }

References ARM_COMPUTE_ERROR, CROSS_MAP, IN_MAP_1D, and IN_MAP_2D.

◆ operator<<() [62/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const Padding2D &	padding2d
	)

Formatted output of the Padding2D type.

Parameters

[out]	os	Output stream.
[in]	padding2d	Padding info for 2D dimension shape.

Returns: Modified output stream.

Definition at line 3272 of file TypePrinter.h.

 {
     os << padding2d.left << "," << padding2d.right << "," << padding2d.top << "," << padding2d.bottom;
     return os;
 }

References Padding2D::bottom, Padding2D::left, Padding2D::right, and Padding2D::top.

◆ operator<<() [63/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const Padding3D &	padding3d
	)

Formatted output of the Padding3D type.

Parameters

[out]	os	Output stream.
[in]	padding3d	Padding info for 3D spatial dimension shape.

Returns: Modified output stream.

Definition at line 2035 of file TypePrinter.h.

 {
     os << padding3d.left << "," << padding3d.right << "," << padding3d.top << "," << padding3d.bottom << ","
        << padding3d.front << "," << padding3d.back;
     return os;
 }

References Padding3D::back, Padding3D::bottom, Padding3D::front, Padding3D::left, Padding3D::right, and Padding3D::top.

◆ operator<<() [64/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const PaddingList &	padding
	)

Formatted output of the PaddingList type.

Parameters

[out]	os	Output stream.
[in]	padding	Type to output.

Returns: Modified output stream.

Definition at line 1055 of file TypePrinter.h.

 {
     os << "{";
     for (auto const &p : padding)
     {
         os << "{" << p.first << "," << p.second << "}";
     }
     os << "}";
     return os;
 }

◆ operator<<() [65/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const PaddingMode &	mode
	)

Formatted output of the PaddingMode type.

Parameters

[out]	os	Output stream.
[in]	mode	Type to output.

Returns: Modified output stream.

Definition at line 1567 of file TypePrinter.h.

 {
     switch (mode)
     {
         case PaddingMode::CONSTANT:
             os << "CONSTANT";
             break;
         case PaddingMode::REFLECT:
             os << "REFLECT";
             break;
         case PaddingMode::SYMMETRIC:
             os << "SYMMETRIC";
             break;
         default:
             ARM_COMPUTE_ERROR("NOT_SUPPORTED!");
     }
  
     return os;
 }

References ARM_COMPUTE_ERROR, CONSTANT, clang_tidy_rules::mode, REFLECT, and SYMMETRIC.

◆ operator<<() [66/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const PadStrideInfo &	pad_stride_info
	)

Formatted output of the PadStrideInfo type.

Parameters

[out]	os	Output stream.
[in]	pad_stride_info	Type to output.

Returns: Modified output stream.

Definition at line 1607 of file TypePrinter.h.

 {
     os << pad_stride_info.stride().first << "," << pad_stride_info.stride().second;
     os << ";";
     os << pad_stride_info.pad_left() << "," << pad_stride_info.pad_right() << "," << pad_stride_info.pad_top() << ","
        << pad_stride_info.pad_bottom();
  
     return os;
 }

References PadStrideInfo::pad_bottom(), PadStrideInfo::pad_left(), PadStrideInfo::pad_right(), PadStrideInfo::pad_top(), and PadStrideInfo::stride().

◆ operator<<() [67/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const PixelValue &	pixel_value
	)

Formatted output of the PixelValue type.

Parameters

[out]	os	Output stream.
[in]	pixel_value	PixelValue to output.

Returns: Modified output stream.

Definition at line 2759 of file TypePrinter.h.

 {
     os << "{value.u64=" << pixel_value.get<uint64_t>() << "}";
     return os;
 }

References PixelValue::get().

◆ operator<<() [68/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const Pooling3dLayerInfo &	info
	)

Formatted output of the Pooling 3d Layer Info.

Parameters

[out]	os	Output stream.
[in]	info	Pooling 3D layer info to print to output stream.

Returns: Modified output stream.

Definition at line 2085 of file TypePrinter.h.

 {
     os << "{Type=" << info.pool_type << ","
        << "IsGlobalPooling=" << info.is_global_pooling;
     if (!info.is_global_pooling)
     {
         os << ","
            << "PoolSize=" << info.pool_size << ", "
            << "Stride=" << info.stride << ", "
            << "Padding=" << info.padding << ", "
            << "Exclude Padding=" << info.exclude_padding << ", "
            << "fp_mixed_precision=" << info.fp_mixed_precision << ", "
            << "DimensionRoundingType=" << info.round_type;
     }
     os << "}";
     return os;
 }

References arm_compute::test::validation::info.

◆ operator<<() [69/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const PoolingLayerInfo &	info
	)

Formatted output of PoolingLayerInfo.

Parameters

[out]	os	Output stream.
[in]	info	Type to output.

Returns: Modified output stream.

Definition at line 667 of file TypePrinter.h.

 {
     os << info.pool_type;
  
     return os;
 }

References arm_compute::test::validation::info.

◆ operator<<() [70/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const PoolingType &	pool_type
	)

Formatted output of the PoolingType type.

Parameters

[out]	os	Output stream.
[in]	pool_type	Type to output.

Returns: Modified output stream.

Definition at line 640 of file TypePrinter.h.

 {
     switch (pool_type)
     {
         case PoolingType::AVG:
             os << "AVG";
             break;
         case PoolingType::MAX:
             os << "MAX";
             break;
         case PoolingType::L2:
             os << "L2";
             break;
         default:
             ARM_COMPUTE_ERROR("NOT_SUPPORTED!");
     }
  
     return os;
 }

References ARM_COMPUTE_ERROR, AVG, L2, and MAX.

◆ operator<<() [71/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const PriorBoxLayerInfo &	info
	)

Formatted output of PriorBoxLayerInfo.

Parameters

[out]	os	Output stream.
[in]	info	Type to output.

Returns: Modified output stream.

Definition at line 2485 of file TypePrinter.h.

 {
     os << "Clip:" << info.clip() << "Flip:" << info.flip() << "StepX:" << info.steps()[0] << "StepY:" << info.steps()[1]
        << "MinSizes:" << info.min_sizes() << "MaxSizes:" << info.max_sizes() << "ImgSizeX:" << info.img_size().x
        << "ImgSizeY:" << info.img_size().y << "Offset:" << info.offset() << "Variances:" << info.variances();
  
     return os;
 }

References arm_compute::test::validation::info.

◆ operator<<() [72/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const QuantizationInfo &	qinfo
	)

Formatted output of the QuantizationInfo type.

Parameters

[out]	os	Output stream.
[in]	qinfo	Type to output.

Returns: Modified output stream.

Definition at line 432 of file TypePrinter.h.

 {
     const UniformQuantizationInfo uqinfo = qinfo.uniform();
     os << "Scale:" << uqinfo.scale << "~";
     os << "Offset:" << uqinfo.offset;
     return os;
 }

References UniformQuantizationInfo::offset, arm_compute::test::validation::qinfo, UniformQuantizationInfo::scale, and QuantizationInfo::uniform().

◆ operator<<() [73/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const Rectangle &	rect
	)

Formatted output of the Rectangle type.

Parameters

[out]	os	Output stream.
[in]	rect	Type to output.

Returns: Modified output stream.

Definition at line 1552 of file TypePrinter.h.

 {
     os << rect.width << "x" << rect.height;
     os << "+" << rect.x << "+" << rect.y;
  
     return os;
 }

References Rectangle::height, Rectangle::width, Rectangle::x, and Rectangle::y.

◆ operator<<() [74/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const ReductionOperation &	op
	)

Formatted output of the Reduction Operations.

Parameters

[out]	os	Output stream.
[in]	op	Type to output.

Returns: Modified output stream.

Definition at line 1801 of file TypePrinter.h.

 {
     switch (op)
     {
         case ReductionOperation::SUM:
             os << "SUM";
             break;
         case ReductionOperation::SUM_SQUARE:
             os << "SUM_SQUARE";
             break;
         case ReductionOperation::MEAN_SUM:
             os << "MEAN_SUM";
             break;
         case ReductionOperation::ARG_IDX_MAX:
             os << "ARG_IDX_MAX";
             break;
         case ReductionOperation::ARG_IDX_MIN:
             os << "ARG_IDX_MIN";
             break;
         case ReductionOperation::PROD:
             os << "PROD";
             break;
         case ReductionOperation::MIN:
             os << "MIN";
             break;
         case ReductionOperation::MAX:
             os << "MAX";
             break;
         default:
             ARM_COMPUTE_ERROR("NOT_SUPPORTED!");
     }
  
     return os;
 }

References ARG_IDX_MAX, ARG_IDX_MIN, ARM_COMPUTE_ERROR, MAX, MEAN_SUM, MIN, PROD, SUM, and SUM_SQUARE.

◆ operator<<() [75/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const ROIPoolingLayerInfo &	pool_info
	)

Formatted output of the ROIPoolingInfo type.

Parameters

[out]	os	Output stream.
[in]	pool_info	Type to output.

Returns: Modified output stream.

Definition at line 227 of file TypePrinter.h.

 {
     os << pool_info.pooled_width() << "x" << pool_info.pooled_height() << "~" << pool_info.spatial_scale();
     return os;
 }

References ROIPoolingLayerInfo::pooled_height(), ROIPoolingLayerInfo::pooled_width(), and ROIPoolingLayerInfo::spatial_scale().

◆ operator<<() [76/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const RoundingPolicy &	rounding_policy
	)

Formatted output of the RoundingPolicy type.

Parameters

[out]	os	Output stream.
[in]	rounding_policy	Type to output.

Returns: Modified output stream.

Definition at line 184 of file TypePrinter.h.

 {
     switch (rounding_policy)
     {
         case RoundingPolicy::TO_ZERO:
             os << "TO_ZERO";
             break;
         case RoundingPolicy::TO_NEAREST_UP:
             os << "TO_NEAREST_UP";
             break;
         case RoundingPolicy::TO_NEAREST_EVEN:
             os << "TO_NEAREST_EVEN";
             break;
         default:
             ARM_COMPUTE_ERROR("NOT_SUPPORTED!");
     }
  
     return os;
 }

References ARM_COMPUTE_ERROR, TO_NEAREST_EVEN, TO_NEAREST_UP, and TO_ZERO.

◆ operator<<() [77/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const SamplingPolicy &	policy
	)

Formatted output of the SamplingPolicy type.

Parameters

[out]	os	Output stream.
[in]	policy	Type to output.

Returns: Modified output stream.

Definition at line 1118 of file TypePrinter.h.

 {
     switch (policy)
     {
         case SamplingPolicy::CENTER:
             os << "CENTER";
             break;
         case SamplingPolicy::TOP_LEFT:
             os << "TOP_LEFT";
             break;
         default:
             ARM_COMPUTE_ERROR("NOT_SUPPORTED!");
     }
  
     return os;
 }

References ARM_COMPUTE_ERROR, CENTER, and TOP_LEFT.

◆ operator<<() [78/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const ScaleKernelInfo &	scale_info
	)

Formatted output of the ScaleKernelInfo type.

Parameters

[out]	os	Output stream.
[in]	scale_info	ScaleKernelInfo to output.

Returns: Modified output stream.

Definition at line 2785 of file TypePrinter.h.

 {
     os << "{interpolation_policy=" << scale_info.interpolation_policy << ", "
        << "BorderMode=" << scale_info.border_mode << ", "
        << "PixelValue=" << scale_info.constant_border_value << ", "
        << "SamplingPolicy=" << scale_info.sampling_policy << ", "
        << "use_padding=" << scale_info.use_padding << ", "
        << "align_corners=" << scale_info.align_corners << ", "
        << "data_layout=" << scale_info.data_layout << "}";
     return os;
 }

References ScaleKernelInfo::align_corners, ScaleKernelInfo::border_mode, ScaleKernelInfo::constant_border_value, ScaleKernelInfo::data_layout, ScaleKernelInfo::interpolation_policy, ScaleKernelInfo::sampling_policy, and ScaleKernelInfo::use_padding.

◆ operator<<() [79/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const Size2D &	size
	)

Formatted output of the Size2D type.

Parameters

[out]	os	Output stream
[in]	size	Type to output

Returns: Modified output stream.

Definition at line 2141 of file TypePrinter.h.

 {
     os << size.width << "x" << size.height;
  
     return os;
 }

References Size2D::height, and Size2D::width.

◆ operator<<() [80/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const Size3D &	size
	)

Formatted output of the Size3D type.

Parameters

[out]	os	Output stream
[in]	size	Type to output

Returns: Modified output stream.

Definition at line 2008 of file TypePrinter.h.

 {
     os << size.width << "x" << size.height << "x" << size.depth;
  
     return os;
 }

References Size3D::depth, Size3D::height, and Size3D::width.

◆ operator<<() [81/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const SoftmaxKernelInfo &	info
	)

Formatted output of the SoftmaxKernelInfo type.

Parameters

[out]	os	Output stream.
[in]	info	SoftmaxKernelInfo to output.

Returns: Modified output stream.

Definition at line 2971 of file TypePrinter.h.

 {
     os << "{beta=" << info.beta << ", "
        << "is_log=" << info.is_log << ", "
        << "input_data_type=" << info.input_data_type << ", "
        << "axis=" << info.axis << "}";
     return os;
 }

References arm_compute::test::validation::info.

◆ operator<<() [82/88]

::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const std::vector< T > &	args
	)

Formatted output of a vector of objects.

Note: : Using the overloaded to_string() instead of overloaded operator<<(), because to_string() functions are overloaded for all types, where two or more of them can use the same operator<<(), ITensor is an example.

Parameters

[out]	os	Output stream
[in]	args	Vector of objects to print

Returns: Modified output stream.

Definition at line 109 of file TypePrinter.h.

 {
     const size_t max_print_size = 5U;
  
     os << "[";
     bool   first = true;
     size_t i;
     for (i = 0; i < args.size(); ++i)
     {
         if (i == max_print_size)
         {
             break;
         }
         if (first)
         {
             first = false;
         }
         else
         {
             os << ", ";
         }
         os << to_string(args[i]);
     }
     if (i < args.size())
     {
         os << ", ...";
     }
     os << "]";
     return os;
 }

References GemmTuner::args, to_string(), and arm_compute::utils::cast::U.

◆ operator<<() [83/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const TensorInfo &	info
	)

Formatted output of the const TensorInfo& type.

Parameters

[out]	os	Output stream.
[in]	info	Type to output.

Returns: Modified output stream.

Definition at line 1172 of file TypePrinter.h.

 {
     os << &info;
     return os;
 }

References arm_compute::test::validation::info.

◆ operator<<() [84/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const WeightsInfo &	weights_info
	)

Formatted output of the WeightsInfo type.

Parameters

[out]	os	Output stream.
[in]	weights_info	Type to output.

Returns: Modified output stream.

Definition at line 211 of file TypePrinter.h.

 {
     os << weights_info.are_reshaped() << ";";
     os << weights_info.num_kernels() << ";" << weights_info.kernel_size().first << ","
        << weights_info.kernel_size().second;
  
     return os;
 }

References arm_compute::test::validation::weights_info.

◆ operator<<() [85/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const Window &	win
	)

Formatted output of the Window type.

Parameters

[out]	os	Output stream.
[in]	win	Type to output.

Returns: Modified output stream.

Definition at line 1447 of file TypePrinter.h.

 {
     os << "{";
     for (unsigned int i = 0; i < Coordinates::num_max_dimensions; i++)
     {
         if (i > 0)
         {
             os << ", ";
         }
         os << win[i];
     }
     os << "}";
  
     return os;
 }

References Dimensions< int >::num_max_dimensions.

◆ operator<<() [86/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const Window::Dimension &	dim
	)

Formatted output of the Window::Dimension type.

Parameters

[out]	os	Output stream.
[in]	dim	Type to output.

Returns: Modified output stream.

Definition at line 1434 of file TypePrinter.h.

 {
     os << "{start=" << dim.start() << ", end=" << dim.end() << ", step=" << dim.step() << "}";
  
     return os;
 }

References Window::Dimension::end(), Window::Dimension::start(), and Window::Dimension::step().

◆ operator<<() [87/88]

inline ::std::ostream& arm_compute::operator<<	(	::std::ostream &	os,
		const WinogradInfo &	info
	)

Formatted output of the WinogradInfo type.

Definition at line 2495 of file TypePrinter.h.

 {
     os << "{OutputTileSize=" << info.output_tile_size << ","
        << "KernelSize=" << info.kernel_size << ","
        << "PadStride=" << info.convolution_info << ","
        << "OutputDataLayout=" << info.output_data_layout << "}";
  
     return os;
 }

References arm_compute::test::validation::info.

◆ operator<<() [88/88]

inline ::std::ostream& arm_compute::operator<<	(	std::ostream &	os,
		const ITensorInfo *	info
	)

Formatted output of the ITensorInfo type.

Parameters

[out]	os	Output stream.
[in]	info	Tensor information.

Returns: Modified output stream.

Definition at line 1142 of file TypePrinter.h.

 {
     const DataType   data_type   = info->data_type();
     const DataLayout data_layout = info->data_layout();
  
     os << "Shape=" << info->tensor_shape() << ","
        << "DataLayout=" << string_from_data_layout(data_layout) << ","
        << "DataType=" << string_from_data_type(data_type);
  
     if (is_data_type_quantized(data_type))
     {
         const QuantizationInfo qinfo   = info->quantization_info();
         const auto             scales  = qinfo.scale();
         const auto             offsets = qinfo.offset();
  
         os << ", QuantizationInfo={"
            << "scales.size=" << scales.size() << ", scale(s)=" << scales << ", ";
  
         os << "offsets.size=" << offsets.size() << ", offset(s)=" << offsets << "}";
     }
     return os;
 }

References arm_compute::cpu::data_layout, ScaleKernelInfo::data_layout, arm_compute::test::validation::data_type, arm_compute::test::validation::info, is_data_type_quantized(), QuantizationInfo::offset(), arm_compute::test::validation::qinfo, QuantizationInfo::scale(), string_from_data_layout(), and string_from_data_type().

◆ operator==() [1/6]

bool arm_compute::operator==	(	const Dimensions< T > &	lhs,
		const Dimensions< T > &	rhs
	)

inline

Check that given dimensions are equal.

Parameters

[in]	lhs	Left-hand side Dimensions.
[in]	rhs	Right-hand side Dimensions.

Returns: True if the given dimensions are equal.

Definition at line 275 of file Dimensions.h.

 {
     return ((lhs.num_dimensions() == rhs.num_dimensions()) && std::equal(lhs.cbegin(), lhs.cend(), rhs.cbegin()));
 }

References Dimensions< T >::cbegin(), Dimensions< T >::cend(), and Dimensions< T >::num_dimensions().

Referenced by operator!=().

◆ operator==() [2/6]

bool arm_compute::operator==	(	const QuantizationInfo &	lhs,
		const QuantizationInfo &	rhs
	)

inline

Check whether two quantization info are equal.

Parameters

[in]	lhs	RHS quantization info.
[in]	rhs	LHS quantization info.

Returns: True if the given quantization info is the same.

Definition at line 161 of file QuantizationInfo.h.

 {
     return (lhs.scale() == rhs.scale()) && (lhs.offset() == rhs.offset());
 }

References QuantizationInfo::offset(), and QuantizationInfo::scale().

◆ operator==() [3/6]

bool arm_compute::operator==	(	const TensorInfo &	lhs,
		const TensorInfo &	rhs
	)

inline

Check whether two tensor info are equal.

Parameters

[in]	lhs	LHS tensor info.
[in]	rhs	RHS tensor info.

Returns: True if the given tensor infos are the same.

Definition at line 355 of file TensorInfo.h.

 {
     return (lhs._total_size == rhs._total_size) &&
            (lhs._offset_first_element_in_bytes == rhs._offset_first_element_in_bytes) &&
            (lhs._strides_in_bytes == rhs._strides_in_bytes) && (lhs._num_channels == rhs._num_channels) &&
            (lhs._tensor_shape == rhs._tensor_shape) && (lhs._dims_state == rhs._dims_state) &&
            (lhs._data_type == rhs._data_type) && (lhs._format == rhs._format) &&
            (lhs._is_resizable == rhs._is_resizable) && (lhs._valid_region == rhs._valid_region) &&
            (lhs._padding == rhs._padding) && (lhs._quantization_info == rhs._quantization_info) &&
            (lhs._data_layout == rhs._data_layout) && (lhs._are_values_constant == rhs._are_values_constant) &&
            (lhs._id == rhs._id);
 }

◆ operator==() [4/6]

bool arm_compute::operator==	(	const UniformQuantizationInfo &	lhs,
		const UniformQuantizationInfo &	rhs
	)

inline

Check whether two quantization info are equal.

Parameters

[in]	lhs	RHS quantization info.
[in]	rhs	LHS quantization info.

Returns: True if the given quantization info is the same.

Definition at line 185 of file QuantizationInfo.h.

 {
     return (lhs.scale == rhs.scale) && (lhs.offset == rhs.offset);
 }

References UniformQuantizationInfo::offset, and UniformQuantizationInfo::scale.

◆ operator==() [5/6]

bool arm_compute::operator==	(	const ValidRegion &	lhs,
		const ValidRegion &	rhs
	)

inline

Parameters

[in]	lhs	LHS valid region
[in]	rhs	RHS valid region

Returns: True if the valid regions are the same.

Definition at line 225 of file Types.h.

 {
     return (lhs.anchor == rhs.anchor) && (lhs.shape == rhs.shape);
 }

References ValidRegion::anchor, and ValidRegion::shape.

◆ operator==() [6/6]

bool arm_compute::operator==	(	const Window &	lhs,
		const Window &	rhs
	)

inline

Parameters

[in]	lhs	LHS window
[in]	rhs	RHS window

Returns: True if the given windows are the same.

Definition at line 323 of file Window.inl.

 {
     return (lhs._dims == rhs._dims) && (lhs._is_broadcasted == rhs._is_broadcasted);
 }

◆ operator>>() [1/4]

inline ::std::istream& arm_compute::operator>>	(	::std::istream &	is,
		BorderMode &	mode
	)

Formatted input of the BorderMode type.

Parameters

[out]	is	Input stream.
[in]	mode	Border mode.

Returns: the modified input stream.

Definition at line 42 of file TypeReader.h.

 {
     std::string value;
  
     is >> value;
  
     std::transform(value.begin(), value.end(), value.begin(), [](unsigned char c)
     {
         return std::toupper(c);
     });
  
     if(value == "UNDEFINED")
     {
         mode = BorderMode::UNDEFINED;
     }
     else if(value == "CONSTANT")
     {
         mode = BorderMode::CONSTANT;
     }
     else if(value == "REPLICATE")
     {
         mode = BorderMode::REPLICATE;
     }
     else
     {
         throw std::invalid_argument("Unsupported value '" + value + "' for border mode");
     }
  
     return is;
 }

References CONSTANT, clang_tidy_rules::mode, REPLICATE, and UNDEFINED.

◆ operator>>() [2/4]

inline ::std::istream& arm_compute::operator>>	(	::std::istream &	stream,
		arm_compute::DataLayout &	data_layout
	)

Input Stream operator for DataLayout.

Parameters

[in]	stream	Stream to parse
[out]	data_layout	Output data layout

Returns: Updated stream

Definition at line 48 of file TypeLoader.h.

 {
     std::string value;
     stream >> value;
     data_layout = data_layout_from_name(value);
     return stream;
 }

References arm_compute::cpu::data_layout, and data_layout_from_name().

◆ operator>>() [3/4]

inline ::std::istream& arm_compute::operator>>	(	::std::istream &	stream,
		CLTunerMode &	tuner_mode
	)

Input Stream operator for CLTunerMode.

Parameters

[in]	stream	Stream to parse
[out]	tuner_mode	Output tuner mode

Returns: Updated stream

Definition at line 87 of file CLTunerTypes.h.

 {
     std::string value;
     stream >> value;
     tuner_mode = tuner_mode_from_name(value);
     return stream;
 }

References tuner_mode_from_name().

◆ operator>>() [4/4]

inline ::std::istream& arm_compute::operator>>	(	::std::istream &	stream,
		DataType &	data_type
	)

Input Stream operator for DataType.

Parameters

[in]	stream	Stream to parse
[out]	data_type	Output data type

Returns: Updated stream

Definition at line 290 of file DataTypeUtils.h.

 {
     std::string value;
     stream >> value;
     data_type = data_type_from_name(value);
     return stream;
 }

References arm_compute::test::validation::data_type, and data_type_from_name().

◆ permute() [1/2]

void arm_compute::permute	(	Dimensions< T > &	dimensions,
		const PermutationVector &	perm
	)

inline

Permutes given Dimensions according to a permutation vector.

Warning: Validity of permutation is not checked

Parameters

[in,out]	dimensions	Dimensions to permute
[in]	perm	Permutation vector

Definition at line 144 of file Helpers.h.

 {
     auto dimensions_copy = utility::make_array<Dimensions<T>::num_max_dimensions>(dimensions.begin(), dimensions.end());
     for (unsigned int i = 0; i < perm.num_dimensions(); ++i)
     {
         T dimension_val = (perm[i] < dimensions.num_dimensions()) ? dimensions_copy[perm[i]] : 0;
         dimensions.set(i, dimension_val);
     }
 }

References Dimensions< T >::begin(), Dimensions< T >::end(), Dimensions< T >::num_dimensions(), and Dimensions< T >::set().

Referenced by arm_compute::misc::shape_calculator::compute_permutation_output_shape(), PermuteLayerNode::configure_output(), NPYLoader::fill_tensor(), AssetsLibrary::fill_with_generator(), arm_compute::graph_utils::permute_shape(), arm_compute::test::validation::TEST_CASE(), CLDepthwiseConvolutionLayer::validate(), CLGEMMDeconvolutionLayer::validate(), arm_compute::test::validation::validate(), and arm_compute::test::validation::validate_wrap().

◆ permute() [2/2]

void arm_compute::permute	(	TensorShape &	shape,
		const PermutationVector &	perm
	)

inline

Permutes given TensorShape according to a permutation vector.

Warning: Validity of permutation is not checked

Parameters

[in,out]	shape	Shape to permute
[in]	perm	Permutation vector

Definition at line 161 of file Helpers.h.

 {
     TensorShape shape_copy = shape;
     for (unsigned int i = 0; i < perm.num_dimensions(); ++i)
     {
         size_t dimension_val = (perm[i] < shape.num_dimensions()) ? shape_copy[perm[i]] : 1;
         shape.set(i, dimension_val, false, false); // Avoid changes in _num_dimension
     }
 }

References Dimensions< T >::num_dimensions(), and arm_compute::test::validation::shape.

◆ permute_strides()

void arm_compute::permute_strides	(	Dimensions< T > &	dimensions,
		const PermutationVector &	perm
	)

inline

Permutes the given dimensions according the permutation vector.

Parameters

[in,out]	dimensions	Dimensions to be permuted.
[in]	perm	Vector describing the permutation.

Definition at line 69 of file Utils.h.

 {
     const auto old_dim = utility::make_array<Dimensions<T>::num_max_dimensions>(dimensions.begin(), dimensions.end());
     for (unsigned int i = 0; i < perm.num_dimensions(); ++i)
     {
         T dimension_val = old_dim[i];
         dimensions.set(perm[i], dimension_val);
     }
 }

References Dimensions< T >::begin(), Dimensions< T >::end(), Dimensions< T >::num_dimensions(), and Dimensions< T >::set().

◆ pixel_size_from_format()

size_t arm_compute::pixel_size_from_format ( Format format )

inline

The size in bytes of the pixel format.

Parameters

[in] format Input format

Returns: The size in bytes of the pixel format

Definition at line 38 of file FormatUtils.h.

 {
     switch (format)
     {
         case Format::U8:
             return 1;
         case Format::U16:
         case Format::S16:
         case Format::BFLOAT16:
         case Format::F16:
         case Format::UV88:
         case Format::YUYV422:
         case Format::UYVY422:
             return 2;
         case Format::RGB888:
             return 3;
         case Format::RGBA8888:
             return 4;
         case Format::U32:
         case Format::S32:
         case Format::F32:
             return 4;
         //Doesn't make sense for planar formats:
         case Format::NV12:
         case Format::NV21:
         case Format::IYUV:
         case Format::YUV444:
         default:
             ARM_COMPUTE_ERROR("Undefined pixel size for given format");
             return 0;
     }
 }

References ARM_COMPUTE_ERROR, BFLOAT16, F16, F32, update_supported_ops::format, IYUV, NV12, NV21, RGB888, RGBA8888, S16, S32, U16, U32, U8, UV88, UYVY422, YUV444, and YUYV422.

◆ plane_idx_from_channel()

int arm_compute::plane_idx_from_channel	(	Format	format,
		Channel	channel
	)

inline

Return the plane index of a given channel given an input format.

Parameters

[in]	format	Input format
[in]	channel	Input channel

Returns: The plane index of the specific channel of the specific format

Definition at line 78 of file FormatUtils.h.

 {
     switch (format)
     {
         // Single planar formats have a single plane
         case Format::U8:
         case Format::U16:
         case Format::S16:
         case Format::U32:
         case Format::S32:
         case Format::BFLOAT16:
         case Format::F16:
         case Format::F32:
         case Format::UV88:
         case Format::RGB888:
         case Format::RGBA8888:
         case Format::YUYV422:
         case Format::UYVY422:
             return 0;
         // Multi planar formats
         case Format::NV12:
         case Format::NV21:
         {
             // Channel U and V share the same plane of format UV88
             switch (channel)
             {
                 case Channel::Y:
                     return 0;
                 case Channel::U:
                 case Channel::V:
                     return 1;
                 default:
                     ARM_COMPUTE_ERROR("Not supported channel");
                     return 0;
             }
         }
         case Format::IYUV:
         case Format::YUV444:
         {
             switch (channel)
             {
                 case Channel::Y:
                     return 0;
                 case Channel::U:
                     return 1;
                 case Channel::V:
                     return 2;
                 default:
                     ARM_COMPUTE_ERROR("Not supported channel");
                     return 0;
             }
         }
         default:
             ARM_COMPUTE_ERROR("Not supported format");
             return 0;
     }
 }

References ARM_COMPUTE_ERROR, BFLOAT16, F16, F32, update_supported_ops::format, IYUV, NV12, NV21, RGB888, RGBA8888, S16, S32, U, U16, U32, U8, UV88, UYVY422, V, Y, YUV444, and YUYV422.

◆ preferred_dummy_work_items_support()

bool preferred_dummy_work_items_support ( const cl::Device & device )

Helper function to check if "dummy work-items" are preferred to have a power of two NDRange In case dummy work-items is enabled, it is OpenCL kernel responsibility to check if the work-item is out-of range or not.

Parameters

[in] device A CL device

Returns: True if dummy work-items should be preferred to dispatch the NDRange

Definition at line 370 of file CLHelpers.cpp.

 {
     ARM_COMPUTE_UNUSED(device);
     // TODO (COMPMID-2044)
     return true;
 }

References ARM_COMPUTE_UNUSED.

Referenced by ClGemmLowpMatrixMultiplyNativeKernel::configure(), ClGemmMatrixMultiplyNativeKernel::configure(), ClGemmLowpMatrixMultiplyReshapedKernel::configure(), ClGemmLowpMatrixMultiplyReshapedOnlyRhsKernel::configure(), ClGemmMatrixMultiplyReshapedOnlyRhsKernel::configure(), and ClGemmMatrixMultiplyReshapedKernel::configure().

◆ preferred_vector_width()

size_t preferred_vector_width	(	const cl::Device &	device,
		DataType	dt
	)

Helper function to get the preferred native vector width size for built-in scalar types that can be put into vectors.

Parameters

[in]	device	A CL device
[in]	dt	data type

Returns: preferred vector width

Definition at line 340 of file CLHelpers.cpp.

 {
     switch (dt)
     {
         case DataType::U8:
         case DataType::S8:
         case DataType::QASYMM8:
         case DataType::QASYMM8_SIGNED:
         case DataType::QSYMM8:
         case DataType::QSYMM8_PER_CHANNEL:
             return device.getInfo<CL_DEVICE_PREFERRED_VECTOR_WIDTH_CHAR>();
         case DataType::U16:
         case DataType::S16:
         case DataType::QSYMM16:
         case DataType::QASYMM16:
             return device.getInfo<CL_DEVICE_PREFERRED_VECTOR_WIDTH_SHORT>();
         case DataType::U32:
         case DataType::S32:
             return device.getInfo<CL_DEVICE_PREFERRED_VECTOR_WIDTH_INT>();
         case DataType::F16:
         case DataType::F32:
             return device.getInfo<CL_DEVICE_PREFERRED_VECTOR_WIDTH_FLOAT>();
         case DataType::U64:
         case DataType::S64:
             return device.getInfo<CL_DEVICE_PREFERRED_VECTOR_WIDTH_LONG>();
         default:
             return 1;
     }
 }

References dt, F16, F32, QASYMM16, QASYMM8, QASYMM8_SIGNED, QSYMM16, QSYMM8, QSYMM8_PER_CHANNEL, S16, S32, S64, S8, U16, U32, U64, and U8.

◆ quantize_qasymm16() [1/2]

uint16_t arm_compute::quantize_qasymm16	(	float	value,
		const QuantizationInfo &	qinfo
	)

inline

Quantize a value given a 16-bit asymmetric quantization scheme.

Parameters

[in]	value	Value to quantize
[in]	qinfo	Quantization information to use for quantizing

Returns: Quantized value

Definition at line 522 of file QuantizationInfo.h.

 {
     return quantize_qasymm16(value, qinfo.uniform());
 }

References arm_compute::test::validation::qinfo, quantize_qasymm16(), and QuantizationInfo::uniform().

◆ quantize_qasymm16() [2/2]

uint16_t arm_compute::quantize_qasymm16	(	float	value,
		const UniformQuantizationInfo &	qinfo,
		RoundingPolicy	rounding_policy = `RoundingPolicy::TO_NEAREST_UP`
	)

inline

Quantize a value given a 16-bit asymmetric quantization scheme.

Parameters

[in]	value	Value to quantize
[in]	qinfo	Quantization information to use for quantizing
[in]	rounding_policy	(Optional) Rounding policy to use. Default: nearest up

Returns: Quantized value

Definition at line 494 of file QuantizationInfo.h.

 {
     int quantized = arm_compute::round(value / qinfo.scale, rounding_policy) + qinfo.offset;
     quantized     = arm_compute::utility::clamp<int, uint16_t>(quantized);
     return quantized;
 }

References QuantizationInfo::offset(), arm_compute::test::validation::qinfo, round(), and QuantizationInfo::scale().

Referenced by arm_compute::cpu::bounding_box_transform_qsymm16(), PixelValue::PixelValue(), arm_compute::test::validation::reference::quantization_layer(), and quantize_qasymm16().

◆ quantize_qasymm8()

uint8_t arm_compute::quantize_qasymm8	(	float	value,
		const INFO_TYPE &	qinfo,
		RoundingPolicy	rounding_policy = `RoundingPolicy::TO_NEAREST_UP`
	)

inline

Quantize a value given an unsigned 8-bit asymmetric quantization scheme.

Parameters

[in]	value	Value to quantize
[in]	qinfo	Quantization information to use for quantizing
[in]	rounding_policy	(Optional) Rounding policy to use. Default: nearest up

Returns: Quantized value

Definition at line 295 of file QuantizationInfo.h.

 {
     return Qasymm8QuantizationHelper<uint8_t>::quantize(value, qinfo, rounding_policy);
 }

References arm_compute::test::validation::qinfo, and Qasymm8QuantizationHelper< QUANTIZED_TYPE >::quantize().

Referenced by ClActivationKernel::configure(), arm_compute::test::validation::convert_to_asymmetric(), arm_compute::scale_helpers::delta_bilinear_c1_quantized(), arm_compute::test::validation::reference::depthconcatenate_layer(), arm_compute::cpu::elementwise_arithm_op_quantized_scalar(), arm_compute::cpu::elementwise_op< uint8_t >(), VerifyAccessor< D >::fill_tensor(), get_quantized_activation_min_max(), arm_compute::quantization::get_quantized_asymmetric_output_min_max(), arm_compute::test::validation::get_quantized_bounds(), arm_compute::cpu::neon_qasymm8_activation(), PixelValue::PixelValue(), arm_compute::test::validation::reference::quantization_layer(), quantize_values(), arm_compute::cpu::roi_align_1x1_qasymm8(), CpuConcatenateHeightKernel::run_op(), CpuConcatenateWidthKernel::run_op(), and arm_compute::cpu::sve2_qasymm8_activation().

◆ quantize_qasymm8_signed()

int8_t arm_compute::quantize_qasymm8_signed	(	float	value,
		const INFO_TYPE &	qinfo,
		RoundingPolicy	rounding_policy = `RoundingPolicy::TO_NEAREST_UP`
	)

inline

Quantize a value given a signed 8-bit asymmetric quantization scheme.

Parameters

[in]	value	Value to quantize
[in]	qinfo	Quantization information to use for quantizing
[in]	rounding_policy	(Optional) Rounding policy to use. Default: nearest up

Returns: Quantized value

Definition at line 309 of file QuantizationInfo.h.

 {
     return Qasymm8QuantizationHelper<int8_t>::quantize(value, qinfo, rounding_policy);
 }

References arm_compute::test::validation::qinfo, and Qasymm8QuantizationHelper< QUANTIZED_TYPE >::quantize().

Referenced by ClActivationKernel::configure(), arm_compute::scale_helpers::delta_bilinear_c1_quantized(), arm_compute::cpu::elementwise_arithm_op_quantized_signed_scalar(), arm_compute::cpu::elementwise_op< int8_t >(), get_quantized_activation_min_max(), arm_compute::quantization::get_quantized_asymmetric_output_min_max(), arm_compute::test::validation::get_quantized_qasymm8_signed_bounds(), arm_compute::cpu::neon_qasymm8_signed_activation(), PixelValue::PixelValue(), arm_compute::test::validation::reference::quantization_layer(), arm_compute::cpu::roi_align_1x1_qasymm8(), CpuConcatenateWidthKernel::run_op(), CpuConcatenateHeightKernel::run_op(), arm_compute::cpu::sve2_qasymm8_signed_activation(), NEQLSTMLayer::validate(), and CLQLSTMLayer::validate().

◆ quantize_qsymm16() [1/2]

int16_t arm_compute::quantize_qsymm16	(	float	value,
		const QuantizationInfo &	qinfo
	)

inline

Quantize a value given a 16-bit symmetric quantization scheme.

Parameters

[in]	value	Value to quantize
[in]	qinfo	Quantization information to use for quantizing

Returns: Quantized value

Definition at line 469 of file QuantizationInfo.h.

 {
     return quantize_qsymm16(value, qinfo.uniform());
 }

References arm_compute::test::validation::qinfo, quantize_qsymm16(), and QuantizationInfo::uniform().

◆ quantize_qsymm16() [2/2]

int16_t arm_compute::quantize_qsymm16	(	float	value,
		const UniformQuantizationInfo &	qinfo,
		RoundingPolicy	rounding_policy = `RoundingPolicy::TO_NEAREST_UP`
	)

inline

Quantize a value given a 16-bit symmetric quantization scheme.

Parameters

[in]	value	Value to quantize
[in]	qinfo	Quantization information to use for quantizing
[in]	rounding_policy	(Optional) Rounding policy to use. Default: nearest up

Returns: Quantized value

Definition at line 441 of file QuantizationInfo.h.

 {
     int quantized = arm_compute::round(value / qinfo.scale, rounding_policy);
     quantized     = arm_compute::utility::clamp<int, int16_t>(quantized);
     return quantized;
 }

References arm_compute::test::validation::qinfo, round(), and QuantizationInfo::scale().

Referenced by arm_compute::cpu::add_qsymm16_neon(), arm_compute::cpu::compute_all_anchors_qasymm16(), ClActivationKernel::configure(), NEQLSTMLayer::configure(), CLQLSTMLayer::configure(), arm_compute::test::validation::convert_to_symmetric(), arm_compute::cpu::neon_qsymm16_activation(), PixelValue::PixelValue(), quantize_qsymm16(), arm_compute::cpu::sub_qsymm16_neon(), NEQLSTMLayer::validate(), and CLQLSTMLayer::validate().

◆ quantize_qsymm8()

int8_t arm_compute::quantize_qsymm8	(	float	value,
		const QuantizationInfo &	qinfo
	)

inline

Quantize a value given a 8-bit symmetric quantization scheme.

Parameters

[in]	value	Value to quantize
[in]	qinfo	Quantization information to use for quantizing

Returns: Quantized value

Definition at line 323 of file QuantizationInfo.h.

 {
     int quantized = arm_compute::round(value / qinfo.uniform().scale, RoundingPolicy::TO_NEAREST_UP);
     quantized     = std::max(-128, std::min(quantized, 127));
     return quantized;
 }

References arm_compute::test::validation::qinfo, round(), UniformQuantizationInfo::scale, TO_NEAREST_UP, and QuantizationInfo::uniform().

Referenced by PixelValue::PixelValue().

◆ quantize_qsymm8_per_channel()

int8_t arm_compute::quantize_qsymm8_per_channel	(	float	value,
		const QuantizationInfo &	qinfo,
		size_t	channel_id = `0`
	)

inline

Quantize a value given a 8-bit symmetric per channel quantization scheme.

Parameters

[in]	value	Value to quantize
[in]	qinfo	Quantization information to use for quantizing
[in]	channel_id	channel index into the scale vector of quantization info

Returns: Quantized value

Definition at line 338 of file QuantizationInfo.h.

 {
     int quantized = arm_compute::round(value / qinfo.scale()[channel_id], RoundingPolicy::TO_NEAREST_UP);
     quantized     = std::max(-128, std::min(quantized, 127));
     return quantized;
 }

References arm_compute::test::validation::qinfo, round(), QuantizationInfo::scale(), and TO_NEAREST_UP.

Referenced by arm_compute::test::validation::get_symm_quantized_per_channel_bounds().

◆ read_file()

std::string read_file	(	const std::string &	filename,
		bool	binary
	)

Load an entire file in memory.

Parameters

[in]	filename	Name of the file to read.
[in]	binary	Is it a binary file ?

Returns: The content of the file.

Definition at line 40 of file Utils.cpp.

 {
     std::string   out;
     std::ifstream fs;
  
 #ifndef ARM_COMPUTE_EXCEPTIONS_DISABLED
     try
     {
 #endif /* ARM_COMPUTE_EXCEPTIONS_DISABLED */
         fs.exceptions(std::ifstream::failbit | std::ifstream::badbit);
         std::ios_base::openmode mode = std::ios::in;
  
         if (binary)
         {
             mode |= std::ios::binary;
         }
  
         fs.open(filename, mode);
  
         // Go to the end of the file
         fs.seekg(0, std::ios::end);
         // Reserve the memory required to store the file's content
         out.reserve(fs.tellg());
         // Go back to the beginning of the file
         fs.seekg(0, std::ios::beg);
         // Copy the content of the file
         out.assign(std::istreambuf_iterator<char>(fs), std::istreambuf_iterator<char>());
 #ifndef ARM_COMPUTE_EXCEPTIONS_DISABLED
     }
     catch (const std::ifstream::failure &e)
     {
         ARM_COMPUTE_ERROR_VAR("Accessing %s: %s", filename.c_str(), e.what());
     }
 #endif /* ARM_COMPUTE_EXCEPTIONS_DISABLED */
  
     return out;
 }

References ARM_COMPUTE_ERROR_VAR, arm_compute::mlgo::parser::end(), and clang_tidy_rules::mode.

Referenced by ClKernelLibrary::program().

◆ release_prepare_tensors()

void arm_compute::release_prepare_tensors	(	WorkspaceData< TensorType > &	workspace,
		ITensorPack &	prep_pack
	)

Definition at line 105 of file MemoryHelpers.h.

 {
     workspace.erase(std::remove_if(workspace.begin(), workspace.end(),
                                    [&prep_pack](auto &wk)
                                    {
                                        const bool to_erase = wk.lifetime == experimental::MemoryLifetime::Prepare;
                                        if (to_erase)
                                        {
                                            prep_pack.remove_tensor(wk.slot);
                                        }
                                        return to_erase;
                                    }),
                     workspace.end());
 }

Referenced by CLWinogradConvolutionLayer::prepare().

◆ release_temporaries()

void arm_compute::release_temporaries	(	const experimental::MemoryRequirements &	mem_reqs,
		WorkspaceData< TensorType > &	workspace
	)

Utility function to release tensors with lifetime marked as Prepare.

Definition at line 122 of file MemoryHelpers.h.

 {
     for (auto &ws : workspace)
     {
         const int slot = ws.slot;
         for (auto &m : mem_reqs)
         {
             if (m.slot == slot && m.lifetime == experimental::MemoryLifetime::Prepare)
             {
                 auto tensor = ws.tensor.get();
                 tensor->allocator()->free();
                 break;
             }
         }
     }
 }

References tensor.

Referenced by CLGEMMLowpMatrixMultiplyCore::prepare(), CLGEMMConvolutionLayer::prepare(), and CLConvolutionLayer::prepare().

◆ restore_program_cache_from_file()

void restore_program_cache_from_file ( const std::string & filename = "cache.bin" )

This function loads prebuilt opencl kernels from a file.

Parameters

[in] filename Name of the file to be used to load the kernels

Definition at line 35 of file Utils.cpp.

 {
     std::ifstream cache_file(filename, std::ios::binary);
     if (cache_file.is_open())
     {
         if (!CLScheduler::get().is_initialised())
         {
             arm_compute::CLScheduler::get().default_init();
         }
  
         while (!cache_file.eof())
         {
             size_t name_len   = 0;
             size_t binary_len = 0;
             cache_file.read(reinterpret_cast<char *>(&name_len), sizeof(size_t));
             cache_file.read(reinterpret_cast<char *>(&binary_len), sizeof(size_t));
             if (name_len == 0 || binary_len == 0)
             {
                 break;
             }
             std::vector<char>          tmp(name_len);
             std::vector<unsigned char> binary(binary_len);
             std::string                name;
             cache_file.read(tmp.data(), name_len);
             name.assign(tmp.data(), name_len);
             tmp.resize(binary_len);
             cache_file.read(reinterpret_cast<char *>(binary.data()), binary_len);
             cl::Context             context = arm_compute::CLScheduler::get().context();
             cl::Program::Binaries   binaries{binary};
             std::vector<cl::Device> devices = context.getInfo<CL_CONTEXT_DEVICES>();
             cl::Program             program(context, devices, binaries);
             program.build();
             CLKernelLibrary::get().add_built_program(name, program);
         }
         cache_file.close();
     }
 }

References CLKernelLibrary::add_built_program(), CLScheduler::context(), arm_compute::test::validation::context, CLScheduler::default_init(), CLKernelLibrary::get(), CLScheduler::get(), and name.

◆ round()

int round	(	float	x,
		RoundingPolicy	rounding_policy
	)

Return a rounded value of x.

Rounding is done according to the rounding_policy.

Parameters

[in]	x	Float value to be rounded.
[in]	rounding_policy	Policy determining how rounding is done.

Returns: Rounded value of the argument x.

Definition at line 36 of file Rounding.cpp.

 {
     using namespace std;
     int rounded = 0;
     switch (rounding_policy)
     {
         case RoundingPolicy::TO_ZERO:
         {
             rounded = static_cast<int>(x);
             break;
         }
         case RoundingPolicy::TO_NEAREST_UP:
         {
             rounded = static_cast<int>(support::cpp11::round(x));
             break;
         }
         case RoundingPolicy::TO_NEAREST_EVEN:
         {
 #ifdef __aarch64__
             asm("fcvtns %x[res], %s[value]" : [res] "=r"(rounded) : [value] "w"(x));
 #else  // __aarch64__
             ARM_COMPUTE_ERROR("TO_NEAREST_EVEN rounding policy is not supported.");
 #endif // __aarch64__
             break;
         }
         default:
         {
             ARM_COMPUTE_ERROR("Unsupported rounding policy.");
             break;
         }
     }
  
     return rounded;
 }

References ARM_COMPUTE_ERROR, arm_compute::support::cpp11::round(), TO_NEAREST_EVEN, TO_NEAREST_UP, and TO_ZERO.

Referenced by Qasymm8QuantizationHelper< QUANTIZED_TYPE >::quantize(), quantize_qasymm16(), quantize_qsymm16(), quantize_qsymm8(), quantize_qsymm8_per_channel(), roi_pooling_layer(), scale_nearest_neighbour_nchw(), arm_compute::scheduler_utils::split_2d(), arm_compute::cpu::sve2_qasymm8_activation(), and arm_compute::cpu::sve2_qasymm8_signed_activation().

◆ rounding_divide_by_pow2() [1/3]

int32_t rounding_divide_by_pow2	(	int32_t	x,
		int	exponent
	)

inline

Round to the nearest division by a power-of-two using exponent.

Note: This function calculates the following expression: (x + 2^n -1 ) / 2^n where n = exponent

Parameters

[in]	x	Element to divide.
[in]	exponent	Integer value used to round to nearest division by a power-of-two

Returns: the nearest division by a power-of-two using exponent

Definition at line 441 of file NEMath.inl.

 {
     const int32_t mask      = (1 << exponent) - 1;
     const int32_t threshold = (mask >> 1) + (x < 0 ? 1 : 0);
     return (x >> exponent) + ((x & mask) > threshold ? 1 : 0);
 }

◆ rounding_divide_by_pow2() [2/3]

int32x4_t rounding_divide_by_pow2	(	int32x4_t	x,
		int	exponent
	)

inline

Round to the nearest division by a power-of-two using exponent.

Note: This function calculates the following expression: (x + 2^n -1 ) / 2^n where n = exponent

Parameters

[in]	x	Vector of 4 elements
[in]	exponent	Integer value used to round to nearest division by a power-of-two

Returns: the nearest division by a power-of-two using exponent

Definition at line 433 of file NEMath.inl.

 {
     const int32x4_t shift_vec  = vdupq_n_s32(-exponent);
     const int32x4_t fixup      = vshrq_n_s32(vandq_s32(x, shift_vec), 31);
     const int32x4_t fixed_up_x = vqaddq_s32(x, fixup);
     return vrshlq_s32(fixed_up_x, shift_vec);
 }

◆ rounding_divide_by_pow2() [3/3]

int32x4_t rounding_divide_by_pow2	(	int32x4_t	x,
		int32x4_t	exponent
	)

inline

Round to the nearest division by a power-of-two using exponent.

Note: This function calculates the following expression: (x + 2^n -1 ) / 2^n where n = exponent

Parameters

[in]	x	Vector of 4 elements
[in]	exponent	Vector of 4 elements with integer value used to round to nearest division by a power-of-two

Returns: the nearest division by a power-of-two using exponent

Definition at line 425 of file NEMath.inl.

 {
     const int32x4_t shift_vec  = vnegq_s32(exponent);
     const int32x4_t fixup      = vshrq_n_s32(vandq_s32(x, shift_vec), 31);
     const int32x4_t fixed_up_x = vqaddq_s32(x, fixup);
     return vrshlq_s32(fixed_up_x, shift_vec);
 }

Referenced by finalize_quantization(), finalize_quantization_int16(), finalize_quantization_symm(), and multiply_by_quantized_multiplier_2row().

◆ run_reverse()

void arm_compute::run_reverse	(	const Window &	window,
		const ITensor *	input,
		const ITensor *	axis,
		ITensor *	output,
		bool	use_inverted_axis
	)

Definition at line 96 of file NEReverseKernel.cpp.

 {
     unsigned int axis_bit = 0;
     const int    rank     = input->info()->num_dimensions();
  
     for (unsigned int i = 0; i < axis->info()->dimension(0); ++i)
     {
         int axis_i = *(reinterpret_cast<const int *>(axis->buffer()) + i);
  
         // The values of axis tensor must be between [-rank, rank-1].
         if ((axis_i < -rank) || (axis_i >= rank))
         {
             ARM_COMPUTE_ERROR("the values of the axis tensor must be within [-rank, rank-1].");
         }
  
         // In case of negative axis value i.e targeted axis(i) = rank + axis(i)
         if (axis_i < 0)
         {
             axis_i = rank + axis_i;
         }
  
         // Reverse ACL axis indices convention i.e. (inverted)axis = (tensor_rank - 1) - axis
         if (use_inverted_axis)
         {
             axis_i = (rank - 1) - axis_i;
         }
  
         axis_bit |= 1 << axis_i;
     }
  
     // Check if we need a left-over loop for the y dimension
     const int window_step_x  = 16 / input->info()->element_size();
     const int window_start_x = window.x().start();
     const int window_end_x   = window.x().end();
  
     Window win(window);
     win.set(Window::DimX, Window::Dimension(0, 1, 1));
  
     Iterator input_it(input, win);
     execute_window_loop(
         win,
         [&](const Coordinates &id)
         {
             int x = window_start_x;
             for (; x <= (window_end_x - window_step_x); x += window_step_x)
             {
                 auto in = wrapper::vloadq(reinterpret_cast<T *>(input_it.ptr()) + x);
  
                 // Reverse 0 axis
                 if (axis_bit & 0x1)
                 {
                     in = wrapper::vrev64(in);
                     in = wrapper::vcombine(wrapper::vgethigh(in), wrapper::vgetlow(in));
                 }
  
                 const int offset_x = (axis_bit & 0x1) ? output->info()->dimension(0) - x - window_step_x : x;
                 const int offset_y = (axis_bit & 0x2) ? output->info()->dimension(1) - id.y() - 1 : id.y();
                 const int offset_z = (axis_bit & 0x4) ? output->info()->dimension(2) - id.z() - 1 : id.z();
                 const int offset_w = (axis_bit & 0x8) ? output->info()->dimension(3) - id[3] - 1 : id[3];
  
                 auto out_ptr =
                     reinterpret_cast<T *>(output->ptr_to_element(Coordinates(offset_x, offset_y, offset_z, offset_w)));
                 wrapper::vstore(out_ptr, in);
             }
  
             // Compute left-over elements
             for (; x < window_end_x; ++x)
             {
                 const auto in = *(reinterpret_cast<T *>(input_it.ptr()) + x);
  
                 const int offset_x = (axis_bit & 0x1) ? output->info()->dimension(0) - x - 1 : x;
                 const int offset_y = (axis_bit & 0x2) ? output->info()->dimension(1) - id.y() - 1 : id.y();
                 const int offset_z = (axis_bit & 0x4) ? output->info()->dimension(2) - id.z() - 1 : id.z();
                 const int offset_w = (axis_bit & 0x8) ? output->info()->dimension(3) - id[3] - 1 : id[3];
  
                 *reinterpret_cast<T *>(output->ptr_to_element(Coordinates(offset_x, offset_y, offset_z, offset_w))) =
                     in;
             }
         },
         input_it);
 }

References ARM_COMPUTE_ERROR, ITensor::buffer(), ITensorInfo::dimension(), Window::DimX, Window::Dimension::end(), execute_window_loop(), ITensor::info(), arm_compute::test::validation::input, Iterator::ptr(), ITensor::ptr_to_element(), Window::set(), Window::Dimension::start(), arm_compute::wrapper::vcombine(), arm_compute::wrapper::vgethigh(), arm_compute::wrapper::vgetlow(), arm_compute::wrapper::vloadq(), arm_compute::wrapper::vrev64(), arm_compute::wrapper::vstore(), and Window::x().

◆ save_program_cache_to_file()

void save_program_cache_to_file ( const std::string & filename = "cache.bin" )

This function saves opencl kernels library to a file.

Parameters

[in] filename Name of the file to be used to save the library

Definition at line 73 of file Utils.cpp.

 {
     if (CLScheduler::get().is_initialised())
     {
         std::ofstream cache_file(filename, std::ios::binary);
         if (cache_file.is_open())
         {
             for (const auto &it : CLKernelLibrary::get().get_built_programs())
             {
                 std::vector<std::vector<unsigned char>> binaries = it.second.getInfo<CL_PROGRAM_BINARIES>();
                 ARM_COMPUTE_ERROR_ON(binaries.size() != 1);
                 const std::string kernel_name      = it.first;
                 size_t            kernel_name_size = kernel_name.length();
                 size_t            binary_size      = binaries[0].size();
                 cache_file.write(reinterpret_cast<char *>(&kernel_name_size), sizeof(size_t));
                 cache_file.write(reinterpret_cast<char *>(&binary_size), sizeof(size_t));
                 cache_file.write(kernel_name.c_str(), kernel_name_size);
                 cache_file.write(reinterpret_cast<const char *>(binaries[0].data()), binaries[0].size());
             }
             cache_file.close();
         }
         else
         {
             ARM_COMPUTE_ERROR("Cannot open cache file");
         }
     }
 }

References ARM_COMPUTE_ERROR, ARM_COMPUTE_ERROR_ON, CLKernelLibrary::get(), CLScheduler::get(), and kernel_name.

◆ scaled_3d_dimensions_signed()

std::tuple< int, int, int > scaled_3d_dimensions_signed	(	int	width,
		int	height,
		int	depth,
		int	kernel_width,
		int	kernel_height,
		int	kernel_depth,
		const Pooling3dLayerInfo &	pool3d_info
	)

Returns calculated width, height and depth of output scaled tensor depending on dimensions rounding mode.

Parameters

[in]	width	Width of input tensor
[in]	height	Height of input tensor
[in]	depth	Depth of input tensor
[in]	kernel_width	Kernel width.
[in]	kernel_height	Kernel height.
[in]	kernel_depth	Kernel depth.
[in]	pool3d_info	Pad and stride and round information for 3d pooling

Returns: A tuple with the new width in the first position, the new height in the second, and the new depth in the third. Returned values can be < 1

Definition at line 366 of file Utils.cpp.

 {
     const int pad_left   = pool3d_info.padding.left;
     const int pad_top    = pool3d_info.padding.top;
     const int pad_right  = pool3d_info.padding.right;
     const int pad_bottom = pool3d_info.padding.bottom;
     const int pad_front  = pool3d_info.padding.front;
     const int pad_back   = pool3d_info.padding.back;
     const int stride_x   = pool3d_info.stride.x();
     const int stride_y   = pool3d_info.stride.y();
     const int stride_z   = pool3d_info.stride.z();
     int       w          = 0;
     int       h          = 0;
     int       d          = 0;
  
     switch (pool3d_info.round_type)
     {
         case DimensionRoundingType::FLOOR:
             w = static_cast<int>(
                 std::floor((static_cast<float>(width + pad_left + pad_right - kernel_width) / stride_x) + 1));
             h = static_cast<int>(
                 std::floor((static_cast<float>(height + pad_top + pad_bottom - kernel_height) / stride_y) + 1));
             d = static_cast<int>(
                 std::floor((static_cast<float>(depth + pad_front + pad_back - kernel_depth) / stride_z) + 1));
             break;
         case DimensionRoundingType::CEIL:
             w = static_cast<int>(
                 std::ceil((static_cast<float>(width + pad_left + pad_right - kernel_width) / stride_x) + 1));
             h = static_cast<int>(
                 std::ceil((static_cast<float>(height + pad_top + pad_bottom - kernel_height) / stride_y) + 1));
             d = static_cast<int>(
                 std::ceil((static_cast<float>(depth + pad_front + pad_back - kernel_depth) / stride_z) + 1));
             break;
         default:
             ARM_COMPUTE_ERROR("Unsupported rounding type");
     }
  
     return std::make_tuple(static_cast<int>(w), static_cast<int>(h), static_cast<int>(d));
 }

References ARM_COMPUTE_ERROR, Padding3D::back, Padding3D::bottom, CEIL, FLOOR, Padding3D::front, Padding3D::left, Pooling3dLayerInfo::padding, Padding3D::right, Pooling3dLayerInfo::round_type, Pooling3dLayerInfo::stride, Padding3D::top, arm_compute::test::validation::w, Size3D::x(), Size3D::y(), and Size3D::z().

Referenced by arm_compute::misc::shape_calculator::compute_pool3d_shape().

◆ scaled_dimensions()

std::pair< unsigned int, unsigned int > scaled_dimensions	(	int	width,
		int	height,
		int	kernel_width,
		int	kernel_height,
		const PadStrideInfo &	pad_stride_info,
		const Size2D &	dilation = `Size2D(1U, 1U)`
	)

Returns expected width and height of output scaled tensor depending on dimensions rounding mode.

Parameters

[in]	width	Width of input tensor (Number of columns)
[in]	height	Height of input tensor (Number of rows)
[in]	kernel_width	Kernel width.
[in]	kernel_height	Kernel height.
[in]	pad_stride_info	Pad and stride information.
[in]	dilation	(Optional) Dilation, in elements, across x and y. Defaults to (1, 1).

Returns: A pair with the new width in the first position and the new height in the second.

Definition at line 288 of file Utils.cpp.

 {
     const int dilation_x = dilation.x();
     const int dilation_y = dilation.y();
     const int pad_left   = pad_stride_info.pad_left();
     const int pad_top    = pad_stride_info.pad_top();
     const int pad_right  = pad_stride_info.pad_right();
     const int pad_bottom = pad_stride_info.pad_bottom();
     const int stride_x   = pad_stride_info.stride().first;
     const int stride_y   = pad_stride_info.stride().second;
     int       w          = 0;
     int       h          = 0;
     switch (pad_stride_info.round())
     {
         case DimensionRoundingType::FLOOR:
             w = static_cast<int>(std::floor(
                 (static_cast<float>(width + pad_left + pad_right - (dilation_x * (kernel_width - 1) + 1)) / stride_x) +
                 1));
             h = static_cast<int>(
                 std::floor((static_cast<float>(height + pad_top + pad_bottom - (dilation_y * (kernel_height - 1) + 1)) /
                             stride_y) +
                            1));
             break;
         case DimensionRoundingType::CEIL:
             w = static_cast<int>(std::ceil(
                 (static_cast<float>(width + pad_left + pad_right - (dilation_x * (kernel_width - 1) + 1)) / stride_x) +
                 1));
             h = static_cast<int>(
                 std::ceil((static_cast<float>(height + pad_top + pad_bottom - (dilation_y * (kernel_height - 1) + 1)) /
                            stride_y) +
                           1));
             break;
         default:
             ARM_COMPUTE_ERROR("Unsupported rounding type");
     }
  
     w = std::max(1, w);
     h = std::max(1, h);
     return std::make_pair(static_cast<unsigned int>(w), static_cast<unsigned int>(h));
 }

References ARM_COMPUTE_ERROR, CEIL, FLOOR, PadStrideInfo::pad_bottom(), PadStrideInfo::pad_left(), PadStrideInfo::pad_right(), PadStrideInfo::pad_top(), PadStrideInfo::round(), PadStrideInfo::stride(), arm_compute::test::validation::w, Size2D::x(), and Size2D::y().

Referenced by calculate_same_pad(), arm_compute::misc::shape_calculator::compute_deep_convolution_shape(), arm_compute::misc::shape_calculator::compute_depthwise_convolution_shape(), arm_compute::misc::shape_calculator::compute_im2col_conv_shape(), PoolingLayerNode::compute_output_descriptor(), FusedConvolutionBatchNormalizationNode::compute_output_descriptor(), FusedDepthwiseConvolutionBatchNormalizationNode::compute_output_descriptor(), DepthwiseConvolutionLayerNode::compute_output_descriptor(), ConvolutionLayerNode::compute_output_descriptor(), arm_compute::misc::shape_calculator::compute_winograd_output_transform_shape(), ClIm2ColKernel::configure(), CpuIm2ColKernel::configure(), CpuGemmConv2d::configure(), ClGemmConv2d::configure(), arm_compute::test::validation::reference::convolution_layer_nchw(), CpuGemmConv2d::has_opt_impl(), arm_compute::test::validation::reference::im2col_nchw(), arm_compute::test::validation::reference::im2col_nhwc(), ClGemmConv2d::validate(), and CpuGemmConv2d::validate().

◆ scaled_dimensions_signed()

std::pair< int, int > scaled_dimensions_signed	(	int	width,
		int	height,
		int	kernel_width,
		int	kernel_height,
		const PadStrideInfo &	pad_stride_info
	)

Returns calculated width and height of output scaled tensor depending on dimensions rounding mode.

Parameters

[in]	width	Width of input tensor (Number of columns)
[in]	height	Height of input tensor (Number of rows)
[in]	kernel_width	Kernel width.
[in]	kernel_height	Kernel height.
[in]	pad_stride_info	Pad and stride information.

Returns: A pair with the new width in the first position and the new height in the second, returned values can be < 1

Definition at line 334 of file Utils.cpp.

 {
     const int pad_left   = pad_stride_info.pad_left();
     const int pad_top    = pad_stride_info.pad_top();
     const int pad_right  = pad_stride_info.pad_right();
     const int pad_bottom = pad_stride_info.pad_bottom();
     const int stride_x   = pad_stride_info.stride().first;
     const int stride_y   = pad_stride_info.stride().second;
     int       w          = 0;
     int       h          = 0;
     switch (pad_stride_info.round())
     {
         case DimensionRoundingType::FLOOR:
             w = static_cast<int>(
                 std::floor((static_cast<float>(width + pad_left + pad_right - kernel_width) / stride_x) + 1));
             h = static_cast<int>(
                 std::floor((static_cast<float>(height + pad_top + pad_bottom - kernel_height) / stride_y) + 1));
             break;
         case DimensionRoundingType::CEIL:
             w = static_cast<int>(
                 std::ceil((static_cast<float>(width + pad_left + pad_right - kernel_width) / stride_x) + 1));
             h = static_cast<int>(
                 std::ceil((static_cast<float>(height + pad_top + pad_bottom - kernel_height) / stride_y) + 1));
             break;
         default:
             ARM_COMPUTE_ERROR("Unsupported rounding type");
     }
  
     return std::make_pair(static_cast<int>(w), static_cast<int>(h));
 }

References ARM_COMPUTE_ERROR, CEIL, FLOOR, PadStrideInfo::pad_bottom(), PadStrideInfo::pad_left(), PadStrideInfo::pad_right(), PadStrideInfo::pad_top(), PadStrideInfo::round(), PadStrideInfo::stride(), and arm_compute::test::validation::w.

Referenced by arm_compute::misc::shape_calculator::compute_pool_shape().

◆ schedule_kernel_on_ctx()

void schedule_kernel_on_ctx	(	CLRuntimeContext *	ctx,
		ICLKernel *	kernel,
		bool	flush = `true`
	)

Schedules a kernel using the context if not nullptr else uses the legacy scheduling flow.

Parameters

[in]	ctx	Context to use.
[in]	kernel	Kernel to schedule.
[in]	flush	(Optional) Specifies if the command queue will be flushed after running the kernel.

Definition at line 138 of file CLHelpers.cpp.

 {
     ARM_COMPUTE_ERROR_ON_NULLPTR(kernel);
     if (ctx)
     {
         ARM_COMPUTE_ERROR_ON(ctx->gpu_scheduler() == nullptr);
         ctx->gpu_scheduler()->enqueue(*kernel, flush);
     }
     else
     {
         CLScheduler::get().enqueue(*kernel, flush);
     }
 }

References ARM_COMPUTE_ERROR_ON, ARM_COMPUTE_ERROR_ON_NULLPTR, CLScheduler::enqueue(), CLScheduler::get(), and CLRuntimeContext::gpu_scheduler().

Referenced by ICLSimpleFunction::run(), and CLInstanceNormalizationLayer::run().

◆ select_preferable_platform()

cl::Platform select_preferable_platform ( CLBackendType cl_backend_type )

This function selects the OpenCL platform based on the backend type.

Parameters

[in] cl_backend_type The OpenCL backend type to use.

Returns: A cl::Platform object.

Definition at line 84 of file CLHelpers.cpp.

 {
     std::vector<cl::Platform> platforms;
     cl::Platform::get(&platforms);
     ARM_COMPUTE_ERROR_ON_MSG(platforms.size() == 0, "Couldn't find any OpenCL platform");
  
     cl::Platform selected_platform{nullptr};
  
     // If the user has selected the Native platform, return the first available.
     switch (cl_backend_type)
     {
         case CLBackendType::Native:
             selected_platform = platforms[0];
             break;
         case CLBackendType::Clvk:
             for (auto p : platforms)
             {
                 std::string res = p.getInfo<CL_PLATFORM_NAME>();
                 if (res.find("clvk") != std::string::npos)
                 {
                     selected_platform = p;
                     break;
                 }
             }
             break;
         default:
             ARM_COMPUTE_ERROR("Unsupported backend type");
     }
  
     if (!selected_platform())
     {
         ARM_COMPUTE_ERROR("No valid platform found");
     }
  
     return selected_platform;
 }

References ARM_COMPUTE_ERROR, ARM_COMPUTE_ERROR_ON_MSG, Clvk, and Native.

Referenced by create_opencl_context_and_device().

◆ set_data_layout_if_unknown()

bool arm_compute::set_data_layout_if_unknown	(	ITensorInfo &	info,
		DataLayout	data_layout
	)

inline

Set the data layout to the specified value if the current data layout is unknown.

Parameters

[in,out]	info	Tensor info used to check and assign.
[in]	data_layout	New data layout.

Returns: True if the data type has been changed.

Definition at line 152 of file AutoConfiguration.h.

 {
     if (info.data_layout() == DataLayout::UNKNOWN)
     {
         info.set_data_layout(data_layout);
         return true;
     }
  
     return false;
 }

References arm_compute::cpu::data_layout, ScaleKernelInfo::data_layout, arm_compute::test::validation::info, and UNKNOWN.

◆ set_data_type_if_unknown()

bool arm_compute::set_data_type_if_unknown	(	ITensorInfo &	info,
		DataType	data_type
	)

inline

Set the data type and number of channels to the specified value if the current data type is unknown.

Parameters

[in,out]	info	Tensor info used to check and assign.
[in]	data_type	New data type.

Returns: True if the data type has been changed.

Definition at line 133 of file AutoConfiguration.h.

 {
     if (info.data_type() == DataType::UNKNOWN)
     {
         info.set_data_type(data_type);
         return true;
     }
  
     return false;
 }

References arm_compute::test::validation::data_type, arm_compute::test::validation::info, and UNKNOWN.

Referenced by NELogicalKernel::configure(), CpuSubKernel::configure(), CpuAddMulAddKernel::configure(), and CpuAddKernel::configure().

◆ set_format_if_unknown()

bool arm_compute::set_format_if_unknown	(	ITensorInfo &	info,
		Format	format
	)

inline

Set the format, data type and number of channels to the specified value if the current data type is unknown.

Parameters

[in,out]	info	Tensor info used to check and assign.
[in]	format	New format.

Returns: True if the format has been changed.

Definition at line 114 of file AutoConfiguration.h.

 {
     if (info.data_type() == DataType::UNKNOWN)
     {
         info.set_format(format);
         return true;
     }
  
     return false;
 }

References update_supported_ops::format, arm_compute::test::validation::info, and UNKNOWN.

Referenced by NEBitwiseNotKernel::configure(), NEBitwiseXorKernel::configure(), NEBitwiseOrKernel::configure(), and NEBitwiseAndKernel::configure().

◆ set_quantization_info_if_empty()

bool arm_compute::set_quantization_info_if_empty	(	ITensorInfo &	info,
		QuantizationInfo	quantization_info
	)

inline

Set the quantization info to the specified value if the current quantization info is empty and the data type of asymmetric quantized type.

Parameters

[in,out]	info	Tensor info used to check and assign.
[in]	quantization_info	Quantization info

Returns: True if the quantization info has been changed.

Definition at line 171 of file AutoConfiguration.h.

 {
     if (info.quantization_info().empty() && (is_data_type_quantized_asymmetric(info.data_type())))
     {
         info.set_quantization_info(quantization_info);
         return true;
     }
  
     return false;
 }

References arm_compute::test::validation::info, and is_data_type_quantized_asymmetric().

◆ set_shape_if_empty()

bool arm_compute::set_shape_if_empty	(	ITensorInfo &	info,
		const TensorShape &	shape
	)

inline

Set the shape to the specified value if the current assignment is empty.

Parameters

[in,out]	info	Tensor info used to check and assign.
[in]	shape	New shape.

Returns: True if the shape has been changed.

Definition at line 95 of file AutoConfiguration.h.

 {
     if (info.tensor_shape().total_size() == 0)
     {
         info.set_tensor_shape(shape);
         return true;
     }
  
     return false;
 }

References arm_compute::test::validation::info, and arm_compute::test::validation::shape.

Referenced by NELogicalKernel::configure(), NEBitwiseNotKernel::configure(), NEBitwiseXorKernel::configure(), NEBitwiseAndKernel::configure(), NEBitwiseOrKernel::configure(), CpuSubKernel::configure(), ClCastKernel::configure(), CpuAddMulAddKernel::configure(), CpuCastKernel::configure(), CpuAddKernel::configure(), and CpuMulKernel::configure().

◆ set_unroll_with_pragma()

void set_unroll_with_pragma	(	CLBuildOptions &	built_opts,
		std::initializer_list< int >	values
	)

Definition at line 487 of file CLHelpers.cpp.

 {
     for (const int value : values)
     {
         if (value > max_manual_loop_unrolling)
         {
             built_opts.add_option("-DUNROLL_WITH_PRAGMA");
             return;
         }
     }
 }

References CLBuildOptions::add_option().

Referenced by CLDepthwiseConvolutionLayerNativeKernel::configure().

◆ set_wbsm()

void set_wbsm	(	cl::Kernel &	kernel,
		cl_int	wbsm_hint
	)

Definition at line 441 of file CLHelpers.cpp.

 {
     cl_int err = clSetKernelExecInfo(kernel.get(), CL_KERNEL_EXEC_INFO_WORKGROUP_BATCH_SIZE_MODIFIER_ARM,
                                      sizeof(cl_int), &wbsm_hint);
     ARM_COMPUTE_UNUSED(err);
     ARM_COMPUTE_ERROR_ON(err != CL_SUCCESS);
 }

References ARM_COMPUTE_ERROR_ON, ARM_COMPUTE_UNUSED, and clSetKernelExecInfo().

Referenced by enqueue().

◆ string_from_activation_func()

const std::string & string_from_activation_func ( const ActivationFunction & act )

Translates a given activation function to a string.

Parameters

[in] act ActivationLayerInfo::ActivationFunction to be translated to string.

Returns: The string describing the activation function.

Definition at line 31 of file ActivationFunctionUtils.cpp.

 {
     static std::map<ActivationFunction, const std::string> act_map = {{ActivationFunction::ABS, "ABS"},
                                                                       {ActivationFunction::LINEAR, "LINEAR"},
                                                                       {ActivationFunction::LOGISTIC, "LOGISTIC"},
                                                                       {ActivationFunction::RELU, "RELU"},
                                                                       {ActivationFunction::BOUNDED_RELU, "BRELU"},
                                                                       {ActivationFunction::LU_BOUNDED_RELU, "LU_BRELU"},
                                                                       {ActivationFunction::LEAKY_RELU, "LRELU"},
                                                                       {ActivationFunction::SOFT_RELU, "SRELU"},
                                                                       {ActivationFunction::ELU, "ELU"},
                                                                       {ActivationFunction::SQRT, "SQRT"},
                                                                       {ActivationFunction::SQUARE, "SQUARE"},
                                                                       {ActivationFunction::TANH, "TANH"},
                                                                       {ActivationFunction::IDENTITY, "IDENTITY"},
                                                                       {ActivationFunction::HARD_SWISH, "HARD_SWISH"},
                                                                       {ActivationFunction::SWISH, "SWISH"},
                                                                       {ActivationFunction::GELU, "GELU"}
  
     };
  
     return act_map[act];
 }

References ABS, BOUNDED_RELU, ELU, GELU, HARD_SWISH, IDENTITY, LEAKY_RELU, LINEAR, LOGISTIC, LU_BOUNDED_RELU, RELU, SOFT_RELU, SQRT, SQUARE, SWISH, and TANH.

◆ string_from_border_mode()

const std::string & string_from_border_mode ( BorderMode border_mode )

Translates a given border mode policy to a string.

Parameters

[in] border_mode BorderMode to be translated to string.

Returns: The string describing the border mode.

Definition at line 96 of file Utils.cpp.

 {
     static std::map<BorderMode, const std::string> border_mode_map = {
         {BorderMode::UNDEFINED, "UNDEFINED"},
         {BorderMode::CONSTANT, "CONSTANT"},
         {BorderMode::REPLICATE, "REPLICATE"},
     };
  
     return border_mode_map[border_mode];
 }

References CONSTANT, REPLICATE, and UNDEFINED.

Referenced by CLFillBorderKernel::configure().

◆ string_from_channel()

const std::string & string_from_channel ( Channel channel )

Convert a channel identity into a string.

Parameters

[in] channel Channel to be translated to string.

Returns: The string describing the channel.

Definition at line 78 of file Utils.cpp.

 {
     static std::map<Channel, const std::string> channels_map = {{Channel::UNKNOWN, "UNKNOWN"},
                                                                 {Channel::R, "R"},
                                                                 {Channel::G, "G"},
                                                                 {Channel::B, "B"},
                                                                 {Channel::A, "A"},
                                                                 {Channel::Y, "Y"},
                                                                 {Channel::U, "U"},
                                                                 {Channel::V, "V"},
                                                                 {Channel::C0, "C0"},
                                                                 {Channel::C1, "C1"},
                                                                 {Channel::C2, "C2"},
                                                                 {Channel::C3, "C3"}};
  
     return channels_map[channel];
 }

References A, B, C0, C1, C2, C3, G, R, U, UNKNOWN, V, and Y.

◆ string_from_data_layout()

const std::string & string_from_data_layout ( DataLayout dl )

Convert a data layout identity into a string.

Parameters

[in] dl DataLayout to be translated to string.

Returns: The string describing the data layout.

Definition at line 30 of file DataLayoutUtils.cpp.

 {
     static std::map<DataLayout, const std::string> dl_map = {
         {DataLayout::UNKNOWN, "UNKNOWN"},
         {DataLayout::NCHW, "NCHW"},
         {DataLayout::NHWC, "NHWC"},
     };
  
     return dl_map[dl];
 }

References dl, NCHW, NHWC, and UNKNOWN.

◆ string_from_data_type()

const std::string & string_from_data_type ( DataType dt )

Convert a data type identity into a string.

Parameters

[in] dt DataType to be translated to string.

Returns: The string describing the data type.

Definition at line 31 of file DataTypeUtils.cpp.

 {
     static std::map<DataType, const std::string> dt_map = {
         {DataType::UNKNOWN, "UNKNOWN"},
         {DataType::S8, "S8"},
         {DataType::U8, "U8"},
         {DataType::S16, "S16"},
         {DataType::U16, "U16"},
         {DataType::S32, "S32"},
         {DataType::U32, "U32"},
         {DataType::S64, "S64"},
         {DataType::U64, "U64"},
         {DataType::F16, "F16"},
         {DataType::F32, "F32"},
         {DataType::F64, "F64"},
         {DataType::SIZET, "SIZET"},
         {DataType::QSYMM8, "QSYMM8"},
         {DataType::QSYMM8_PER_CHANNEL, "QSYMM8_PER_CHANNEL"},
         {DataType::QASYMM8, "QASYMM8"},
         {DataType::QASYMM8_SIGNED, "QASYMM8_SIGNED"},
         {DataType::QSYMM16, "QSYMM16"},
         {DataType::QASYMM16, "QASYMM16"},
     };
  
     return dt_map[dt];
 }

References dt, F16, F32, F64, QASYMM16, QASYMM8, QASYMM8_SIGNED, QSYMM16, QSYMM8, QSYMM8_PER_CHANNEL, S16, S32, S64, S8, SIZET, U16, U32, U64, U8, and UNKNOWN.

◆ string_from_format()

const std::string & string_from_format ( Format format )

Convert a tensor format into a string.

Parameters

[in] format Format to be translated to string.

Returns: The string describing the format.

Definition at line 31 of file FormatUtils.cpp.

 {
     static std::map<Format, const std::string> formats_map = {
         {Format::UNKNOWN, "UNKNOWN"},   {Format::U8, "U8"},
         {Format::S16, "S16"},           {Format::U16, "U16"},
         {Format::S32, "S32"},           {Format::U32, "U32"},
         {Format::F16, "F16"},           {Format::F32, "F32"},
         {Format::UV88, "UV88"},         {Format::RGB888, "RGB888"},
         {Format::RGBA8888, "RGBA8888"}, {Format::YUV444, "YUV444"},
         {Format::YUYV422, "YUYV422"},   {Format::NV12, "NV12"},
         {Format::NV21, "NV21"},         {Format::IYUV, "IYUV"},
         {Format::UYVY422, "UYVY422"}};
  
     return formats_map[format];
 }

References F16, F32, update_supported_ops::format, IYUV, NV12, NV21, RGB888, RGBA8888, S16, S32, U16, U32, U8, UNKNOWN, UV88, UYVY422, YUV444, and YUYV422.

Referenced by error_on_format_not_in().

◆ string_from_gemmlowp_output_stage()

const std::string & string_from_gemmlowp_output_stage ( GEMMLowpOutputStageType output_stage )

Translates a given GEMMLowp output stage to a string.

Parameters

[in] output_stage GEMMLowpOutputStageInfo to be translated to string.

Returns: The string describing the GEMMLowp output stage

Definition at line 154 of file Utils.cpp.

 {
     static std::map<GEMMLowpOutputStageType, const std::string> output_stage_map = {
         {GEMMLowpOutputStageType::NONE, ""},
         {GEMMLowpOutputStageType::QUANTIZE_DOWN, "quantize_down"},
         {GEMMLowpOutputStageType::QUANTIZE_DOWN_FIXEDPOINT, "quantize_down_fixedpoint"},
         {GEMMLowpOutputStageType::QUANTIZE_DOWN_FLOAT, "quantize_down_float"}};
  
     return output_stage_map[output_stage];
 }

References NONE, output_stage, QUANTIZE_DOWN, QUANTIZE_DOWN_FIXEDPOINT, and QUANTIZE_DOWN_FLOAT.

Referenced by ClGemmLowpOffsetContributionOutputStageKernel::configure().

◆ string_from_interpolation_policy()

const std::string & string_from_interpolation_policy ( InterpolationPolicy policy )

Translates a given interpolation policy to a string.

Parameters

[in] policy InterpolationPolicy to be translated to string.

Returns: The string describing the interpolation policy.

Definition at line 30 of file InterpolationPolicyUtils.cpp.

 {
     static std::map<InterpolationPolicy, const std::string> interpolation_policy_map = {
         {InterpolationPolicy::AREA, "AREA"},
         {InterpolationPolicy::BILINEAR, "BILINEAR"},
         {InterpolationPolicy::NEAREST_NEIGHBOR, "NEAREST_NEIGHBOUR"},
     };
  
     return interpolation_policy_map[policy];
 }

References AREA, BILINEAR, and NEAREST_NEIGHBOR.

Referenced by ClScaleKernel::configure(), and CpuScaleKernel::configure().

◆ string_from_norm_type()

const std::string & string_from_norm_type ( NormType type )

Translates a given normalization type to a string.

Parameters

[in] type NormType to be translated to string.

Returns: The string describing the normalization type.

Definition at line 107 of file Utils.cpp.

 {
     static std::map<NormType, const std::string> norm_type_map = {
         {NormType::IN_MAP_1D, "IN_MAP_1D"},
         {NormType::IN_MAP_2D, "IN_MAP_2D"},
         {NormType::CROSS_MAP, "CROSS_MAP"},
     };
  
     return norm_type_map[type];
 }

References CROSS_MAP, IN_MAP_1D, IN_MAP_2D, and type.

◆ string_from_pixel_value()

std::string string_from_pixel_value	(	const PixelValue &	value,
		const DataType	data_type
	)

Convert a PixelValue to a string, represented through the specific data type.

Parameters

[in]	value	The PixelValue to convert
[in]	data_type	The type to be used to convert the `value`

Returns: String representation of the PixelValue through the given data type.

Definition at line 165 of file Utils.cpp.

 {
     std::stringstream ss;
     std::string       converted_string;
  
     switch (data_type)
     {
         case DataType::U8:
         case DataType::QASYMM8:
             // Needs conversion to 32 bit, otherwise interpreted as ASCII values
             ss << uint32_t(value.get<uint8_t>());
             converted_string = ss.str();
             break;
         case DataType::S8:
         case DataType::QASYMM8_SIGNED:
         case DataType::QSYMM8_PER_CHANNEL:
             // Needs conversion to 32 bit, otherwise interpreted as ASCII values
             ss << int32_t(value.get<int8_t>());
             converted_string = ss.str();
             break;
         case DataType::U16:
         case DataType::QASYMM16:
             ss << value.get<uint16_t>();
             converted_string = ss.str();
             break;
         case DataType::S16:
         case DataType::QSYMM16:
             ss << value.get<int16_t>();
             converted_string = ss.str();
             break;
         case DataType::U32:
             ss << value.get<uint32_t>();
             converted_string = ss.str();
             break;
         case DataType::S32:
             ss << value.get<int32_t>();
             converted_string = ss.str();
             break;
         case DataType::F32:
             converted_string = float_to_string_with_full_precision(value.get<float>());
             break;
         case DataType::F16:
             static_assert(sizeof(half) == 2, "Half must be 16 bit");
             ss << value.get<half>();
             converted_string = ss.str();
             break;
         default:
             ARM_COMPUTE_ERROR("Not handled");
     }
  
     return converted_string;
 }

References ARM_COMPUTE_ERROR, arm_compute::test::validation::data_type, F16, F32, float_to_string_with_full_precision(), PixelValue::get(), QASYMM16, QASYMM8, QASYMM8_SIGNED, QSYMM16, QSYMM8_PER_CHANNEL, S16, S32, S8, arm_compute::test::validation::ss(), U16, U32, and U8.

Referenced by ClFillKernel::configure(), ClScaleKernel::configure(), CLPadLayerKernel::configure(), and ClTemplateResize::get_tag_lut().

◆ string_from_pooling_type()

const std::string & string_from_pooling_type ( PoolingType type )

Translates a given pooling type to a string.

Parameters

[in] type PoolingType to be translated to string.

Returns: The string describing the pooling type.

Definition at line 118 of file Utils.cpp.

 {
     static std::map<PoolingType, const std::string> pool_type_map = {
         {PoolingType::MAX, "MAX"},
         {PoolingType::AVG, "AVG"},
         {PoolingType::L2, "L2"},
     };
  
     return pool_type_map[type];
 }

References AVG, L2, MAX, and type.

Referenced by ClPool2dKernel::configure(), and ClPool3dKernel::configure().

◆ string_from_target()

const std::string & string_from_target ( GPUTarget target )

Translates a given gpu device target to string.

Parameters

[in] target Given gpu target.

Returns: The string describing the target.

Definition at line 169 of file GPUTarget.cpp.

 {
     static std::map<GPUTarget, const std::string> gpu_target_map = {
         {GPUTarget::MIDGARD, "midgard"},   {GPUTarget::BIFROST, "bifrost"}, {GPUTarget::VALHALL, "valhall"},
         {GPUTarget::FIFTHGEN, "fifthgen"},
  
         {GPUTarget::T600, "t600"},         {GPUTarget::T700, "t700"},       {GPUTarget::T800, "t800"},
         {GPUTarget::G71, "g71"},           {GPUTarget::G72, "g72"},         {GPUTarget::G51, "g51"},
         {GPUTarget::G51BIG, "g51big"},     {GPUTarget::G51LIT, "g51lit"},   {GPUTarget::G31, "g31"},
         {GPUTarget::G76, "g76"},           {GPUTarget::G52, "g52"},         {GPUTarget::G52LIT, "g52lit"},
         {GPUTarget::G77, "g77"},           {GPUTarget::G57, "g57"},         {GPUTarget::G78, "g78"},
         {GPUTarget::G68, "g68"},           {GPUTarget::G78AE, "g78ae"},     {GPUTarget::G710, "g710"},
         {GPUTarget::G610, "g610"},         {GPUTarget::G510, "g510"},       {GPUTarget::G310, "g310"},
         {GPUTarget::G715, "g715"},         {GPUTarget::G615, "g615"},       {GPUTarget::G720, "g720"},
         {GPUTarget::G620, "g620"}};
  
     return gpu_target_map[target];
 }

References BIFROST, FIFTHGEN, G31, G310, G51, G510, G51BIG, G51LIT, G52, G52LIT, G57, G610, G615, G620, G68, G71, G710, G715, G72, G720, G76, G77, G78, G78AE, MIDGARD, T600, T700, T800, and VALHALL.

Referenced by arm_compute::cl_gemm::auto_heuristics::select_mlgo_gemm_config_native(), arm_compute::cl_gemm::auto_heuristics::select_mlgo_gemm_config_reshaped(), arm_compute::cl_gemm::auto_heuristics::select_mlgo_gemm_config_reshaped_only_rhs(), and arm_compute::cl_gemm::auto_heuristics::select_mlgo_gemm_kernel().

◆ swap()

void arm_compute::swap	(	Window &	lhs,
		Window &	rhs
	)

inline

Parameters

[in]	lhs	First window to swap.
[in]	rhs	Second window to swap.

Definition at line 318 of file Window.inl.

 {
     lhs._dims.swap(rhs._dims);
 }

Referenced by CaffePreproccessor::CaffePreproccessor().

◆ throw_error()

void throw_error ( Status err )

Throw an std::runtime_error.

Parameters

[in] err Error status

Definition at line 47 of file Error.cpp.

 {
     ARM_COMPUTE_THROW(std::runtime_error(err.error_description()));
 }

References ARM_COMPUTE_THROW, and Status::error_description().

◆ to_string() [1/99]

std::string arm_compute::to_string ( CLGEMMKernelType val )

inline

Converts a CLGEMMKernelType to string.

Parameters

[in] val CLGEMMKernelType value to be converted

Returns: String representing the corresponding CLGEMMKernelType

Definition at line 2547 of file TypePrinter.h.

 {
     switch (val)
     {
         case CLGEMMKernelType::NATIVE:
         {
             return "Native";
         }
         case CLGEMMKernelType::RESHAPED_ONLY_RHS:
         {
             return "Reshaped_Only_RHS";
         }
         case CLGEMMKernelType::RESHAPED:
         {
             return "Reshaped";
         }
         default:
         {
             return "Unknown";
         }
     }
 }

References NATIVE, RESHAPED, and RESHAPED_ONLY_RHS.

◆ to_string() [2/99]

std::string arm_compute::to_string ( const ArithmeticOperation & op )

inline

Formatted output of the Arithmetic Operation.

Parameters

[in] op Type to output.

Returns: Formatted string.

Definition at line 1787 of file TypePrinter.h.

 {
     std::stringstream str;
     str << op;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [3/99]

std::string arm_compute::to_string ( const arm_compute::ActivationLayerInfo & info )

inline

Formatted output of the activation function info type.

Parameters

[in] info ActivationLayerInfo to output.

Returns: Formatted string.

Definition at line 526 of file TypePrinter.h.

 {
     std::stringstream str;
     if (info.enabled())
     {
         str << info.activation();
     }
     return str.str();
 }

References arm_compute::test::validation::info, and caffe_mnist_image_extractor::str.

◆ to_string() [4/99]

std::string arm_compute::to_string ( const arm_compute::ActivationLayerInfo::ActivationFunction & function )

inline

Formatted output of the activation function type.

Parameters

[in] function Type to output.

Returns: Formatted string.

Definition at line 573 of file TypePrinter.h.

 {
     std::stringstream str;
     str << function;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [5/99]

std::string arm_compute::to_string ( const arm_compute::CpuMatMulSettings & settings )

inline

Formatted output of the arm_compute::CpuMatMulSettings type.

Parameters

[in] settings arm_compute::CpuMatMulSettings type to output.

Returns: Formatted string.

Definition at line 3614 of file TypePrinter.h.

 {
     std::stringstream str;
     str << settings;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

Referenced by to_string().

◆ to_string() [6/99]

std::string arm_compute::to_string ( const arm_compute::DataLayout & data_layout )

inline

Formatted output of the DataLayout type.

Parameters

[in] data_layout Type to output.

Returns: Formatted string.

Definition at line 727 of file TypePrinter.h.

 {
     std::stringstream str;
     str << data_layout;
     return str.str();
 }

References arm_compute::cpu::data_layout, and caffe_mnist_image_extractor::str.

◆ to_string() [7/99]

std::string arm_compute::to_string ( const arm_compute::DataType & data_type )

inline

Formatted output of the DataType type.

Parameters

[in] data_type Type to output.

Returns: Formatted string.

Definition at line 851 of file TypePrinter.h.

 {
     std::stringstream str;
     str << data_type;
     return str.str();
 }

References arm_compute::test::validation::data_type, and caffe_mnist_image_extractor::str.

◆ to_string() [8/99]

std::string arm_compute::to_string ( const arm_compute::MatMulInfo & matmul_info )

inline

Formatted output of the arm_compute::MatMulInfo type.

Parameters

[in] matmul_info arm_compute::MatMulInfo type to output.

Returns: Formatted string.

Definition at line 3553 of file TypePrinter.h.

 {
     std::stringstream str;
     str << matmul_info;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [9/99]

std::string arm_compute::to_string ( const arm_compute::MatMulKernelInfo & matmul_info )

inline

Formatted output of the arm_compute::MatMulKernelInfo type.

Parameters

[in] matmul_info arm_compute::MatMulKernelInfo type to output.

Returns: Formatted string.

Definition at line 3586 of file TypePrinter.h.

 {
     std::stringstream str;
     str << matmul_info;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [10/99]

std::string arm_compute::to_string ( const arm_compute::NormalizationLayerInfo & info )

inline

Formatted output of NormalizationLayerInfo.

Parameters

[in] info Type to output.

Returns: Formatted string.

Definition at line 613 of file TypePrinter.h.

 {
     std::stringstream str;
     str << info.type() << ":NormSize=" << info.norm_size();
     return str.str();
 }

References arm_compute::test::validation::info, and caffe_mnist_image_extractor::str.

◆ to_string() [11/99]

std::string arm_compute::to_string ( const BorderMode & mode )

inline

Formatted output of the BorderMode type.

Parameters

[in] mode Type to output.

Returns: Formatted string.

Definition at line 1636 of file TypePrinter.h.

 {
     std::stringstream str;
     str << mode;
     return str.str();
 }

References clang_tidy_rules::mode, and caffe_mnist_image_extractor::str.

◆ to_string() [12/99]

std::string arm_compute::to_string ( const BorderSize & border )

inline

Formatted output of the BorderSize type.

Parameters

[in] border Type to output.

Returns: Formatted string.

Definition at line 1649 of file TypePrinter.h.

 {
     std::stringstream str;
     str << border;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [13/99]

std::string arm_compute::to_string ( const BoundingBoxTransformInfo & bbox_info )

inline

Formatted output of the BoundingBoxTransformInfo type.

Parameters

[in] bbox_info Type to output.

Returns: Formatted string.

Definition at line 366 of file TypePrinter.h.

 {
     std::stringstream str;
     str << bbox_info;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [14/99]

std::string arm_compute::to_string ( const BoxNMSLimitInfo & info )

inline

Converts a BoxNMSLimitInfo to string.

Parameters

[in] info BoxNMSLimitInfo value to be converted

Returns: String representing the corresponding BoxNMSLimitInfo

Definition at line 3122 of file TypePrinter.h.

 {
     std::stringstream str;
     str << info;
     return str.str();
 }

References arm_compute::test::validation::info, and caffe_mnist_image_extractor::str.

◆ to_string() [15/99]

std::string arm_compute::to_string ( const Channel & channel )

inline

Formatted output of the Channel type.

Parameters

[in] channel Type to output.

Returns: Formatted string.

Definition at line 1000 of file TypePrinter.h.

 {
     std::stringstream str;
     str << channel;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [16/99]

std::string arm_compute::to_string ( const ClComponentElementwiseBinary::Attributes::ElementwiseOp & op )

inline

Formatted output of the arm_compute::experimental::dynamic_fusion::ClComponentElementwiseBinary::Attributes::ElementwiseOp type.

Parameters

[in] op arm_compute::experimental::dynamic_fusion::ClComponentElementwiseBinary::Attributes::ElementwiseOp type to output.

Returns: Formatted string.

Definition at line 68 of file ElementwiseBinary.h.

 {
     std::stringstream str;
     str << op;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

Referenced by ClSoftmaxKernel::configure(), arm_compute::graph::backends::detail::create_batch_normalization_layer(), arm_compute::graph::backends::detail::create_convolution_layer(), arm_compute::graph::backends::detail::create_depthwise_convolution_layer(), arm_compute::graph::backends::detail::create_fused_convolution_batch_normalization_layer(), arm_compute::graph::backends::detail::create_fused_depthwise_convolution_batch_normalization_layer(), InitializerListDataset< T >::iterator::description(), GpuCkwElementwiseBinary::get_name(), handle_cl_error(), arm_compute::test::framework::make_printable(), operator<<(), arm_compute::cl_gemm::auto_heuristics::select_mlgo_gemm_config_native(), arm_compute::cl_gemm::auto_heuristics::select_mlgo_gemm_config_reshaped(), arm_compute::cl_gemm::auto_heuristics::select_mlgo_gemm_config_reshaped_only_rhs(), arm_compute::cl_gemm::auto_heuristics::select_mlgo_gemm_kernel(), to_string(), to_string_if_not_null(), CpuGemm::validate(), DotGraphVisitor::visit(), and CLMutableCommandBuffer::~CLMutableCommandBuffer().

◆ to_string() [17/99]

std::string arm_compute::to_string ( const CLTunerMode val )

inline

Convert a CLTunerMode value to a string.

Parameters

val	CLTunerMode value to be converted

Returns: String representing the corresponding CLTunerMode.

Definition at line 2518 of file TypePrinter.h.

 {
     switch (val)
     {
         case CLTunerMode::EXHAUSTIVE:
         {
             return std::string("Exhaustive");
         }
         case CLTunerMode::NORMAL:
         {
             return std::string("Normal");
         }
         case CLTunerMode::RAPID:
         {
             return std::string("Rapid");
         }
         default:
         {
             ARM_COMPUTE_ERROR("Invalid tuner mode.");
             return std::string("UNDEFINED");
         }
     }
 }

References ARM_COMPUTE_ERROR, EXHAUSTIVE, NORMAL, and RAPID.

◆ to_string() [18/99]

std::string arm_compute::to_string ( const ComparisonOperation & op )

inline

Formatted output of the Comparison Operations.

Parameters

[in] op Type to output.

Returns: Formatted string.

Definition at line 1933 of file TypePrinter.h.

 {
     std::stringstream str;
     str << op;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [19/99]

std::string arm_compute::to_string ( const ComputeAnchorsInfo & anchors_info )

inline

Formatted output of the ComputeAnchorsInfo type.

Parameters

[in] anchors_info Type to output.

Returns: Formatted string.

Definition at line 392 of file TypePrinter.h.

 {
     std::stringstream str;
     str << anchors_info;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [20/99]

std::string arm_compute::to_string ( const Conv2dInfo & conv_info )

inline

Converts a Conv2dInfo to string.

Parameters

[in] conv_info Conv2dInfo value to be converted

Returns: String representing the corresponding Conv2dInfo

Definition at line 2745 of file TypePrinter.h.

 {
     std::stringstream str;
     str << conv_info;
     return str.str();
 }

References arm_compute::test::validation::conv_info, and caffe_mnist_image_extractor::str.

◆ to_string() [21/99]

std::string arm_compute::to_string ( const Conv3dInfo & conv3d_info )

inline

Formatted output of the Conv3dInfo type.

Parameters

[in] conv3d_info Type to output.

Returns: Formatted string.

Definition at line 3172 of file TypePrinter.h.

 {
     std::stringstream str;
     str << conv3d_info;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [22/99]

std::string arm_compute::to_string ( const ConvertPolicy & policy )

inline

Definition at line 1732 of file TypePrinter.h.

 {
     std::stringstream str;
     str << policy;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [23/99]

std::string arm_compute::to_string ( const ConvolutionInfo & info )

inline

Converts a ConvolutionInfo to string.

Parameters

[in] info ConvolutionInfo value to be converted

Returns: String representing the corresponding ConvolutionInfo

Definition at line 2605 of file TypePrinter.h.

 {
     std::stringstream str;
     str << info;
     return str.str();
 }

References arm_compute::test::validation::info, and caffe_mnist_image_extractor::str.

◆ to_string() [24/99]

std::string arm_compute::to_string ( const ConvolutionMethod & conv_method )

inline

Formatted output of the ConvolutionMethod type.

Parameters

[in] conv_method Type to output

Returns: Formatted string.

Definition at line 2200 of file TypePrinter.h.

 {
     std::stringstream str;
     str << conv_method;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [25/99]

std::string arm_compute::to_string ( const Coordinates & coord )

inline

Formatted output of the Coordinates type.

Parameters

[in] coord Type to output.

Returns: Formatted string.

Definition at line 1378 of file TypePrinter.h.

 {
     std::stringstream str;
     str << coord;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [26/99]

std::string arm_compute::to_string ( const Coordinates2D & coord_2d )

inline

Converts a Coordinates2D to string.

Parameters

[in] coord_2d Coordinates2D value to be converted

Returns: String representing the corresponding Coordinates2D

Definition at line 2921 of file TypePrinter.h.

 {
     std::stringstream str;
     str << coord_2d;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [27/99]

std::string arm_compute::to_string ( const DetectionOutputLayerCodeType & detection_code )

inline

Formatted output of the DetectionOutputLayerCodeType type.

Parameters

[in] detection_code Type to output

Returns: Formatted string.

Definition at line 2385 of file TypePrinter.h.

 {
     std::stringstream str;
     str << detection_code;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [28/99]

std::string arm_compute::to_string ( const DetectionOutputLayerInfo & detection_info )

inline

Formatted output of the DetectionOutputLayerInfo type.

Parameters

[in] detection_info Type to output

Returns: Formatted string.

Definition at line 2422 of file TypePrinter.h.

 {
     std::stringstream str;
     str << detection_info;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [29/99]

std::string arm_compute::to_string ( const DetectionPostProcessLayerInfo & detection_info )

inline

Formatted output of the DetectionPostProcessLayerInfo type.

Parameters

[in] detection_info Type to output

Returns: Formatted string.

Definition at line 2458 of file TypePrinter.h.

 {
     std::stringstream str;
     str << detection_info;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [30/99]

std::string arm_compute::to_string ( const DetectionWindow & detection_window )

inline

Formatted output of the DetectionWindow type.

Parameters

[in] detection_window Type to output

Returns: Formatted string.

Definition at line 2471 of file TypePrinter.h.

 {
     std::stringstream str;
     str << detection_window;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [31/99]

std::string arm_compute::to_string ( const DimensionRoundingType & rounding_type )

inline

Converts a DimensionRoundingType to string.

Parameters

[in] rounding_type DimensionRoundingType value to be converted

Returns: String representing the corresponding DimensionRoundingType

Definition at line 3135 of file TypePrinter.h.

 {
     std::stringstream str;
     str << rounding_type;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [32/99]

std::string arm_compute::to_string ( const Dimensions< T > & dimensions )

inline

Formatted output of the Dimensions type.

Parameters

[in] dimensions Type to output.

Returns: Formatted string.

Definition at line 1339 of file TypePrinter.h.

 {
     std::stringstream str;
     str << dimensions;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [33/99]

std::string arm_compute::to_string ( const ElementWiseUnary & op )

inline

Formatted output of the Elementwise unary Operations.

Parameters

[in] op Type to output.

Returns: Formatted string.

Definition at line 1946 of file TypePrinter.h.

 {
     std::stringstream str;
     str << op;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [34/99]

std::string arm_compute::to_string ( const experimental::dynamic_fusion::CastAttributes & cast_attr )

inline

Formatted output of the arm_compute::experimental::dynamic_fusion::CastAttributes type.

Parameters

[in] cast_attr arm_compute::experimental::dynamic_fusion::CastAttributes type to output.

Returns: Formatted string.

Definition at line 3403 of file TypePrinter.h.

 {
     std::stringstream str;
     str << cast_attr;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [35/99]

std::string arm_compute::to_string ( const experimental::dynamic_fusion::ClampAttributes & clamp_attr )

inline

Formatted output of the arm_compute::experimental::dynamic_fusion::ClampAttributes type.

Parameters

[in] clamp_attr arm_compute::experimental::dynamic_fusion::ClampAttributes type to output.

Returns: Formatted string.

Definition at line 3464 of file TypePrinter.h.

 {
     std::stringstream str;
     str << clamp_attr;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [36/99]

std::string arm_compute::to_string ( const experimental::dynamic_fusion::Conv2dAttributes & conv2d_attr )

inline

Formatted output of the arm_compute::experimental::dynamic_fusion::Conv2dAttributes type.

Parameters

[in] conv2d_attr arm_compute::experimental::dynamic_fusion::Conv2dAttributes type to output.

Returns: Formatted string.

Definition at line 3374 of file TypePrinter.h.

 {
     std::stringstream str;
     str << conv2d_attr;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [37/99]

std::string arm_compute::to_string ( const experimental::dynamic_fusion::DepthwiseConv2dAttributes & dw_conv2d_attr )

inline

Formatted output of the arm_compute::experimental::dynamic_fusion::DepthwiseConv2dAttributes type.

Parameters

[in] dw_conv2d_attr arm_compute::experimental::dynamic_fusion::DepthwiseConv2dAttributes type to output.

Returns: Formatted string.

Definition at line 3436 of file TypePrinter.h.

 {
     std::stringstream str;
     str << dw_conv2d_attr;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [38/99]

std::string arm_compute::to_string ( const experimental::dynamic_fusion::GpuPool2dSettings & settings )

inline

Formatted output of the arm_compute::experimental::dynamic_fusion::GpuPool2dSettings type.

Parameters

[in] settings arm_compute::experimental::dynamic_fusion::GpuPool2dSettings type to output.

Returns: Formatted string.

Definition at line 3343 of file TypePrinter.h.

 {
     std::stringstream str;
     str << settings;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [39/99]

std::string arm_compute::to_string ( const experimental::dynamic_fusion::Pool2dAttributes & pool2d_attr )

inline

Formatted output of the arm_compute::experimental::dynamic_fusion::Pool2dAttributes type.

Parameters

[in] pool2d_attr arm_compute::experimental::dynamic_fusion::Pool2dAttributes type to output.

Returns: Formatted string.

Definition at line 3317 of file TypePrinter.h.

 {
     std::stringstream str;
     str << pool2d_attr;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [40/99]

std::string arm_compute::to_string ( const experimental::dynamic_fusion::ResizeAttributes & resize_attr )

inline

Formatted output of the arm_compute::experimental::dynamic_fusion::ResizeAttributes type.

Parameters

[in] resize_attr arm_compute::experimental::dynamic_fusion::ResizeAttributes type to output.

Returns: Formatted string.

Definition at line 3496 of file TypePrinter.h.

 {
     std::stringstream str;
     str << resize_attr;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [41/99]

std::string arm_compute::to_string ( const experimental::dynamic_fusion::SoftmaxAttributes & softmax_attr )

inline

Formatted output of the arm_compute::experimental::dynamic_fusion::SoftmaxAttributes type.

Parameters

[in] softmax_attr arm_compute::experimental::dynamic_fusion::SoftmaxAttributes type to output.

Returns: Formatted string.

Definition at line 3526 of file TypePrinter.h.

 {
     std::stringstream str;
     str << softmax_attr;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [42/99]

std::string arm_compute::to_string ( const FFT1DInfo & fft1d_info )

inline

Converts a FFT1DInfo to string.

Parameters

[in] fft1d_info FFT1DInfo value to be converted

Returns: String representing the corresponding FFT1DInfo

Definition at line 2866 of file TypePrinter.h.

 {
     std::stringstream str;
     str << fft1d_info;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [43/99]

std::string arm_compute::to_string ( const FFT2DInfo & fft2d_info )

inline

Converts a FFT2DInfo to string.

Parameters

[in] fft2d_info FFT2DInfo value to be converted

Returns: String representing the corresponding FFT2DInfo

Definition at line 2894 of file TypePrinter.h.

 {
     std::stringstream str;
     str << fft2d_info;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [44/99]

std::string arm_compute::to_string ( const FFTDirection & fft_dir )

inline

Converts a FFT1DInfo to string.

Parameters

[in] fft_dir FFT1DInfo value to be converted

Returns: String representing the corresponding FFT1DInfo

Definition at line 2839 of file TypePrinter.h.

 {
     std::stringstream str;
     str << "{" << fft_dir << "}";
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [45/99]

std::string arm_compute::to_string ( const Format & format )

inline

Formatted output of the Format type.

Parameters

[in] format Type to output.

Returns: Formatted string.

Definition at line 933 of file TypePrinter.h.

 {
     std::stringstream str;
     str << format;
     return str.str();
 }

References update_supported_ops::format, and caffe_mnist_image_extractor::str.

◆ to_string() [46/99]

std::string arm_compute::to_string ( const FullyConnectedLayerInfo & info )

inline

Converts a FullyConnectedLayerInfo to string.

Parameters

[in] info FullyConnectedLayerInfo value to be converted

Returns: String representing the corresponding FullyConnectedLayerInfo

Definition at line 2636 of file TypePrinter.h.

 {
     std::stringstream str;
     str << info;
     return str.str();
 }

References arm_compute::test::validation::info, and caffe_mnist_image_extractor::str.

◆ to_string() [47/99]

std::string arm_compute::to_string ( const FuseBatchNormalizationType & fuse_type )

inline

Converts a FuseBatchNormalizationType to string.

Parameters

[in] fuse_type FuseBatchNormalizationType value to be converted

Returns: String representing the corresponding FuseBatchNormalizationType

Definition at line 2957 of file TypePrinter.h.

 {
     std::stringstream str;
     str << fuse_type;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [48/99]

std::string arm_compute::to_string ( const GEMMInfo & info )

inline

Formatted output of the GEMMInfo type.

Parameters

[in] info Type to output.

Returns: Formatted string.

Definition at line 1495 of file TypePrinter.h.

 {
     std::stringstream str;
     str << info;
     return str.str();
 }

References arm_compute::test::validation::info, and caffe_mnist_image_extractor::str.

◆ to_string() [49/99]

std::string arm_compute::to_string ( const GEMMKernelInfo & gemm_info )

inline

Formatted output of the GEMMKernelInfo type.

Parameters

[in] gemm_info GEMMKernelInfo Type to output.

Returns: Formatted string.

Definition at line 331 of file TypePrinter.h.

 {
     std::stringstream str;
     str << gemm_info;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [50/99]

std::string arm_compute::to_string ( const GEMMLHSMatrixInfo & gemm_info )

inline

Formatted output of the GEMMLHSMatrixInfo type.

Parameters

[in] gemm_info GEMMLHSMatrixInfo to output.

Returns: Formatted string.

Definition at line 318 of file TypePrinter.h.

 {
     std::stringstream str;
     str << gemm_info;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [51/99]

std::string arm_compute::to_string ( const GEMMLowpOutputStageInfo & gemm_info )

inline

Converts a GEMMLowpOutputStageInfo to string.

Parameters

[in] gemm_info GEMMLowpOutputStageInfo value to be converted

Returns: String representing the corresponding GEMMLowpOutputStageInfo

Definition at line 2714 of file TypePrinter.h.

 {
     std::stringstream str;
     str << gemm_info;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [52/99]

std::string arm_compute::to_string ( const GEMMLowpOutputStageType & gemm_type )

inline

Converts a GEMMLowpOutputStageType to string.

Parameters

[in] gemm_type GEMMLowpOutputStageType value to be converted

Returns: String representing the corresponding GEMMLowpOutputStageType

Definition at line 2678 of file TypePrinter.h.

 {
     std::stringstream str;
     str << gemm_type;
     return str.str();
 }

References arm_compute::mlgo::parser::gemm_type(), and caffe_mnist_image_extractor::str.

◆ to_string() [53/99]

std::string arm_compute::to_string ( const GEMMReshapeInfo & info )

inline

Formatted output of the GEMMReshapeInfo type.

Parameters

[in] info Type to output.

Returns: Formatted string.

Definition at line 1482 of file TypePrinter.h.

 {
     std::stringstream str;
     str << info;
     return str.str();
 }

References arm_compute::test::validation::info, and caffe_mnist_image_extractor::str.

◆ to_string() [54/99]

std::string arm_compute::to_string ( const GEMMRHSMatrixInfo & gemm_info )

inline

Formatted output of the GEMMRHSMatrixInfo type.

Parameters

[in] gemm_info GEMMRHSMatrixInfo to output.

Returns: Formatted string.

Definition at line 305 of file TypePrinter.h.

 {
     std::stringstream str;
     str << gemm_info;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [55/99]

std::string arm_compute::to_string ( const GenerateProposalsInfo & proposals_info )

inline

Formatted output of the GenerateProposalsInfo type.

Parameters

[in] proposals_info Type to output.

Returns: Formatted string.

Definition at line 418 of file TypePrinter.h.

 {
     std::stringstream str;
     str << proposals_info;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [56/99]

std::string arm_compute::to_string ( const GPUTarget & gpu_target )

inline

Formatted output of the GPUTarget type.

Parameters

[in] gpu_target Type to output

Returns: Formatted string.

Definition at line 2324 of file TypePrinter.h.

 {
     std::stringstream str;
     str << gpu_target;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [57/99]

std::string arm_compute::to_string ( const InterpolationPolicy & policy )

inline

Formatted output of the InterpolationPolicy type.

Parameters

[in] policy Type to output.

Returns: Formatted string.

Definition at line 1688 of file TypePrinter.h.

 {
     std::stringstream str;
     str << policy;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [58/99]

std::string arm_compute::to_string ( const ITensor * tensor )

inline

Formatted output of the ITensorInfo type obtained from const ITensor* type.

Parameters

[in] tensor Type to output.

Returns: Formatted string.

Definition at line 1239 of file TypePrinter.h.

 {
     std::string ret_str = "nullptr";
     if (tensor != nullptr)
     {
         std::stringstream str;
         str << "ITensor->info(): " << tensor->info();
         ret_str = str.str();
     }
     return ret_str;
 }

References CLTensor::info(), caffe_mnist_image_extractor::str, and tensor.

◆ to_string() [59/99]

std::string arm_compute::to_string ( const ITensorInfo & info )

inline

Formatted output of the const ITensorInfo& type.

Parameters

[in] info Type to output.

Returns: Formatted string.

Definition at line 1197 of file TypePrinter.h.

 {
     std::stringstream str;
     str << &info;
     return str.str();
 }

References arm_compute::test::validation::info, and caffe_mnist_image_extractor::str.

◆ to_string() [60/99]

std::string arm_compute::to_string ( const ITensorInfo * info )

inline

Formatted output of the const ITensorInfo* type.

Parameters

[in] info Type to output.

Returns: Formatted string.

Definition at line 1210 of file TypePrinter.h.

 {
     std::string ret_str = "nullptr";
     if (info != nullptr)
     {
         std::stringstream str;
         str << info;
         ret_str = str.str();
     }
     return ret_str;
 }

References arm_compute::test::validation::info, and caffe_mnist_image_extractor::str.

◆ to_string() [61/99]

std::string arm_compute::to_string ( const LSTMParams< T > & lstm_params )

Converts a LSTMParams to string.

Parameters

[in] lstm_params LSTMParams<T> value to be converted

Returns: String representing the corresponding LSTMParams

Definition at line 3036 of file TypePrinter.h.

 {
     std::stringstream str;
     str << lstm_params;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [62/99]

std::string arm_compute::to_string ( const Multiples & multiples )

inline

Formatted output of the Multiples type.

Parameters

[in] multiples Type to output.

Returns: Formatted string.

Definition at line 1675 of file TypePrinter.h.

 {
     std::stringstream str;
     str << multiples;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [63/99]

std::string arm_compute::to_string ( const NMSType nms_type )

inline

Converts a NMSType to string.

Parameters

[in] nms_type NMSType value to be converted

Returns: String representing the corresponding NMSType

Definition at line 3088 of file TypePrinter.h.

 {
     std::stringstream str;
     str << nms_type;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [64/99]

std::string arm_compute::to_string ( const NormType & type )

inline

Formatted output of the Norm Type.

Parameters

[in] type Type to output.

Returns: Formatted string.

Definition at line 1959 of file TypePrinter.h.

 {
     std::stringstream str;
     str << type;
     return str.str();
 }

References caffe_mnist_image_extractor::str, and type.

◆ to_string() [65/99]

std::string arm_compute::to_string ( const Padding2D & padding2d )

inline

Converts a Padding2D to string.

Parameters

[in] padding2d Padding2D value to be converted

Returns: String representing the corresponding Padding2D

Definition at line 3284 of file TypePrinter.h.

 {
     std::stringstream str;
     str << padding2d;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [66/99]

std::string arm_compute::to_string ( const Padding3D & padding3d )

inline

Converts a Padding3D to string.

Parameters

[in] padding3d Padding3D value to be converted

Returns: String representing the corresponding Padding3D

Definition at line 2048 of file TypePrinter.h.

 {
     std::stringstream str;
     str << padding3d;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [67/99]

std::string arm_compute::to_string ( const PaddingList & padding )

inline

Formatted output of the PaddingList type.

Parameters

[in] padding Type to output.

Returns: Formatted string.

Definition at line 1662 of file TypePrinter.h.

 {
     std::stringstream str;
     str << padding;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [68/99]

std::string arm_compute::to_string ( const PaddingMode & mode )

inline

Formatted output of the PaddingMode type.

Parameters

[in] mode Type to output.

Returns: Formatted string.

Definition at line 1593 of file TypePrinter.h.

 {
     std::stringstream str;
     str << mode;
     return str.str();
 }

References clang_tidy_rules::mode, and caffe_mnist_image_extractor::str.

◆ to_string() [69/99]

std::string arm_compute::to_string ( const PadStrideInfo & pad_stride_info )

inline

Formatted output of the PadStrideInfo type.

Parameters

[in] pad_stride_info Type to output.

Returns: Formatted string.

Definition at line 1623 of file TypePrinter.h.

 {
     std::stringstream str;
     str << pad_stride_info;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [70/99]

std::string arm_compute::to_string ( const PixelValue & pixel_value )

inline

Converts a PixelValue to string.

Parameters

[in] pixel_value PixelValue value to be converted

Returns: String representing the corresponding PixelValue

Definition at line 2771 of file TypePrinter.h.

 {
     std::stringstream str;
     str << pixel_value;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [71/99]

std::string arm_compute::to_string ( const Pooling3dLayerInfo & info )

inline

Formatted output of the Pooling 3d Layer Info.

Parameters

[in] info Type to output.

Returns: Formatted string.

Definition at line 2109 of file TypePrinter.h.

 {
     std::stringstream str;
     str << info;
     return str.str();
 }

References arm_compute::test::validation::info, and caffe_mnist_image_extractor::str.

◆ to_string() [72/99]

std::string arm_compute::to_string ( const PoolingLayerInfo & info )

inline

Formatted output of the Pooling Layer Info.

Parameters

[in] info Type to output.

Returns: Formatted string.

Definition at line 1985 of file TypePrinter.h.

 {
     std::stringstream str;
     str << "{Type=" << info.pool_type << ","
         << "DataLayout=" << info.data_layout << ","
         << "IsGlobalPooling=" << info.is_global_pooling;
     if (!info.is_global_pooling)
     {
         str << ","
             << "PoolSize=" << info.pool_size.width << "," << info.pool_size.height << ","
             << "PadStride=" << info.pad_stride_info;
     }
     str << "}";
     return str.str();
 }

References ScaleKernelInfo::data_layout, arm_compute::test::validation::info, and caffe_mnist_image_extractor::str.

◆ to_string() [73/99]

std::string arm_compute::to_string ( const PoolingType & type )

inline

Formatted output of the Pooling Type.

Parameters

[in] type Type to output.

Returns: Formatted string.

Definition at line 1972 of file TypePrinter.h.

 {
     std::stringstream str;
     str << type;
     return str.str();
 }

References caffe_mnist_image_extractor::str, and type.

◆ to_string() [74/99]

std::string arm_compute::to_string ( const PriorBoxLayerInfo & info )

inline

Formatted output of the PriorBoxLayerInfo.

Parameters

[in] info Type to output.

Returns: Formatted string.

Definition at line 2122 of file TypePrinter.h.

 {
     std::stringstream str;
     str << "{";
     str << "Clip:" << info.clip() << "Flip:" << info.flip() << "StepX:" << info.steps()[0]
         << "StepY:" << info.steps()[1] << "MinSizes:" << info.min_sizes().size()
         << "MaxSizes:" << info.max_sizes().size() << "ImgSizeX:" << info.img_size().x
         << "ImgSizeY:" << info.img_size().y << "Offset:" << info.offset() << "Variances:" << info.variances().size();
     str << "}";
     return str.str();
 }

References arm_compute::test::validation::info, and caffe_mnist_image_extractor::str.

◆ to_string() [75/99]

std::string arm_compute::to_string ( const QuantizationInfo & quantization_info )

inline

Formatted output of the QuantizationInfo type.

Parameters

[in] quantization_info Type to output.

Returns: Formatted string.

Definition at line 446 of file TypePrinter.h.

 {
     std::stringstream str;
     str << quantization_info;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [76/99]

std::string arm_compute::to_string ( const ReductionOperation & op )

inline

Formatted output of the Reduction Operations.

Parameters

[in] op Type to output.

Returns: Formatted string.

Definition at line 1842 of file TypePrinter.h.

 {
     std::stringstream str;
     str << op;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [77/99]

std::string arm_compute::to_string ( const ROIPoolingLayerInfo & pool_info )

inline

Formatted output of the ROIPoolingInfo type.

Parameters

[in] pool_info Type to output.

Returns: Formatted string.

Definition at line 239 of file TypePrinter.h.

 {
     std::stringstream str;
     str << pool_info;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [78/99]

std::string arm_compute::to_string ( const RoundingPolicy & rounding_policy )

inline

Formatted output of RoundingPolicy.

Parameters

[in] rounding_policy Type to output.

Returns: Formatted string.

Definition at line 680 of file TypePrinter.h.

 {
     std::stringstream str;
     str << rounding_policy;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [79/99]

std::string arm_compute::to_string ( const SamplingPolicy & policy )

inline

Formatted output of the SamplingPolicy type.

Parameters

[in] policy Type to output.

Returns: Formatted string.

Definition at line 1701 of file TypePrinter.h.

 {
     std::stringstream str;
     str << policy;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [80/99]

std::string arm_compute::to_string ( const ScaleKernelInfo & scale_info )

inline

Converts a ScaleKernelInfo to string.

Parameters

[in] scale_info ScaleKernelInfo value to be converted

Returns: String representing the corresponding ScaleKernelInfo

Definition at line 2803 of file TypePrinter.h.

 {
     std::stringstream str;
     str << scale_info;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [81/99]

std::string arm_compute::to_string ( const Size2D & type )

inline

Formatted output of the Size2D type.

Parameters

[in] type Type to output

Returns: Formatted string.

Definition at line 2154 of file TypePrinter.h.

 {
     std::stringstream str;
     str << type;
     return str.str();
 }

References caffe_mnist_image_extractor::str, and type.

◆ to_string() [82/99]

std::string arm_compute::to_string ( const Size3D & type )

inline

Formatted output of the Size3D type.

Parameters

[in] type Type to output

Returns: Formatted string.

Definition at line 2021 of file TypePrinter.h.

 {
     std::stringstream str;
     str << type;
     return str.str();
 }

References caffe_mnist_image_extractor::str, and type.

◆ to_string() [83/99]

std::string arm_compute::to_string ( const SoftmaxKernelInfo & info )

inline

Converts a SoftmaxKernelInfo to string.

Parameters

[in] info SoftmaxKernelInfo value to be converted

Returns: String representing the corresponding SoftmaxKernelInfo

Definition at line 2986 of file TypePrinter.h.

 {
     std::stringstream str;
     str << info;
     return str.str();
 }

References arm_compute::test::validation::info, and caffe_mnist_image_extractor::str.

◆ to_string() [84/99]

std::string arm_compute::to_string ( const std::tuple< TensorShape, TensorShape, arm_compute::WeightFormat > values )

inline

Formatted output of the std::tuple<TensorShape, TensorShape, arm_compute::WeightFormat> tuple.

Parameters

[in] values tuple of input and output tensor shapes and WeightFormat used.

Returns: Formatted string.

Definition at line 3253 of file TypePrinter.h.

 {
     std::stringstream str;
     str << "[Input shape = " << std::get<0>(values);
     str << ", ";
     str << "Expected output shape = " << std::get<1>(values);
  
     str << ", ";
     str << "WeightFormat = " << std::get<2>(values) << "]";
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [85/99]

std::string arm_compute::to_string ( const std::vector< T > & args )

Formatted output of a vector of objects.

Parameters

[in] args Vector of objects to print

Returns: String representing args.

Definition at line 147 of file TypePrinter.h.

 {
     std::stringstream str;
     str << args;
     return str.str();
 }

References GemmTuner::args, and caffe_mnist_image_extractor::str.

◆ to_string() [86/99]

std::string arm_compute::to_string ( const Strides & stride )

inline

Formatted output of the Strides type.

Parameters

[in] stride Type to output.

Returns: Formatted string.

Definition at line 1352 of file TypePrinter.h.

 {
     std::stringstream str;
     str << stride;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [87/99]

std::string arm_compute::to_string ( const T & val )

inline

Fallback method: try to use std::to_string:

Parameters

[in] val Value to convert to string

Returns: String representing val.

Definition at line 93 of file TypePrinter.h.

 {
     return support::cpp11::to_string(val);
 }

References arm_compute::support::cpp11::to_string().

◆ to_string() [88/99]

std::string arm_compute::to_string ( const TensorInfo & info )

inline

Formatted output of the const TensorInfo& type.

Parameters

[in] info Type to output.

Returns: Formatted string.

Definition at line 1184 of file TypePrinter.h.

 {
     std::stringstream str;
     str << &info;
     return str.str();
 }

References arm_compute::test::validation::info, and caffe_mnist_image_extractor::str.

◆ to_string() [89/99]

std::string arm_compute::to_string ( const TensorShape & shape )

inline

Formatted output of the TensorShape type.

Parameters

[in] shape Type to output.

Returns: Formatted string.

Definition at line 1365 of file TypePrinter.h.

 {
     std::stringstream str;
     str << shape;
     return str.str();
 }

References arm_compute::test::validation::shape, and caffe_mnist_image_extractor::str.

◆ to_string() [90/99]

std::string arm_compute::to_string ( const uint8_t num )

inline

Converts a LSTMParams to string.

Parameters

[in] num uint8_t value to be converted

Returns: String representing the corresponding uint8_t

Definition at line 3049 of file TypePrinter.h.

 {
     // Explicity cast the uint8_t to signed integer and call the corresponding overloaded to_string() function.
     return ::std::to_string(static_cast<int>(num));
 }

References to_string().

◆ to_string() [91/99]

std::string arm_compute::to_string ( const WeightFormat wf )

inline

Formatted output of the arm_compute::WeightFormat type.

Parameters

[in] wf arm_compute::WeightFormat Type to output.

Returns: Formatted string.

Definition at line 3185 of file TypePrinter.h.

 {
 #define __CASE_WEIGHT_FORMAT(wf) \
     case WeightFormat::wf:       \
         return #wf;
     switch (wf)
     {
         __CASE_WEIGHT_FORMAT(UNSPECIFIED)
         __CASE_WEIGHT_FORMAT(ANY)
         __CASE_WEIGHT_FORMAT(OHWI)
         __CASE_WEIGHT_FORMAT(OHWIo2)
         __CASE_WEIGHT_FORMAT(OHWIo4)
         __CASE_WEIGHT_FORMAT(OHWIo8)
         __CASE_WEIGHT_FORMAT(OHWIo16)
         __CASE_WEIGHT_FORMAT(OHWIo32)
         __CASE_WEIGHT_FORMAT(OHWIo64)
         __CASE_WEIGHT_FORMAT(OHWIo128)
         __CASE_WEIGHT_FORMAT(OHWIo4i2)
         __CASE_WEIGHT_FORMAT(OHWIo4i2_bf16)
         __CASE_WEIGHT_FORMAT(OHWIo8i2)
         __CASE_WEIGHT_FORMAT(OHWIo8i2_bf16)
         __CASE_WEIGHT_FORMAT(OHWIo16i2)
         __CASE_WEIGHT_FORMAT(OHWIo16i2_bf16)
         __CASE_WEIGHT_FORMAT(OHWIo32i2)
         __CASE_WEIGHT_FORMAT(OHWIo32i2_bf16)
         __CASE_WEIGHT_FORMAT(OHWIo64i2)
         __CASE_WEIGHT_FORMAT(OHWIo64i2_bf16)
         __CASE_WEIGHT_FORMAT(OHWIo4i4)
         __CASE_WEIGHT_FORMAT(OHWIo4i4_bf16)
         __CASE_WEIGHT_FORMAT(OHWIo8i4)
         __CASE_WEIGHT_FORMAT(OHWIo8i4_bf16)
         __CASE_WEIGHT_FORMAT(OHWIo16i4)
         __CASE_WEIGHT_FORMAT(OHWIo16i4_bf16)
         __CASE_WEIGHT_FORMAT(OHWIo32i4)
         __CASE_WEIGHT_FORMAT(OHWIo32i4_bf16)
         __CASE_WEIGHT_FORMAT(OHWIo64i4)
         __CASE_WEIGHT_FORMAT(OHWIo64i4_bf16)
         __CASE_WEIGHT_FORMAT(OHWIo2i8)
         __CASE_WEIGHT_FORMAT(OHWIo4i8)
         __CASE_WEIGHT_FORMAT(OHWIo8i8)
         __CASE_WEIGHT_FORMAT(OHWIo16i8)
         __CASE_WEIGHT_FORMAT(OHWIo32i8)
         __CASE_WEIGHT_FORMAT(OHWIo64i8)
         default:
             return "invalid value";
     }
 #undef __CASE_WEIGHT_FORMAT
 }

References __CASE_WEIGHT_FORMAT, ANY, OHWI, OHWIo128, OHWIo16, OHWIo16i2, OHWIo16i2_bf16, OHWIo16i4, OHWIo16i4_bf16, OHWIo16i8, OHWIo2, OHWIo2i8, OHWIo32, OHWIo32i2, OHWIo32i2_bf16, OHWIo32i4, OHWIo32i4_bf16, OHWIo32i8, OHWIo4, OHWIo4i2, OHWIo4i2_bf16, OHWIo4i4, OHWIo4i4_bf16, OHWIo4i8, OHWIo64, OHWIo64i2, OHWIo64i2_bf16, OHWIo64i4, OHWIo64i4_bf16, OHWIo64i8, OHWIo8, OHWIo8i2, OHWIo8i2_bf16, OHWIo8i4, OHWIo8i4_bf16, OHWIo8i8, and UNSPECIFIED.

◆ to_string() [92/99]

std::string arm_compute::to_string ( const WeightsInfo & info )

inline

Formatted output of the WeightsInfo type.

Parameters

[in] info Type to output.

Returns: Formatted string.

Definition at line 1469 of file TypePrinter.h.

 {
     std::stringstream str;
     str << info;
     return str.str();
 }

References arm_compute::test::validation::info, and caffe_mnist_image_extractor::str.

◆ to_string() [93/99]

std::string arm_compute::to_string ( const Window & win )

inline

Formatted output of the Window& type.

Parameters

[in] win Type to output.

Returns: Formatted string.

Definition at line 1520 of file TypePrinter.h.

 {
     std::stringstream str;
     str << win;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [94/99]

std::string arm_compute::to_string ( const Window::Dimension & dim )

inline

Formatted output of the Window::Dimension type.

Parameters

[in] dim Type to output.

Returns: Formatted string.

Definition at line 1508 of file TypePrinter.h.

 {
     std::stringstream str;
     str << dim;
     return str.str();
 }

References caffe_mnist_image_extractor::str.

◆ to_string() [95/99]

std::string arm_compute::to_string ( const WinogradInfo & type )

inline

Definition at line 2505 of file TypePrinter.h.

 {
     std::stringstream str;
     str << type;
     return str.str();
 }

References caffe_mnist_image_extractor::str, and type.

◆ to_string() [96/99]

std::string arm_compute::to_string ( ITensor & tensor )

inline

Formatted output of the ITensorInfo type obtained from the ITensor& type.

Parameters

[in] tensor Type to output.

Returns: Formatted string.

Definition at line 1268 of file TypePrinter.h.

 {
     std::stringstream str;
     str << "ITensor.info(): " << tensor.info();
     return str.str();
 }

References CLTensor::info(), caffe_mnist_image_extractor::str, and tensor.

◆ to_string() [97/99]

std::string arm_compute::to_string ( ITensor * tensor )

inline

Formatted output of the ITensorInfo type obtained from the ITensor* type.

Parameters

[in] tensor Type to output.

Returns: Formatted string.

Definition at line 1257 of file TypePrinter.h.

 {
     return to_string(static_cast<const ITensor *>(tensor));
 }

References tensor, and to_string().

◆ to_string() [98/99]

std::string arm_compute::to_string ( ITensorInfo * info )

inline

Formatted output of the ITensorInfo* type.

Parameters

[in] info Type to output.

Returns: Formatted string.

Definition at line 1228 of file TypePrinter.h.

 {
     return to_string(static_cast<const ITensorInfo *>(info));
 }

References arm_compute::test::validation::info, and to_string().

◆ to_string() [99/99]

std::string arm_compute::to_string ( Window * win )

inline

Formatted output of the Window* type.

Parameters

[in] win Type to output.

Returns: Formatted string.

Definition at line 1533 of file TypePrinter.h.

 {
     std::string ret_str = "nullptr";
     if (win != nullptr)
     {
         std::stringstream str;
         str << *win;
         ret_str = str.str();
     }
     return ret_str;
 }

References caffe_mnist_image_extractor::str.

◆ to_string_if_not_null()

std::string arm_compute::to_string_if_not_null ( T * arg )

Formatted output if arg is not null.

Parameters

[in] arg Object to print

Returns: String representing arg.

Definition at line 74 of file TypePrinter.h.

 {
     if (arg == nullptr)
     {
         return "nullptr";
     }
     else
     {
         return to_string(*arg);
     }
 }

References to_string().

◆ tuner_mode_from_name()

CLTunerMode arm_compute::tuner_mode_from_name ( const std::string & name )

inline

Converts a string to a strong types enumeration CLTunerMode.

Parameters

[in] name String to convert

Returns: Converted CLTunerMode enumeration

Definition at line 57 of file CLTunerTypes.h.

 {
     static const std::map<std::string, CLTunerMode> tuner_modes = {
         {"exhaustive", CLTunerMode::EXHAUSTIVE},
         {"normal", CLTunerMode::NORMAL},
         {"rapid", CLTunerMode::RAPID},
     };
  
 #ifndef ARM_COMPUTE_EXCEPTIONS_DISABLED
     try
     {
 #endif /* ARM_COMPUTE_EXCEPTIONS_DISABLED */
         return tuner_modes.at(arm_compute::utility::tolower(name));
  
 #ifndef ARM_COMPUTE_EXCEPTIONS_DISABLED
     }
     catch (const std::out_of_range &)
     {
         throw std::invalid_argument(name);
     }
 #endif /* ARM_COMPUTE_EXCEPTIONS_DISABLED */
 }

References EXHAUSTIVE, name, NORMAL, RAPID, and arm_compute::utility::tolower().

Referenced by operator>>().

◆ update_window_and_padding()

bool arm_compute::update_window_and_padding	(	Window &	win,
		Ts &&...	patterns
	)

Update window and padding size for each of the access patterns.

First the window size is reduced based on all access patterns that are not allowed to modify the padding of the underlying tensor. Then the padding of the remaining tensors is increased to match the window.

Parameters

[in]	win	Window that is used by the kernel.
[in]	patterns	Access patterns used to calculate the final window and padding.

Returns: True if the window has been changed. Changes to the padding do not influence the returned value.

Definition at line 46 of file WindowHelpers.h.

 {
     bool window_changed = false;
  
     utility::for_each([&](const IAccessWindow &w) { window_changed |= w.update_window_if_needed(win); }, patterns...);
  
     utility::for_each([&](IAccessWindow &w) { w.update_padding_if_needed(win); }, patterns...);
  
     return window_changed;
 }

References arm_compute::utility::for_each(), and arm_compute::test::validation::w.

Referenced by ICLSimpleKernel::configure(), NEBitwiseNotKernel::configure(), NEBitwiseOrKernel::configure(), NEBitwiseXorKernel::configure(), and NEBitwiseAndKernel::configure().

◆ upper_string()

std::string upper_string ( const std::string & val )

Raise a given string to upper case.

Parameters

[in] val Given string to lower.

Returns: The upper case string

Definition at line 45 of file StringUtils.cpp.

 {
     std::string res = val;
     std::transform(res.begin(), res.end(), res.begin(), ::toupper);
     return res;
 }

◆ validate()

Status arm_compute::validate	(	const ITensorInfo *	scores_in,
		const ITensorInfo *	boxes_in,
		const ITensorInfo *	batch_splits_in,
		const ITensorInfo *	scores_out,
		const ITensorInfo *	boxes_out,
		const ITensorInfo *	classes,
		const ITensorInfo *	batch_splits_out,
		const ITensorInfo *	keeps,
		const ITensorInfo *	keeps_size,
		const BoxNMSLimitInfo	info
	)

Definition at line 243 of file CPPBoxWithNonMaximaSuppressionLimit.cpp.

 {
     ARM_COMPUTE_UNUSED(batch_splits_in, batch_splits_out, keeps, keeps_size, info);
     ARM_COMPUTE_RETURN_ERROR_ON_NULLPTR(scores_in, boxes_in, scores_out, boxes_out, classes);
     ARM_COMPUTE_RETURN_ERROR_ON_DATA_TYPE_CHANNEL_NOT_IN(scores_in, 1, DataType::QASYMM8, DataType::QASYMM8_SIGNED,
                                                          DataType::F16, DataType::F32);
  
     const bool is_qasymm8 =
         scores_in->data_type() == DataType::QASYMM8 || scores_in->data_type() == DataType::QASYMM8_SIGNED;
     if (is_qasymm8)
     {
         ARM_COMPUTE_RETURN_ERROR_ON_DATA_TYPE_CHANNEL_NOT_IN(boxes_in, 1, DataType::QASYMM16);
         ARM_COMPUTE_RETURN_ERROR_ON_MISMATCHING_DATA_TYPES(boxes_in, boxes_out);
         ARM_COMPUTE_RETURN_ERROR_ON_MISMATCHING_QUANTIZATION_INFO(boxes_in, boxes_out);
         const UniformQuantizationInfo boxes_qinfo = boxes_in->quantization_info().uniform();
         ARM_COMPUTE_RETURN_ERROR_ON(boxes_qinfo.scale != 0.125f);
         ARM_COMPUTE_RETURN_ERROR_ON(boxes_qinfo.offset != 0);
     }
  
     return Status{};
 }

References ARM_COMPUTE_RETURN_ERROR_ON, ARM_COMPUTE_RETURN_ERROR_ON_DATA_TYPE_CHANNEL_NOT_IN, ARM_COMPUTE_RETURN_ERROR_ON_MISMATCHING_DATA_TYPES, ARM_COMPUTE_RETURN_ERROR_ON_MISMATCHING_QUANTIZATION_INFO, ARM_COMPUTE_RETURN_ERROR_ON_NULLPTR, ARM_COMPUTE_UNUSED, ITensorInfo::data_type(), F16, F32, arm_compute::test::validation::info, UniformQuantizationInfo::offset, QASYMM16, QASYMM8, QASYMM8_SIGNED, ITensorInfo::quantization_info(), UniformQuantizationInfo::scale, and QuantizationInfo::uniform().

◆ vdequantize() [1/9]

float32x4x4_t arm_compute::vdequantize	(	const int16x8x2_t &	qv,
		const UniformQuantizationInfo &	qi
	)

inline

Dequantize a neon vector holding 16 16-bit quantized values.

Parameters

[in]	qv	Input values to be dequantized.
[in]	qi	Quantization information to be used in the computation.

Returns: Dequantized values in a neon vector

Definition at line 171 of file NESymm.h.

 {
     const float         scale              = qi.scale;
     const float32x4_t   vscale             = vdupq_n_f32(scale);
     const float32x4x4_t vdequantized_input = {{
         vmulq_f32(vcvtq_f32_s32(vmovl_s16(vget_low_s16(qv.val[0]))), vscale),
         vmulq_f32(vcvtq_f32_s32(vmovl_s16(vget_high_s16(qv.val[0]))), vscale),
         vmulq_f32(vcvtq_f32_s32(vmovl_s16(vget_low_s16(qv.val[1]))), vscale),
         vmulq_f32(vcvtq_f32_s32(vmovl_s16(vget_high_s16(qv.val[1]))), vscale),
     }};
     return vdequantized_input;
 }

References UniformQuantizationInfo::scale, and arm_compute::test::validation::scale.

◆ vdequantize() [2/9]

float32x4x4_t arm_compute::vdequantize	(	const int8x16_t &	qv,
		const float32x4x4_t	vscale
	)

inline

Dequantize following symmetric quantization scheme a neon vector holding 16 quantized values.

Parameters

[in]	qv	Input values to be dequantized.
[in]	vscale	Vector containing quantization scaling factors.

Returns: Dequantized values in a neon vector

Definition at line 554 of file NEAsymm.h.

 {
     const float32x4x4_t vdequantized_input = {{
         vmulq_f32(vcvtq_f32_s32(vmovl_s16(vget_low_s16(vmovl_s8(vget_low_s8(qv))))), vscale.val[0]),
         vmulq_f32(vcvtq_f32_s32(vmovl_s16(vget_high_s16(vmovl_s8(vget_low_s8(qv))))), vscale.val[1]),
         vmulq_f32(vcvtq_f32_s32(vmovl_s16(vget_low_s16(vmovl_s8(vget_high_s8(qv))))), vscale.val[2]),
         vmulq_f32(vcvtq_f32_s32(vmovl_s16(vget_high_s16(vmovl_s8(vget_high_s8(qv))))), vscale.val[3]),
     }};
     return vdequantized_input;
 }

◆ vdequantize() [3/9]

float32x4x4_t arm_compute::vdequantize	(	const int8x16_t &	qv,
		const UniformQuantizationInfo &	qi
	)

inline

Dequantize a neon vector holding 16 signed quantized values.

Parameters

[in]	qv	Input values to be dequantized.
[in]	qi	Quantization information to be used in the computation.

Returns: Dequantized values in a neon vector

Definition at line 482 of file NEAsymm.h.

 {
     const float         scale              = qi.scale;
     const int           offset             = qi.offset;
     const int32x4_t     voffset            = vdupq_n_s32(offset);
     const float32x4_t   vscale             = vdupq_n_f32(scale);
     const float32x4x4_t vdequantized_input = {{
         vmulq_f32(vcvtq_f32_s32(vsubq_s32(vmovl_s16(vget_low_s16(vmovl_s8(vget_low_s8(qv)))), voffset)), vscale),
         vmulq_f32(vcvtq_f32_s32(vsubq_s32(vmovl_s16(vget_high_s16(vmovl_s8(vget_low_s8(qv)))), voffset)), vscale),
         vmulq_f32(vcvtq_f32_s32(vsubq_s32(vmovl_s16(vget_low_s16(vmovl_s8(vget_high_s8(qv)))), voffset)), vscale),
         vmulq_f32(vcvtq_f32_s32(vsubq_s32(vmovl_s16(vget_high_s16(vmovl_s8(vget_high_s8(qv)))), voffset)), vscale),
     }};
     return vdequantized_input;
 }

References UniformQuantizationInfo::offset, offset(), UniformQuantizationInfo::scale, and arm_compute::test::validation::scale.

◆ vdequantize() [4/9]

float32x4x4_t arm_compute::vdequantize	(	const int8x16_t &	qv,
		float	scale
	)

inline

Dequantize following a symmetric quantization scheme a neon vector holding 16 quantized values.

Parameters

[in]	qv	Input values to be dequantized.
[in]	scale	Quantization scaling factor.

Returns: Dequantized values in a neon vector

Definition at line 572 of file NEAsymm.h.

 {
     const float32x4_t   vscale             = vdupq_n_f32(scale);
     const float32x4x4_t vdequantized_input = {{
         vmulq_f32(vcvtq_f32_s32(vmovl_s16(vget_low_s16(vmovl_s8(vget_low_s8(qv))))), vscale),
         vmulq_f32(vcvtq_f32_s32(vmovl_s16(vget_high_s16(vmovl_s8(vget_low_s8(qv))))), vscale),
         vmulq_f32(vcvtq_f32_s32(vmovl_s16(vget_low_s16(vmovl_s8(vget_high_s8(qv))))), vscale),
         vmulq_f32(vcvtq_f32_s32(vmovl_s16(vget_high_s16(vmovl_s8(vget_high_s8(qv))))), vscale),
     }};
     return vdequantized_input;
 }

References arm_compute::test::validation::scale.

◆ vdequantize() [5/9]

float32x4x4_t arm_compute::vdequantize	(	const int8x16_t &	qv,
		float	scale,
		int32_t	offset
	)

inline

Dequantize a vector of 16 values stored as signed asymmetric.

Parameters

[in]	qv	Input values to be dequantized.
[in]	scale	Quantization scaling factor.
[in]	offset	Zero quantization offset.

Returns: Dequantized values in a neon vector

Definition at line 534 of file NEAsymm.h.

 {
     const int32x4_t     voffset            = vdupq_n_s32(offset);
     const float32x4_t   vscale             = vdupq_n_f32(scale);
     const float32x4x4_t vdequantized_input = {{
         vmulq_f32(vcvtq_f32_s32(vsubq_s32(vmovl_s16(vget_low_s16(vmovl_s8(vget_low_s8(qv)))), voffset)), vscale),
         vmulq_f32(vcvtq_f32_s32(vsubq_s32(vmovl_s16(vget_high_s16(vmovl_s8(vget_low_s8(qv)))), voffset)), vscale),
         vmulq_f32(vcvtq_f32_s32(vsubq_s32(vmovl_s16(vget_low_s16(vmovl_s8(vget_high_s8(qv)))), voffset)), vscale),
         vmulq_f32(vcvtq_f32_s32(vsubq_s32(vmovl_s16(vget_high_s16(vmovl_s8(vget_high_s8(qv)))), voffset)), vscale),
     }};
     return vdequantized_input;
 }

References offset(), and arm_compute::test::validation::scale.

◆ vdequantize() [6/9]

float32x4x2_t arm_compute::vdequantize	(	const int8x8_t &	qv,
		const UniformQuantizationInfo &	qi
	)

inline

Dequantize a neon vector holding 8 singed quantized values.

Parameters

[in]	qv	Input values to be dequantized.
[in]	qi	Quantization information to be used in the computation.

Returns: Dequantized values in a neon vector

Definition at line 432 of file NEAsymm.h.

 {
     const float         scale              = qi.scale;
     const int           offset             = qi.offset;
     const int32x4_t     voffset            = vdupq_n_s32(offset);
     const float32x4_t   vscale             = vdupq_n_f32(scale);
     const float32x4x2_t vdequantized_input = {{
         vmulq_f32(vcvtq_f32_s32(vsubq_s32(vmovl_s16(vget_low_s16(vmovl_s8(qv))), voffset)), vscale),
         vmulq_f32(vcvtq_f32_s32(vsubq_s32(vmovl_s16(vget_high_s16(vmovl_s8(qv))), voffset)), vscale),
     }};
     return vdequantized_input;
 }

References UniformQuantizationInfo::offset, offset(), UniformQuantizationInfo::scale, and arm_compute::test::validation::scale.

◆ vdequantize() [7/9]

float32x4x4_t arm_compute::vdequantize	(	const uint8x16_t &	qv,
		const UniformQuantizationInfo &	qi
	)

inline

Dequantize a neon vector holding 16 quantized values.

Parameters

[in]	qv	Input values to be dequantized.
[in]	qi	Quantization information to be used in the computation.

Returns: Dequantized values in a neon vector

Definition at line 452 of file NEAsymm.h.

 {
     const float         scale              = qi.scale;
     const int           offset             = qi.offset;
     const int32x4_t     voffset            = vdupq_n_s32(offset);
     const float32x4_t   vscale             = vdupq_n_f32(scale);
     const float32x4x4_t vdequantized_input = {{
         vmulq_f32(vcvtq_f32_s32(
                       vsubq_s32(vreinterpretq_s32_u32(vmovl_u16(vget_low_u16(vmovl_u8(vget_low_u8(qv))))), voffset)),
                   vscale),
         vmulq_f32(vcvtq_f32_s32(
                       vsubq_s32(vreinterpretq_s32_u32(vmovl_u16(vget_high_u16(vmovl_u8(vget_low_u8(qv))))), voffset)),
                   vscale),
         vmulq_f32(vcvtq_f32_s32(
                       vsubq_s32(vreinterpretq_s32_u32(vmovl_u16(vget_low_u16(vmovl_u8(vget_high_u8(qv))))), voffset)),
                   vscale),
         vmulq_f32(vcvtq_f32_s32(
                       vsubq_s32(vreinterpretq_s32_u32(vmovl_u16(vget_high_u16(vmovl_u8(vget_high_u8(qv))))), voffset)),
                   vscale),
     }};
     return vdequantized_input;
 }

References UniformQuantizationInfo::offset, offset(), UniformQuantizationInfo::scale, and arm_compute::test::validation::scale.

◆ vdequantize() [8/9]

float32x4x4_t arm_compute::vdequantize	(	const uint8x16_t &	qv,
		float	scale,
		int32_t	offset
	)

inline

Dequantize following an asymmetric quantization scheme a neon vector holding 16 quantized values.

Parameters

[in]	qv	Input values to be dequantized.
[in]	scale	Quantization scaling factor.
[in]	offset	Zero quantization offset.

Returns: Dequantized values in a neon vector

Definition at line 505 of file NEAsymm.h.

 {
     const int32x4_t     voffset            = vdupq_n_s32(offset);
     const float32x4_t   vscale             = vdupq_n_f32(scale);
     const float32x4x4_t vdequantized_input = {{
         vmulq_f32(vcvtq_f32_s32(
                       vsubq_s32(vreinterpretq_s32_u32(vmovl_u16(vget_low_u16(vmovl_u8(vget_low_u8(qv))))), voffset)),
                   vscale),
         vmulq_f32(vcvtq_f32_s32(
                       vsubq_s32(vreinterpretq_s32_u32(vmovl_u16(vget_high_u16(vmovl_u8(vget_low_u8(qv))))), voffset)),
                   vscale),
         vmulq_f32(vcvtq_f32_s32(
                       vsubq_s32(vreinterpretq_s32_u32(vmovl_u16(vget_low_u16(vmovl_u8(vget_high_u8(qv))))), voffset)),
                   vscale),
         vmulq_f32(vcvtq_f32_s32(
                       vsubq_s32(vreinterpretq_s32_u32(vmovl_u16(vget_high_u16(vmovl_u8(vget_high_u8(qv))))), voffset)),
                   vscale),
     }};
     return vdequantized_input;
 }

References offset(), and arm_compute::test::validation::scale.

◆ vdequantize() [9/9]

float32x4x2_t arm_compute::vdequantize	(	const uint8x8_t &	qv,
		const UniformQuantizationInfo &	qi
	)

inline

Dequantize a neon vector holding 8 quantized values.

Parameters

[in]	qv	Input values to be dequantized.
[in]	qi	Quantization information to be used in the computation.

Returns: Dequantized values in a neon vector

Definition at line 410 of file NEAsymm.h.

 {
     const float         scale              = qi.scale;
     const int           offset             = qi.offset;
     const int32x4_t     voffset            = vdupq_n_s32(offset);
     const float32x4_t   vscale             = vdupq_n_f32(scale);
     const float32x4x2_t vdequantized_input = {{
         vmulq_f32(vcvtq_f32_s32(vsubq_s32(vreinterpretq_s32_u32(vmovl_u16(vget_low_u16(vmovl_u8(qv)))), voffset)),
                   vscale),
         vmulq_f32(vcvtq_f32_s32(vsubq_s32(vreinterpretq_s32_u32(vmovl_u16(vget_high_u16(vmovl_u8(qv)))), voffset)),
                   vscale),
     }};
     return vdequantized_input;
 }

References UniformQuantizationInfo::offset, offset(), UniformQuantizationInfo::scale, and arm_compute::test::validation::scale.

Referenced by arm_compute::cpu::elementwise_comp_quantized_signed(), arm_compute::cpu::elementwise_op< int8_t >(), arm_compute::cpu::elementwise_op< uint8_t >(), arm_compute::cpu::elementwise_op_quantized(), arm_compute::cpu::elementwise_op_quantized_signed(), arm_compute::cpu::neon_qasymm8_activation(), arm_compute::cpu::neon_qasymm8_signed_activation(), CpuConcatenateWidthKernel::run_op(), and CpuConcatenateHeightKernel::run_op().

◆ vdequantize_int16()

float32x4x2_t arm_compute::vdequantize_int16	(	const int16x8_t &	qv,
		float	scale
	)

inline

Dequantize a neon vector holding 8 16-bit quantized values.

Parameters

[in]	qv	Input values to be dequantized.
[in]	scale	Quantization scale

Returns: Dequantized values in a neon vector

Definition at line 135 of file NESymm.h.

 {
     const float32x4_t   vscale             = vdupq_n_f32(scale);
     const float32x4x2_t vdequantized_input = {{vmulq_f32(vcvtq_f32_s32(vmovl_s16(vget_low_s16(qv))), vscale),
                                                vmulq_f32(vcvtq_f32_s32(vmovl_s16(vget_high_s16(qv))), vscale)}};
     return vdequantized_input;
 }

References arm_compute::test::validation::scale.

Referenced by arm_compute::cpu::neon_qsymm16_activation().

◆ verfq_f32()

float32x4_t arm_compute::verfq_f32 ( float32x4_t x )

Calculate error function.

Parameters

[in] x Input vector in F32 format.

Returns: The calculated erf.

◆ vexpq_f32()

float32x4_t arm_compute::vexpq_f32 ( float32x4_t x )

Calculate exponential.

Parameters

[in] x Input vector value in F32 format.

Returns: The calculated exponent.

Referenced by arm_compute::cpu::neon_softmax_quantized().

◆ vfloorq_f32()

float32x4_t arm_compute::vfloorq_f32 ( float32x4_t val )

Calculate floor of a vector.

Parameters

[in] val Input vector value in F32 format.

Returns: The calculated floor vector.

Referenced by arm_compute::cpu::elementwise_arithm_op< ArithmeticOperation::DIV, typename wrapper::traits::neon_vector< int32_t, 4 > >(), and arm_compute::cpu::fp32_neon_floor().

◆ vinv_f32()

float32x2_t arm_compute::vinv_f32 ( float32x2_t x )

Calculate reciprocal.

Parameters

[in] x Input value.

Returns: The calculated reciprocal.

◆ vinvq_f32()

float32x4_t arm_compute::vinvq_f32 ( float32x4_t x )

Calculate reciprocal.

Parameters

[in] x Input value.

Returns: The calculated reciprocal.

◆ vinvsqrt_f32()

float32x2_t arm_compute::vinvsqrt_f32 ( float32x2_t x )

Calculate inverse square root.

Parameters

[in] x Input value.

Returns: The calculated inverse square root.

◆ vinvsqrtq_f32()

float32x4_t arm_compute::vinvsqrtq_f32 ( float32x4_t x )

Calculate inverse square root.

Parameters

[in] x Input value.

Returns: The calculated inverse square root.

◆ vlogq_f32()

float32x4_t arm_compute::vlogq_f32 ( float32x4_t x )

Calculate logarithm.

Parameters

[in] x Input vector value in F32 format.

Returns: The calculated logarithm.

◆ vmax2q_f32()

float32x4x2_t arm_compute::vmax2q_f32	(	float32x4x2_t	a,
		float32x4x2_t	b
	)

Compute lane-by-lane maximum between elements of a float vector with 4x2 elements.

Parameters

[in]	a	Float input vector
[in]	b	Float input vector

Returns: The lane-by-lane maximum -> float32x4x2

◆ vmlaq_qasymm8()

qasymm8x16_t vmlaq_qasymm8	(	qasymm8x16_t	vd,
		float32x4_t	vs,
		float32x4_t	vo
	)

inline

Perform a multiply-accumulate on all 16 components of a QASYMM8 vector.

vd*vs + vo

Parameters

[in]	vd	Input vector value in QASYMM8 format
[in]	vs	Vector multiplier in F32 format. The multiplier value must be duplicated across all four lanes.
[in]	vo	Vector addend in F32 format. The addend value must be duplicated across all four lanes.

Returns: A 16-component vector in QASYMM8 format, saturated to fit

Definition at line 30 of file NEAsymm.inl.

 {
     // Convert uint8 vectors to uint16 vectors
     const uint8x8_t vd_low        = vget_low_u8(vd);
     const uint8x8_t vd_high       = vget_high_u8(vd);
     uint16x8_t      vd_low_u16x8  = vmovl_u8(vd_low);
     uint16x8_t      vd_high_u16x8 = vmovl_u8(vd_high);
     // Convert uint16 vectors to uint32 vectors
     uint32x4_t A_u32x4 = vmovl_u16(vget_low_u16(vd_low_u16x8));
     uint32x4_t B_u32x4 = vmovl_u16(vget_high_u16(vd_low_u16x8));
     uint32x4_t C_u32x4 = vmovl_u16(vget_low_u16(vd_high_u16x8));
     uint32x4_t D_u32x4 = vmovl_u16(vget_high_u16(vd_high_u16x8));
     // Convert uint32 vectors to float32 vectors
     float32x4_t A_f32x4 = vcvtq_f32_u32(A_u32x4);
     float32x4_t B_f32x4 = vcvtq_f32_u32(B_u32x4);
     float32x4_t C_f32x4 = vcvtq_f32_u32(C_u32x4);
     float32x4_t D_f32x4 = vcvtq_f32_u32(D_u32x4);
     // vd = vd*vs + vo
     A_f32x4 = vmlaq_f32(vo, A_f32x4, vs);
     B_f32x4 = vmlaq_f32(vo, B_f32x4, vs);
     C_f32x4 = vmlaq_f32(vo, C_f32x4, vs);
     D_f32x4 = vmlaq_f32(vo, D_f32x4, vs);
     // Convert float32 vectors to uint32 vectors
 #if __aarch64__
     if (round_policy == RoundingPolicy::TO_NEAREST_EVEN)
     {
         A_u32x4 = vcvtnq_u32_f32(A_f32x4);
         B_u32x4 = vcvtnq_u32_f32(B_f32x4);
         C_u32x4 = vcvtnq_u32_f32(C_f32x4);
         D_u32x4 = vcvtnq_u32_f32(D_f32x4);
     }
     else if (round_policy == RoundingPolicy::TO_NEAREST_UP)
     {
         A_u32x4 = vcvtaq_u32_f32(A_f32x4);
         B_u32x4 = vcvtaq_u32_f32(B_f32x4);
         C_u32x4 = vcvtaq_u32_f32(C_f32x4);
         D_u32x4 = vcvtaq_u32_f32(D_f32x4);
     }
     else
     {
         A_u32x4 = vcvtq_u32_f32(A_f32x4);
         B_u32x4 = vcvtq_u32_f32(B_f32x4);
         C_u32x4 = vcvtq_u32_f32(C_f32x4);
         D_u32x4 = vcvtq_u32_f32(D_f32x4);
     }
 #else  // #if __aarch64__
     // rounding mode only supported in aarch64
     A_u32x4 = vcvtq_u32_f32(A_f32x4);
     B_u32x4 = vcvtq_u32_f32(B_f32x4);
     C_u32x4 = vcvtq_u32_f32(C_f32x4);
     D_u32x4 = vcvtq_u32_f32(D_f32x4);
 #endif // #if __aarch64__
     // Convert uint32 vectors to uint16 vectors (with saturation)
     vd_low_u16x8  = vcombine_u16(vqmovn_u32(A_u32x4), vqmovn_u32(B_u32x4));
     vd_high_u16x8 = vcombine_u16(vqmovn_u32(C_u32x4), vqmovn_u32(D_u32x4));
     // convert uint16 vectors to uint8 vectors (with saturation)
     return vcombine_u8(vqmovn_u16(vd_low_u16x8), vqmovn_u16(vd_high_u16x8));
 }

References TO_NEAREST_EVEN, and TO_NEAREST_UP.

◆ vmlaq_qasymm8_signed()

qasymm8x16_signed_t vmlaq_qasymm8_signed	(	qasymm8x16_signed_t	vd,
		float32x4_t	vs,
		float32x4_t	vo
	)

inline

Perform a multiply-accumulate on all 16 components of a QASYMM8_SIGNED vector.

vd*vs + vo

Parameters

[in]	vd	Input vector value in QASYMM8_SIGNED format
[in]	vs	Vector multiplier in F32 format. The multiplier value must be duplicated across all four lanes.
[in]	vo	Vector addend in F32 format. The addend value must be duplicated across all four lanes.

Returns: A 16-component vector in QASYMM8_SIGNED format, saturated to fit

Definition at line 90 of file NEAsymm.inl.

 {
     // Convert uint8 vectors to int16 vectors
     const int8x8_t vd_low        = vget_low_s8(vd);
     const int8x8_t vd_high       = vget_high_s8(vd);
     int16x8_t      vd_low_s16x8  = vmovl_s8(vd_low);
     int16x8_t      vd_high_s16x8 = vmovl_s8(vd_high);
     // Convert int16 vectors to int32 vectors
     int32x4_t A_s32x4 = vmovl_s16(vget_low_s16(vd_low_s16x8));
     int32x4_t B_s32x4 = vmovl_s16(vget_high_s16(vd_low_s16x8));
     int32x4_t C_s32x4 = vmovl_s16(vget_low_s16(vd_high_s16x8));
     int32x4_t D_s32x4 = vmovl_s16(vget_high_s16(vd_high_s16x8));
     // Convert int32 vectors to float32 vectors
     float32x4_t A_f32x4 = vcvtq_f32_s32(A_s32x4);
     float32x4_t B_f32x4 = vcvtq_f32_s32(B_s32x4);
     float32x4_t C_f32x4 = vcvtq_f32_s32(C_s32x4);
     float32x4_t D_f32x4 = vcvtq_f32_s32(D_s32x4);
     // vd = vd*vs + vo
     A_f32x4 = vmlaq_f32(vo, A_f32x4, vs);
     B_f32x4 = vmlaq_f32(vo, B_f32x4, vs);
     C_f32x4 = vmlaq_f32(vo, C_f32x4, vs);
     D_f32x4 = vmlaq_f32(vo, D_f32x4, vs);
 #if __aarch64__
     if (round_policy == RoundingPolicy::TO_NEAREST_EVEN)
     {
         A_s32x4 = vcvtnq_s32_f32(A_f32x4);
         B_s32x4 = vcvtnq_s32_f32(B_f32x4);
         C_s32x4 = vcvtnq_s32_f32(C_f32x4);
         D_s32x4 = vcvtnq_s32_f32(D_f32x4);
     }
     else if (round_policy == RoundingPolicy::TO_NEAREST_UP)
     {
         A_s32x4 = vcvtaq_s32_f32(A_f32x4);
         B_s32x4 = vcvtaq_s32_f32(B_f32x4);
         C_s32x4 = vcvtaq_s32_f32(C_f32x4);
         D_s32x4 = vcvtaq_s32_f32(D_f32x4);
     }
     else
     {
         A_s32x4 = vcvtq_s32_f32(A_f32x4);
         B_s32x4 = vcvtq_s32_f32(B_f32x4);
         C_s32x4 = vcvtq_s32_f32(C_f32x4);
         D_s32x4 = vcvtq_s32_f32(D_f32x4);
     }
 #else  // #if __aarch64__
     // rounding mode only supported in aarch64
     A_s32x4 = vcvtq_s32_f32(A_f32x4);
     B_s32x4 = vcvtq_s32_f32(B_f32x4);
     C_s32x4 = vcvtq_s32_f32(C_f32x4);
     D_s32x4 = vcvtq_s32_f32(D_f32x4);
 #endif // #if __aarch64__
  
     // Convert int32 vectors to int16 vectors (with saturation)
     vd_low_s16x8  = vcombine_s16(vqmovn_s32(A_s32x4), vqmovn_s32(B_s32x4));
     vd_high_s16x8 = vcombine_s16(vqmovn_s32(C_s32x4), vqmovn_s32(D_s32x4));
     // convert int16 vectors to int8 vectors (with saturation)
     return vcombine_s8(vqmovn_s16(vd_low_s16x8), vqmovn_s16(vd_high_s16x8));
 }

References TO_NEAREST_EVEN, and TO_NEAREST_UP.

◆ vpowq_f32()

float32x4_t arm_compute::vpowq_f32	(	float32x4_t	val,
		float32x4_t	n
	)

Calculate n power of a number.

pow(x,n) = e^(n*log(x))

Parameters

[in]	val	Input vector value in F32 format.
[in]	n	Powers to raise the input to.

Returns: The calculated power.

◆ vquantize() [1/2]

uint8x8_t arm_compute::vquantize	(	const float32x4x2_t &	qv,
		const UniformQuantizationInfo &	qi
	)

inline

Quantize a neon vector holding 8 floating point values.

Parameters

[in]	qv	Input values to be quantized.
[in]	qi	Quantization information to be used in the computation.

Returns: A neon vector holding the quantized values

Definition at line 591 of file NEAsymm.h.

 {
     const float       scale     = qi.scale;
     const int         offset    = qi.offset;
     const float32x4_t voffset   = vdupq_n_f32(offset);
     const float32x4_t vinvscale = vdupq_n_f32(1.f / scale);
     const int32x4x4_t rf        = {{
 #ifdef __aarch64__
         vcvtnq_s32_f32(vmlaq_f32(voffset, qv.val[0], vinvscale)),
         vcvtnq_s32_f32(vmlaq_f32(voffset, qv.val[1], vinvscale)),
 #else  //__aarch64__
         vcvtq_s32_f32(vmlaq_f32(voffset, qv.val[0], vinvscale)),
         vcvtq_s32_f32(vmlaq_f32(voffset, qv.val[1], vinvscale)),
 #endif //__aarch64__
     }};
     return vqmovun_s16(vcombine_s16(vqmovn_s32(rf.val[0]), vqmovn_s32(rf.val[1])));
 }

References UniformQuantizationInfo::offset, offset(), UniformQuantizationInfo::scale, and arm_compute::test::validation::scale.

Referenced by arm_compute::cpu::elementwise_op< uint8_t >(), arm_compute::cpu::neon_qasymm8_activation(), CpuConcatenateHeightKernel::run_op(), and CpuConcatenateWidthKernel::run_op().

◆ vquantize() [2/2]

uint8x16_t arm_compute::vquantize	(	const float32x4x4_t &	qv,
		const UniformQuantizationInfo &	qi
	)

inline

Quantize a neon vector holding 16 floating point values.

Parameters

[in]	qv	Input values to be quantized.
[in]	qi	Quantization information to be used in the computation.

Returns: A neon vector holding the quantized values

Definition at line 661 of file NEAsymm.h.

 {
     auto            rf = vquantize_internal(qv, qi.scale, qi.offset);
     const uint8x8_t pa = vqmovun_s16(vcombine_s16(vqmovn_s32(rf.val[0]), vqmovn_s32(rf.val[1])));
     const uint8x8_t pb = vqmovun_s16(vcombine_s16(vqmovn_s32(rf.val[2]), vqmovn_s32(rf.val[3])));
     return vcombine_u8(pa, pb);
 }

References UniformQuantizationInfo::offset, UniformQuantizationInfo::scale, and vquantize_internal().

◆ vquantize_int16()

int16x8_t arm_compute::vquantize_int16	(	const float32x4x2_t &	qv,
		float	scale
	)

inline

Quantize a neon vector holding 8 floating point values.

Parameters

[in]	qv	Input values to be quantized.
[in]	scale	Quantization scale

Returns: A neon vector holding the quantized values

Definition at line 150 of file NESymm.h.

 {
     const float32x4_t vinvscale = vdupq_n_f32(1.f / scale);
  
     const int32x4x2_t rf = {{
 #ifdef __aarch64__
         vcvtnq_s32_f32(vmulq_f32(qv.val[0], vinvscale)), vcvtnq_s32_f32(vmulq_f32(qv.val[1], vinvscale))
 #else  //__aarch64__
         vcvtq_s32_f32(vmulq_f32(qv.val[0], vinvscale)), vcvtq_s32_f32(vmulq_f32(qv.val[1], vinvscale))
 #endif //__aarch64__
     }};
     return vcombine_s16(vqmovn_s32(rf.val[0]), vqmovn_s32(rf.val[1]));
 }

References arm_compute::test::validation::scale.

Referenced by arm_compute::cpu::neon_qsymm16_activation().

◆ vquantize_internal()

int32x4x4_t arm_compute::vquantize_internal	(	const float32x4x4_t &	qv,
		float	scale,
		int32_t	offset
	)

inline

Definition at line 634 of file NEAsymm.h.

 {
     const int32x4_t   voffset   = vdupq_n_s32(offset);
     const float32x4_t vinvscale = vdupq_n_f32(1.f / scale);
     const int32x4x4_t rf        = {{
 #ifdef __aarch64__
         vaddq_s32(vcvtaq_s32_f32(vmulq_f32(qv.val[0], vinvscale)), voffset),
         vaddq_s32(vcvtaq_s32_f32(vmulq_f32(qv.val[1], vinvscale)), voffset),
         vaddq_s32(vcvtaq_s32_f32(vmulq_f32(qv.val[2], vinvscale)), voffset),
         vaddq_s32(vcvtaq_s32_f32(vmulq_f32(qv.val[3], vinvscale)), voffset),
 #else  //__aarch64__
         vaddq_s32(vcvtq_s32_f32(vmulq_f32(qv.val[0], vinvscale)), voffset),
         vaddq_s32(vcvtq_s32_f32(vmulq_f32(qv.val[1], vinvscale)), voffset),
         vaddq_s32(vcvtq_s32_f32(vmulq_f32(qv.val[2], vinvscale)), voffset),
         vaddq_s32(vcvtq_s32_f32(vmulq_f32(qv.val[3], vinvscale)), voffset),
 #endif //__aarch64__
     }};
     return rf;
 }

References offset(), and arm_compute::test::validation::scale.

Referenced by vquantize(), vquantize_qasymm16(), and vquantize_signed().

◆ vquantize_qasymm16()

uint16x8x2_t arm_compute::vquantize_qasymm16	(	const float32x4x4_t &	qv,
		const UniformQuantizationInfo &	qi
	)

inline

Quantize to QASYMM16 a neon vector holding 16 floating point values.

Parameters

[in]	qv	Input values to be quantized.
[in]	qi	Quantization information to be used in the computation.

Returns: A neon vector holding the quantized values

Definition at line 691 of file NEAsymm.h.

 {
     auto             rf = vquantize_internal(qv, qi.scale, qi.offset);
     const uint16x8_t pa = vcombine_u16(vqmovun_s32(rf.val[0]), vqmovun_s32(rf.val[1]));
     const uint16x8_t pb = vcombine_u16(vqmovun_s32(rf.val[2]), vqmovun_s32(rf.val[3]));
     return {pa, pb};
 }

References UniformQuantizationInfo::offset, UniformQuantizationInfo::scale, and vquantize_internal().

◆ vquantize_qsymm16()

qsymm16x8x2_t arm_compute::vquantize_qsymm16	(	const float32x4x4_t &	qv,
		const UniformQuantizationInfo &	qi
	)

inline

Quantize a neon vector holding 16 floating point values.

Parameters

[in]	qv	Input values to be quantized.
[in]	qi	Quantization information to be used in the computation.

Returns: A neon vector holding the quantized values

Definition at line 191 of file NESymm.h.

 {
     const float scale = qi.scale;
     ARM_COMPUTE_ERROR_ON(scale == 0.f);
     const float32x4_t vinvscale = vdupq_n_f32(1.f / scale);
     const int32x4x4_t rf        = {{
 #ifdef __aarch64__
         vcvtnq_s32_f32(vmulq_f32(qv.val[0], vinvscale)),
         vcvtnq_s32_f32(vmulq_f32(qv.val[1], vinvscale)),
         vcvtnq_s32_f32(vmulq_f32(qv.val[2], vinvscale)),
         vcvtnq_s32_f32(vmulq_f32(qv.val[3], vinvscale)),
 #else  //__aarch64__
         vcvtq_s32_f32(vmulq_f32(qv.val[0], vinvscale)),
         vcvtq_s32_f32(vmulq_f32(qv.val[1], vinvscale)),
         vcvtq_s32_f32(vmulq_f32(qv.val[2], vinvscale)),
         vcvtq_s32_f32(vmulq_f32(qv.val[3], vinvscale)),
 #endif //__aarch64__
     }};
     const qsymm16x8x2_t res = {
         vcombine_s16(vqmovn_s32(rf.val[0]), vqmovn_s32(rf.val[1])),
         vcombine_s16(vqmovn_s32(rf.val[2]), vqmovn_s32(rf.val[3])),
     };
  
     return res;
 }

References ARM_COMPUTE_ERROR_ON, UniformQuantizationInfo::scale, and arm_compute::test::validation::scale.

◆ vquantize_signed() [1/2]

int8x8_t arm_compute::vquantize_signed	(	const float32x4x2_t &	qv,
		const UniformQuantizationInfo &	qi
	)

inline

Quantize a neon vector holding 8 floating point values.

Parameters

[in]	qv	Input values to be quantized.
[in]	qi	Quantization information to be used in the computation.

Returns: A neon vector holding the singed quantized values

Definition at line 616 of file NEAsymm.h.

 {
     const float       scale     = qi.scale;
     const int         offset    = qi.offset;
     const float32x4_t voffset   = vdupq_n_f32(offset);
     const float32x4_t vinvscale = vdupq_n_f32(1.f / scale);
     const int32x4x4_t rf        = {{
 #ifdef __aarch64__
         vcvtnq_s32_f32(vmlaq_f32(voffset, qv.val[0], vinvscale)),
         vcvtnq_s32_f32(vmlaq_f32(voffset, qv.val[1], vinvscale)),
 #else  //__aarch64__
         vcvtq_s32_f32(vmlaq_f32(voffset, qv.val[0], vinvscale)),
         vcvtq_s32_f32(vmlaq_f32(voffset, qv.val[1], vinvscale)),
 #endif //__aarch64__
     }};
     return vqmovn_s16(vcombine_s16(vqmovn_s32(rf.val[0]), vqmovn_s32(rf.val[1])));
 }

References UniformQuantizationInfo::offset, offset(), UniformQuantizationInfo::scale, and arm_compute::test::validation::scale.

Referenced by arm_compute::cpu::elementwise_op< int8_t >(), arm_compute::cpu::neon_qasymm8_signed_activation(), CpuConcatenateHeightKernel::run_op(), and CpuConcatenateWidthKernel::run_op().

◆ vquantize_signed() [2/2]

int8x16_t arm_compute::vquantize_signed	(	const float32x4x4_t &	qv,
		const UniformQuantizationInfo &	qi
	)

inline

Signed quantize a neon vector holding 16 floating point values.

Parameters

[in]	qv	Input values to be quantized.
[in]	qi	Quantization information to be used in the computation.

Returns: A neon vector holding the quantized values

Definition at line 676 of file NEAsymm.h.

 {
     auto           rf = vquantize_internal(qv, qi.scale, qi.offset);
     const int8x8_t pa = vqmovn_s16(vcombine_s16(vqmovn_s32(rf.val[0]), vqmovn_s32(rf.val[1])));
     const int8x8_t pb = vqmovn_s16(vcombine_s16(vqmovn_s32(rf.val[2]), vqmovn_s32(rf.val[3])));
     return vcombine_s8(pa, pb);
 }

References UniformQuantizationInfo::offset, UniformQuantizationInfo::scale, and vquantize_internal().

◆ vreduce()

float vreduce ( const float32x4_t & v )

inline

Reduce a vector to be a scalar by accumulating all lanes in the vector.

Parameters

[in] v Vector to be reduced.

Returns: the wrapped-around number.

Definition at line 537 of file NEMath.inl.

 {
     const float32x2_t v0    = vget_high_f32(v);
     const float32x2_t v1    = vget_low_f32(v);
     const float32x2_t v_out = vadd_f32(v0, v1);
  
     const float a = vget_lane_f32(v_out, 0);
     const float b = vget_lane_f32(v_out, 1);
  
     return a + b;
 }

References arm_compute::test::validation::b.

Referenced by arm_compute::cpu::kernels::convolve_nchw(), arm_compute::cpu::kernels::convolve_nhwc(), and arm_compute::cpu::directconv3d_float_neon_ndhwc().

◆ vroundq_rte_f32()

float32x4_t arm_compute::vroundq_rte_f32 ( float32x4_t val )

Calculate round value of a vector to nearest with ties to even.

Parameters

[in] val Input vector value in F32 format.

Returns: The calculated round vector.

◆ vsin_f32()

float32x2_t arm_compute::vsin_f32 ( float32x2_t val )

Calculate sine.

Parameters

[in] val Input vector value in radians, F32 format.

Returns: The calculated sine.

◆ vsinq_f32()

float32x4_t arm_compute::vsinq_f32 ( float32x4_t val )

Calculate sine.

Parameters

[in] val Input vector value in radians, F32 format.

Returns: The calculated sine.

◆ vtanhq_f32()

float32x4_t arm_compute::vtanhq_f32 ( float32x4_t val )

Calculate hyperbolic tangent.

tanh(x) = (e^2x - 1)/(e^2x + 1)

Note: We clamp x to [-5,5] to avoid overflowing issues.

Parameters

[in] val Input vector value in F32 format.

Returns: The calculated Hyperbolic Tangent.

◆ vtaylor_polyq_f32()

float32x4_t arm_compute::vtaylor_polyq_f32	(	float32x4_t	x,
		const std::array< float32x4_t, 8 > &	coeffs
	)

Perform a 7th degree polynomial approximation using Estrin's method.

Parameters

[in]	x	Input vector value in F32 format.
[in]	coeffs	Polynomial coefficients table.

Returns: The calculated approximation.

◆ wrap_around()

T arm_compute::wrap_around	(	T	x,
		T	m
	)

inline

Wrap-around a number within the range 0 <= x < m.

Parameters

[in]	x	Input value
[in]	m	Range

Returns: the wrapped-around number

Definition at line 277 of file Helpers.h.

 {
     return x >= 0 ? x % m : (x % m + m) % m;
 }

Referenced by SplitLayerNode::compute_output_descriptor(), ClSoftmaxKernel::configure(), CLGatherKernel::configure(), CpuSoftmaxGeneric::configure(), CLL2NormalizeLayerKernel::configure(), NEStackLayer::configure(), NEL2NormalizeLayer::configure(), CLStackLayer::configure(), CLL2NormalizeLayer::configure(), SplitLayerNode::configure_output(), convert_negative_axis(), arm_compute::test::validation::reference::softmax_layer_generic(), arm_compute::test::validation::reference::unstack(), SplitLayerNode::validate(), CpuSoftmaxGeneric::validate(), NEStackLayer::validate(), NEL2NormalizeLayer::validate(), CLStackLayer::validate(), CLL2NormalizeLayer::validate(), and GpuSoftmax::validate_op().

Variable Documentation

◆ erf_f32_lut

const std::array< ErfLutEntry< float >, 513 > erf_f32_lut

The lookup table for FP32 erf(x) calculation.

Definition at line 30 of file Math.cpp.

◆ log_tab

const std::array<float32x4_t, 8> log_tab

Initial value:

= {{
    vdupq_n_f32(-2.29561495781f),
    vdupq_n_f32(-2.47071170807f),
    vdupq_n_f32(-5.68692588806f),
    vdupq_n_f32(-0.165253549814f),
    vdupq_n_f32(5.17591238022f),
    vdupq_n_f32(0.844007015228f),
    vdupq_n_f32(4.58445882797f),
    vdupq_n_f32(0.0141278216615f),
}}

Logarithm polynomial coefficients.

Definition at line 34 of file NEMath.inl.

◆ MAX_DIMS

constexpr size_t MAX_DIMS = 6

constexpr

Constant value used to indicate maximum dimensions of a Window, TensorShape and Coordinates.

Definition at line 38 of file Dimensions.h.

Referenced by arm_compute::misc::shape_calculator::calculate_concatenate_shape().

◆ te_sin_coeff2

constexpr float te_sin_coeff2 = 0.166666666666f

constexpr

Sin polynomial coefficients.

Definition at line 46 of file NEMath.inl.

◆ te_sin_coeff3

constexpr float te_sin_coeff3 = 0.05f

constexpr

Definition at line 47 of file NEMath.inl.

◆ te_sin_coeff4

constexpr float te_sin_coeff4 = 0.023809523810f

constexpr

Definition at line 48 of file NEMath.inl.

◆ te_sin_coeff5

constexpr float te_sin_coeff5 = 0.013888888889f

constexpr

Definition at line 49 of file NEMath.inl.

Namespaces

Data Structures

Typedefs

Enumerations

Functions

Variables

Detailed Description

Typedef Documentation

◆ BBox

◆ BiStrides

◆ CLEqual

◆ CLFloatArray

◆ CLGreater

◆ CLGreaterEqual

◆ CLImage

◆ CLInt16Array

◆ CLInt32Array

◆ CLLess

◆ CLLessEqual

◆ CLLogSoftmaxLayer

◆ CLNotEqual

◆ CLSoftmaxLayer

◆ CLUInt16Array

◆ CLUInt32Array

◆ CLUInt8Array

◆ CropInfo

◆ float32_t

◆ FloatArray

◆ GroupMappings

◆ half

◆ ICLFloatArray

◆ ICLImage

◆ ICLInt16Array

◆ ICLInt32Array

◆ ICLUInt16Array

◆ ICLUInt32Array

◆ ICLUInt8Array

◆ IFloatArray

◆ IImage

◆ IInt16Array

◆ IInt32Array

◆ Image

◆ INEKernel

◆ Int16Array

◆ Int32Array

◆ IUInt16Array

◆ IUInt32Array

◆ IUInt8Array

◆ LabelBBox

◆ lock_guard

◆ MemoryMappings

◆ MemoryRequirements

◆ Multiples

◆ Mutex

◆ NEAbsLayer

◆ NEEqual

◆ NEExpLayer

◆ NEGreater

◆ NEGreaterEqual

◆ NELess

◆ NELessEqual

◆ NELogLayer

◆ NELogSoftmaxLayer

◆ NENegLayer

◆ NENotEqual

◆ NERoundLayer

◆ NERsqrtLayer

◆ NEScheduler

◆ NESinLayer

◆ NESoftmaxLayer

◆ OperatorType

◆ PaddingInfo

◆ PaddingList

◆ PaddingSize

◆ PermutationVector

◆ qasymm16_t

◆ qasymm8_signed_t

◆ qasymm8_t

◆ qasymm8x16_signed_t

◆ qasymm8x16_t