25 #ifndef ARM_COMPUTE_GCGEMM_H 26 #define ARM_COMPUTE_GCGEMM_H 54 GCGEMM(std::shared_ptr<IMemoryManager> memory_manager =
nullptr);
109 bool _is_interleaved_transposed;
111 bool _reshape_b_only_on_first_run;
Base class for all functions.
void run() override
Run the kernels contained in the function.
Store the tensor's metadata.
Interface for GLES Compute tensor.
OpenGL ES kernel to perform the in-place matrix addition between 2 matrices, taking into account that...
Interface for OpenGL ES tensor.
Copyright (c) 2017-2021 Arm Limited.
OpenGLES kernel which transposes the elements of a matrix in chunks of 1xW, where W is equal to (16 /...
GCGEMM(std::shared_ptr< IMemoryManager > memory_manager=nullptr)
Default constructor.
static Status validate(const ITensorInfo *a, const ITensorInfo *b, const IGCTensor *c, const ITensorInfo *output, const float alpha, const float beta, const GEMMInfo &gemm_info=GEMMInfo())
Static function to check if given info will lead to a valid configuration of GCGEMM.
void prepare() override
Prepare the function for executing.
GCGEMM & operator=(const GCGEMM &)=delete
Prevent instances of this class from being copied (As this class contains pointers) ...
Basic function to execute GEMM on OpenGLES Compute.
void configure(const IGCTensor *a, const IGCTensor *b, const IGCTensor *c, IGCTensor *output, float alpha, float beta, const GEMMInfo &gemm_info=GEMMInfo())
Initialise the kernel's inputs and output.
OpenGL ES kernel which interleaves the elements of a matrix A in chunk of 4x4.
GLES Compute kernel to multiply two input matrices "A" and "B" or to multiply a vector "A" by a matri...