24.04
|
Go to the documentation of this file.
24 #ifndef ACL_SRC_DYNAMIC_FUSION_SKETCH_GPU_COMPONENTS_CL_CLCOMPONENTDEPTHWISECONV2D_H
25 #define ACL_SRC_DYNAMIC_FUSION_SKETCH_GPU_COMPONENTS_CL_CLCOMPONENTDEPTHWISECONV2D_H
37 namespace experimental
39 namespace dynamic_fusion
44 class DepthwiseConv2dAttributes;
47 class GpuCkwDepthwiseConv2d;
77 unsigned int n0()
const;
82 unsigned int m0()
const;
85 bool _export_input_to_cl_image{
false};
86 bool _export_weights_to_cl_image{
false};
87 bool _fast_relaxed_math{
true};
88 bool _is_fma_available{
false};
94 class ClTemplateDepthwiseConv2d;
169 std::unique_ptr<GpuCkwDepthwiseConv2d> _component_writer;
174 #endif // ACL_SRC_DYNAMIC_FUSION_SKETCH_GPU_COMPONENTS_CL_CLCOMPONENTDEPTHWISECONV2D_H
bool export_weights_to_cl_image() const
Get export_weights_to_cl_image flag.
unsigned int m0() const
Set M0: number of rows processed by each thread.
GpuComponentType
Component type in the context of fusion Its main purpose is to inform the optimizer how to perform fu...
bool export_input_to_cl_image() const
Get export_input_to_cl_image flag.
This is a generic class that packs the arguments of an operator.
const IGpuCkwComponentDriver * ckw_component_driver() const override
Get writer for the component.
ClComponentDepthwiseConv2d(ComponentId id, const Properties &properties, const ArgumentPack< ITensorInfo > &tensors, const Attributes &attributes, const Settings &settings)
Constructor.
Component specific settings.
Properties common to all kernel component types.
Properties properties() const
Get properties.
bool fast_relaxed_math() const
Get fast_relaxed_math flag.
An interface used by GpuCkwDriver to write source code for a kernel component.
~ClComponentDepthwiseConv2d() override
Destructor.
int32_t ComponentId
Uniquely identifies a kernel component within a workload.
ClComponentDepthwiseConv2d & operator=(const ClComponentDepthwiseConv2d &component)=delete
Prevent instances of this class from being copied.
Copyright (c) 2017-2024 Arm Limited.
An abstract interface of a component.
static Status validate(const Properties &properties, const ArgumentPack< ITensorInfo > &tensors, const Attributes &attributes, const Settings &settings)
Validate the component.
Attributes are backend-agnostic parameters (in addition to the input/output tensors) of an operator.
GpuComponentType type() const override
Get component type.
ArgumentPack< ITensorInfo > tensors() const
Get tensor arguments.
unsigned int n0() const
Get N0: number of columns processed by each thread.
bool is_fma_available() const
Get is_fma_available flag.