pub struct SDDMMBufferSizeOperation<'c> { /* private fields */ }
Expand description
A sddmm_buffer_size
operation. Precompute buffersize for SDDMM operation.
The gpu.sddmm_buffer_size
operation returns the buffer size required
to perform the SDDMM operation on the given sparse and dense matrices.
The operation expects handles returned by previous sparse operations
to construct an environment and the operands for SDDMM.
If the async
keyword is present, the op is executed asynchronously (i.e.
it does not block until the execution has finished on the device). In
that case, it returns a !gpu.async.token in addition to the environment.
Example:
%buffersz, %token = gpu.sddmm_buffer_size async [%dep] %dnmatA{TRANSPOSE}, %dnmatB{TRANSPOSE}, %spmatC into f32
The matrix arguments can also be associated with one of the following operators: NON_TRANSPOSE, TRANSPOSE, CONJUGATE_TRANSPOSE. The default value is NON_TRANSPOSE.
Implementations§
source§impl<'c> SDDMMBufferSizeOperation<'c>
impl<'c> SDDMMBufferSizeOperation<'c>
sourcepub fn as_operation(&self) -> &Operation<'c>
pub fn as_operation(&self) -> &Operation<'c>
Returns a generic operation.
sourcepub fn builder(
context: &'c Context,
location: Location<'c>
) -> SDDMMBufferSizeOperationBuilder<'c, Unset, Unset, Unset, Unset, Unset, Unset>
pub fn builder( context: &'c Context, location: Location<'c> ) -> SDDMMBufferSizeOperationBuilder<'c, Unset, Unset, Unset, Unset, Unset, Unset>
Creates a builder.