monolish  0.17.1
MONOlithic LInear equation Solvers for Highly-parallel architecture
Functions
monolish::vml::add

element by element addition Dense matrix A and Dense matrix B. More...

Functions

void monolish::vml::add (const matrix::Dense< double > &A, const matrix::Dense< double > &B, matrix::Dense< double > &C)
 element by element addition Dense matrix A and Dense matrix B. More...
 
void monolish::vml::add (const matrix::Dense< float > &A, const matrix::Dense< float > &B, matrix::Dense< float > &C)
 
void monolish::vml::add (const tensor::tensor_Dense< double > &A, const tensor::tensor_Dense< double > &B, tensor::tensor_Dense< double > &C)
 element by element addition tensor_Dense tensor A and tensor_Dense tensor B. More...
 
void monolish::vml::add (const tensor::tensor_Dense< float > &A, const tensor::tensor_Dense< float > &B, tensor::tensor_Dense< float > &C)
 

Detailed Description

element by element addition Dense matrix A and Dense matrix B.

element by element addition tensor_Dense tensor A and tensor_Dense tensor B.

Function Documentation

◆ add() [1/4]

void monolish::vml::add ( const matrix::Dense< double > &  A,
const matrix::Dense< double > &  B,
matrix::Dense< double > &  C 
)

element by element addition Dense matrix A and Dense matrix B.

Parameters
Amonolish Dense Matrix (size M x N)
Bmonolish Dense Matrix (size M x N)
Cmonolish Dense Matrix (size M x N)
Note
  • # of computation: M*N
  • Multi-threading: true
  • GPU acceleration: true
    • # of data transfer: 0

◆ add() [2/4]

void monolish::vml::add ( const matrix::Dense< float > &  A,
const matrix::Dense< float > &  B,
matrix::Dense< float > &  C 
)

◆ add() [3/4]

void monolish::vml::add ( const tensor::tensor_Dense< double > &  A,
const tensor::tensor_Dense< double > &  B,
tensor::tensor_Dense< double > &  C 
)

element by element addition tensor_Dense tensor A and tensor_Dense tensor B.

Parameters
Amonolish tensor_Dense tensor
Bmonolish tensor_Dense tensor
Cmonolish tensor_Dense tensor
Note
  • # of computation: size
  • Multi-threading: true
  • GPU acceleration: true
    • # of data transfer: 0

◆ add() [4/4]

void monolish::vml::add ( const tensor::tensor_Dense< float > &  A,
const tensor::tensor_Dense< float > &  B,
tensor::tensor_Dense< float > &  C 
)