Skip to content
Snippets Groups Projects
Commit a0195132 authored by Lorenzo Moneta's avatar Lorenzo Moneta
Browse files

Use fast zero GPU initialisation for Cuda matrices

parent 47167e85
No related branches found
No related tags found
No related merge requests found
......@@ -158,17 +158,8 @@ void TCuda<AFloat>::InitializeIdentity(TCudaMatrix<AFloat> & A)
template<typename AFloat>
void TCuda<AFloat>::InitializeZero(TCudaMatrix<AFloat> & A)
{
size_t m,n;
m = A.GetNrows();
n = A.GetNcols();
TMatrixT<AFloat> B(m, n);
for (size_t i = 0; i < m; i++) {
for (size_t j = 0; j < n ; j++) {
B(i,j) = 0.0;
}
}
A = B;
// use fast zero initialization on the device
A.Zero();
}
} // namespace DNN
......
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment