Feeds
提问
Can I get faster MatrixMultiplication using CUDA than with Matlab internal GPU implementation??
I have to Multiply a Matrix A (size about 400 x400 ) with M Matrices B, while m is also about 500. N=400;M=500; A=rand(...
12 years 前 | 4 个回答 | 0
4
个回答提问
gather takes really long after using ptx file /CUDA
I try to make a matrixmultiplication using CUDA via ptx file to take advantage over the matlab internal functions. My .cu codes ...
12 years 前 | 1 个回答 | 0
1
个回答提问
CUDA number of tasks exceed number of threads times blocks
I have a problem if my number of tasks exceed the number of total available threads. Lets images I want to add tow vectors of l...
12 years 前 | 0 个回答 | 0