Feeds
提问
Sum of squares profiling on GPU
I was profiling some code that runs on my GPU and came across something rather puzzling that I haven't been able to sort out... ...
11 years 前 | 1 个回答 | 2
1
个回答提问
3D gpuArray vs cells of 2D gpuArrays major speed difference!
Can anybody explain why these codes have drastically different runtimes? I have a shared setup routine clear all y = ...
11 years 前 | 1 个回答 | 0
1
个回答提问
Using multiple GPUs in a parfor type of loop
I am working on a machine learning problem where I am training my classifier using a GPU and the parallel computing toolbox. I ...
11 years 前 | 1 个回答 | 0
1
个回答提问
Breaking up a computation vs "..." to continue line - Huge Speed Difference !?
I was profiling an ode solver today and found something very strange. The system has 9 states and the computation of the ode RH...
12 years 前 | 1 个回答 | 0
1
个回答提问
Matrix multiply slices of 3d Matricies
Given two 3d matricies, A and B with size(A) = (n, m, k) and size(B) = (m, p, k) perform matrix multipl...
12 years 前 | 3 个回答 | 0
3
个回答提问
Linear combination of cell arrays
Is there a compact way (without loops) to take linear combinations of cell arrays that contain the same type of data (matrices o...
12 years 前 | 1 个回答 | 0
1
个回答提问
How do you make a callable object?
I want to define a class (much like griddedInterpolant) that has callable objects. Is there a particular method name that is in...
12 years 前 | 1 个回答 | 2
1
个回答提问
Fastest way to dot product all columns of two matricies of same size
I have come up against this problem over and over, and I have a nice solution, but it seems non-optimal from a speed sense. Does...
13 years 前 | 2 个回答 | 0