photo

Joss Knight

MathWorks

Last seen: 1 day 前 自 2013 起处于活动状态

Followers: 0   Following: 0

Although I cannot be contacted directly, if you would like to ask me a question all you have to do is mention "GPU" somewhere in your MATLAB Answers question.

统计学

  • Guiding Light
  • 36 Month Streak
  • Knowledgeable Level 5
  • Revival Level 2
  • First Answer

查看徽章

Feeds

排序方式:

已回答
GPU memory usage for Hadamard product
I can't reproduce this. You say you have 48Gb of GPU memory available...have you checked this? Try running gpuDevice and lookin...

4 days 前 | 0

已回答
Initializing LSTM which is imported using ONNX
This code is suspect % initialize input arrays obs = dlarray(rand(obs_size),"BS"); state = dlarray(rand(state_size),"SBS"); ...

4 days 前 | 0

已回答
trainnet gives training loss is NaN
Do your network weights contain NaNs? Try this nansInMyNetwork = ~(all(cellfun(@allfinite, net.Learnables.Value)) && all(cellfu...

11 days 前 | 0

已回答
how to use printf inside a CUDA kernel?
Just use it, and launch MATLAB from a terminal. On Linux, the output will appear in the terminal window. On Windows you will nee...

16 days 前 | 0

| 已接受

已回答
mexcuda compiler error: "__global__" does not apply here
You cannot call a function declared __global__ from another function declared __global__. Declare the second function as __devic...

28 days 前 | 0

| 已接受

已回答
CUSOLVER_STATUS_INTERNAL_ERROR with RTX 4080 Super in MATLAB 2022a C MEX Function
This is a known issue with CUDA 11.2, Ada cards, and cusolver. The solution is to upgrade MATLAB to R2023a or above, because it ...

28 days 前 | 1

| 已接受

已回答
CUSOLVER_STATUS_INTERNAL_ERROR with RTX 4080 Super in MATLAB 2022a C MEX Function
It's probably necessary for you to provide your code, since any number of things could have gone wrong prior to intializing cuso...

29 days 前 | 0

已回答
why do I got such a low score when i use bench function in matlab2024a
Do check your computer's power settings, to make sure they are set to "best performance".

29 days 前 | 0

已回答
Matab is very slow in processing figures: might it be caused by NVidia Optimus?
Check your laptop's power settings and make sure they are in 'best performance' mode.

1 month 前 | 0

| 已接受

已回答
Which is the difference between 'multi-gpu' and 'parallel-gpu' in 'trainingOption' function of the DeepLearning Toolbox?
The purpose of 'multi-gpu' is effectively to try to ensure you are using a local pool with numGpus workers, rather than needing ...

1 month 前 | 1

已回答
MATLAB does not detect NVIDIA T400 GPU despite updated drivers
The 516 driver is too old for R2024a, which is on CUDA 12.2. Download the recommended 552 driver for your T400 from https://www....

1 month 前 | 0

已回答
MATLAB GPU UBUNTU 20.04 compatibility CUDA driver
Try the 555 and 550 drivers recommended for your device available from https://www.nvidia.com/en-gb/drivers/, rather than relyin...

1 month 前 | 0

已回答
Memory leak occurring when repeatedly performing matrix left division on GPU
Thank you very much for reporting this. This appears to be a bug in MATLAB's gpuArray support for overdetermined solves. For now...

2 months 前 | 0

| 已接受

已回答
How do I use multiple inputs in mexcuda?
You have passed in two inputs but you have a check if (nrhs!=1), so you are erroring because you don't have one input. You need ...

3 months 前 | 1

已回答
Why training stops showing error (out of memory) during implementing transfer learning with pre-trained network despite having a laptop of well configuration?
You have a very small GPU I'm afraid. You need to lower your |MiniBatchSize| until it works. Also, set your |ExecutionEnvironmen...

3 months 前 | 1

| 已接受

已回答
Parallel Processing is better than buying another MATLAB licence?
There may be a misunderstanding here. Your typical license is to a user or a computer. That user, or that computer, can open as ...

3 months 前 | 0

已回答
I can`t solve out this problem, there is always Output argument "varargout{2}" (and possibly others) not assigned a value in the execution with "dlarray/dlgradient" function.
Your mistake is |dlfeval(@dlgradient,...)|. You need to put your code that computes the loss and the loss gradients into a funct...

3 months 前 | 0

| 已接受

已回答
AlexNet (filter groups) cannot be well resolved inside dlfeval (dimension mismatch error)
Sorry about this. This is because you have an old version of alexnet (although yes, it shouldn't error). For now, please get you...

3 months 前 | 0

| 已接受

已回答
arrayfun vs loops again
I wish it were a cleverer answer, but I'm afraid that it's simply that MATLAB has been heavily optimized for |for| loops over th...

3 months 前 | 3

已回答
Extreme memory usage when using gather
As others have worked out, it looks like the issue is the indexed assignment into u: u(TERRAIN:end,m) = abs(gather(head(ffield,...

4 months 前 | 0

| 已接受

已回答
How can i fix the error mismatch format when convert multiplication layer from LayerGraph to dlnetwork in prune and quantize network example
For one solution, replace the fully connected layers with convolution layers with filters the size of the input and num filters ...

4 months 前 | 0

已回答
How can i fix the error mismatch format when convert multiplication layer from LayerGraph to dlnetwork in prune and quantize network example
That's an annoying limitation which is hopefully fixed in current releases. Is there any particular reason why you can't use |mu...

4 months 前 | 0

已回答
Arrayfun/gpuArray CUDA kernel need to be able to remember previous steps
This is a bit too vague to answer. Without indexing, how can each subproblem retrieve its subset of the data? If you just mean i...

4 months 前 | 0

已回答
Loop through list of vectors (ie, the rows of a matrix), applying same "simple" function to each one, on GPU? Should I use arrayfun somehow?
If your calculation is truly sequential then by definition you cannot parallelize along the sequence. But it sounds like you def...

4 months 前 | 0

已回答
Parallel Computing in C when using Matlab Coder (parpool and gpuArray)
GPU Coder will generate CUDA code for you. It can even automatically vectorize |for| loops. For a multithreaded parallel for loo...

6 months 前 | 1

已回答
Can parfor run a series of GPU programs simultaneously?
It looks like you just have a bug in your CUDAKernel implementation, probably accessing unallocated memory. This is putting the ...

7 months 前 | 0

已回答
error of GPU, net = trainNetwork(datastore, lgraph, options);
You are using your display GPU for computation and it does not have enough capacity. Try disabling all hardware acceleration for...

7 months 前 | 1

| 已接受

已回答
How to train a sequence to classification network on GPU
This performance discrepancy is normal. Small sequence networks often cannot benefit from GPU parallelism, especially if they us...

7 months 前 | 0

已回答
gpuArray large sparse arrays. Error codes: "CUSPARSE_INTERNAL_ERROR" / "UNKNOWN_ERROR"
Hi Joseph. It's hard to be definitive. There were some problems with cusparse and also Windows drivers when supporting the newes...

7 months 前 | 0

已回答
NVIDIA A2 performance in Matlab R2023b is lousy
The A2's spec says its double precision performance is 140 GFLOPS vs the V100's 7 TFLOPS, so this is pretty much expected. Even ...

9 months 前 | 3

| 已接受

加载更多