CUDA ptxas fatal : Unresolved extern function '_Z22mwGet​GlobalThre​adIndexv'

18 次查看(过去 30 天)
Hi
I used gpucoder to generate the cuda file corresponding to original function. After that i built a .PTX file from my cuda file and now i want to create CUDAkernel using parallel.gpu.CUDAKernel to speedup my function. By using k = parallel.gpu.CUDAKernel('myfunction.ptx','float *, const double *','_Z15myfunction_kernel1dP15emxArray_real_T'), I am getting an error below:
Error using parallel.gpu.CUDAKernel
An error occurred during PTX compilation of <image>.
The information log was:
The error log was:
ptxas fatal : Unresolved extern function '_Z22mwGetGlobalThreadIndexv'
The CUDA error code was: CUDA_ERROR_INVALID_PTX.
How to get rid of this error? Any help will be much appreciated. Regards

回答(1 个)

Linda Koletsou Soulti
Hello Riffat,
it seems that the kernel calls into a helper function that it was not compiled with the kernel.
The following example may be helpful:
Cheers,
Linda
  3 个评论
Linda Koletsou Soulti
Hello Riffat,
in the folder of the generated code there should be a file called MWCudaDimUtility.cu. Your device code calls into a function called mwGetGlobalThreadIndex inside this file. There are a couple of ways to resolve the issue caused by this dependency. I believe the easiest would be to copy the contents of the MWCudaDimUtility.cu file inside the file of your code, before the definition of your function. Then build this file to ptx.
Cheers,
Linda
Riffat Tehseen Amjad
I solved the problem by copying MWCudaDimUtility.h file, found on the web, in place of MWCudaDimUtility.h file created by gpucoder and it worked. Thanks for your help Linda!
Best,
Riffat

请先登录,再进行评论。

类别

Help CenterFile Exchange 中查找有关 GPU Computing 的更多信息

产品


版本

R2019b

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by