SGE and matlab_metadata.mat

2 次查看(过去 30 天)
Marcin
Marcin 2011-2-26
回答: Ayush 2025-7-19
I have the following problem with running distributed jobs under Sun Grid Engine (MATLAB R2010b). When I submit multiple jobs and one of them finishes, it gets deleted from the cluster, which is fine. What is not fine, is that the matlab_metadata.mat file, which is common to all the jobs is deleted too, which makes the remaining jobs to fail. How can I fix it?
many thanks

回答(1 个)

Ayush
Ayush 2025-7-19
Hi Marcin,
I understand you are encountering a known issue with distributed jobs in MATLAB R2010b under Sun Grid Engine (SGE). The shared "matlab_metadata.mat" file is being deleted when one job finishes, causing the remaining jobs to fail.
Actually as per my understanding, in MATLAB R2010b, each worker cleans up job-related files upon completion, and the logic does not account for other workers potentially still needing the shared "matlab_metadata.mat" file. This is a race condition and file lifecycle management issue in older MATLAB releases.
You can try the following solutions:
  1. You can try using a separate "JobStorageLocation" for each job. Try to ensure that each job has a dedicated folder for its metadata. This way, jobs won't interfere with each other's files.
2. You can try modifying the cleanup behavior if its possible in your workflow.
3. You can try upgrading the MATLAB versions as this issue is fixed in later releases of MATLAB (starting from R2012a and onward).
Hope it helps!

类别

Help CenterFile Exchange 中查找有关 MATLAB 的更多信息

标签

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by