Parallel pool failed to start - Job test failed during validation MATLAB 2021b
50 次查看(过去 30 天)
显示 更早的评论
Hello everyone,
when i start the parallel pool i get the following error:
Parallel pool failed to start with the following error. For more detailed information, validate the profile 'local' in the Cluster Profile Manager.
The validation fails the "Job test" and "Parallel pool test". The validation report is posted below.
Can anyone help me please?
Kind regards and thanks in advance!
Alexander
VALIDATION REPORT
Profile: local
Scheduler Type: Local
Stage: Cluster connection test (parcluster)
Status: Passed
Start Time: Tue Dec 21 16:38:41 CET 2021
Finish Time: Tue Dec 21 16:38:41 CET 2021
Running Duration: 0 min 0 sec
Description:
Error Report:
Command Line Output:
Debug Log:
Stage: Job test (createJob)
Status: Failed
Start Time: Tue Dec 21 16:38:41 CET 2021
Finish Time: Tue Dec 21 16:38:41 CET 2021
Running Duration: 0 min 1 sec
Description: Job errored or did not reach the state 'finished'.
Error Report: Job errored or did not reach the state 'finished'.
Command Line Output:
Debug Log: LOG FILE OUTPUT:
Stage: SPMD job test (createCommunicatingJob)
Status: Passed
Start Time: Tue Dec 21 16:38:41 CET 2021
Finish Time: Tue Dec 21 16:39:10 CET 2021
Running Duration: 0 min 28 sec
Description: Job ran with 8 workers.
Error Report:
Command Line Output:
Debug Log:
Stage: Pool job test (createCommunicatingJob)
Status: Passed
Start Time: Tue Dec 21 16:39:12 CET 2021
Finish Time: Tue Dec 21 16:39:44 CET 2021
Running Duration: 0 min 33 sec
Description: Job ran with 8 workers.
Error Report:
Command Line Output:
Debug Log:
Stage: Parallel pool test (parpool)
Status: Failed
Start Time: Tue Dec 21 16:39:45 CET 2021
Finish Time: Tue Dec 21 16:39:57 CET 2021
Running Duration: 0 min 11 sec
Description: Failed to initialize the interactive session.
Error Report: Failed to initialize the interactive session.
Caused by:
Error using parallel.internal.pool.AbstractInteractiveClient>iThrowIfBadParallelJobStatus (line 426)
The interactive communicating job failed with no message.
Command Line Output:
Debug Log: CLIENT LOG OUTPUT
Checking communicating job status.
Job state is 'failed'.
No task error messages.
Communicating job debug log:
LOG FILE OUTPUT:
Session failed to start when creating InteractiveClient. Error: Error using parallel.internal.pool.AbstractInteractiveClient>iThrowWithCause (line 305)
Failed to initialize the interactive session.
Error in parallel.internal.pool.AbstractInteractiveClient/start (line 142)
iThrowWithCause( 'parallel:convenience:FailedToInitializeInteractiveSession', err );
Error in parallel.internal.pool.AbstractClusterPool>iStartClient (line 831)
spmdInitialized = client.start(sessionBuildFcn, sessionInfo, numWorkers, cluster, ...
Error in parallel.internal.pool.AbstractClusterPool.hBuildPool (line 585)
iStartClient(client, sessionInfo, forceSpmdEnabled, cluster, supportRestart, argsList);
Error in parallel.internal.types.ValidationStages>iOpenPoolForCluster (line 456)
aPool = parallel.internal.pool.AbstractClusterPool.hBuildPool('Cluster', cluster, 'NumWorkers', numWorkers);
Error in parallel.internal.types.ValidationStages>@()iOpenPoolForCluster(runInfo)
Error in parallel.internal.types.ValidationStages>iCallWithNoHotlinks (line 336)
[varargout{1:nargout}] = fcn();
Error in parallel.internal.types.ValidationStages>iRunParpoolStage (line 247)
[commandWindowOutput, aPool] = evalc(iWrapForEvalc(openPoolFcn));
Error in parallel.internal.types.ValidationStages/run (line 68)
[eventData, runInfo] = obj.RunFunction(obj, runInfo);
Error in parallel.internal.validator.Validator/runValidationSuite (line 191)
[eventData, stageRunInfo] = currentStage.run(stageRunInfo);
Error in parallel.internal.validator.Validator/validate (line 103)
status = obj.runValidationSuite(profileName, suite);
Error in parallel.internal.ui.AbstractValidationManager/validate (line 36)
obj.Validator.validate(profileName, validationSuite);
Error in parallel.internal.ui.ValidationManager.validateProfile (line 36)
parallel.internal.ui.ValidationManager.getOrCreateInstance().validate(profileName, suite);
Caused by:
Error using parallel.internal.pool.AbstractInteractiveClient>iThrowIfBadParallelJobStatus (line 426)
The interactive communicating job failed with no message.
Failed to run the DisarmableOncleanup callback due to the following error:
Dot indexing is not supported for variables of this type.
0 个评论
回答(1 个)
Raymond Norris
2021-12-21
This feels like an install issue. It's odd that Stage 2 would fail, but Stage 3 & 4 pass. And then especially have Stage 5 fail (Stage 5 has similarities with Stage 3 & 4). I would suggest contacting Technical Support (support@mathworks.com) to troubleshoot this.
2 个评论
另请参阅
类别
在 Help Center 和 File Exchange 中查找有关 Clusters and Clouds 的更多信息
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!