Hey Ari,
I tried running the example on my end and got a different graph in the training compared to both what you have seen and the screenshot in the example guide. But my output was reproduceable across multiple runs of the example:
However, while the path taken during training does seem to vary, the results logged in the scope are inline with that the example expects, which showcases that the training was successful and predictions are in line: