Hi,
I believe the reason is that traditional weighted regression computes R-square based on transformed variables, the better R-square is indicative of weighting addressing the heteroskedastic nature of the data. However, ordinary least squares does give you the best possible RMSE, which is computed based on the original variables.
In fact, this paper talks about the very same issue.
Please read the bottom left paragraph on page 237 ( Inserted as an image here ).