Hi Joan,
As per my understanding, we use half mean squared error only because we need to because when you take the derivative of the cost function, the multiplier is cancelled, and the derivation is cleaner. It makes the math easier to handle and adding a half or not shouldn't matter since minimizing is unaffected by constants.