Generate a deterministic fp16 masked mean over the last dimension using mask x > 0.
