This fixes some failures on Turing where "round to zero" rounds to the max f16 value but the CPU reference value is infinite.
Co-authored-by: aeseulgi <kim2h7903@gmail.com>