Rounding is done according to the rounding[3:0] parameter, which can be one of:
_MM_FROUND_TO_NEAREST_INT
// round to nearest_MM_FROUND_TO_NEG_INF
// round down_MM_FROUND_TO_POS_INF
// round up_MM_FROUND_TO_ZERO
// truncate_MM_FROUND_CUR_DIRECTION
// use MXCSR.RC; see _MM_SET_ROUNDING_MODE
_MM_FROUND_TO_NEAREST_INT
| _MM_FROUND_NO_EXC
// round to nearest, and suppress exceptions_MM_FROUND_TO_NEG_INF
| _MM_FROUND_NO_EXC
// round down, and suppress exceptions_MM_FROUND_TO_POS_INF
| _MM_FROUND_NO_EXC
// round up, and suppress exceptions_MM_FROUND_TO_ZERO
| _MM_FROUND_NO_EXC
// truncate, and suppress exceptions_MM_FROUND_CUR_DIRECTION
| _MM_FROUND_NO_EXC
// use MXCSR.RC and suppress exceptions; see _MM_SET_ROUNDING_MODE