rtn in ternary way (#632)

* rtn in ternary way * Check both flags to preserve NaN * Format * Rearrange flag1 * Apply suggestions from code review Co-authored-by: Ronan Keryell <ronan@keryell.fr> --------- Co-authored-by: Rosty Geyyer <rosty.geyyer@amd.com> Co-authored-by: Rostyslav Geyyer <46627076+geyyer@users.noreply.github.com> Co-authored-by: Ronan Keryell <ronan@keryell.fr>

rtn in ternary way (#632)
* rtn in ternary way * Check both flags to preserve NaN * Format * Rearrange flag1 * Apply suggestions from code review Co-authored-by: Ronan Keryell <ronan@keryell.fr> --------- Co-authored-by: Rosty Geyyer <rosty.geyyer@amd.com> Co-authored-by: Rostyslav Geyyer <46627076+geyyer@users.noreply.github.com> Co-authored-by: Ronan Keryell <ronan@keryell.fr>
8a659a2e · Dan Yao · GitHub · 6ae12434 · 8a659a2e
Unverified Commit 8a659a2e authored Mar 21, 2023 by Dan Yao Committed by GitHub Mar 20, 2023
Hide whitespace changes
Inline Side-by-side

Showing with 30 additions and 32 deletions

include/ck/utility/data_type.hpp include/ck/utility/data_type.hpp +30 -32

No files found.
--- a/include/ck/utility/data_type.hpp
+++ b/include/ck/utility/data_type.hpp
@@ -974,38 +974,36 @@ inline __host__ __device__ constexpr bhalf_t type_convert<bhalf_t, float>(float
        uint32_t int32;
    } u = {x};
-    if(~u.int32 & 0x7f800000)
+    // When the exponent bits are not all 1s, then the value is zero, normal,
-    {
+    // or subnormal. We round the bfloat16 mantissa up by adding 0x7FFF, plus
-        // When the exponent bits are not all 1s, then the value is zero, normal,
+    // 1 if the least significant bit of the bfloat16 mantissa is 1 (odd).
-        // or subnormal. We round the bfloat16 mantissa up by adding 0x7FFF, plus
+    // This causes the bfloat16's mantissa to be incremented by 1 if the 16
-        // 1 if the least significant bit of the bfloat16 mantissa is 1 (odd).
+    // least significant bits of the float mantissa are greater than 0x8000,
-        // This causes the bfloat16's mantissa to be incremented by 1 if the 16
+    // or if they are equal to 0x8000 and the least significant bit of the
-        // least significant bits of the float mantissa are greater than 0x8000,
+    // bfloat16 mantissa is 1 (odd). This causes it to be rounded to even when
-        // or if they are equal to 0x8000 and the least significant bit of the
+    // the lower 16 bits are exactly 0x8000. If the bfloat16 mantissa already
-        // bfloat16 mantissa is 1 (odd). This causes it to be rounded to even when
+    // has the value 0x7f, then incrementing it causes it to become 0x00 and
-        // the lower 16 bits are exactly 0x8000. If the bfloat16 mantissa already
+    // the exponent is incremented by one, which is the next higher FP value
-        // has the value 0x7f, then incrementing it causes it to become 0x00 and
+    // to the unrounded bfloat16 value. When the bfloat16 value is subnormal
-        // the exponent is incremented by one, which is the next higher FP value
+    // with an exponent of 0x00 and a mantissa of 0x7f, it may be rounded up
-        // to the unrounded bfloat16 value. When the bfloat16 value is subnormal
+    // to a normal value with an exponent of 0x01 and a mantissa of 0x00.
-        // with an exponent of 0x00 and a mantissa of 0x7F, it may be rounded up
+    // When the bfloat16 value has an exponent of 0xFE and a mantissa of 0x7F,
-        // to a normal value with an exponent of 0x01 and a mantissa of 0x00.
+    // incrementing it causes it to become an exponent of 0xFF and a mantissa
-        // When the bfloat16 value has an exponent of 0xFE and a mantissa of 0x7F,
+    // of 0x00, which is Inf, the next higher value to the unrounded value.
-        // incrementing it causes it to become an exponent of 0xFF and a mantissa
+    bool flag0 = ~u.int32 & 0x7f800000;
-        // of 0x00, which is Inf, the next higher value to the unrounded value.
-        u.int32 += 0x7fff + ((u.int32 >> 16) & 1); // Round to nearest, round to even
+    // When all of the exponent bits are 1, the value is Inf or NaN.
-    }
+    // Inf is indicated by a zero mantissa. NaN is indicated by any nonzero
-    else if(u.int32 & 0xffff)
+    // mantissa bit. Quiet NaN is indicated by the most significant mantissa
-    {
+    // bit being 1. Signaling NaN is indicated by the most significant
-        // When all of the exponent bits are 1, the value is Inf or NaN.
+    // mantissa bit being 0 but some other bit(s) being 1. If any of the
-        // Inf is indicated by a zero mantissa. NaN is indicated by any nonzero
+    // lower 16 bits of the mantissa are 1, we set the least significant bit
-        // mantissa bit. Quiet NaN is indicated by the most significant mantissa
+    // of the bfloat16 mantissa, in order to preserve signaling NaN in case
-        // bit being 1. Signaling NaN is indicated by the most significant
+    // the bfloat16's mantissa bits are all 0.
-        // mantissa bit being 0 but some other bit(s) being 1. If any of the
+    bool flag1 = !flag0 && (u.int32 & 0xffff);
-        // lower 16 bits of the mantissa are 1, we set the least significant bit
-        // of the bfloat16 mantissa, in order to preserve signaling NaN in case
+    u.int32 += flag0 ? 0x7fff + ((u.int32 >> 16) & 1) : 0; // Round to nearest, round to even
-        // the bloat16's mantissa bits are all 0.
+    u.int32 |= flag1 ? 0x10000 : 0x0;                      // Preserve signaling NaN
-        u.int32 |= 0x10000; // Preserve signaling NaN
-    }
    return uint16_t(u.int32 >> 16);
 }