Half base type #1706

yhmtsai · 2024-10-22T15:50:26Z

This pr moves the half out of extended_float.hpp with some modification and half is available in the public interface.

Currently, we can still mark some conversion/operation with GKO_ATTRIBUTE and GKO_INLINE.
However, I think it is good to ensure we only use the vendor's impl on device side.
To achieve it, Jacobi needs to use __half not half now.
This also removes the undefined behavior from reinterpret_cast to std::memcpy

~~one question: should we mark the half precision in experimental namespace?~~ in usual interface from discussion

MarcelKoch

Just to reiterate, I'm in favor of putting this directly into the gko namespace.

include/ginkgo/core/base/half.hpp

core/test/base/half.cpp

yhmtsai · 2024-10-24T13:32:05Z

@MarcelKoch I have made the half without types.hpp dependence such that we can include it directly without circular dependence and the instantiate function have the definition of half now

core/test/base/floating_bit_helper.hpp

upsj

LGTM!

upsj · 2024-11-22T08:04:43Z

accessor/cuda_helper.hpp

@@ -17,7 +17,15 @@
 #include "utils.hpp"


+struct __half;


I don't think there's a way around the issue, but we should be aware that __half is a reserved identifier according to the C++ standard, so we should technically not be defining anything with that name. Also there might be a tiny potential of name mangling issues if this is ever handled as a class instead of a struct with MSVC.

upsj · 2024-11-22T08:07:07Z

core/test/base/half.cpp

+    half x = create_from_bits("0" "01111111" "00000000000000000000000");
+
+    ASSERT_EQ(get_bits(x), get_bits("0" "01111" "0000000000"));


FYI, since C++14 we can use binary literals 0b01111111

upsj · 2024-11-22T08:08:01Z

cuda/base/types.hpp

+
+template <>
+struct culibs_type_impl<std::complex<half>> {
+    using type = __half2;


Just for my understanding, __half2 is a vector type derived from __half?

you are right. only one function __hcmadd does complex operation, but the other are all vector type operation.

I have check cusparse they use __half2 to hold complex<__half>, too. https://docs.nvidia.com/cuda/cusparse/#cudadatatype-t

upsj · 2024-11-22T08:09:09Z

dev_tools/scripts/gdb-ginkgo.py

+        # GDB doesn't seem to consider the user-defined conversion in its Value.cast,
+        # so we need to call the conversion operator explicitly
+        address = hex(val.address)
+        self.float_val = gdb.parse_and_eval(f"reinterpret_cast<gko::half*>({address})->operator float()")


If that function is ever not exported, we may need to implement an xfunction for it

I am not familar with that. This is done by @MarcelKoch
could you implement that?

We likely don't need it now, just for future reference

.pre-commit-config.yaml

… half to another test

Co-authored-by: Marcel Koch <marcel.koch@kit.edu>

This PR implements the half precision for matrices and components Related PR: #1706

yhmtsai added the 1:ST:ready-for-review This PR is ready for review label Oct 22, 2024

yhmtsai self-assigned this Oct 22, 2024

yhmtsai force-pushed the half_type branch 2 times, most recently from 265cb47 to b0c488b Compare October 22, 2024 16:06

yhmtsai added 1:ST:WIP This PR is a work in progress. Not ready for review. and removed 1:ST:ready-for-review This PR is ready for review labels Oct 22, 2024

yhmtsai force-pushed the half_type branch 4 times, most recently from 8bd8d1c to 2ab1acd Compare October 23, 2024 12:45

yhmtsai added 1:ST:ready-for-review This PR is ready for review and removed 1:ST:WIP This PR is a work in progress. Not ready for review. labels Oct 23, 2024

yhmtsai mentioned this pull request Oct 23, 2024

Build fails with strict-aliasing violations #1657

Open

yhmtsai added this to the Ginkgo 1.9.0 milestone Oct 24, 2024

MarcelKoch self-requested a review October 24, 2024 08:14

MarcelKoch approved these changes Oct 24, 2024

View reviewed changes

include/ginkgo/core/base/half.hpp Outdated Show resolved Hide resolved

include/ginkgo/core/base/half.hpp Show resolved Hide resolved

core/test/base/half.cpp Outdated Show resolved Hide resolved

yhmtsai force-pushed the half_type branch from da716c7 to 7c2461f Compare October 24, 2024 12:16

MarcelKoch requested a review from thoasm October 24, 2024 12:54

yhmtsai requested a review from MarcelKoch October 24, 2024 13:30

MarcelKoch approved these changes Oct 24, 2024

View reviewed changes

core/test/base/floating_bit_helper.hpp Outdated Show resolved Hide resolved

core/test/base/floating_bit_helper.hpp Outdated Show resolved Hide resolved

yhmtsai force-pushed the half_type branch from 7c2461f to 0f9df36 Compare October 24, 2024 13:55

This was referenced Oct 24, 2024

Half matrix and components #1708

Merged

Half precision support #1257

Closed

yhmtsai force-pushed the half_type branch 2 times, most recently from 7eaf547 to d4f3893 Compare November 5, 2024 18:03

yhmtsai force-pushed the half_type branch 2 times, most recently from f924ff5 to 5a7ae0c Compare November 18, 2024 11:16

yhmtsai force-pushed the half_type branch from adfc8ac to 1368a21 Compare November 20, 2024 17:33

upsj approved these changes Nov 22, 2024

View reviewed changes

yhmtsai force-pushed the half_type branch 3 times, most recently from 8cc9759 to 0fa6dc9 Compare November 28, 2024 18:11

yhmtsai and others added 8 commits November 29, 2024 14:56

half base type

f39aeab

half does not have constexpr constructor

c37b7be

fix the undefined behavior and the issue from big-endian, and extract…

afb108b

… half to another test

jacobi use __half in device not gko::half now

3b46e41

type map

685331c

fix error: non-constant-expression cannot be narrowed

9c51790

update gdb-ginkgo

6317e3f

Co-authored-by: Marcel Koch <marcel.koch@kit.edu>

make half not rely on type

b8f4584

yhmtsai force-pushed the half_type branch from 0fa6dc9 to 3b05fc9 Compare November 29, 2024 15:22

yhmtsai and others added 5 commits November 30, 2024 00:27

collect the reused part and undef after usage

8508fbb

Co-authored-by: Marcel Koch <marcel.koch@kit.edu>

use memcpy not std::memcpy in hip

03eb022

add alignment

1f0f619

delete the sycl half test as we do not enable it directly

acbae4a

use reference for half when it is possible

b5afcac

yhmtsai force-pushed the half_type branch from 3b05fc9 to b5afcac Compare November 30, 2024 01:30

yhmtsai added 1:ST:ready-to-merge This PR is ready to merge. 1:ST:skip-full-test and removed 1:ST:ready-for-review This PR is ready for review labels Dec 3, 2024

yhmtsai merged commit a144043 into develop Dec 3, 2024
10 of 11 checks passed

yhmtsai deleted the half_type branch December 3, 2024 01:08

yhmtsai added a commit that referenced this pull request Dec 3, 2024

Merge #1708 Add Half matrix and components implementation

76ef161

This PR implements the half precision for matrices and components Related PR: #1706

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Half base type #1706

Half base type #1706

yhmtsai commented Oct 22, 2024 •

edited

Loading

MarcelKoch left a comment

yhmtsai commented Oct 24, 2024

upsj left a comment

upsj Nov 22, 2024

upsj Nov 22, 2024

upsj Nov 22, 2024

yhmtsai Nov 29, 2024

yhmtsai Nov 30, 2024

upsj Nov 22, 2024

yhmtsai Nov 29, 2024

upsj Nov 29, 2024

		half x = create_from_bits("0" "01111111" "00000000000000000000000");

		ASSERT_EQ(get_bits(x), get_bits("0" "01111" "0000000000"));

Half base type #1706

Half base type #1706

Conversation

yhmtsai commented Oct 22, 2024 • edited Loading

MarcelKoch left a comment

Choose a reason for hiding this comment

yhmtsai commented Oct 24, 2024

upsj left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

yhmtsai commented Oct 22, 2024 •

edited

Loading