FastGelu float16 #621

RandySheriffH · 2023-12-07T21:36:11Z

Add float16 support for contrib cuda ops.

…untime-extensions into rashuai/MFloat16

wenbingl · 2023-12-11T03:41:10Z

includes/onnxruntime_f16.h

+#include "onnxruntime_c_api.h"
+#if ORT_API_VERSION >= 16
+
+#include "onnxruntime_float16.h"


is this file shipping with ort C++ package?

yes it is, since 1.16.

wenbingl · 2023-12-11T03:56:55Z

operators/contrib/cuda/fast_gelu.h

+    const T* bias_data = bias.has_value() ? (*bias)->Data() : nullptr;
+    auto bias_length = bias.has_value() ? (*bias)->NumberOfElement() : 0;
+    using TT = typename CudaT<T>::MappedType;
+    LaunchFastGeluKernel<TT>(reinterpret_cast<cudaStream_t>(ctx.cuda_stream),


should the return error code be handled here?

Good point - will report it in coming iteration.

wenbingl · 2023-12-11T04:00:24Z

test/cuda/test_cudaops.py

+        ]
+
+        input0 = helper.make_tensor_value_info(
+            'x', onnx_proto.TensorProto.FLOAT16, [])


dumb question, which fp16 was tested here, MFloat16 or BFloat16?

It is MFloat16.
For BFloat16, we need to test it by native cases since the type is not exposed via python.

RandyShuai and others added 12 commits December 6, 2023 18:04

support float16

a15c40a

add ut for float16

e69d27b

support bfloat16

6f3d4d5

refactor

c16160d

fetch prop

a0d8722

Merge branch 'main' into rashuai/MFloat16

f04287f

Merge branch 'main' into rashuai/MFloat16

83ad32f

ifdef f16

1157d28

Merge branch 'rashuai/MFloat16' of https://github.com/microsoft/onnxr…

c6a2374

…untime-extensions into rashuai/MFloat16

remove header

4bf0e9c

ifdef cuda

7cf21b4

typename mapped type

db82b8b

RandySheriffH marked this pull request as ready for review December 9, 2023 05:39

RandySheriffH requested a review from a team as a code owner December 9, 2023 05:39

wenbingl approved these changes Dec 11, 2023

View reviewed changes

Merge branch 'main' into rashuai/MFloat16

172efbb

RandySheriffH merged commit 1ccc405 into main Dec 11, 2023

RandySheriffH deleted the rashuai/MFloat16 branch December 11, 2023 22:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FastGelu float16 #621

FastGelu float16 #621

RandySheriffH commented Dec 7, 2023 •

edited

Loading

wenbingl Dec 11, 2023

RandySheriffH Dec 11, 2023

wenbingl Dec 11, 2023

RandySheriffH Dec 11, 2023

wenbingl Dec 11, 2023

RandySheriffH Dec 11, 2023 •

edited

Loading

FastGelu float16 #621

FastGelu float16 #621

Conversation

RandySheriffH commented Dec 7, 2023 • edited Loading

wenbingl Dec 11, 2023

Choose a reason for hiding this comment

RandySheriffH Dec 11, 2023

Choose a reason for hiding this comment

wenbingl Dec 11, 2023

Choose a reason for hiding this comment

RandySheriffH Dec 11, 2023

Choose a reason for hiding this comment

wenbingl Dec 11, 2023

Choose a reason for hiding this comment

RandySheriffH Dec 11, 2023 • edited Loading

Choose a reason for hiding this comment

RandySheriffH commented Dec 7, 2023 •

edited

Loading

RandySheriffH Dec 11, 2023 •

edited

Loading