Fix Triton mini batch bug #688

dagardner-nv · 2023-02-09T19:13:44Z

Fixes issue that can occur when the number of input tensors is greater than the number of rows in the dataframe.
Refactor the on_next method of the InferenceClientStage which was getting a bit long and hard to follow, moving some logic to their own methods.
Wait until all mini-batches have returned before calculating the reduction.

Fixes #680

… mapping back to the same source row in the dataframe. ex: model max batch = 4 seq_ids = [0, 0, 0, 1, 1, 2, 3, 3] Would cause the inference inputs for dataframe row 1 to occurr in two non-adjacent batches. This is a non-ideal fix as this doesn't handle the case where the inference input is so large that it spans more rows than the model's max batch size.

…sed on the shape to a method in TensorUtils

morpheus/_lib/include/morpheus/objects/rmm_tensor.hpp

* Remove usage of `tensorShape_t` which was deprecated, and later removed. * Replace usage of tensor constructor in favor of the recommended `make_tensor` helper method. * Adds more C++ unittests * RMMTensor marked as a public symbol so the C++ tests can use it * Add `cuda-nvtx` package to CI driver build, needed for matx-0.3.0 Includes changes from PR #688 fixes #317 Authors: - David Gardner (https://github.com/dagardner-nv) Approvers: - Michael Demoret (https://github.com/mdemoret-nv) URL: #667

dagardner-nv · 2023-02-22T20:09:51Z

PR #667 contained this code and was merged.

* Remove usage of `tensorShape_t` which was deprecated, and later removed. * Replace usage of tensor constructor in favor of the recommended `make_tensor` helper method. * Adds more C++ unittests * RMMTensor marked as a public symbol so the C++ tests can use it * Add `cuda-nvtx` package to CI driver build, needed for matx-0.3.0 Includes changes from PR nv-morpheus#688 fixes nv-morpheus#317 Authors: - David Gardner (https://github.com/dagardner-nv) Approvers: - Michael Demoret (https://github.com/mdemoret-nv) URL: nv-morpheus#667

dagardner-nv added 11 commits February 6, 2023 15:47

Merge branch 'branch-23.03' into david-fix-triton-mini-batch

6d6b167

temp skip test

ebbd900

Revert partial fix

ada2547

Move duplicated method for computing the element count of a tensor ba…

62f90af

…sed on the shape to a method in TensorUtils

wip, trying to break up current method to make it digestable

4965e64

wip, trying to break up current method to make it digestable

ac13361

wip

d877b58

wip

025c508

First pass at bugfix and refactor

ba9f20a

Use the mapped name

ba9a372

dagardner-nv requested a review from a team as a code owner February 9, 2023 19:13

dagardner-nv changed the title ~~David fix triton mini batch~~ Fix Triton mini batch bug #680 Feb 9, 2023

dagardner-nv changed the title ~~Fix Triton mini batch bug #680~~ Fix Triton mini batch bug Feb 9, 2023

remove temporary skip on failing test

38f7f89

dagardner-nv added bug Something isn't working non-breaking Non-breaking change 3 - Ready for Review labels Feb 9, 2023

Formatting

644d192

dagardner-nv mentioned this pull request Feb 9, 2023

Adopt matx v0.3.0 #667

Merged

dagardner-nv and others added 4 commits February 9, 2023 14:30

Merge branch 'branch-23.03' into david-fix-triton-mini-batch

104480e

Formatting

741ae40

Merge branch 'branch-23.03' into david-fix-triton-mini-batch

33b4ea8

Merge branch 'branch-23.03' into david-fix-triton-mini-batch

950f4e6

mdemoret-nv requested changes Feb 16, 2023

View reviewed changes

morpheus/_lib/include/morpheus/objects/rmm_tensor.hpp Outdated Show resolved Hide resolved

dagardner-nv and others added 4 commits February 16, 2023 10:12

Merge branch 'branch-23.03' into david-fix-triton-mini-batch

868ed1a

Revert get_buffer method

ac3cb69

Add second map to hold output buffers

b38674a

Formatting fix

ffc9aa0

dagardner-nv requested a review from mdemoret-nv February 16, 2023 18:50

Merge branch 'branch-23.03' into david-fix-triton-mini-batch

59487e7

dagardner-nv closed this Feb 22, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Triton mini batch bug #688

Fix Triton mini batch bug #688

dagardner-nv commented Feb 9, 2023 •

edited

Loading

dagardner-nv commented Feb 22, 2023

Fix Triton mini batch bug #688

Fix Triton mini batch bug #688

Conversation

dagardner-nv commented Feb 9, 2023 • edited Loading

dagardner-nv commented Feb 22, 2023

dagardner-nv commented Feb 9, 2023 •

edited

Loading