Computing LSTM forward layer without allocating #3351

robertbastian · 2023-04-19T14:46:11Z

Based on #3349

There's no visible performance difference on our benchmarks, however this avoids an allocation of length codepoints x hidden units, and we only bench on small strings. For longer strings it's probably good to avoid this allocation.

#3305

zbraniecki · 2023-04-20T19:11:48Z

I can report perf improvement on th lstm from icu-perf

icu4x/th/baked/segmenter/word/lstm/overview
                        time:   [2.6583 ms 2.6646 ms 2.6730 ms]
                        change: [-1.4063% -1.1068% -0.7734%] (p = 0.00 < 0.05)
                        Change within noise threshold.

icu4x/th/baked/segmenter/line/lstm/overview
                        time:   [2.6445 ms 2.6487 ms 2.6530 ms]
                        change: [-1.7483% -1.4729% -1.2153%] (p = 0.00 < 0.05)
                        Performance has improved.

robertbastian · 2023-04-20T19:15:32Z

Yeah I got the 1% as well but it's nothing to write home about 😀

zbraniecki · 2023-04-21T17:48:37Z

I'm all for celebrating little wins.

sffc

Thanks for making this change

components/segmenter/src/complex/lstm/mod.rs

sffc · 2023-05-04T08:35:19Z

components/segmenter/src/complex/lstm/mod.rs

                    self.dic
-                        .get_copied(UnvalidatedStr::from_bytes(&buf[..i]))
+                        .get_copied_by(|key| {
+                            key.as_bytes().iter().copied().cmp(


Suggestion (optional): UTF-8 byte order is equivalent to UTF-32 order; it may be cleaner/smaller/faster code if you compared an iterator of char rather than an iterator of u8.

Separately, I'm starting to get a bit worried that the grapheme cluster code may bloat the code size of the LSTM segmenter. We should probably delete it at some point. Maybe as part of the next round of ML model upgrades.

The keys are UnvalidatedStrs aka [u8], we cannot use them as anything that's strongly typed.

There is https://docs.rs/utf8_iter/latest/utf8_iter/ for that

We don't like deps

Segmenter already depends on utf8_iter.

icu4x/components/segmenter/Cargo.toml

Line 35 in b6c4018

utf8_iter = "1.0.3"

robertbastian added 5 commits April 19, 2023 14:52

x

e8de12b

vis

dd061ef

pubs

bb1067b

fix

3705319

segiter

d620b8a

robertbastian force-pushed the segiter branch from ff5a237 to d620b8a Compare April 19, 2023 14:54

This comment was marked as spam.

Sign in to view

Merge branch 'main' into segiter

f089002

robertbastian added the C-segmentation Component: Segmentation label Apr 20, 2023

clippy

a7c5cb3

robertbastian requested review from Manishearth and removed request for Manishearth April 20, 2023 15:32

Merge branch 'main' into segiter

0716570

robertbastian marked this pull request as ready for review April 20, 2023 16:48

robertbastian requested review from aethanyc, makotokato and sffc as code owners April 20, 2023 16:48

robertbastian requested a review from Manishearth April 20, 2023 16:48

sffc previously approved these changes May 2, 2023

View reviewed changes

Merge branch 'main' into segiter

20a4655

robertbastian requested a review from sffc May 3, 2023 10:16

robertbastian dismissed sffc’s stale review via 20a4655 May 3, 2023 10:16

robertbastian mentioned this pull request May 3, 2023

Continue investigating LSTM speedups #3305

Open

4 tasks

sffc previously approved these changes May 3, 2023

View reviewed changes

robertbastian commented May 3, 2023

View reviewed changes

components/segmenter/src/complex/lstm/mod.rs Outdated Show resolved Hide resolved

Manishearth removed their request for review May 3, 2023 16:34

cmp iters

0bee697

robertbastian dismissed sffc’s stale review via 0bee697 May 3, 2023 18:10

robertbastian requested a review from Manishearth as a code owner May 3, 2023 18:10

test

b8b1387

robertbastian requested a review from sffc May 3, 2023 18:38

sffc approved these changes May 4, 2023

View reviewed changes

robertbastian merged commit 5ab0f5f into unicode-org:main May 4, 2023

robertbastian deleted the segiter branch May 4, 2023 08:46

Manishearth mentioned this pull request Sep 21, 2023

1.3 utils release list #4066

Closed

13 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Computing LSTM forward layer without allocating #3351

Computing LSTM forward layer without allocating #3351

robertbastian commented Apr 19, 2023 •

edited

Loading

This comment was marked as spam.

zbraniecki commented Apr 20, 2023

robertbastian commented Apr 20, 2023

zbraniecki commented Apr 21, 2023

sffc left a comment

sffc May 4, 2023

robertbastian May 4, 2023

sffc May 4, 2023

robertbastian May 4, 2023

aethanyc May 4, 2023

Computing LSTM forward layer without allocating #3351

Computing LSTM forward layer without allocating #3351

Conversation

robertbastian commented Apr 19, 2023 • edited Loading

This comment was marked as spam.

zbraniecki commented Apr 20, 2023

robertbastian commented Apr 20, 2023

zbraniecki commented Apr 21, 2023

sffc left a comment

Choose a reason for hiding this comment

sffc May 4, 2023

Choose a reason for hiding this comment

robertbastian May 4, 2023

Choose a reason for hiding this comment

sffc May 4, 2023

Choose a reason for hiding this comment

robertbastian May 4, 2023

Choose a reason for hiding this comment

aethanyc May 4, 2023

Choose a reason for hiding this comment

robertbastian commented Apr 19, 2023 •

edited

Loading