JRNN batch size to speed up inference? #69

max-markov · 2022-05-02T14:00:08Z

It seems like the out-of-the box version provided for JRNN (building on the original published model) is very slow, and inference for a single sentence takes quite a while, even when using a GPU (around 1 minute).

Increasing the batch size doesn't help since the inference code is buggy and only supports single samples. Changing the dimensionality in the graph definition file (graph-2016-09-10.pbtxt) leads to dimensionality mismatches downstream.

Has anyone been able to make inference faster for this model?

Thank you :)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

JRNN batch size to speed up inference? #69

JRNN batch size to speed up inference? #69

max-markov commented May 2, 2022

JRNN batch size to speed up inference? #69

JRNN batch size to speed up inference? #69

Comments

max-markov commented May 2, 2022