Fix split when a nbsp character is present #1073

FredTreg · 2024-10-14T10:08:22Z

When computing the minReadableWidth of a cell, the code does not account for non-breaking space characters (also known as nbsp or \u00A0).

nbsp characters should be treated as non-space characters when calculating the string length, as this is the intended function of such characters.

Failing to do so results in suboptimal output, particularly for languages like French, where punctuation marks such as colons (:) are always preceded by a space and should remain on the same line as the preceding word.

This PR fixes that issue.

…character

umaganesan · 2024-10-14T16:48:18Z

How times do I need to unsubscribe from this website, thread and conversation? Thanks

…

On Mon, 14 Oct 2024, 3:38 pm Frederic Tregon, ***@***.***> wrote: When computing the minReadableWidth of a cell, the code does not account for non-breaking space characters (also known as nbsp or \u00A0). nbsp characters should be treated as non-space characters when calculating the string length, as this is the intended function of such characters. Failing to do so results in suboptimal output, particularly for languages like French, where punctuation marks such as colons (:) are always preceded by a space and should remain on the same line as the preceding word. This PR fixes that issue. ------------------------------ You can view, comment on, or merge this pull request online at: #1073 Commit Summary - 5657768 <5657768> fix(split): when computing longest word, nbsp should be considered a character File Changes (1 file <https://github.com/simonbengtsson/jsPDF-AutoTable/pull/1073/files>) - *M* src/widthCalculator.ts <https://github.com/simonbengtsson/jsPDF-AutoTable/pull/1073/files#diff-0aff2bfd71ecbb850b40ffe226854fd5421dc8aec1515a784c8d76e48e153f9a> (2) Patch Links: - https://github.com/simonbengtsson/jsPDF-AutoTable/pull/1073.patch - https://github.com/simonbengtsson/jsPDF-AutoTable/pull/1073.diff — Reply to this email directly, view it on GitHub <#1073>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ADSU6DPBFLPVETFACP7TYDDZ3OJ23AVCNFSM6AAAAABP4UOYXGVHI2DSMVQWIX3LMV43ASLTON2WKOZSGU4DKNJQHA2DOMY> . You are receiving this because you are subscribed to this thread.Message ID: ***@***.***>

simonbengtsson · 2024-10-14T20:50:41Z

Interesting! Based on my very limited understanding shouldn't a cell with the content d'où viens-tu ? be considered two words then? Ie wouldn't it be better to calculate the "longestWordWidth" based on the entire viens-tu ? instead of viens-tu and "?" separately?

FredTreg · 2024-10-14T21:43:59Z

This is exactly the goal of this fix, the French would be pre-processed as viens-tu\u00A0?, so with my fix it would be one word only.
Another example is with currencies (still in French), where we would write 12 € preprocessed as 12\u00A0€. Without the fix it would be appear as 2 lines, with the fix it stays on one line

simonbengtsson · 2024-10-14T23:37:59Z

Got it! I thought since \s does not match non breaking spaces as far as I understand it would work with this. But I'll try it tomorrow considering you have experienced issues with it.

FredTreg · 2024-10-15T05:48:19Z

since \s does not match non breaking spaces

I do not know about that, I did the following on the Chrome console:

> const words = 'two\u00A0words'
  words.split(/\s+/)
  > (2) ['two', 'words']

So it does split on nbsp and the character is indeed listed as whitespace on MDN: https://developer.mozilla.org/en-US/docs/Web/JavaScript/Guide/Regular_expressions/Character_classes

simonbengtsson · 2024-10-15T07:24:03Z

You are right. I tried with a regex tester yesterday, but apparently incorrectly. Can you add a comment or a named variable for the regex? Then I'll merge promptly.

FredTreg · 2024-10-15T07:53:45Z

Done!

simonbengtsson · 2024-10-15T13:31:24Z

Thanks! Merged and released in v3.8.4

FredTreg · 2024-10-15T13:36:17Z

Thank you, and thanks for making it easy to contribute.

fix(split): when computing longest word, nbsp should be considered a …

5657768

…character

doc(split): adding comment on nbsp splitting

e1896e1

doc(split): fixing comment on nbsp splitting

e4f512a

simonbengtsson merged commit 70b66de into simonbengtsson:master Oct 15, 2024

cyb3rn3t mentioned this pull request Nov 6, 2024

[Snyk] Upgrade jspdf-autotable from 3.8.3 to 3.8.4 sofia-lpz/sandersCRM#50

Open

snipe mentioned this pull request Nov 8, 2024

[Snyk] Upgrade jspdf-autotable from 3.8.3 to 3.8.4 snipe/snipe-it#15786

Closed

omaration21 mentioned this pull request Nov 8, 2024

[Snyk] Upgrade jspdf-autotable from 3.8.3 to 3.8.4 omaration21/sandersapp#14

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix split when a nbsp character is present #1073

Fix split when a nbsp character is present #1073

FredTreg commented Oct 14, 2024 •

edited

Loading

umaganesan commented Oct 14, 2024 via email

simonbengtsson commented Oct 14, 2024

FredTreg commented Oct 14, 2024 •

edited

Loading

simonbengtsson commented Oct 14, 2024

FredTreg commented Oct 15, 2024

simonbengtsson commented Oct 15, 2024 •

edited

Loading

FredTreg commented Oct 15, 2024 •

edited

Loading

simonbengtsson commented Oct 15, 2024

FredTreg commented Oct 15, 2024

Fix split when a nbsp character is present #1073

Fix split when a nbsp character is present #1073

Conversation

FredTreg commented Oct 14, 2024 • edited Loading

umaganesan commented Oct 14, 2024 via email

simonbengtsson commented Oct 14, 2024

FredTreg commented Oct 14, 2024 • edited Loading

simonbengtsson commented Oct 14, 2024

FredTreg commented Oct 15, 2024

simonbengtsson commented Oct 15, 2024 • edited Loading

FredTreg commented Oct 15, 2024 • edited Loading

simonbengtsson commented Oct 15, 2024

FredTreg commented Oct 15, 2024

FredTreg commented Oct 14, 2024 •

edited

Loading

FredTreg commented Oct 14, 2024 •

edited

Loading

simonbengtsson commented Oct 15, 2024 •

edited

Loading

FredTreg commented Oct 15, 2024 •

edited

Loading