Arabic Numeric Shaping Support #2156

samarsultan · 2017-02-07T19:13:36Z

Arabic and many other languages (Thai and Bengali) have classical shapes for digits “National Digits” that are different from the conventional Western Digits (European).
National digits have the same semantic meaning as the European digits, and the numbers they form are read from left to right (most significant digit on the left). The difference is only a difference in glyphs.

From the Arabic user's point of view, Arabic-Indic numerals are the basic numerals used in almost all forms of documents such as most of government documents (IDs, birth certificates, driver's licenses, passports and household bills), bank statements, newspapers, calendars, road signs and menus.

Options for Arabic Numeric Shaping

There are 3 options which should be taken into consideration when implementing national numeric shaping support in any framework/technology. These options are:
• None: No shaping is performed, and the value appears as it is in the data source.
• National: Digit shapes are determined from the user’s language.
• Contextual: Digit shapes are determined from the preceding characters in the buffer. European digits follow strong Latin character and Arabic-Indic digits follow strong Arabic character. When there is no preceding strong characters, the base text direction attribute determines the digit shaping. (Arabic-Indic digits in RTL context and European digits in LTR context).

Problem Statement

Most of the available frameworks/technologies lack the contextual shaping option of national digits. Contextual digit shaping is a very important feature as the Arabic users don’t expect to see Arabic-Indic numerals or European numerals only when they have mixed English and Arabic data.
For example, if a document has many paragraphs some in Arabic and others in English, in the Arabic paragraphs the Arabic users expect to see national or Arabic-Indic numerals, and in the English paragraphs the Arabic users expect to see European numerals.
Since the mixed English and Arabic data cases are very common in Arabic region, the same case with numerals is very common too.

Figure 1: Notebook file name

Figure 2: Item names (files & running tab)

Figure 3: Output from markdown cell.

Figure 4: Pagination in the generated pdf

Carreau · 2017-02-09T20:10:48Z

Thanks for the detailed description ! The conversion to PDF will also require changes to https://github.com/jupyter/nbconvert likely.

samarsultan · 2017-02-12T14:58:58Z

Yea , I see.

We can defer it slightly until the current notebook is fully supported.

samarsultan mentioned this issue Feb 7, 2017

Adding GUI Mirroring- issue #1852 #1860

Merged

samarsultan mentioned this issue Feb 12, 2017

Bidi Support design proposal for (Numeric Shaping , National Calendar and UI Mirroring) #2178

Open

jasongrout mentioned this issue Feb 9, 2018

Jupyter misdisplying Python lists with Arabic and alphanumeric elements jupyterlab/jupyterlab#3846

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Arabic Numeric Shaping Support #2156

Arabic Numeric Shaping Support #2156

samarsultan commented Feb 7, 2017

Carreau commented Feb 9, 2017

samarsultan commented Feb 12, 2017

Arabic Numeric Shaping Support #2156

Arabic Numeric Shaping Support #2156

Comments

samarsultan commented Feb 7, 2017

Options for Arabic Numeric Shaping

Problem Statement

Carreau commented Feb 9, 2017

samarsultan commented Feb 12, 2017