Allow \u2e12 in variable names #34835

ChrisRackauckas · 2020-02-21T09:59:31Z

This came up in https://github.com/JuliaDiffEq/ModelingToolkit.jl/issues/247 where u₁⸒₂ would be great name for a generated symbolic variable in a matrix, but it is not allowed with the current punctuation rules in the parser. I would advocate for allowing this inside of variable names since right now it doesn't have another use.

The text was updated successfully, but these errors were encountered:

JeffreySarnoff · 2020-02-21T18:12:02Z

mᵢ⸒ⱼ⸒ₖ

JeffBezanson · 2020-02-21T19:16:50Z

Is there a latex sequence for this?

JeffreySarnoff · 2020-02-21T19:25:20Z

Not per se afaik. ($$m_{i,j,k}$$) ($_,$) though

ChrisRackauckas · 2020-02-21T19:59:35Z

Indeed the Latex we generate as output do m_{i,j,k}, but we can't have that as a variable name so we are trying to get as close as possible:

https://github.com/JuliaDiffEq/ModelingToolkit.jl/blob/v1.2.7/test/latexify.jl#L38-L51

Essentially what we're doing is taking in data and then spitting out the Latex for the physical laws that would generate the data, so we're trying to make our symbolic variables on the Julia side be symbols that are close to the Latex to make it easier to read and relate (since we're generating Julia code for the functions as well). What we're missing is a unicode subscript comma.

StefanKarpinski · 2020-02-21T20:27:44Z

It seems like it should be input as \_,<tab>.

JeffreySarnoff · 2020-02-21T21:00:14Z

There is also \u02cf 'ˏ' mᵢˏⱼˏₖ and \u201A '‚' mᵢ‚ⱼ‚ₖ (vs '⸒' mᵢ⸒ⱼ⸒ₖ using \u2e12) either is more comma-like and available in many more fonts (these lists do not include recently released fonts):
~10 font families supporting \u2e12 (mᵢ⸒ⱼ⸒ₖ)
~100 font families supporting \u201a (mᵢ‚ⱼ‚ₖ)
~40 font families supporting \u02cf (mᵢˏⱼˏₖ)

IBM Plex Mono has \u201a \u02cf, does not have \u2e12
FiraMono, Roboto Mono, Hasklig have \u201a, do not have \u02cf \u2e12

Given the lesser availability of \u2e12, one of the alternatives is preferable.

knuesel · 2021-02-25T10:27:45Z

u+201a actually looks too nice, i.e. it's too hard to distinguish from a regular comma: mᵢ‚ⱼ‚ₖ vs mᵢ,ⱼ,ₖ.

In a sense, that u+02cf looks a bit weird is an advantage, as it makes it clear that it's not a regular comma: mᵢˏⱼˏₖ vs mᵢ,ⱼ,ₖ (this is also true of u+2e12). This is what ModelingToolkit uses now.

There is also u+02cc "modifier letter low vertical line": mᵢˌⱼˌₖ. It is distinguishable from a regular comma, present in many fonts, and already allowed in variable names (this is all also true of u+02cf).

JeffBezanson added the parser Language parsing and surface syntax label Feb 21, 2020

JeffBezanson added the unicode Related to unicode characters and encodings label Feb 21, 2020

tkf mentioned this issue Feb 22, 2020

Define g ⨟ f = f ∘ g #34832

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow \u2e12 in variable names #34835

Allow \u2e12 in variable names #34835

ChrisRackauckas commented Feb 21, 2020

JeffreySarnoff commented Feb 21, 2020 •

edited

Loading

JeffBezanson commented Feb 21, 2020

JeffreySarnoff commented Feb 21, 2020 •

edited

Loading

ChrisRackauckas commented Feb 21, 2020

StefanKarpinski commented Feb 21, 2020

JeffreySarnoff commented Feb 21, 2020 •

edited

Loading

knuesel commented Feb 25, 2021

Allow \u2e12 in variable names #34835

Allow \u2e12 in variable names #34835

Comments

ChrisRackauckas commented Feb 21, 2020

JeffreySarnoff commented Feb 21, 2020 • edited Loading

JeffBezanson commented Feb 21, 2020

JeffreySarnoff commented Feb 21, 2020 • edited Loading

ChrisRackauckas commented Feb 21, 2020

StefanKarpinski commented Feb 21, 2020

JeffreySarnoff commented Feb 21, 2020 • edited Loading

knuesel commented Feb 25, 2021

JeffreySarnoff commented Feb 21, 2020 •

edited

Loading

JeffreySarnoff commented Feb 21, 2020 •

edited

Loading

JeffreySarnoff commented Feb 21, 2020 •

edited

Loading