You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We could try to include the dash in the list of allowed word characters. Maybe only need to do some preprocesssing to remove it at the beginning and end, so fragments like "-word" or "word-" (happens a lot e.g. in German language) are processed correctly.
Seems like we're splitting words such as
green-yellow
orT-rex
. I wonder if there's a good solution to this.The text was updated successfully, but these errors were encountered: