Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Cb changes 20241021 #10960

Merged
merged 6 commits into from
Oct 21, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Original file line number Diff line number Diff line change
Expand Up @@ -26432,6 +26432,10 @@ Piazza del Campo
Diario Libre
Community Notes
Safri Duo
Israel Defense Forces
International Trade Administration
Kronen Zeitung
Neue Kronen Zeitung
Copa Sudamericana
CONMEBOL Sudamericana
Copa Libertadores
Expand Down Expand Up @@ -26548,6 +26552,11 @@ Jean Paul Richter
Jean Paul
Sapir-Whorf
Jared Isaacman
Bain & Company
Trusted Shops
Amphitheatre Parkway
Emily in Paris
Steve Urkel
Polaris
Polaris Dawn
Internet Archive
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -3704,6 +3704,7 @@ protected List<SuggestedReplacement> getOnlySuggestions(String word) {
case "Dnake": return topMatch("Danke");
case "Muehe": return topMatch("Mühe");
case "Muehen": return topMatch("Mühen");
case "Kuhdamm": return topMatch("Ku’damm");
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🛠️ Refactor suggestion

Consider using a straight apostrophe or including multiple suggestions for "Ku’damm".

The suggestion "Ku’damm" uses a typographic apostrophe (U+2019). For consistency and to avoid potential encoding issues, you might use the standard apostrophe (U+0027) as in "Ku'damm". Additionally, since "Kudamm" without an apostrophe is also a common spelling, consider including it as an alternative suggestion.

Apply the following diff to update the suggestion:

-          case "Kuhdamm": return topMatch("Ku’damm");
+          case "Kuhdamm": return topMatch("Ku'damm", "Kudamm");

Committable suggestion was skipped due to low confidence.

case "Torschusspanik": return topMatch("Torschlusspanik");
case "ggf": return topMatch("ggf.");
case "Ggf": return topMatch("Ggf.");
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -11593,6 +11593,13 @@ Mocro-Mafia*
Burning-Man-Festival*
Higgs-Teilchen*
Higgs-Teilchens*
Scripted-Reality-Star*
Scripted-Reality-Stars*
Scripted-Reality-Sendung*
Scripted-Reality-Sendungen*
Scripted-Reality-Show*
Scripted-Reality-Shows*
Chagos-Archipel*
Drive-by-Shooting*
Drive-by-Shootings*
Reenactor-Messe$
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -243668,6 +243668,11 @@ Bouteiller/S
Boutella/S
Face-to-face-Kommunikation
Globusse/N
Kickl/S #name
Menendez #name
SDKs
ITA
fremd_ficken
Kompany/S #name
In-Dubio-pro-Reo-Grundsatz
In-Dubio-pro-Reo-Grundsatzes
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -572,4 +572,3 @@ Würstel/SN
Förderansuchen
Mocca/S
obgenannt/A
Kickl/S #name
Original file line number Diff line number Diff line change
Expand Up @@ -64659,6 +64659,13 @@ superschön/A
darlehensgleich/A
ultrahart/A
steuerschädlich/A
steuerunschädlich/A
Fressflash
alkoholselig/A
tot_rasen
elfmeterwürdig/A
freistoßwürdig/A
homemade
quotenschwächer/A
quotenstärker/A
quotenstark/A
Expand Down Expand Up @@ -64818,6 +64825,8 @@ raus_verhandeln
heraus_verhandeln
hinaus_verhandeln
rein_verhandeln
herum_verhandeln
rum_verhandeln
herein_verhandeln
hinein_verhandeln
Burning-Man-Festival/S
Expand Down Expand Up @@ -64853,6 +64862,7 @@ Sokratismus
Yars-Rakete/N
Yars-Raketensystem/S
Yars-Raketensysteme/N
anbringbar/A
isoperimetrisch/A
bruchstabil/A
einbringbar/A
Expand Down Expand Up @@ -64893,6 +64903,39 @@ Feuchteabdichtungen
kaputt_laufen
wichtig_machen
Yars-Atomrakete/N
Taşan-Funke/S #name
Scripted-Reality-Star/S
Scripted-Reality-Sendung
Scripted-Reality-Sendungen
Scripted-Reality-Show/S
WEEE-Reg.-Nr.
suchtähnlich/A
muschelähnlich/A
leitzinsrelevant/A
neidzerfressen/A
existenzbedrohend/A
debattierfreudig/A
menschheitsgeschichtlich/A
Dropshipper/SN
Dropshipperin
Dropshipperinnen
Drive-in-Fenster/SN
Drive-in-Schalter/SN
Drive-through-Fenster/SN
Drive-through-Schalter/SN
Drive-through/S
fremdflirten
fremdflirtend
fremdflirte
fremdflirtest
fremdzuflirten
fremdgeflirtet
fremd_vögeln
fremd_kuscheln
zusammen_kaufen
herbstblond/A
tot_trampeln
Chagos-Archipel/S
Kaputtheit
Kaputtheiten
Emailleplatte/N
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -138391,6 +138391,7 @@ Scaled Composites
Scaled Agile Framework
Scaled Agile©
Scaled Agile
Happy Path/S
Skalden- und Prosatexte/N
Sören Skalicks
Henderson Lake/S
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -3078,6 +3078,14 @@ Individuelle Gesundheitsleistung
Auricula auris
Auricula atrii
Keith Jarrett/S
Schin Bet/S
Samu Haber/S
Scripted Reality
Trusted Flagging/S
Trusted Flagger/S
Trusted Flaggerin
Trusted Flaggerinnen
Sinem Taşan-Funke/S
Guardia Civil
Loose Cannon/S
French Press
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -758,6 +758,8 @@ Pro und Kontra Liste=Pro-und-Kontra-Liste
Pro & Kontra Liste=Pro-und-Kontra-Liste
Pro und Kontra Listen=Pro-und-Kontra-Listen
Pro & Kontra Listen=Pro-und-Kontra-Listen
Tasan-Funke=Taşan-Funke
Tasan-Funkes=Taşan-Funkes
EUfeindlich=EU-feindlich
EUfeindliche=EU-feindliche
EUfeindlichem=EU-feindlichem
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -1081,6 +1081,8 @@ Computershare Computershare NNP
pax pax NNS
torr torr NNS
plankton plankton NNS
Kochi Kochi NNP
Kerala Kerala NNP
nanoplankton nanoplankton NNS
Choctaw Choctaw JJ
Brangelina Brangelina NNP
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -10934,6 +10934,17 @@ AC PRIV
A/C PRIV
AC Priv
A/C Priv
IC
ICs
PDE
PDEs
XT
offsite
offsites
onsite
onsites
Kochi
Kerala
ENT
ENTs
silastic
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -475,6 +475,29 @@ unionist
unionists
boulangerie
boulangeries
pyrography
pyrograph
pyrographed
pyrographing
pyrographs
pyrographies
pyrographic
misandry
misandric
misandrist
misandrists
transmisandry
misandrism
misandristic
misandrous
Moldovan
Moldovans
Majorcan
Majorcans
Monacan
Monacans
Monegasque
Monegasques
indemnity
indemnities
indemnitee
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -8020,6 +8020,8 @@ Daffy Duck NNP
rima glottidis NN:U
Pavel Durov NNP
Ahmed Aboutaleb NNP
Chagos Archipelago NNP
Shin Bet NNP
Iron Crown NNP
Harold Shipton NNP
David Aaker NNP
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -86445,8 +86445,23 @@ USA
<token postag="PRP" skip="-1"></token>
<token>?</token>
</antipattern>
<antipattern>
<token regexp="yes" case_sensitive="yes">can|ca|have|should|could|will|wo|may|might</token>
<token regexp="yes">not|n't</token>
<token postag="PRP_S.*" postag_regexp="yes" />
<token postag="VB" />
</antipattern>
<antipattern>
<token regexp="yes">@.+</token>
<token regexp="yes">.+</token>
<token regexp="yes" case_sensitive="yes">can|ca|have|should|could|will|wo|won|may|might</token>
<token regexp="yes">not|n't</token>
</antipattern>
<pattern>
<token><exception postag="SENT_START"/></token>
<token>
<exception postag="SENT_START|PCT" postag_regexp="yes" />
<exception regexp="yes">@.+</exception>
</token>
<marker>
<token regexp="yes" case_sensitive="yes">can|ca|have|should|could|will|wo|won|may|might</token>
<token regexp="yes">not|n't</token>
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -3350,6 +3350,11 @@ For more detailed information, including tag definitions, please check out our d
<example>... yields are around 2 t/ha, African under 1 t/ha, <marker>Egypt and</marker> Arabia up to 3.5 to 4 t/ha with irrigation.</example>
</rule>
<rule><!-- 3 adjectives -->
<antipattern>
<token>first</token>
<token>and</token>
<token>foremost</token>
</antipattern>
<pattern>
<token chunk_re=".-VP" postag="VB.*" postag_regexp="yes" />
<token postag="RB" min="0" chunk_re=".-AD.*" />
Expand Down Expand Up @@ -3386,6 +3391,11 @@ For more detailed information, including tag definitions, please check out our d
<token>,</token>
<token postag="JJ.*" postag_regexp="yes" regexp="yes">[A-Z].+</token>
</antipattern>
<antipattern>
<token>first</token>
<token>and</token>
<token>foremost</token>
</antipattern>
<pattern>
<token inflected="yes" regexp="yes">be|seem|appear|look</token>
<token postag="RB" min="0" max="2" chunk_re=".-AD.*" />
Expand Down