-
Notifications
You must be signed in to change notification settings - Fork 79
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
chore(nlp): Dutch added to contenful #1249
Conversation
6ec0b53
to
19a055b
Compare
Codecov Report
@@ Coverage Diff @@
## contentful/hu #1249 +/- ##
=================================================
+ Coverage 64.58% 64.62% +0.03%
=================================================
Files 231 232 +1
Lines 6436 6443 +7
Branches 1115 1115
=================================================
+ Hits 4157 4164 +7
Misses 1966 1966
Partials 313 313
Flags with carried forward coverage won't be shown. Click here to find out more.
Continue to review full report at Codecov.
|
packages/botonic-plugin-contentful/src/nlp/stopwords/stopwords-nl.ts
Outdated
Show resolved
Hide resolved
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Take a look at @dpinol comments and then you're good to go!
@@ -76,7 +76,6 @@ export const nlDefaultStopWords = [ | |||
'toch', | |||
'toen', | |||
'tot', | |||
'u', | |||
'uit', |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I didn't mean to remove all 1 letters, only the full alphabet below
Depends on #1248. Please, review it first.⚠️
Description
Dutch tokenizer, stemmer, and stopwords have been added to contentful nlp.
Context
Adding them will make it possible to process Dutch text.
Approach taken / Explain the design
The tokenizer, stemmer and stopwords used are from the nlpjs library.
Testing
The pull request...