-
-
Notifications
You must be signed in to change notification settings - Fork 2.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
DOI matching in duplicate check #6897
Conversation
Add zero weights for otes and comments.
small improvements
Moving cleanup's logic to DOI constructor.Add results of generate-authors.sh.
Moving cleanup's logic to DOI constructor.Add results of generate-authors.sh. (revert AUTHORS)
into KunAndrew-fix-for-issue-6707 * 'fix-for-issue-6707' of https://github.com/KunAndrew/jabref: issue 6707 Moving cleanup's logic to DOI constructor.Add results of generate-authors.sh. (revert AUTHORS) issue 6707 Moving cleanup's logic to DOI constructor.Add results of generate-authors.sh. issue 6707 small improvements issue 6707 Add zero weights for otes and comments. Add DOI matching with not full coincidence.
The JabRef maintainers will add the following name to the AUTHORS file. In case you want to use a different one, please comment here and adjust your name in your git configuration for future commits.
|
@KunAndrew Your problems probably came from the fact that your repo's master branch is behind the naster branch of jabref.
|
@@ -108,12 +108,12 @@ public DOI(String doi) { | |||
Matcher matcher = EXACT_DOI_PATT.matcher(trimmedDoi); | |||
if (matcher.find()) { | |||
// match only group \1 | |||
this.doi = matcher.group(1); | |||
this.doi = matcher.group(1).replaceAll("[^\\w,/,:,-,.,-]|[_]", ""); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@KunAndrew I think you need to adjust the DOI regex pattern:
One CrossRef test fails, because of the missing underscore:
The DOI cannot be resolved wihtout underscore
org.opentest4j.AssertionFailedError: expected: <10.1007/11538394_20> but was: <10.1007/1153839420>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There should be a comment added which characters are removed and why. Without any comment, this is a "magic number" which should be extraced to a constant.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I removed this addtional regex because it was not really making sense. The DOI class already trims DOIs and also removes other encoding components.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Small code comments
@@ -108,12 +108,12 @@ public DOI(String doi) { | |||
Matcher matcher = EXACT_DOI_PATT.matcher(trimmedDoi); | |||
if (matcher.find()) { | |||
// match only group \1 | |||
this.doi = matcher.group(1); | |||
this.doi = matcher.group(1).replaceAll("[^\\w,/,:,-,.,-]|[_]", ""); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There should be a comment added which characters are removed and why. Without any comment, this is a "magic number" which should be extraced to a constant.
Co-authored-by: Oliver Kopp <kopp.dev@gmail.com>
Co-authored-by: Oliver Kopp <kopp.dev@gmail.com>
Co-authored-by: Oliver Kopp <kopp.dev@gmail.com>
…-issue-6707 * upstream/master: (35 commits) Fix a fetcher test for the ShortDOIService (#6945) Fixes Shared Database: Changes filtering in CoarseChangeFilter to attribute property (#6868) Changed default value of "search and store files relative to bibtex file" to true (#6928) Replace comment by just a failure (#6943) Fix: in entry types editor selected field is not removed after first click (#6941) Fix remove actions for entry types in the editor (#6933) Always use Java 15 (#6929) Update DevDocs: workaround for issues with local openjfx maven libraries (#6931) Fixes bugs in the `regex` cite key pattern modifier (#6893) Add missing author Readability for citation key patterns (#6706) Add new author Reset to master and add default case to switch (#6847) Bump mockito-core from 3.5.10 to 3.5.11 (#6924) Bump byte-buddy-parent from 1.10.14 to 1.10.15 (#6923) Bump org.beryx.jlink from 2.21.4 to 2.22.0 (#6925) Bump xmpbox from 2.0.20 to 2.0.21 (#6926) Bump pascalgn/automerge-action from v0.9.0 to v0.10.0 (#6927) Improve parsing of short DOIs (#6920) Bump junit-vintage-engine from 5.6.2 to 5.7.0 (#6910) ...
remove additional regex we already are trimming and normalizing DOIs
Fixes #6707
Follow up from #6756