forked from elastic/elasticsearch
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[ML] expand allowed NER labels to be any I-O-B tagged labels (elastic…
…#87091) Named entity recognition (NER) is a special form of token classification. The specific kind of labelling we support is Inside-Outside-Beginning (IOB) tagging. These labels indicate if they are the inside of a token (with a `I-` or `I_`), the beginning (`B-` or `B_`) or outside (`O`). Each valid token classification label starts with the require prefix or `O`. Before this commit, we restricted the labels to a specific set: ``` O(Entity.NONE), // Outside a named entity B_MISC(Entity.MISC), // Beginning of a miscellaneous entity right after another miscellaneous entity I_MISC(Entity.MISC), // Miscellaneous entity B_PER(Entity.PER), // Beginning of a person's name right after another person's name I_PER(Entity.PER), // Person's name B_ORG(Entity.ORG), // Beginning of an organization right after another organization I_ORG(Entity.ORG), // Organisation B_LOC(Entity.LOC), // Beginning of a location right after another location I_LOC(Entity.LOC); // Location ``` But now, any entity is allowed, as long as the naming of the labels adhere to IOB tagging rules.
- Loading branch information
Showing
5 changed files
with
230 additions
and
198 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,5 @@ | ||
pr: 87091 | ||
summary: Expand allowed NER labels to be any I-O-B tagged labels | ||
area: Machine Learning | ||
type: enhancement | ||
issues: [] |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.