Skip to content

Latest commit

 

History

History
10 lines (9 loc) · 461 Bytes

README.md

File metadata and controls

10 lines (9 loc) · 461 Bytes

Pynder repo (Py-Finder)

The pynder repo is a showcase on how to implement hundreds if not thousands of custom spacy pipeline components in production. This repo is usefull when analyzing large/vast amounts of documents from which one wants to mine many particular fields.

The implemented BaseMatchers:

  • BaseRegex
  • BaseSpacyMatcher (tokenized matching)
  • BaseTFIDF (Twerm Frequency Inverse Document Frequency)
  • BaseNormalizedCounter