refinr 0.3.0
NEW FEATURES
- Rewrote some of the cpp functions to incorporate
std::unordered_map()
, resulting in a substantial speed improvement when passing large character vectors (length 100,000+) to either of the exported functions (#8).
PKG API CHANGES
- In function
n_gram_merge()
, renamed argedit_dist_weights
toweight
. The only purpose of this arg is to be passed along to functionstringdistmatrix
from the stringdist package (which uses the nameweight
, so this change is simply to match that).
BUG FIXES
-
Fixed issue in which input strings that contained accent marks were not being properly handled/clustered (#9). The fix involved adding stringi to
Imports
and usingstringi::stri_trans_general()
. -
Fixed issue in
n_gram_merge()
in which incorrect values were being return when input argignore_strings
was notNULL
, and argbus_suffix = FALSE
(#7). -
Fixed issue in which input strings that contained punctuation that was NOT surrounded by spaces was returning incorrect values (#6).
-
Fixed issue in which the edit value assigned to a cluster was sometimes not the most frequent string in that cluster (#5).