Releases: hrs/docsim
Releases · hrs/docsim
v0.1.7
v0.1.6
Changelog
- 0c5e2a3 Build deb, rpm, and homebrew packages on release
- 40640c9 Add short flags for file, stdin, limit, & verbose
- a586fed Log parsing files when --verbose
- 0e833ad Remove --omit-query flag (always omit the query)
- 76efdbd Extract findParsableFiles func from ParseCorpus
- 4f93fd8 Yield 0.0 score if query has a zero norm
- 6d75471 Reword README
v0.1.5
v0.1.4
v0.1.3
Changelog
- 21e8fad Extract
hasPermittedExtension
function - 0b4b3b6 Warn:
docsim
doesn't understand.ignore
files - c763c34 Add a -version flag
- 357bef5 Brag about reading from STDIN
- 7685e70 Explicitly allow certain file extensions
- bae7ca9 Handle missing mime.types database
- d9f9410 Set up CI action to run tests
- 1265de9 Backfill some document parsing tests
- e5dc8e5 Populate config directly from flags
- 88c3ea3 Don't bother writing to a tempfile
- 2be7753 Ensure
make build
searches subdirectories
v0.1.2
v0.1.1
v0.1.0
Changelog
- 474d74c Add a build release task
- 97208f8 Write a simple README
- ed0c97d Don't include symlinks in the corpus
- a8ae6a2 Split words manually instead of by regexp
- cf33283 Check that this is a text file before opening
- 45f20a4 Document use with non-English documents
- 2b745c5 Replace deprecated use of ioutil with io
- 71d71f7 Backfill stoplist tests
- 911c082 Move corpus parsing out of main
- 85c23ad Backfill similarity tests
- 9b10bb2 If no query file's provided, read from STDIN
- 0913d52 Add --no-stemming flag to skip stemming
- b41825e Add --no-stoplist flag to skip stoplist
- adb900a Default to searching current directory
- 1fa579d License with the GPLv3
- 68c9692 Add a simple Makefile
- 69dee47 Include a simple manual page
- 2be26f1 Move code into a lib directory
- da14b80 Save memory by clearing term freqs after TF-IDF
- 30caf51 Recursively search directories for files
- 25f1c88 Add a --verbose flag
- 750d5f2 Add --omit-target flag to skip target in results
- f81720b Just print errors to stderr, don't log
- 08612d3 Only search files that seem to contain text
- 144d473 Add flags for sort order, limit, showing scores
- 43be014 Sort results, low-to-high
- 3cf6192 Display search results more readably
- 27eff9a Search the corpus with a query document
- 371c11d Maintain TF-IDF weights for each document
- 471fcd1 Corpus stores its inverse document frequency
- 4b7b1ac Documents store their term frequency
- 5358a2d Maintain a corpus of documents
- e023141 Track count of term occurrences
- cc1a73d Memoize stemming results in a local cache
- 765d3af Stem words with the Porter Stemmer
- bee8fd0 Don't include words in a standard English stoplist
- 7959b68 Split tokens but retain contractions
- c5a08da Instantiate a document containing words
- e36485e Get target file and search files from args
- 8cfa20f Hello, docsim.