Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

MLE-1053 rebase asapp fixes #4

Open
wants to merge 29 commits into
base: ASAPP-fixes
Choose a base branch
from
Open
Changes from 1 commit
Commits
Show all changes
29 commits
Select commit Hold shift + click to select a range
c0e7a13
merge computeSubwords functions
Celebio Sep 13, 2018
711f513
added FAQ on how to get reproducible results (#633)
Sep 13, 2018
a5d22ab
Fix broken link (#590)
EmilStenstrom Sep 13, 2018
8e68462
Conforming to Facebook c++ style
Celebio Oct 24, 2018
be1e597
Compute precision/recall for each label
Celebio Oct 24, 2018
25c3994
fixing python binding for `predict` function
Celebio Oct 26, 2018
6efad35
duplicate import removed
edenbaus Nov 2, 2018
58fe650
fixing missing include in productquantizer.cc that is causing compila…
Celebio Nov 2, 2018
2e52f53
Refactor model testing and metrics code. (#672)
Nov 6, 2018
4a3b5af
meter class refactoring for per-label stats, some function deprecatio…
Celebio Nov 6, 2018
d759dd1
adding python binding for `test-label`
Celebio Nov 6, 2018
0ddcd5f
adding coverage option for Makefile and setup.py
Celebio Nov 6, 2018
41a0f39
putting back the usage of vector to loop in C++ in multiline prediction
Celebio Nov 6, 2018
c180783
fix circleci errors
Celebio Nov 7, 2018
5c229ab
Fixed typos at readme.md (#662)
schneiderl Nov 8, 2018
ead7911
fix support for older C++11 compilers for python bindings
Celebio Nov 20, 2018
4aee63d
Add circleci build badges to the README.md
Celebio Nov 21, 2018
256032b
remove printing functions from fasttext class
Celebio Nov 23, 2018
b8022b5
python install, a more robust pybind11 include
Celebio Nov 27, 2018
a84a6a4
add argument names in fasttext.h
Celebio Nov 27, 2018
71b4101
Normalize buffer vector in analogy queries
Celebio Nov 27, 2018
8850c51
One-vs-all cross-entropy loss
Celebio Nov 27, 2018
7deac6d
adding ova loss option to python bindings
Celebio Dec 4, 2018
501b9b1
Better default for number of threads
whiletruelearn Dec 4, 2018
7842495
Re-licensing fasttext to MIT
Dec 18, 2018
3c4a3ea
footer language : default to EN (#581)
Dec 20, 2018
67e8950
set version to have an ASAPP suffix, add Cython to install_requires
fwph May 22, 2018
f74aad6
bump version after publish script change
cdfox-asapp Sep 7, 2018
b7fa4e7
Update setup.py
cdfox-asapp Oct 18, 2018
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
added FAQ on how to get reproducible results (facebookresearch#633)
Summary:
Hi everyone, and thanks for this wonderful library. I'm relatively new to it, and I found myself struggling a bit when trying to obtain reproducible results, e.g. in order to find the the best parameters.
I found the perfect answer in a 2016 issue here on your repo (facebookresearch#116)  and I though it could be useful to add it to the FAQs.

I'm sending you two PR:
- this one, in which I added the FAQ
- a second one, in which I modified the description in src/args.cc for the "thread" param

Of course feel free to choose which one to keep (or eventually to trash both of them).

Thanks!
Leonardo
Pull Request resolved: facebookresearch#633

Differential Revision: D9814563

Pulled By: EdouardGrave

fbshipit-source-id: 83e4b7a7163b9013aef144dedd9b4bd5945bafdf
Leonardo Foderaro authored and facebook-github-bot committed Sep 13, 2018
commit 711f513bc6bfa17bc77f21b4cb1305a81acb3cdd
3 changes: 3 additions & 0 deletions docs/faqs.md
Original file line number Diff line number Diff line change
@@ -53,3 +53,6 @@ You'll likely see this behavior because your learning rate is too high. Try redu

## My compiler / architecture can't build fastText. What should I do?
Try a newer version of your compiler. We try to maintain compatibility with older versions of gcc and many platforms, however sometimes maintaining backwards compatibility becomes very hard. In general, compilers and tool chains that ship with LTS versions of major linux distributions should be fair game. In any case, create an issue with your compiler version and architecture and we'll try to implement compatibility.

## How do I run fastText in a fully reproducible way? Each time I run it I get different results.
If you run fastText multiple times you'll obtain slightly different results each time due to the optimization algorithm (asynchronous stochastic gradient descent, or Hogwild). If you need to get the same results (e.g. to confront different input params set) you have to set the 'thread' parameter to 1. In this way you'll get exactly the same performances at each run (with the same input params).