Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

About Hyper-paramters #32

Open
maple-zhou opened this issue Apr 3, 2023 · 1 comment
Open

About Hyper-paramters #32

maple-zhou opened this issue Apr 3, 2023 · 1 comment

Comments

@maple-zhou
Copy link

Hi, sorry to bother.
I really appreciate that this is a wonderful work, but I'm wondering how do you confirm the hyper-parameters of the distill.py?
e.g. lr_img, lr_lr, lr_teacher, max_start_epoch, and so on.
Do you use the grid search on the hyper parameters? If so, how do you implement it? Since I notice that there are a number of hyper-parameters needing to be set, I think it would take quite a few time to get a good setting.
Thanks!

@GeorgeCazenavette
Copy link
Owner

Hi, apologies for the late reply.

For the smaller experiments, we did a decently sized grid search. However, we could not afford to do this for the larger experiments.

I just ran a bunch of experiments and tracked them with wandb.

Ideally, the method should not be as sensitive to hparams. Hopefully future work can resolve this :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants