Feature_perc #20

amissarova · 2022-07-19T21:09:23Z

Hi!

I was wondering about the rationale for feature_perc = 0.5 and not 1? Are any particular reasons to randomly select features (besides computational complexity)?

Thanks!

skinnider · 2022-07-19T21:10:20Z

Nope, just a way to reduce the runtime.

amissarova · 2022-07-19T21:11:04Z

cool, thanks

amissarova · 2022-07-19T21:13:58Z

related q: I just tried running augur with feature_perc = 1. I would have expected that for each gene, each subsampling and each fold I will now get importance score - but it is not the case (there are some subsampling where I dont have an input for this gene). Why?
Thanks!

skinnider · 2022-07-20T03:21:25Z

By default 50% of genes will be filtered out with select_variance - are they there when setting var_quantile=0?

amissarova · 2022-07-20T09:43:27Z

Hey,
I now set feature_perc = 1 and var_quantile = 0 --> for some genes, I still dont have an importance score entry for some of the subsamplings.

amissarova · 2022-07-20T09:56:07Z

Guess that possibly happens coz of the initial hard-coded filtering of genes with no variance (for given downsampling)? Or are there some other reasons?

skinnider · 2022-07-20T14:04:40Z

That seems plausible, yes.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature_perc #20

Feature_perc #20

amissarova commented Jul 19, 2022

skinnider commented Jul 19, 2022

amissarova commented Jul 19, 2022

amissarova commented Jul 19, 2022

skinnider commented Jul 20, 2022

amissarova commented Jul 20, 2022

amissarova commented Jul 20, 2022

skinnider commented Jul 20, 2022

Feature_perc #20

Feature_perc #20

Comments

amissarova commented Jul 19, 2022

skinnider commented Jul 19, 2022

amissarova commented Jul 19, 2022

amissarova commented Jul 19, 2022

skinnider commented Jul 20, 2022

amissarova commented Jul 20, 2022

amissarova commented Jul 20, 2022

skinnider commented Jul 20, 2022