add normal and wald types #648

strengejacke · 2022-09-25T10:01:09Z

Usually, parameters::degrees_of_freedom() and insight::get_df() are expected to do the same thing. However, currently the design of those two functions differ. parameters::degrees_of_freedom() offers more options to extract df's. This PR aims at bringing insight::get_df() on par with parameters::degrees_of_freedom(), so the latter can be fully replaced by insight::get_df(). I think it doesn't make sense and is rather confusing to have two methods which do different things for certain models.

@easystats/core-team

codecov-commenter · 2022-09-25T10:22:56Z

Codecov Report

Merging #648 (404839f) into main (572e53a) will increase coverage by 0.65%.
The diff coverage is 70.49%.

❗ Current head 404839f differs from pull request most recent head 6abada1. Consider uploading reports for the commit 6abada1 to get more accurate results

@@            Coverage Diff             @@
##             main     #648      +/-   ##
==========================================
+ Coverage   54.32%   54.98%   +0.65%     
==========================================
  Files         119      124       +5     
  Lines       14101    14334     +233     
==========================================
+ Hits         7661     7881     +220     
- Misses       6440     6453      +13

Impacted Files	Coverage Δ
R/get_df_betwithin.R	`0.00% <0.00%> (ø)`
R/get_predicted.R	`71.28% <ø> (ø)`
R/get_predicted_args.R	`72.22% <0.00%> (-4.33%)`	⬇️
R/get_predicted_gam.R	`73.91% <0.00%> (+3.07%)`	⬆️
R/get_varcov.R	`23.29% <ø> (ø)`
R/get_df_residual.r	`22.22% <22.22%> (ø)`
R/get_predicted_se.R	`68.67% <33.33%> (+0.81%)`	⬆️
R/get_varcov_sandwich.R	`84.14% <33.33%> (ø)`
R/get_df_satterthwaite.R	`54.54% <54.54%> (ø)`
R/get_sigma.R	`28.28% <62.50%> (-1.63%)`	⬇️
... and 7 more

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

R/get_df.R

bwiernik · 2022-09-25T11:41:05Z

R/get_df.R

+#' - `"wald"` for models with z-statistic, returns `"Inf"`. Else, tries to
+#'   extract residual degrees of freedoms. If residual degrees of freedom could
+#'   not be extracted, returns `"Inf"`.
+#' - `"analytical"` returns analytical degrees of freedom, i.e. `n-k`


What's the difference between residual and analytical? They sound the same to me from this description

analytical is always n_obs() - n_param() (n-k). not sure if which cases this may differ from residual df?

Co-authored-by: Brenton M. Wiernik <bwiernik@users.noreply.github.com>

…nto get_df_typey

strengejacke · 2022-09-25T19:13:33Z

@bwiernik @mattansb what should "analytical" df return for models with z-statistic? Still n-k, or Inf? If the latter, "analytical" would be almost the same "wald". See docs:

- `"residual"` tries to extract residual degrees of freedoms. If residual
   degrees of freedom cannot be extracted, returns analytical degrees of
   freedom, i.e. `n-k` (number of observations minus number of parameters).
 - `"wald"` for models with z-statistic, returns `"Inf"`. Else, tries to
   extract residual degrees of freedoms. If residual degrees of freedom
   cannot be extracted, returns `"Inf"`.
 - `"analytical"` for models with z-statistic, returns `"Inf"`. Else, returns
   analytical degrees of freedom, i.e. `n-k` (number of observations minus
   number of parameters).
 - `"normal"` always returns `"Inf"`.
 - `"model"` returns model-based degrees of freedom, i.e. the number of
   (estimated) parameters.

bwiernik · 2022-09-25T19:14:53Z

I think the only issue is that residual is poorly described. It's whatever the model returns as the df, whereas analytical is always exactly n - params.

strengejacke · 2022-09-25T20:05:15Z

Should we be more strict/consistent here?

strengejacke · 2022-09-25T20:51:34Z

I think analytical and residual df are actually the same, so maybe we should simplify this.

Then we would have

"normal" returns Inf.
"wald" returns analytical (aka residual) df for models with t-statistic, and Inf for all other models. Also returns Inf if analytical df cannot be extracted.
"analytical" (aka "residual") returns n-k (number of observations minus number of parameters) and Inf if analytical df cannot be extracted.

mattansb · 2022-09-26T19:48:48Z

For some models it depends how you define "number of parameters" - thinking of mixed models. If I have 1000 obs from 10 subjects and y ~ 1 + time + (1 + time | Subject) lme4 has 6 parameters (2 fixed, 2 variance, 1 random covariance, 1 sigma), but arguably there are more - the 2 * 10 individual random parameters.

Something to think about?

strengejacke added 3 commits September 25, 2022 12:00

add normal and wald types

a3da984

add more type options

a197b1d

news, descripton

dff4c9c

strengejacke added 3 commits September 25, 2022 12:45

add tests

5d6df5d

add more tests

3f72ab6

fix typo

16eac8f

bwiernik requested changes Sep 25, 2022

View reviewed changes

strengejacke and others added 11 commits September 25, 2022 14:04

Update R/get_df.R

bfe55ac

Co-authored-by: Brenton M. Wiernik <bwiernik@users.noreply.github.com>

typo

52247e3

Merge branch 'get_df_typey' of https://github.com/easystats/insight i…

801a1b4

…nto get_df_typey

use correct arg name

aaa8a05

more type -> method

2fc815b

add more tests

7cf5864

add test glm.nb

ef39651

fix test

08946ae

get_df is enough, we just need methods for residual df

5fc2d50

fix tests

6ed28fa

less verbose, fix some tests

edfb0df

strengejacke added 2 commits September 25, 2022 22:20

fix satterthwaite and KR df

87c0927

docs

f374992

strengejacke added 5 commits September 25, 2022 23:12

simplify, more consistent

bc5352b

add pkg to suggest

e519372

simplify tests

71ace44

add ml1-df

cda0a29

fix

93bdeec

strengejacke added 27 commits September 26, 2022 08:38

typo

1c1e1ce

minor

17a7c57

fix test issues

e6c969a

fix

d98930e

no need for "dots"

2d0344c

minor, docs

bd96b83

comment

12717a1

fix test issues

fb7cb28

test against pbkrtest

995d6d9

fix test

57806f3

fix test issues

8f2aeb2

outcomment test for now

321a1e5

outcomment test

99d9c2e

take chi2 into account

6abada1

fix

f946c84

fix fixest-df

cc4907e

minor

09b5ea8

Merge branch 'main' into get_df_typey

0f7b999

test

d793720

now captured by default

2cf5551

update namespace

8781783

...

08258f7

dof.gls

e671fe5

n_parameters for gls

8917a46

add tests

a997519

fix test

39da400

fix test

f49ac27

strengejacke merged commit 03e3978 into main Sep 26, 2022

strengejacke deleted the get_df_typey branch September 26, 2022 19:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add normal and wald types #648

add normal and wald types #648

strengejacke commented Sep 25, 2022 •

edited

Loading

codecov-commenter commented Sep 25, 2022 •

edited

Loading

bwiernik Sep 25, 2022

strengejacke Sep 25, 2022

strengejacke commented Sep 25, 2022

bwiernik commented Sep 25, 2022

strengejacke commented Sep 25, 2022

strengejacke commented Sep 25, 2022 •

edited

Loading

mattansb commented Sep 26, 2022

add normal and wald types #648

add normal and wald types #648

Conversation

strengejacke commented Sep 25, 2022 • edited Loading

codecov-commenter commented Sep 25, 2022 • edited Loading

Codecov Report

bwiernik Sep 25, 2022

Choose a reason for hiding this comment

strengejacke Sep 25, 2022

Choose a reason for hiding this comment

strengejacke commented Sep 25, 2022

bwiernik commented Sep 25, 2022

strengejacke commented Sep 25, 2022

strengejacke commented Sep 25, 2022 • edited Loading

mattansb commented Sep 26, 2022

strengejacke commented Sep 25, 2022 •

edited

Loading

codecov-commenter commented Sep 25, 2022 •

edited

Loading

strengejacke commented Sep 25, 2022 •

edited

Loading