Adding B1 balance index and tests #2281

jeremyguez · 2022-05-19T11:47:56Z

Here is the PR for B1 balance index. Compared to #2251 I just added a round function at the end. @jeromekelleher

codecov · 2022-05-19T12:00:46Z

Codecov Report

Merging #2281 (72474ec) into main (3e70389) will increase coverage by 0.00%.
The diff coverage is 100.00%.

@@           Coverage Diff           @@
##             main    #2281   +/-   ##
=======================================
  Coverage   93.29%   93.30%           
=======================================
  Files          27       27           
  Lines       26089    26097    +8     
  Branches     1172     1175    +3     
=======================================
+ Hits        24341    24349    +8     
  Misses       1718     1718           
  Partials       30       30

Flag	Coverage Δ
c-tests	`92.23% <ø> (ø)`
lwt-tests	`89.05% <ø> (ø)`
python-c-tests	`71.86% <12.50%> (-0.04%)`	⬇️
python-tests	`98.88% <100.00%> (+<0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
python/tskit/trees.py	`98.05% <100.00%> (+<0.01%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 3e70389...72474ec. Read the comment docs.

jeromekelleher · 2022-05-19T12:35:58Z

Looks great! I don't understand why we're rounding though?

jeremyguez · 2022-05-19T12:42:23Z

Looks great! I don't understand why we're rounding though?

It's because of the tests, having something like this didn't look clean (and I felt it could lead to errors):

def test_b1(self):
        assert self.tree().b1_index() == 1.8333333333333333

And I wasn't sure this was clean either, since it's not testing the output of the method itself:

def test_b1(self):
        assert round(self.tree().b1_index(),3) == 1.833

But I guess there is a clean general way of working around tests in that case?

jeremyguez · 2022-05-19T12:59:53Z

By the way: I wrote in the doc that the output is a float, but actually in some cases, like for the empty tree, the output is 0 which is an int. Should we force the result to be a float, so it's always the same output type?

jeromekelleher · 2022-05-19T13:16:55Z

Ah, I see. I think the usual way to handle this would be something like

def test_b1(self):
        assert self.tree().b1_index() == pytest.approx(1.8333)

By the way: I wrote in the doc that the output is a float, but actually in some cases, like for the empty tree, the output is 0 which is an int. Should we force the result to be a float, so it's always the same output type?

Good point. The cleanest way to fix this would be to set total = 0.0 at the top of the loop, which will force it to be a float.

jeremyguez · 2022-05-19T14:21:39Z

By removing the rounding I get now this kind of errors:

assert tree.b1_index() == b1_index_definition(tree)
E           assert 19.4718253968254 == 19.471825396825395

I guess I should use python.approx() in all tests?

jeromekelleher · 2022-05-19T14:24:19Z

Use pytest.approx, like

assert tree.b1_index() == pytest.approx(b1_index_definition(tree))

jeremyguez · 2022-05-20T06:30:25Z

I put pytest.approx(1.833, rel=1e-3), as the default is using rel=1e-6.

Is it an issue if the definition method output is not always a float, or should I force it to float too? The tests do pass without it.

jeromekelleher · 2022-05-20T08:03:04Z

Do the tests not pass at the default tolerance? I would have expected the values to be very close.

Is it an issue if the definition method output is not always a float, or should I force it to float too? The tests do pass without it.

that's fine, it doesn't really matter.

jeremyguez · 2022-05-20T08:15:37Z

Do the tests not pass at the default tolerance? I would have expected the values to be very close.

For this test the default relative tolerance is indeed working, as the difference is very small:

assert tree.b1_index() == pytest.approx(b1_index_definition(tree))

But for this one, the default 1e-6 relative tolerance is too small compared to the difference between 1.833333.. and 1.833:

assert self.tree().b1_index() == pytest.approx(1.833)

So I had two solutions that pass the test:

assert self.tree().b1_index() == pytest.approx(1.833, rel=1e-3)

or

assert self.tree().b1_index() == pytest.approx(1.8333333)

The first one seemed cleaner to me.

jeromekelleher · 2022-05-20T08:16:48Z

Ah, I see. When you're comparing with a fixed value either way is good. Whatever you prefer.

jeromekelleher

LGTM, spotted one issue in docstring

jeromekelleher · 2022-05-20T08:38:49Z

python/tskit/trees.py

+        .. seealso:: See `Shao and Sokal (1990)
+            <https://www.jstor.org/stable/2992186>`_ for details.
+
+        :return: The B1 balance index (rounded).


Delete (rounded)

jeremyguez force-pushed the b1_index_python branch from 5089530 to 5aa6337 Compare May 19, 2022 12:50

jeremyguez force-pushed the b1_index_python branch from 5aa6337 to 9db822c Compare May 20, 2022 08:27

jeromekelleher approved these changes May 20, 2022

View reviewed changes

Adding B1 balance index and tests

72474ec

jeremyguez force-pushed the b1_index_python branch from 9db822c to 72474ec Compare May 20, 2022 08:46

jeromekelleher added the AUTOMERGE-REQUESTED Ask Mergify to merge this PR label May 20, 2022

jeromekelleher mentioned this pull request May 20, 2022

B1 Balance index (Python) #2251

Closed

mergify bot merged commit 9968e62 into tskit-dev:main May 20, 2022

mergify bot removed the AUTOMERGE-REQUESTED Ask Mergify to merge this PR label May 20, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding B1 balance index and tests #2281

Adding B1 balance index and tests #2281

jeremyguez commented May 19, 2022

codecov bot commented May 19, 2022 •

edited

Loading

jeromekelleher commented May 19, 2022

jeremyguez commented May 19, 2022

jeremyguez commented May 19, 2022

jeromekelleher commented May 19, 2022

jeremyguez commented May 19, 2022

jeromekelleher commented May 19, 2022

jeremyguez commented May 20, 2022

jeromekelleher commented May 20, 2022

jeremyguez commented May 20, 2022

jeromekelleher commented May 20, 2022

jeromekelleher left a comment

jeromekelleher May 20, 2022

Adding B1 balance index and tests #2281

Adding B1 balance index and tests #2281

Conversation

jeremyguez commented May 19, 2022

codecov bot commented May 19, 2022 • edited Loading

Codecov Report

jeromekelleher commented May 19, 2022

jeremyguez commented May 19, 2022

jeremyguez commented May 19, 2022

jeromekelleher commented May 19, 2022

jeremyguez commented May 19, 2022

jeromekelleher commented May 19, 2022

jeremyguez commented May 20, 2022

jeromekelleher commented May 20, 2022

jeremyguez commented May 20, 2022

jeromekelleher commented May 20, 2022

jeromekelleher left a comment

Choose a reason for hiding this comment

jeromekelleher May 20, 2022

Choose a reason for hiding this comment

codecov bot commented May 19, 2022 •

edited

Loading