New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Feature/multifiltertest #142

Merged

mdenker merged 35 commits into NeuralEnsemble:master from INM-6:feature/multifiltertest

Apr 3, 2018

Contributor

EmanueleLucrezia commented Mar 5, 2018

This pull request adds the multiple filter test for the detection of rate change points in single unit spike trains. The algorithm is following the paper:

Messer, M., Kirchner, M., Schiemann, J., Roeper, J., Neininger, R., & Schneider, G. (2014). A multiple filter test for the detection of rate changes in renewal
processes with varying variance. The Annals of Applied Statistics, 8(4),2027-2067.

and the code is adapted from the R implementation:

Adapted from the published R implementation:
DOI: 10.1214/14-AOAS782SUPP;.r

EmanueleLucrezia and others added 24 commits

September 27, 2016 16:09


          Added multiple filter test script

f3fc8d5


          Removed repeated lines

ae127c3


          Added some TODOs

3b3cfa2


          added Multiple Filter Test for detection of rate change points, and i…

b63b361

…ts unittest


          typos

ce11e33


          Merge branch 'master' into feature/multifiltertest

f4fd62b


          Merge branch 'master' into feature/multifiltertest

aa2246d


          modify brownnian

0fbde4f


          modify documentation and add line 121

061256c


          if at line 453 and small others

0dc8f61


          aestetics and details

233b4a9


          intermediate commit'

0de78e0


          update

79f9d29


          worked little bit on documentation and did some corrections

540def3


          nothing

a0c7840


          fix conflict

09d6964


          fixed unittest and some aesthetics in mft

a25e938


          add assertRaises for function -filter and -filterprocess

f572a2d


          add assertRaises for function -filter and -filterprocess

0d39ead


          further modification


          fixed unittests

26fcc08


          some comments

71ef3fb


          some comments

49023f0


          forgetness

1b008eb

mdenker assigned Junji110, mdenker and alperyeg

mdenker added the enhancement label

Member

mdenker commented Mar 14, 2018

It seems the failing unit test -- which is related to reproducing published results -- is mainly failing as it depends on realization of the spike trains which are generated (with fixed CPs at 150, 180, and 500s). Although the random seed is fixed, the spike trains seems to differ on travis vs. when running it locally. In other words, the test data operates in a borderline regime, where the method will not always correctly detect all CPs in the data.

Thus, it may be a good idea to save the generated spike data explicitly and load it in the unit test.

In addition, one could remodel the current unit test such, that the differences between the rates are more pronounced such that the method is very sure to detect them.

Member

mdenker commented Mar 14, 2018

In terms of file names -- would it be better to name the module change_point_detection? I think it's a more telling name, and would allow similar methods to be bundled under this name.

Independent of the module name decided upon, please name the corresponding unit test file test_XXX.py, where XXX is the module name.


          changed file names

b7dc4e2

Contributor

Junji110 commented Mar 14, 2018 •

edited

Loading

The unit test script (test_change_point_detection.py) imports elephant.multiple_filter_test and this raises an error. Please correct this line to the new module name.
Also, the example code in the docstring of change_point_detection.py imports multiple_filter_test. This also needs to be corrected.

Contributor Author

EmanueleLucrezia commented Mar 15, 2018

The limit processes are not properly generated, namely, from pag. 10 of the article:
'Note that all limit processes (L h,t ) t are derived from the same Brownian motion in order to ensure comparability..'
I generated a different Brownian motions for every window h, instead to use always the same. This leads to a slightly higher threshold Q,

EmanueleLucrezia added 3 commits

March 15, 2018 16:08


          corrected generation of the limit processes, by moving the generated …

882d9ba

…brownian motion outside the for loop across windows


          change test with public data in test with long data, the rate have be…

fd4b2da

…en changed from 8, 13, 18, 16.5 Hz to 4, 13, 36, 16.5 Hz and the seed is not fixed anymore


          generation offline for the test parameters used in the unit test

2e3acc7

Collaborator

coveralls commented Mar 15, 2018 •

edited

Loading

Coverage increased (+1.2%) to 86.996% when pulling ea175dd on INM-6:feature/multifiltertest into 08d6ff1 on NeuralEnsemble:master.


          fixed print statement

15f51d3

Junji110 reviewed

View reviewed changes

elephant/change_point_detection.py Outdated

+                  >>> import quantities as pq
+                  >>> import neo
+                  >>> from elephant.multiple_filter_test import multiple_filter_test

Contributor

Junji110 Mar 19, 2018

the module name (elephant.multiple_filter_test) needs to be corrected

Junji110 reviewed

View reviewed changes

elephant/change_point_detection.py

+                          variances of the limit process correspodning to `h`. This will be
+                          used to normalize the `filter_process` in order to give to the every
+                          maximum the same impact on the global statistic.

Contributor

Junji110 Mar 19, 2018

Explanation for test_quantile is missing, and the order of the arguments is not consistent with that of the definition of the function.

Junji110 reviewed

View reviewed changes

elephant/change_point_detection.py Outdated

+                  Returns:
+                  --------
+                      cps : list of list
+                         one list for each `h`, containing the points detected with the

Contributor

Junji110 Mar 19, 2018

What is 'h'?

Junji110 reviewed

View reviewed changes

elephant/change_point_detection.py Outdated

+                          #print("detected point {0}".format(cp), "with filter {0}".format(h))
+                          # before to repet the procedure the h-neighbourg of 'cp' detected
+                          # is cut, because rate changes within it are explained by this cp
+                          differences[np.where(

Contributor

Junji110 Mar 19, 2018

This line is too complicated. Can be rewritten as:

            mask_fore = time_index > cp_index - int((h / dt_temp).simplified)
            mask_back = time_index < cp_index + int((h / dt_temp).simplified)
            differences[mask_fore & mask_back] = 0

Junji110 reviewed

View reviewed changes

elephant/change_point_detection.py Outdated

+                          # check if the neighbourhood of detected cp does not contain cps
+                          # detected with other windows
+                          neighbourhood_free = True
+                          if i == 0:

Contributor

Junji110 Mar 19, 2018

This if-case is not necessary.
When i == 0, the for-loop in the else case actually doesn't iterate anything.

Junji110 reviewed

View reviewed changes

elephant/change_point_detection.py Outdated

+                  except ValueError:
+                      raise ValueError("dt must be a time quantity")
+                  t_fin_m = t_fin_sec.magnitude

Contributor

Junji110 Mar 19, 2018

t_in_sec and t_fin_sec are not actually used.
You should rather define them as t_in_sec = t_in.rescale(u).magnitude and t_fin_sec = t_fin.rescale(u).magnitude, and use them instead of t_in_m and t_fin_m.
This kind of confusing usage of rescale is seen in other functions as well.
Please correct them as well so that the treatment of units is consistent within the code.

Junji110 reviewed

View reviewed changes

elephant/change_point_detection.py Outdated

+                      matrix_normalized.append((matrix[i] - mean) / np.sqrt(var))
+                      null_mean.append(mean)
+                      null_var.append(var)
+                  matrix_normalized = np.asanyarray(matrix_normalized)

Contributor

Junji110 Mar 19, 2018

This part of code is unnecessarily complicated and redundant.
The same result can be obtained by

null_mean = maxima_matrix.mean(axis=0)
null_var = maxima_matrix.var(axis=0)
matrix_normalized = (maxima_matrix - null_mean) / np.sqrt(null_var)

EmanueleLucrezia and others added 5 commits

March 19, 2018 16:31


          implemented J.I. comments

c1b5e6e


          implemented J.I. comments

c7f24df


          Merge branch 'feature/multifiltertest' of github.com:INM-6/elephant i…

e6d934c

…nto feature/multifiltertest


          units problem, TODOs: assertNotistance and unittest for EmpParam

1b73b95


          docstring: added s and removed :

24a86fd

Contributor

alperyeg commented Mar 20, 2018

@EmanueleLucrezia In the docstrings I inserted s at the headers, e.g Parameter -> Parameters. Otherwise, the documentation may not show these parts.

Member

mdenker commented Mar 26, 2018

Comments by Messer:

It is particularly important to derive the different limit processes from the same underlying Brownian motion. The idea is that there is a single spike train on the 'side of real data’ that results in one single Brownian motion on the side of the ‘limit theory’. Then all Limit processes L are derived from the same Brownian motion as this corresponds to all filtered derivative processes G that are derived from the same spike train. (If you had different Brownian motions for different windows this would mean something like observing different spike trains in the same analysis, which does not make sense..)
The example in the original R-code is indeed chosen in a ‘borderline regime’ as you call it. The idea of that scenario was to show the necessity of different windows, i.e., besides large rate changes (small windows sufficient) choosing also a small rate increase where particularly a large window is necessary. Indeed, this results in the problem that the smaller rate increase is sometimes not detected (by chance). But of course feel free to increase the rates changes in you examples to ensure ‘robustness’ of their detection.
I also quickly scrolled along your code and by chance saw that you called $\alpha$ integer valued. I guess it should be floating as it is theoretically thought in the invertval (0,1)

mdenker added this to the Release 0.5 milestone


          changed integer value alpha to float

ea175dd

Contributor

alperyeg commented Mar 28, 2018

@mdenker, see below our input to Messer's comments.

It is particularly important to derive the different limit processes from the same underlying Brownian motion (...)

This is addressed by commit 882d9ba

(...) alpha integer valued. I guess it should be floating as it is theoretically thought in the invertval (0,1)

With my latest commit alpha is now a floating number, also the documentation is accordingly updated.

(...) But of course feel free to increase the rates changes in you examples to ensure ‘robustness’ of their detection.

This is done with commit fd4b2da

mdenker merged commit 1595c70 into NeuralEnsemble:master

Moritz-Alexander-Kern deleted the feature/multifiltertest branch

March 31, 2022 08:25

Moritz-Alexander-Kern mentioned this pull request

Multiple filter test in elephant INM-6/elephant#25

Closed

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels