MAINT: save data and rebuild model #49

eigenfoo · 2018-07-18T15:15:22Z

I forgot to save the data as part of the trace. This does that and also allows us to rebuild_model.

aseyboldt · 2018-07-18T15:39:36Z

bayesalpha/author_model.py

@@ -233,6 +239,12 @@ def fit_authors(data,
    trace.attrs['model-version'] = get_versions()['version']
    trace.attrs['model-type'] = AUTHOR_MODEL_TYPE

+    if save_data:
+        d = data.set_index(['meta_user_id',


These dimension names should match the ones used in the model (line 59)

That way xarray knows that they are the same dimension, and doesn't store the coordinates twice.

Ah, so then:

author -> meta_user_id
algo -> meta_algorithm_id
backtest -> meta_code_id

Right?

aseyboldt · 2018-07-18T15:48:13Z

thanks

eigenfoo · 2018-07-18T16:23:59Z

@aseyboldt latest Jenkins build failed because of the same dim names

E           ValueError: conflicting MultiIndex level name(s):
E           'meta_code_id' (dim_0), (meta_code_id)
E           'meta_user_id' (dim_0), (meta_user_id)
E           'meta_algorithm_id' (dim_0), (meta_algorithm_id)

eigenfoo · 2018-07-18T16:47:46Z

I think the easiest solution to this would be to change the dim names for _data to be meta_code_id__, meta_user_id__, etc. The underscores can be added or deleted when saving and loading the data.

I'm not sure if this is the most elegant solution though, since the user may wait to access the data by itself, and the underscores would make it a pain.

@aseyboldt what do you think?

aseyboldt · 2018-07-18T16:49:26Z

It should be possible to fix that. We somehow need to tell xarray to reuse those coords. Give me a couple of minutes.

aseyboldt · 2018-07-18T17:26:19Z

This seems to be more difficult than I thought: pydata/xarray#1603

The dimension meta_user_id in _data is not the same as the other meta_user_id. The first one has repeated entries, so that the length is equal len(data), while the second has len num_authors. So you could use new names like data_meta_user_id. This isn't all that nice, but I think it makes some sense.

eigenfoo · 2018-07-18T17:30:05Z

data storage is necromancy

aseyboldt · 2018-07-18T17:44:23Z

I created pydata/xarray#2299.

I think we should just add a

d.index.names = ['data_meta_user_id', 'data_meta_...']

just before the assignment to the xarray dataset.

MAINT: save data and rebuild model

168cfb1

eigenfoo requested a review from aseyboldt July 18, 2018 15:29

aseyboldt reviewed Jul 18, 2018

View reviewed changes

MAINT: rename dims

761de76

aseyboldt merged commit 48bf273 into master Jul 18, 2018

aseyboldt deleted the save_data branch July 18, 2018 15:48

eigenfoo mentioned this pull request Jul 18, 2018

BUG: rename axes #51

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MAINT: save data and rebuild model #49

MAINT: save data and rebuild model #49

eigenfoo commented Jul 18, 2018 •

edited

Loading

aseyboldt Jul 18, 2018

aseyboldt Jul 18, 2018

eigenfoo Jul 18, 2018

aseyboldt Jul 18, 2018

eigenfoo Jul 18, 2018

aseyboldt commented Jul 18, 2018

eigenfoo commented Jul 18, 2018 •

edited

Loading

eigenfoo commented Jul 18, 2018

aseyboldt commented Jul 18, 2018

aseyboldt commented Jul 18, 2018

eigenfoo commented Jul 18, 2018

aseyboldt commented Jul 18, 2018

MAINT: save data and rebuild model #49

MAINT: save data and rebuild model #49

Conversation

eigenfoo commented Jul 18, 2018 • edited Loading

aseyboldt Jul 18, 2018

Choose a reason for hiding this comment

aseyboldt Jul 18, 2018

Choose a reason for hiding this comment

eigenfoo Jul 18, 2018

Choose a reason for hiding this comment

aseyboldt Jul 18, 2018

Choose a reason for hiding this comment

eigenfoo Jul 18, 2018

Choose a reason for hiding this comment

aseyboldt commented Jul 18, 2018

eigenfoo commented Jul 18, 2018 • edited Loading

eigenfoo commented Jul 18, 2018

aseyboldt commented Jul 18, 2018

aseyboldt commented Jul 18, 2018

eigenfoo commented Jul 18, 2018

aseyboldt commented Jul 18, 2018

eigenfoo commented Jul 18, 2018 •

edited

Loading

eigenfoo commented Jul 18, 2018 •

edited

Loading