Generalizing dataloader and loading multiple species #88
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Hi all,
I wanted to start this pull request as a discussion about some extensions to CREsted that I have considered/need, and have built out a preliminary version of in my fork. I haven't written tests and certainly the changes I have here break other things in CREsted I haven't checked.
I'm working on building models that train on data across species, as well as have additional information like gene expression vectors passed to the model.
Therefore what I have altered in my fork includes the following:
I'm still working out some bugs, but figured I shouldn't go any further until I had reached out to see what you have in the works here, and if this code is useful to you and is the type of thing you would consider merging when it is mature. Otherwise I'll continue developing this as independent extensions for CREsted.
Thanks so much for developing this great package!
Matthew
P.S. The low-level usage for the extended classes I've written looks like so: