added json-ld contexts to api response #95

sherwoodf · 2024-04-05T16:22:22Z

After this change requesting any of the following objects:

Biosample
Collection
ImageAcquisition
Image
Specimen
Study
StudyFileReference
will return json with an @context field with a URL that points to a file in this github repo. The content type will still be 'application/json', regardless of what gets passed in.

List of changes:

Created JSON LD context objects, with some mapping to defined concepts in external ontologies where relevant. Includes ID mappings for objects that should point back to the appropriate BIA api endpoints. Included a generic vocabulary for all undefined terms.
Updated models to automatically add in a context object with a link to the github repository path where these files will exist
Added test that attempts to parse the json-ld using the contexts, in a similar manner to how i would expect users to parse the json. There is a slight difference, in that they should not need to manually insert the jsonld context (the link should just work), but you can't test changes while pointing to the link.
changed tests that rely on the models to have some placeholder value for the context
changed the imports in the tests to avoid from X import *
Added some type checking & comment to the util method i created for checking unordered lists of dicts were equivalent.
updated the env template with a few extra pointers on how to set up testing

api/src/models/persistence.py

Add suggested extensions

api/src/tests/test_rdf.py

ctr26 · 2024-04-08T13:44:11Z

This is an unfinished thought but I think I should note here anyway.

I'd prefer if the contexts string field(s) were constructed by a class. Either through composition or mixin, though I think my preference so far would be a composition context class that has a repr that returns a string. A mixin would need a post_init function which might clash with other future inheritors.

My reason being repeating the same string multiple times is error prone in future development where a developer might change one field correctly but miss another not another. Moreover there might be cases where you want the url to change (maybe to a file:/// for local development or something) and then that should be changed in the .env file ( eventually not today). It also has the advantage of future version bumps being smaller PRs.

Also, using the raw. GitHub path is better than the tree one. For pure file serving a common pattern I've seen is using the GitHub.io path. This is what is done for helm file serving, but in this case I don't see any meaningful advantage.

Thoughts? If be interested in knowing if my above points are moot because of the nature of the context.

sherwoodf · 2024-04-08T14:19:20Z

I'd prefer if the contexts string field(s) were constructed by a class. Either through composition or mixin, though I think my preference so far would be a composition context class that has a repr that returns a string. A mixin would need a post_init function which might clash with other future inheritors.

My reason being repeating the same string multiple times is error prone in future development where a developer might change one field correctly but miss another not another. Moreover there might be cases where you want the url to change (maybe to a file:/// for local development or something) and then that should be changed in the .env file ( eventually not today). It also has the advantage of future version bumps being smaller PRs.

Thoughts? If be interested in knowing if my above points are moot because of the nature of the context.

Definitely not moot. I had initially create a base dictionary which had some of the content used in all contexts (vocab, title) etc which was then used in the more specific ones which i used to create the json directly in objects. When i switched to the json file i removed that 'hierarchy'.

While I agree that the current system would be prone to errors with changes, i don't think making a better one now is worth development time. It feels too early to try to create the 'correct' system as i'm not confident that we know what result we're aiming for. All the values should currently be conisdered placeholders: most of the important ones don't have an existing definitions, and where i have created a mapping, i'd bet against changes in one file needing to be reflected in all the others. I'd feel more comfortable coming back to it in ~4 months time to create a system that reflects the data-modelling decisions of the working group.

I view this more change as getting the api to return valid JSON-LD to make valid RDF to play with, but no where near the 'correct' RDF that conforms to an ontology of our api concepts (which still needs defining).

ctr26 · 2024-04-08T14:52:02Z

Yep, valid reasons.

If the structure is being actively developed and is prone to significant change anyway then paying some technical debt now is your decision.

Thanks for the clear and thoughtful response

liviuba

Looks good, please check suggestions

api/src/models/persistence.py

api/src/tests/util.py

liviuba

LGTM!

added json-ld contexts to api response

5350a36

sherwoodf requested review from liviuba and ctr26 April 5, 2024 16:22

liviuba reviewed Apr 8, 2024

View reviewed changes

api/src/models/persistence.py Outdated Show resolved Hide resolved

Don't ignore project-local .vscode (useful for defaults)

4241459

Add suggested extensions

liviuba reviewed Apr 8, 2024

View reviewed changes

api/src/tests/test_rdf.py Show resolved Hide resolved

Add @context as a URI in tests, to keep consistent with the field

e428e11

liviuba and others added 2 commits April 8, 2024 16:52

Update readme

5f86d28

updated context to use pydantic's Url class, rather than string

aa4e017

sherwoodf requested a review from liviuba April 10, 2024 09:01

liviuba requested changes Apr 10, 2024

View reviewed changes

api/src/models/persistence.py Outdated Show resolved Hide resolved

api/src/tests/util.py Outdated Show resolved Hide resolved

fixed import in tests, updated model hierarchy

492607f

sherwoodf requested a review from liviuba April 10, 2024 10:00

updated client to handle json-ld context field

10517be

liviuba approved these changes Apr 10, 2024

View reviewed changes

sherwoodf removed the request for review from ctr26 April 10, 2024 14:55

sherwoodf merged commit 39b06fb into main Apr 10, 2024
0 of 7 checks passed

sherwoodf deleted the jsonld branch July 4, 2024 15:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

added json-ld contexts to api response #95

added json-ld contexts to api response #95

sherwoodf commented Apr 5, 2024 •

edited

Loading

ctr26 commented Apr 8, 2024

sherwoodf commented Apr 8, 2024 •

edited

Loading

ctr26 commented Apr 8, 2024

liviuba left a comment

liviuba left a comment

added json-ld contexts to api response #95

added json-ld contexts to api response #95

Conversation

sherwoodf commented Apr 5, 2024 • edited Loading

ctr26 commented Apr 8, 2024

sherwoodf commented Apr 8, 2024 • edited Loading

ctr26 commented Apr 8, 2024

liviuba left a comment

Choose a reason for hiding this comment

liviuba left a comment

Choose a reason for hiding this comment

sherwoodf commented Apr 5, 2024 •

edited

Loading

sherwoodf commented Apr 8, 2024 •

edited

Loading