Commons Alliance Components

This repository will try to explain the components that take part in the Team Calcium NIH Data Commons Pilot (and beyond).

Note, if you are viewing this on github, the images may be cached, please visit:

https://david4096.github.io/data-platforms/

For more background read the Data Biosphere post.

Visit the DataBiosphere github organization.

Prototype

The various components coordinate to create a platform useful for data analysis.

Digital Object Catalog

Provides clients and services access to resources available in object stores. Digital objects can be files and the catalog itself maintains a registry of locations to find the files, as well as minimal metadata.

GUID Resolver

Allows globally unique identifiers to be "resolved" to digital objects. For more information please refer to Identifier Interoperability.

Namespace Service

Identifiers can be given different namespaces or "prefixes". The namespace service allows commons members to easily manage GUIDs across projects and domains. For more information please refer to Identifier Interoperability.

Access Control

To guarantee authority and authenticity of requests, some access control services are provided. These services will at least be able to identify a client and delegate authority to the access control system of choice.

Analytical Engine

Software which can orchestrate and execute computational tasks in heterogeneous computing environments.

Tool Repository

A resource which contains templates of reusable computational tasks that can be directed at new data, and then executed by the Analytical Engine.

Workspaces

Clients accessing a commons infrastructure should be able to manage data for secondary and tertiary data analysis.

Indexing and Search

Data in commons infrastructure should be findable using Search mechanisms. Indexing makes data available for search.

Ontology

A controlled vocabulary informs indexers and or querying applications to make metadata usable.

Metadata Indexer

Metadata made available by a platform is indexed into a store. Indexers allow data to be made findable using a structured document scheme.

Metadata Querying

Once metadata have been indexed into a platform, these indices are made available by services that allow queries to be formed against the metadata.

Portal

Commons infrastructure should provide interfaces to make data easily findable. Once data has been found in a portal, it can be added to a workspace.

Application

Applications combine a variety of Commons components to carry out specific tasks.