radiantearth · cholmes · Aug 30, 2018 · Aug 15, 2018 · Aug 20, 2018 · Aug 21, 2018
diff --git a/dataset-spec/README.md b/dataset-spec/README.md
@@ -0,0 +1,62 @@
+# Dataset Spec for STAC
+
+## Introduction
+
+One topic of interest has been the search of datasets*, instead of within a dataset, i.e. in (sub-)catalogs, items and assets. [STAC](https://github.com/radiantearth/stac-spec) is focused on search within a dataset, but it includes some simple constructs to catalog datasets. This could be an independent spec that STAC uses, and others can also independently use, to describe datasets in a lightweight way.
+
+*\* There is no standardized name for the concept we are describing here. Others called it: dataset series (ISO 19115), collection (CNES, NASA), dataset (JAXA), dataset series (ESA), product (JAXA).*
+
+## Core
+
+| Element         | Type                                  | Name                            | Description                                                  |
+| --------------- | ------------------------------------- | ------------------------------- | ------------------------------------------------------------ |
+| id              | string                                | Dataset ID (required)           | Identifier for the dataset that is unique across the provider. MUST follow the pattern ` ^[A-Za-z0-9_\-\/]+$ `. TODO: Allow slash? |
+| title           | string                                | Title                           | A short descriptive one-line title for the dataset.          |
+| description     | string                                | Description (required)          | Detailed multi-line description to fully explain the entity. [CommonMark 0.28](http://commonmark.org/) syntax MAY be used for rich text representation. |
+| keywords        | [string]                              | Keywords                        | List of keywords describing the dataset.                     |
+| version         | string                                | Dataset Version                 | Version of the dataset. [Semantic Versioning (SemVer)](https://semver.org/) SHOULD be followed. |
+| license         | string                                | Dataset License Name (required) | Dataset's license(s) as a [SPDX License identifier or expression](https://spdx.org/licenses/) or `proprietary` if the license is not on the SPDX license list. See `license_url` for more information. |
+| license_url     | string                                | Dataset License URL             | Dataset's license URL SHOULD be specified if `license` is set to `proprietary`. |
+| provider        | [Provider Object]                     | Data Provider                   | The organizations that created the content of the dataset.   |
+| host            | Host Object                           | Storage Provider                | The organization that hosts the dataset.                     |
+| spatial_extent  | [GeoJSON Object](http://geojson.org/) | Spatial extent (required)       | The spatial extent covered by the dataset as [GeoJSON](http://geojson.org/) object. |
+| temporal_extent | string                                | Temporal extent (required)      | Temporal extent covered by the dataset. Date/time intervals MUST be formatted according to ISO 8601. ToDo: Support open date ranges |
+| links           | [Link Object]                         | Links (required)                | A list of references to other documents, see Link Object for further documentation. TODO: Remove if catalog is revised and links are specified on the catalog level. |
+
+### Provider Object
+
+| Element | Type   | Name                  | Description                                     |
+| ------- | ------ | --------------------- | ----------------------------------------------- |
+| name    | string | Organization name     | The name of the organization or the individual. |
+| url     | string | Organization homepage | Homepage of the provider.                       |
+
+###  Host Object
+
+| Element        | Type    | Name                  | Description                                                  |
+| -------------- | ------- | --------------------- | ------------------------------------------------------------ |
+| description    | string  | Description           | Detailed description to explain the hosting details. [CommonMark 0.28](http://commonmark.org/) syntax MAY be used for rich text representation. |
+| scheme         | string  | Scheme (required)     | Values: S3, GCS, URL, OTHER                                  |
+| id             | string  | Identifier (required) | Host-specific identifier such as an URL or asset id.         |
+| region         | string  | Region                | Provider specific region where the data is stored.           |
+| requester_pays | boolean | Requester pays        | `true` if requester pays, `false` if host pays. Defaults to `false`. |
+
+**Note:** The idea of storage profiles is currently [discussed](https://github.com/radiantearth/stac-spec/issues/148). Therefore, scheme, id and region may be removed from the final spec.
+
+### Link Object
+
+| Element | Type   | Name                | Description                                                  |
+| ------- | ------ | ------------------- | ------------------------------------------------------------ |
+| href    | string | Link (required)     | The actual link in the format of an URL. Relative and absolute links are both allowed. |
+| rel     | string | Relation (required) | Relationship between the current document and the linked document. |
+| type    | string | MIME-type           | MIME-type of the referenced entity.                          |
+| title   | string | Title               | Human-readable title for the link.                           |
+
+## Extensions
+
+Related extensions to be used with the dataset spec:
+
+* [EO extension](../extensions/stac-eo-spec.md)
+  Please note that some fields such as `eo:sun_elevation ` or `eo:sun_azimuth` are only meaningful on the item level and MUST not be used in datasets.
+* [Dimensions extension](../extensions/dimension) (currently in review, see [PR #164](https://github.com/radiantearth/stac-spec/pull/164))
+* [Scientific extension](../extensions/scientific) (currently in review, see [PR #186](https://github.com/radiantearth/stac-spec/pull/186))
+* Provenance extension  (planned, see [issue #179](https://github.com/radiantearth/stac-spec/issues/179))
diff --git a/dataset-spec/json-schema/dataset.json b/dataset-spec/json-schema/dataset.json
@@ -0,0 +1,136 @@
+{
+  "$schema": "http://json-schema.org/draft-06/schema#",
+  "id": "dataset.json#",
+  "title": "Dataset Item",
+  "description": "This object represents the dataset in a SpatioTemporal Asset Catalog.",
+  "type": "object",
+  "required": [
+    "id",
+    "description",
+    "license",
+    "spatial_extent",
+    "temporal_extent",
+    "links"
+  ],
+  "properties": {
+    "id": {
+      "title": "Provider ID",
+      "type": "string",
+      "pattern": "^[A-Za-z0-9_\\-\/]+$"
+    },
+    "title": {
+      "title": "Title",
+      "type": "string"
+    },
+    "description": {
+      "title": "Description",
+      "type": "string"
+    },
+    "keywords": {
+      "title": "Keywords",
+      "type": "array",
+      "items": {
+        "type": "string"
+      }
+    },
+    "license": {
+      "title": "License Name",
+      "type": "string"
+    },
+    "license_url": {
+      "title": "License URL",
+      "type": "string",
+      "format": "url"
+    },
+    "provider": {
+      "type": "array",
+      "items": {
+        "properties": {
+          "name": {
+            "title": "Organization Name",
+            "type": "string"
+          },
+          "url": {
+            "title": "Organization homepage",
+            "type": "string",
+            "format": "url"
+          }
+        }
+      }
+    },
+    "host": {
+      "required": [
+        "id",
+        "scheme"
+      ],
+      "properties": {
+        "id": {
+          "title": "Identifirer",
+          "type": "string"
+        },
+        "scheme": {
+          "title": "Scheme",
+          "type": "string",
+          "enum": [
+            "S3",
+            "GCS",
+            "URL",
+            "OTHER"
+          ]
+        },
+        "description": {
+          "title": "Description",
+          "type": "string"
+        },
+        "region": {
+          "title": "Region",
+          "type": "string"
+        },
+        "requester_pays": {
+          "title": "Requester Pays",
+          "type": "boolean",
+          "default": false
+        }
+      }
+    },
+    "version": {
+      "title": "Version",
+      "type": "string"
+    },
+    "temporal_extent": {
+      "title": "Temporal extent",
+      "type": "string"
+    },
+    "spatial_extent": {
+      "type": "object"
+    },
+    "links": {
+      "type": "array",
+      "items": {
+        "type": "object",
+        "required": [
+          "href",
+          "rel"
+        ],
+        "properties": {
+          "href": {
+            "title": "Link",
+            "type": "string"
+          },
+          "rel": {
+            "title": "Relation",
+            "type": "string"
+          },
+          "type": {
+            "title": "type",
+            "type": "string"
+          },
+          "title": {
+            "title": "Title",
+            "type": "string"
+          }
+        }
+      }
+    }
+  }
+}
diff --git a/extensions/dimension/README.md b/extensions/dimension/README.md
@@ -0,0 +1,19 @@
+# STAC Dimensions Extension Spec
+
+This document explains the fields of the STAC Dimensions Extension (dim) to a STAC `Dataset`. Data can have different dimensions (= axes), e.g. in meteorology. The properties of these dimensions can be defined with this extension.
+
+## Dimensions Extension Description
+
+This is the field that extends the `Dataset` object:
+
+| Element          | Type                 | Name                      | Description                                                  |
+| ---------------- | -------------------- | ------------------------- | ------------------------------------------------------------ |
+| dim:dimensions          | [Dimension Object] | Dimensions               | Dimensions of the data. If the dimensions have an order, the order SHOULD be reflected in the order of the array. |
+
+### Dimension Object
+
+| Element | Type             | Name                | Description                                                  |
+| ------- | ---------------- | ------------------- | ------------------------------------------------------------ |
+| label   | string           | Label (required)    | Human-readable label for the dimension.                      |
+| unit    | string           | Unit of Measurement | Unit of measurement, preferably SI. ToDo: Any standard to express this, e.g. [UDUNITS](https://www.unidata.ucar.edu/software/udunits/) or this [dict](https://www.unc.edu/~rowlett/units/)? |
+| extent  | [number\|string] | Data Extent         | Specifies the extent of the data, i.e. the lower bound as the first element and the upper bound as the second element of the array. |
diff --git a/extensions/dimension/example.json b/extensions/dimension/example.json
@@ -0,0 +1,23 @@
+{
+  "dim:dimensions": [
+    {
+      "label": "Longitude",
+      "unit": "°",
+      "extent": [-180, 180]
+    },
+    {
+      "label": "Latitude",
+      "unit": "°",
+      "extent": [-90, 90]
+    },
+    {
+      "label": "Temperature",
+      "unit": "°C",
+      "extent": [-20, 60]
+    },
+    {
+      "label": "Date",
+      "extent": ["2018-01-01T00:00:00Z", "2018-01-31T23:59:59Z"]
+    }
+  ]
+}
diff --git a/extensions/dimension/schema.json b/extensions/dimension/schema.json
@@ -0,0 +1,36 @@
+{
+  "$schema": "http://json-schema.org/draft-07/schema#",
+  "type": "object",
+  "title": "STAC Dimensions Extension Spec",
+  "properties": {
+    "dim:dimensions": {
+      "type": "array",
+      "title": "Dimensions",
+      "items": {
+        "type": "object",
+        "required": [
+          "label"
+        ],
+        "properties": {
+          "label": {
+            "type": "string",
+            "title": "Label"
+          },
+          "unit": {
+            "type": "string",
+            "title": "Unit of Measurement"
+          },
+          "extent": {
+            "type": "array",
+            "title": "Data Extent",
+            "minItems": 2,
+            "maxItems": 2,
+            "items": {
+              "type": ["number", "string"]
+            }
+          }
+        }
+      }
+    }
+  }
+}
diff --git a/extensions/stac-collection-spec.md b/extensions/stac-collection-spec.md
@@ -4,11 +4,11 @@ A group of STAC `Item` objects from a single source can share a lot of common me
 
 ## Collection Extension Description
 
-| element             | type info                 | name                    | description                                                                                 | 
-|----------------------|---------------------------|-------------------------|---------------------------------------------------------------------------------------------| 
-| c:id | string | Collection ID | Machine readable ID for the collection
-| c:name | string (optional) | Collection Name | A name given to the Collection, used for display
-| c:description | string (optional) | Collection Description | A human readable description of the collection. [CommonMark 0.28](http://commonmark.org/) syntax MAY be used for rich text representation.
+| element       | type info         | name                   | description                                      |
+| ------------- | ----------------- | ---------------------- | ------------------------------------------------ |
+| c:id          | string            | Collection ID          | Machine readable ID for the collection           |
+| c:name        | string (optional) | Collection Name        | A name given to the Collection, used for display |
+| c:description | string (optional) | Collection Description | A human readable description of the collection. [CommonMark 0.28](http://commonmark.org/) syntax MAY be used for rich text representation. |
 
 A `Collection` does not have many specific fields, as it may contain any fields that are in the core spec as well as any other extension. This provides maximum flexibility to data providers, as some the set of common metadata fields can vary between different types of data. For instance, Landsat and Sentinel data always has a eo:off_nadir value of 0, because those satellites are always pointed downward (i.e., nadir), while satellite that can be pointed will have varying eo:off_nadir values.