Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Shapefile masking #5470

Merged
merged 58 commits into from
Feb 13, 2024
Merged
Show file tree
Hide file tree
Changes from 44 commits
Commits
Show all changes
58 commits
Select commit Hold shift + click to select a range
3b15cd2
Working draft of shapefile masking
acchamber Sep 1, 2023
2ef0d15
Version of shapefile masking with tests and ready for preliminary review
acchamber Sep 5, 2023
6d36d05
Updated tests with proper paths and skip_tests decorator
acchamber Sep 6, 2023
befb1e5
Merge branch 'main' into shapefile_masking
acchamber Sep 7, 2023
6c2ef62
Merge branch 'main' into shapefile_masking
acchamber Oct 2, 2023
7b28fcd
Merge branch 'main' into shapefile_masking
acchamber Nov 8, 2023
6decedf
fixed some paths and removed broken code
acchamber Nov 8, 2023
d3b91d1
Merge branch 'SciTools:main' into shapefile_masking
acchamber Nov 9, 2023
d168f24
Added more tests and split into integration and unit tests. Testing w…
acchamber Nov 9, 2023
4947ffb
Merge branch 'shapefile_masking' of https://github.com/acchamber/iris…
acchamber Nov 9, 2023
1d05f63
responces to comments on utils.py for shapefile masking
acchamber Nov 20, 2023
ca363cc
tests actually pass now
acchamber Nov 20, 2023
23f3640
Moved tests to correct locations and strted changes on _shapefiles.py
acchamber Nov 20, 2023
57af617
some changes to _shapefiles to match review
acchamber Nov 20, 2023
5ba0ebc
added setUp cases to tests
acchamber Nov 20, 2023
cce3f9b
moved test names to lower_case and added acknoledgment
acchamber Nov 20, 2023
3ec7cc3
removed seperate guess_bounds function
acchamber Nov 20, 2023
7baab21
updated structure to properly call coord names/coords when optimal
acchamber Nov 22, 2023
c0aa728
sphnix improvements to docstring
acchamber Nov 22, 2023
44fe0cd
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Nov 22, 2023
28fcd44
commited dask map_blocks approach and some test improvements
acchamber Nov 23, 2023
87cc28e
replaced bounds rebasing via modulus with vectorized version
acchamber Nov 23, 2023
92d869f
Dask chunk control and some docstrings
acchamber Nov 24, 2023
4b38611
Merge branch 'shapefile_masking' of https://github.com/acchamber/iris…
acchamber Nov 27, 2023
befceeb
reverted behaviour of modulus function to ASCEND and switcher argumen…
acchamber Nov 27, 2023
e391e43
edied tests to work with flipped argument order
acchamber Nov 29, 2023
8b0e869
Improved optimisation by reading shapely docs properly and just using…
acchamber Jan 10, 2024
194aabf
Docstring updates and a 4d integration test
acchamber Feb 6, 2024
9521c2e
Merge branch 'main' into shapefile_masking
trexfeathers Feb 6, 2024
e89634b
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Feb 6, 2024
162bd45
Update lib/iris/_shapefiles.py
acchamber Feb 6, 2024
07ab745
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Feb 6, 2024
e8a23b7
improving readability from martin
acchamber Feb 6, 2024
ea5be9c
Merge branch 'shapefile_masking' of https://github.com/acchamber/iris…
acchamber Feb 6, 2024
64278ca
removed dask.delayed call
acchamber Feb 6, 2024
a46d7c9
Update lib/iris/_shapefiles.py
acchamber Feb 6, 2024
eca46c0
Update lib/iris/_shapefiles.py
acchamber Feb 6, 2024
061b76b
Update lib/iris/util.py
acchamber Feb 6, 2024
cc6016b
Added warning for possible mismatch of mask/cube coords
acchamber Feb 7, 2024
5e7d799
test for new warning
acchamber Feb 7, 2024
440f049
added test
acchamber Feb 7, 2024
3c13e1b
Update lib/iris/_shapefiles.py
acchamber Feb 7, 2024
1e1d711
Added licenses
acchamber Feb 7, 2024
22e2cf7
Merge branch 'shapefile_masking' of https://github.com/acchamber/iris…
acchamber Feb 7, 2024
9cb96b9
fixed doctest failures in example
acchamber Feb 7, 2024
deb5ff9
Improved test coverage
acchamber Feb 7, 2024
6eaa36b
fixed doctest
acchamber Feb 7, 2024
3b7384f
doctest again
acchamber Feb 7, 2024
a0aec74
Docstring tidy up.
trexfeathers Feb 7, 2024
e13e757
Merge pull request #1 from trexfeathers/docstring_tidy
acchamber Feb 7, 2024
b76dabd
fixed prime meridian bug
acchamber Feb 9, 2024
ca380ed
Update lib/iris/_shapefiles.py
acchamber Feb 9, 2024
0518a40
Merge branch 'main' into shapefile_masking
acchamber Feb 9, 2024
404474f
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Feb 9, 2024
b924795
Added first draft of user guide page
acchamber Feb 12, 2024
4ab753a
Add What's New entry.
trexfeathers Feb 13, 2024
84a8212
Merge pull request #2 from trexfeathers/shapefile_whatsnew
acchamber Feb 13, 2024
bf3c720
Merge branch 'SciTools:main' into shapefile_masking
acchamber Feb 13, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
225 changes: 225 additions & 0 deletions lib/iris/_shapefiles.py
trexfeathers marked this conversation as resolved.
Show resolved Hide resolved
Original file line number Diff line number Diff line change
@@ -0,0 +1,225 @@
# Copyright Iris contributors
#
# This file is part of Iris and is released under the BSD license.
# See LICENSE in the root of the repository for full licensing details.

# Much of this code is originally based off the ASCEND library, developed in
# the Met Office by Chris Kent, Emilie Vanvyve, David Bentley, Joana Mendes
# many thanks to them. Converted to iris by Alex Chamberlain-Clay


from itertools import product
import warnings

import numpy as np
import shapely
import shapely.errors
import shapely.geometry as sgeom
import shapely.ops

from iris.exceptions import IrisDefaultingWarning, IrisUserWarning


def create_shapefile_mask(
geometry,
cube,
minimum_weight=0.0,
):
"""Make a mask for a cube from a shape.

Get the mask of the intersection between the
given shapely geometry and cube with x/y DimCoords.
Can take a minimum weight and evaluate area overlaps instead


Parameters
----------
geometry : A :class:`shapely.Geometry` object

cube : A :class:`iris.cube.Cube`
trexfeathers marked this conversation as resolved.
Show resolved Hide resolved
which has 1d x and y coordinates
minimum_weight : A float between 0 and 1 determining what % of a cell
a shape must cover for the cell to remain unmasked.
eg: 0.1 means that at least 10% of the shape overlaps the cell
to be unmasked
Requires geometry to be a Polygon or MultiPolygon
Defaults to 0.0 (eg only test intersection)

Returns
-------
A :class:`np.array` of the shape of the x & y coordinates of the cube, with points to mask equal to True

"""
from iris.cube import Cube, CubeList

try:
msg = "Geometry is not a valid Shapely object"
if shapely.is_valid(geometry) is False:
raise TypeError(msg)
except Exception:
raise TypeError(msg)
if not isinstance(cube, Cube):
if isinstance(cube, CubeList):
msg = "Received CubeList object rather than Cube - \
to mask a CubeList iterate over each Cube"
raise TypeError(msg)
else:
msg = "Received non-Cube object where a Cube is expected"
raise TypeError(msg)
if minimum_weight > 0.0 and isinstance(
geometry,
(
sgeom.Point,
sgeom.LineString,
sgeom.LinearRing,
sgeom.MultiPoint,
sgeom.MultiLineString,
),
):
minimum_weight = 0.0
warnings.warn(
"""Shape is of invalid type for minimum weight masking,
must use a Polygon rather than Line shape.\n
Masking based off intersection instead. """,
category=IrisDefaultingWarning,
)

# prepare 2D cube
y_name, x_name = _cube_primary_xy_coord_names(cube)
trexfeathers marked this conversation as resolved.
Show resolved Hide resolved
cube_2d = cube.slices([y_name, x_name]).next()
for coord in cube_2d.dim_coords:
if not coord.has_bounds():
coord.guess_bounds()
trans_geo = _transform_coord_system(geometry, cube_2d)

y_coord, x_coord = [cube_2d.coord(n) for n in (y_name, x_name)]
x_bounds = _get_mod_rebased_coord_bounds(x_coord)
y_bounds = _get_mod_rebased_coord_bounds(y_coord)
# prepare array for dark
box_template = [
sgeom.box(x[0], y[0], x[1], y[1]) for x, y in product(x_bounds, y_bounds)
]
# shapely can do lazy evaluation of intersections if it's given a list of grid box shapes
# delayed lets us do it in parallel
intersect_template = shapely.intersects(trans_geo, box_template)
# we want areas not under shapefile to be True (to mask)
intersect_template = np.invert(intersect_template)
# now calc area overlaps if doing weights and adjust mask
if minimum_weight > 0.0:
intersections = np.array(box_template)[~intersect_template]
intersect_template[~intersect_template] = [
trans_geo.intersection(box).area / box.area <= minimum_weight
for box in intersections
]
mask_template = np.reshape(intersect_template, cube_2d.shape[::-1]).T
return mask_template


def _transform_coord_system(geometry, cube, geometry_system=None):
"""Project the shape onto another coordinate system.

Parameters
----------
geometry: A :class:`shapely.Geometry` object

cube: A :class:`iris.cube.Cube` with the coord_system to
be projected to and a x coordinate

geometry_system: A :class:`iris.coord_systems` object describing
the coord_system of the shapefile. Defaults to None,
which is treated as GeogCS

Returns
-------
A transformed copy of the provided :class:`shapely.Geometry`
"""
y_name, x_name = _cube_primary_xy_coord_names(cube)
import iris.analysis.cartography

DEFAULT_CS = iris.coord_systems.GeogCS(
iris.analysis.cartography.DEFAULT_SPHERICAL_EARTH_RADIUS
)
target_system = cube.coord_system()
if not target_system:
warnings.warn(
"Cube has no coord_system; using default GeogCS lat/lon",
category=IrisDefaultingWarning,
)
target_system = DEFAULT_CS
if geometry_system is None:
geometry_system = DEFAULT_CS
target_proj = target_system.as_cartopy_projection()
source_proj = geometry_system.as_cartopy_projection()

trans_geometry = target_proj.project_geometry(geometry, source_proj)
# A default coord system in iris can be either -180 to 180 or 0 to 360
if target_system == DEFAULT_CS and cube.coord(x_name).points[-1] > 180:
trans_geometry = shapely.transform(trans_geometry, _trans_func)
trexfeathers marked this conversation as resolved.
Show resolved Hide resolved
if target_system != DEFAULT_CS and cube.coord(x_name).points[-1] > 180:
# this may lead to incorrect masking or not depending on projection type so warn user
warnings.warn(
"""Cube has x-coordinates over 180E and a non-standard projection time.\n
This may lead to incorrect masking. \n
If the result is not as expected, you might want to transform the x coordinate points to -180 to 180 """,
category=IrisUserWarning,
)
return trans_geometry


def _trans_func(geometry):
"""Pocket function for transforming the x coord of a geometry from -180 to 180 to 0-360."""
for point in geometry:
if point[0] < 0:
point[0] = 360 - np.abs(point[0])
return geometry


def _cube_primary_xy_coord_names(cube):
"""Return the primary latitude and longitude coordinate names, or long names, from a cube.

Arguments:
cube (:class:`iris.cube.Cube`): An Iris cube

Returns
-------
The names of the primary latitude and longitude coordinates
"""
latc = (
cube.coords(axis="y", dim_coords=True)[0]
if cube.coords(axis="y", dim_coords=True)
else -1
)
lonc = (
cube.coords(axis="x", dim_coords=True)[0]
if cube.coords(axis="x", dim_coords=True)
else -1
)

if -1 in (latc, lonc):
msg = "Error retrieving 1d xy coordinates in cube: {!r}"
raise ValueError(msg.format(cube))

latitude = latc.name()
longitude = lonc.name()
return latitude, longitude


def _get_mod_rebased_coord_bounds(coord):
"""Take in a coord and returns a array of the bounds of that coord rebased to the modulus.

Arguments:
coord (:class:`iris.coords.Coord`): An Iris coordinate
with a modulus

Returns
-------
A 1d Numpy array of [start,end] pairs for bounds of the coord
"""
modulus = coord.units.modulus
# Force realisation (rather than core_bounds) - more efficient for the
# repeated indexing happening downstream.
result = np.array(coord.bounds)
if modulus:
result[result < 0.0] = (np.abs(result[result < 0.0]) % modulus) * -1
result[np.isclose(result, modulus, 1e-10)] = 0.0
return result
108 changes: 108 additions & 0 deletions lib/iris/tests/integration/test_mask_cube_from_shapefile.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,108 @@
# Copyright Iris contributors
#
# This file is part of Iris and is released under the BSD license.
# See LICENSE in the root of the repository for full licensing details.

import math

import cartopy.io.shapereader as shpreader
import numpy as np

import iris
import iris.tests as tests
from iris.util import mask_cube_from_shapefile


@tests.skip_data
class TestCubeMasking(tests.IrisTest):
"""integration tests of mask_cube_from_shapefile
using different projections in iris_test_data -
values are the KGO calculated using ASCEND.
"""

def setUp(self):
ne_countries = shpreader.natural_earth(
resolution="10m", category="cultural", name="admin_0_countries"
)
self.reader = shpreader.Reader(ne_countries)

def test_global_proj_russia(self):
path = tests.get_data_path(
["NetCDF", "global", "xyt", "SMALL_hires_wind_u_for_ipcc4.nc"]
)
test_global = iris.load_cube(path)
ne_russia = [
country.geometry
for country in self.reader.records()
if "Russia" in country.attributes["NAME_LONG"]
][0]
masked_test = mask_cube_from_shapefile(test_global, ne_russia)
print(np.sum(masked_test.data))
assert math.isclose(
np.sum(masked_test.data), 76845.37, rel_tol=0.001
), "Global data with Russia mask failed test"

def test_rotated_pole_proj_germany(self):
path = tests.get_data_path(
["NetCDF", "rotated", "xy", "rotPole_landAreaFraction.nc"]
)
test_rotated = iris.load_cube(path)
ne_germany = [
country.geometry
for country in self.reader.records()
if "Germany" in country.attributes["NAME_LONG"]
][0]
masked_test = mask_cube_from_shapefile(test_rotated, ne_germany)
assert math.isclose(
np.sum(masked_test.data), 179.46872, rel_tol=0.001
), "rotated europe data with German mask failed test"

def test_transverse_mercator_proj_uk(self):
path = tests.get_data_path(
["NetCDF", "transverse_mercator", "tmean_1910_1910.nc"]
)
test_transverse = iris.load_cube(path)
ne_uk = [
country.geometry
for country in self.reader.records()
if "United Kingdom" in country.attributes["NAME_LONG"]
][0]
masked_test = mask_cube_from_shapefile(test_transverse, ne_uk)
assert math.isclose(
np.sum(masked_test.data), 90740.25, rel_tol=0.001
), "transverse mercator UK data with UK mask failed test"

def test_rotated_pole_proj_germany_weighted_area(self):
path = tests.get_data_path(
["NetCDF", "rotated", "xy", "rotPole_landAreaFraction.nc"]
)
test_rotated = iris.load_cube(path)
ne_germany = [
country.geometry
for country in self.reader.records()
if "Germany" in country.attributes["NAME_LONG"]
][0]
masked_test = mask_cube_from_shapefile(
test_rotated, ne_germany, minimum_weight=0.9
)
assert math.isclose(
np.sum(masked_test.data), 125.60199, rel_tol=0.001
), "rotated europe data with 0.9 weight germany mask failed test"

def test_4d_global_proj_brazil(self):
path = tests.get_data_path(["NetCDF", "global", "xyz_t", "GEMS_CO2_Apr2006.nc"])
test_4d_brazil = iris.load_cube(path, "Carbon Dioxide")
ne_brazil = [
country.geometry
for country in self.reader.records()
if "Brazil" in country.attributes["NAME_LONG"]
][0]
masked_test = mask_cube_from_shapefile(
test_4d_brazil,
ne_brazil,
)
print(np.sum(masked_test.data))
# breakpoint()
assert math.isclose(
np.sum(masked_test.data), 18616921.2, rel_tol=0.001
), "4d data with brazil mask failed test"
Loading
Loading