Skip to content

A package to train machine learning models on geospatial data, mainly for weather and climate. Used to run ArchesWeather and ArchesWeatherGen

License

Notifications You must be signed in to change notification settings

INRIA/geoarches

Repository files navigation

geoarches

geoarches is a machine learning package for training, running and evaluating ML models on weather and climate data, developed by Guillaume Couairon and Renu Singh in the ARCHES team at INRIA (Paris, France).

geoarches's building blocks can be easily integrated into research ML pipelines. It can also be used to run the ArchesWeather and ArchesWeatherGen weather models.

geoarches is based on pytorch, pytorch-lightning and hydra for configuration. After the package is installed, you can use its modules in python code, but you can also call the main training and evaluating scripts of geoarches.

To develop your own models or make modifications to the existing ones, the intended usage is to write configurations files and pytorch lightning classes in your own working directory. Hydra will then discover your custom configs folder, and you can point to your custom classes from your custom config files.

Link to a recent presentation of geoarches.

Code Overview

geoarches is meant to jumpstart your ML pipeline with building blocks for data handling, model training, and evaluation. This is an effort to share engineering tools and research knowledge across projects.

Data:

  • download/: scripts that parallelize downloads and show how to use chunking to speed up read access.
  • dataloaders/: PyTorch datasets that read netcdf data and prepare tensors to feed into model training.

Model training:

  • backbones/: network architecture that can be plugged into lightning modules.
  • lightning_modules/: wrapper around backbone modules to handle loss computation, optimizer, etc for training and inference (agnostic to backbone but specific to ML task).

Evaluation:

  • metrics/: tested suite of iterative metrics (memory efficient) for deterministic and generative models.
  • evaluation/: scripts for running metrics over model predictions and plotting.

Pipeline:

  • main_hydra.py: script to run training or inference with hydra configuration.
  • documentation/: quickstart code for training and inference from a notebook.

Installation

Install poetry

We use poetry for package dependencies. Use pipx to install poetry:

brew install pipx
pipx install poetry

Environment

Create an environment or activate the environment you are already using.

conda create --name weather python=3.11
conda activate weather

Move into the git repo and install dependencies:

cd geoarches
poetry install

Poetry, by default, installs the geoarches package in editable mode. Editable mode allows you to make changes to the geoarches code locally, and these changes will automatically be reflected in your code that depends on it.

Useful directories

We recommend making the following symlinks in the codebase folder:

ln -s /path/to/data/ data             # Store data for training and evaluation.
ln -s /path/to/models/ modelstore     # Store model checkpoints and model hydra configs.
ln -s /path/to/evaluation/ evalstore  # Store intermediate model outputs for computing metrics.
ln -s /path/to/wandb/ wandblogs       # Store Wandb logs.

If you want to store models and data in your working directory, or can also simply create regular folders.

Downloading ArchesWeather and ArchesWeatherGen

Use following the script to download the 4 deterministic models (archesweather-m-...) and generative model (archesweathergen).

src="https://huggingface.co/gcouairon/ArchesWeather/resolve/main"
MODELS="archesweather-m-seed0 archesweather-m-seed1 archesweather-m-skip-seed0 archesweather-m-skip-seed1 archesweathergen"
for MOD in $MODELS; do
    mkdir -p modelstore/$MOD/checkpoints
    wget -O modelstore/$MOD/checkpoints/checkpoint.ckpt $src/${MOD}_checkpoint.ckpt
    wget -O modelstore/$MOD/config.yaml $src/${MOD}_config.yaml 
done

You can follow instructions in docs/archesweather/run.ipynb to load the models and run inference with them. See docs/archesweather/train.md to run training.

Downloading ERA5 statistics

To compute brier score on ERA5 (needed to instantiate ArchesWeather models for inference or training), you will need to download ERA5 quantiles:

src="https://huggingface.co/gcouairon/ArchesWeather/resolve/main"
wget -O geoarches/stats/era5-quantiles-2016_2022.nc $src/era5-quantiles-2016_2022.nc

Using geoarches modules in python

Your directory structure after following installation should look like this:

├── geoarches
│   ├── geoarches
│   │   ├── ...
└── your_own_project
    ├── ...

The recommended way to use the package is to depend on the package inside your own working directory, by importing them in your project code e.g.

from geoarches.dataloaders.era5 import Era5Forecast
ds = Era5Foreacast(path='data/era5_240/full',
                   load_prev=True,
                   norm_scheme='pangu')

Making edits directly in the geoarches package will make updates more difficult, but if you prefer this option, you can create a development branch so as to rebase it on future updates of geoarches. (See Contributing section).

Running models with geoarches

Using hydra configuration

We use Hydra to easily configure training experiments. main_hydra.py is pointed to the configs/ folder which tells geoarches which dataloader, lightning module, backbone, and their arguments to run. You can also override arguments by CLI (see below for useful arguments). Please read Hydra documentation for more information.

Train models with CLI

To train model named default_run, you can run

python -m geoarches.main_hydra \
module=archesweather \ # Uses module/archesweather.yaml
dataloader=era5 \ # Uses dataloader/era5.yaml
++name=default_run \ # Name of run, used for Wandb logging and checkpoint dir

This will start a training for the deterministic model ArchesWeather on ERA5 data.

The model config will be saved to modelstore/default_run/config.yaml and the model checkpoints will be saved to modelstore/default_run/checkpoints.

Useful training options are

python -m geoarches.main_hydra \
++log=True \ # log metrics on weights and biases (See Wandb section below.)
++seed=0 \ # set global seed
++cluster.gpus=4 \ # number of gpus used for distributed training
++batch_size=1 \ # batch size per gpu
++max_steps=300000 \ # maximum number of steps for training, but it's good to leave this at 300k for era5 trainings
++save_step_frequency=50000 \ # if you need to save checkpoints at a higher frequency

Run on SLURM

To run on a SLURM cluster, you can create a configs/cluster folder inside your working directory and put a custom_slurm.yaml configuration file in it with custom arguments. Then you call tell geoarches to use this configuration file with

python -m geoarches.submit --config-dir configs cluster=custom_slurm

Log experiments to Wandb

Find your API key under User settings in your account (https://docs.wandb.ai/support/find_api_key/) and set the Wandb environment variable in your ~/.bashrc.

export WANDB_API_KEY="..."

Then tell geoaches to log to Wandb.

python -m geoarches.main_hydra \
++log=True \ # log metrics on weights and biases
++cluster.wandb_mode=offline \ # online allows machine with internet connection to log directly to wandb. Otherwise offline mode logs locally and requires a separate step to sync with wandb.

Evaluate models with CLI

To run evaluation of a model (e.g. ArchesWeather) on the test set (2020), you can run

MODEL=archesweather-m
python -m geoarches.main_hydra ++mode=test ++name=$MODEL

It will automatically load the config file in modelstore/$MODEL and load the latest checkpoint from modelstore/$MODEL/checkpoints. It will then run the metrics relevant for the loaded model (deterministic metrics for deterministic models and similarly for generative models)

Warning: if the provided model does not exist, it will not throw an error.

Useful options for testing:

python -m geoarches.main_hydra ++mode=test ++name=$MODEL \
++ckpt_filename_match=100000 \ # substring that should be present in checkpoint file name, e.g. here for loading the checkpoint at step 100000
++limit_test_batches=0.1 \ # run test on only a fraction of test set for debugging
++module.module.rollout_iterations=10 \ # autoregressive rollout horizon, in which case the line below is also needed
++dataloader.test_args.multistep=10 \ # allow the dataloader to load trajectories of size 10

++dataloader.test_args.

For testing the generative models, you can also use the following options:

++module.inference.num_steps=25 \ # num diffusion steps in generation
++module.inference.num_members=50 \ # num members in ensemble
++module.inference.rollout_iterations=10 \ # number of auto-regressive steps, 10 days by default.

Implementing your own models

  1. Create a configs/ folder in your own project folder for your hydra configuration files. In this folder, you can put your own configs, e.g. by copying config files from geoarches and modifying them. Please note the config files should be put in the appropriate folder (configs/cluster/, configs/dataloader/ or configs/module/). You will need a base configs/config.yaml. See geoarches for an example.

  2. Add new lightning modules or architectures in your working directory (we recommend putting lightning modules in a lightning_modules folder, and pytorch-only backbone architectures in a backbones folder). To tell hydra to use these modules, you can create a module config file custom_forecast.yaml under configs/module/ and point to your new backbone and module classes:

module:
  _target_: lightning_modules.custom_module.CustomLightningModule
  ...

backbone:
  _target_: backbones.custom_backbone.CustomBackbone
  ...

You can of course mix and match your custom modules and backbones with the ones in geoarches.

  1. Training models only requires one to tell hydra to use your configs folder with
python -m geoarches.main_hydra --config-dir configs

Compute model outputs and metrics separately

You can compute model outputs and metrics separately. In that case, you first run evaluation as following:

python -m geoarches.main_hydra ++mode=test ++name=$MODEL \
++module.inference.save_test_outputs=False \

Then, to compute metrics, you can run evaluation/eval_multistep.py which reads in inference output from xarray files, computes specified metrics, and dumps metrics to output_dir. Example:

python -m geoarches.evaluation.eval_multistep \
    --pred_path evalstore/modelx_predictions/ \
    --output_dir evalstore/modelx_predictions/ \
    --groundtruth_path data/era5_240/full/  \
    --multistep 10 \
    --metrics era5_ensemble_metrics --num_workers 2

Before running, you need to make sure the metrics are registered in evaluation/metric_registry.py using register_metric(). You can find examples in the file. Example:

register_metric(
    "era5_ensemble_metrics",
    Era5EnsembleMetrics,
    save_memory=True,
)

Metrics are registered with a name, class, and any arguments.

Plot (WIP)

You can plot metrics for several models using the script plot.py. Just specify where the computed metrics are stored (either .nc or .pt files). Example:

python -m geoarches.evaluation.plot --output_dir plots/ \ --metric_paths /evalstore/modelx/...nc /evalstore/modely/...nc --model_names_for_legend ModelX ModelY \ --metrics rankhist --rankhist_prediction_timedeltas 1 7 \ --figsize 10 4 --vars Z500 Q700 T850 U850 V850

Contributing to geoarches

If you want to contribute to geoarches, please see the Contributing section.

External resources

Many thanks to the authors of WeatherLearn for adapting the Pangu-Weather pseudocode to pytorch. The code for our model is mostly based on their codebase.

WeatherBench

WeatherLearn

About

A package to train machine learning models on geospatial data, mainly for weather and climate. Used to run ArchesWeather and ArchesWeatherGen

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published