Comments on computational efficiency for GPBoost

Estimation/training of random effects models and Gaussian process (GP) models, in particular, can be computationally demanding for large data (not just in GPBoost). Below, we list some strategies for computationally efficient inference.

Approximations for Gaussian processes

In brief, try:

GPModel(..., gp_approx = "vecchia") for Gaussian process models
GPModel(..., gp_approx = "vecchia", matrix_inversion_method = "iterative") for Gaussian processes with non-Gaussian likelihoods (e.g., classification)

In more detail:

The GPBoost library implements Vecchia approximations for Gaussian processes. To activate this, set gp_approx = "vecchia". The parameter num_neighbors controls a trade-off between runtime and accuracy. The smaller the num_neighbors, the faster the code will run. See here for more information on the methodological background.
- For non-Gaussian likelihoods (e.g., classification), iterative methods (instead of the Cholesky decomposition) can additionally speed-up computations with a Vecchia approximation: GPModel(..., gp_approx = "vecchia", matrix_inversion_method = "iterative").
- Additional speed-up for matrix_inversion_method = "iterative" can sometimes be obtained by setting the cg_max_num_it to a lower value, say, 100 or 20 (default = 1000). This can be done by calling the set_optim_params function prior to running the GPBoost algorithm or by setting this in the params argument when training a GPModel. This option particularly recommended for the GPBoost algorithm.
The GPBoost library also imports other GP approximations; see here (gp_approx) for a list of currently supported approximations

Adjusting the way hyperparameters (of random effects / GP models) are chosen

Setting the parameter train_gp_model_cov_pars to false in the function gpb.train can make training faster for the GPBoost algorithm as covariance parameters are not trained. In this case, you can consider them as tuning parameters that can be chosen using, e.g., cross-validation. As a middle ground, you can also increase the convergence tolerance for the hyperparameter estimation. This means that hyperparameters are estimated less accurately but faster. Examples:
- gpb.train(..., train_gp_model_cov_pars=FALSE)) (R) / gpb.train(..., train_gp_model_cov_pars=False) (Python)
- gp_model$set_optim_params(params=list(delta_rel_conv=1e-3)) (R) / gp_model.set_optim_params(params={"delta_rel_conv": 1e-3}) (Python)
To get a better understanding of the progress of the hyperparameter optimization, set the option trace to true in the gp_model. Examples:
- gp_model$set_optim_params(params=list(trace=TRUE)) (R) / gp_model.set_optim_params(params={"trace": True}) (Python)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Computational_efficiency.rst

Computational_efficiency.rst

Comments on computational efficiency for GPBoost

Approximations for Gaussian processes

Adjusting the way hyperparameters (of random effects / GP models) are chosen

Files

Computational_efficiency.rst

Latest commit

History

Computational_efficiency.rst

File metadata and controls

Comments on computational efficiency for GPBoost

Approximations for Gaussian processes

Adjusting the way hyperparameters (of random effects / GP models) are chosen