Skip to content

An R package for creating publication-ready forest plots.

License

Notifications You must be signed in to change notification settings

maxiancheng/forester

 
 

Repository files navigation

Forester

The goal of forester is to make it easy for you to create a publication-quality forest plot with as much or as little information displayed on either side as you require.

Installation

This package is currently early in development, and must be installed from this github repo.

devtools::install_github("rdboyes/forester")

Basic Usage

Suppose we wish to replicate the following figure published in the NEJM [1]:

Forester simply requires the left side of the table (in this case, three columns with Subgroups and counts for each of two groups) and vectors which contain the point estimates and confidence intervals.

library(forester)

table <- readxl::read_excel(system.file("extdata", "example_figure_data.xlsx", package = "forester"))

# indent the subgroup if there is a number in the placebo column
table$Subgroup <- ifelse(is.na(table$Placebo), 
                         table$Subgroup,
                         paste0("   ", table$Subgroup))

# remove indent of the first row
table$Subgroup[1] <- "All Patients"

# use forester to create the table with forest plot
forester(left_side_data = table[,1:3],
           estimate = table$Estimate,
           ci_low = table$`CI low`,
           ci_high = table$`CI high`,
           display = FALSE,
           xlim = c(-100, 25),
           file_path = here::here("man/figures/forester_plot.png"))

Forester handles the alignment of the graph and the table automatically, so figures with fewer rows or columns should work by simply passing a smaller data frame to the function:

forester(left_side_data = table[1:12,1:3],
           estimate = table$Estimate[1:12],
           ci_low = table$`CI low`[1:12],
           ci_high = table$`CI high`[1:12],
           display = FALSE,
           file_path = here::here("man/figures/fewer_rows.png"))

forester(left_side_data = table[,1],
           estimate = table$Estimate,
           ci_low = table$`CI low`,
           ci_high = table$`CI high`,
           display = FALSE,
           file_path = here::here("man/figures/fewer_cols.png"))

Display Options

Using the default options will result in a png image being written to your temporary folder and opened with your default program for viewing images. If you don’t want to open the file, specify display = FALSE. ggplot2::ggsave is used to save the file, so any format supported there will work in theory (though not necessarily in practice). Use the render_as option to change the output file format - for example, render_as = "pdf" will render the plot in pdf form. If you want to display the plot inline in an rmarkdown document, use render_as = "rmarkdown (this feature is still experimental). You can alternatively generate the plots as images and then use normal markdown syntax to include them in your document (i.e. ![]()). Be sure to change the file_path option to save the plot somewhere permanent if you are using this option or would otherwise like your plots to be saved.

Font Families

Pass any font to font_family to control the output font. Tested options include "mono", "sans", and "serif". Any font will work, but alignment may not be perfect for untested fonts. Use the provided options nudge_x and nudge_y to correct alignment issues.

library(extrafont)
#> Registering fonts with R

loadfonts(device = "win")
windowsFonts("Fira Sans" = windowsFont("Fira Sans"))

forester(left_side_data = table[,1:3],
           estimate = table$Estimate,
           ci_low = table$`CI low`,
           ci_high = table$`CI high`,
           display = FALSE,
           nudge_y = -.3,
           file_path = here::here("man/figures/forester_plot_fira.png"),
           font_family = "Fira Sans")

Right Side Data

The option estimate_precision can be used to change the number of decimals displayed in the right hand side table. estimate_col_name allows you to change the name of the default right side column. If you require more control, pass a dataframe to right_side_data to override the right side completely.

forester(left_side_data = table[,1],
           estimate = table$Estimate,
           ci_low = table$`CI low`,
           ci_high = table$`CI high`,
           display = FALSE,
           estimate_precision = 3,
           estimate_col_name = "Estimate (95% CI)",
           file_path = here::here("man/figures/more_precise.png"))

Column Justification

Column justification can be controlled either with a single number (e.g. the default justify = 0 for all columns left justified) or with a vector of length equal to the number of columns in left_side_data, plus one for the estimate column. If the vector form is specified, each number controls the justification of one column. Right justification is justify = 1, centering is justify = 0.5.

forester(left_side_data = table[,1:3],
           estimate = table$Estimate,
           ci_low = table$`CI low`,
           ci_high = table$`CI high`,
           display = FALSE,
           justify = c(0,1,1,1),
           estimate_col_name = "Estimate (95% CI)",
           file_path = here::here("man/figures/right_just.png"))

Plot Width

Change ggplot_width from its default value of 30 to adjust the relative amount of space that the figure takes up. nudge_x can be used to correct alignment issues if this option causes any.

forester(left_side_data = table[,1],
           estimate = table$Estimate,
           ci_low = table$`CI low`,
           ci_high = table$`CI high`,
           display = FALSE,
           ggplot_width = 40,
           nudge_x = .5,
           file_path = here::here("man/figures/more_plot_width.png"))

Limits, Breaks, and the Null Line

Limits and breaks are set automatically, but the defaults can be overwritten using xlim (use a vector of length 2, i.e. c(low, high)) and xbreaks (use a vector indicating the breaks). null_line_at defaults to 0, but can be set to any value. This would be most commonly used to set the null_line_at = 1.0 for relative measures. x_scale_linear defaults to TRUE but can be set to FALSE if a logarithmic scale is required. If a confidence interval extends outside the range set by xlim, it will automatically be indicated using an arrow.

forester(left_side_data = table[,1:3],
           estimate = table$Estimate,
           ci_low = table$`CI low`,
           ci_high = table$`CI high`,
           display = FALSE,
           file_path = here::here("man/figures/limit_breaks.png"),
           font_family = "sans",
           null_line_at = -50,
           xlim = c(-100, -25),
           xbreaks = c(-100, -75, -50, -25))

Table Colour Options

As pointed out here, the default colour scheme looks a bit retro. Personally, I like the default stripe colour “#eff3f2”, but you can change the stripe colour to anything you want :)

forester(left_side_data = table[,1:3],
           estimate = table$Estimate,
           ci_low = table$`CI low`,
           ci_high = table$`CI high`,
           display = FALSE,
           file_path = here::here("man/figures/stripe_colour.png"),
           font_family = "sans",
           stripe_colour = "#ff0000"
           )

Adding Arrows

Arrows can be added below the plot area (with arrows = TRUE) and labelled (with arrow_labels = c("Left Label", "Right Label")).

forester(left_side_data = table[,1:3],
           estimate = table$Estimate,
           ci_low = table$`CI low`,
           ci_high = table$`CI high`,
           display = FALSE,
           file_path = here::here("man/figures/forester_plot_arrows.png"),
           font_family = "sans",
           null_line_at = 0,
           xlim = c(-100, 25),
           xbreaks = c(-100, -75, -50, -25, 0, 25),
           arrows = TRUE, 
           arrow_labels = c("Inclisiran Better", "Placebo Better"))

Point Size and Shape

The size and shape of the points can be controlled using vectors of values passed to point_sizes and point_shapes.

shapes <- rep(16, times = 30)

shapes[1] <- 17

sizes <- rep(3.25, times = 30)

sizes[30] <- 5

forester(left_side_data = table[,1:3],
           estimate = table$Estimate,
           ci_low = table$`CI low`,
           ci_high = table$`CI high`,
           display = FALSE,
           file_path = here::here("man/figures/size_shape.png"),
           font_family = "sans",
           null_line_at = 0,
           xlim = c(-100, 25),
           xbreaks = c(-100, -75, -50, -25, 0, 25),
           arrows = TRUE, 
           arrow_labels = c("Inclisiran Better", "Placebo Better"),
           point_sizes = sizes,
           point_shapes = shapes)

Adding Additional ggplot Objects

Custom ggplot objects can be passed to the forester function using the parameter add_plot. To align the plot with the rows of the table, the vertical center of the bottom row is at y = 0, and each row is one unit tall on the y axis. add_plot_width can be set to customize the width of the plot (units are relative to the width of the table).

library(ggplot2)
#> Warning: package 'ggplot2' was built under R version 4.3.2
library(tibble)

ex_plot <- ggplot(tibble(x = rep(1:7, each = 15), y = rep(0:14, times = 7)), aes(x = x, y = y)) +
  geom_point()

forester(left_side_data = table[1:15,1:3],
           estimate = table$Estimate[1:15],
           ci_low = table$`CI low`[1:15],
           ci_high = table$`CI high`[1:15],
           display = FALSE,
           add_plot = ex_plot,
           file_path = here::here("man/figures/add_dots.png"))

Citation Information

If you’d like to cite forester, please use:

citation("forester")
#> To cite forester in publications use:
#> 
#>   Boyes, Randy (2021). Forester: An R package for creating
#>   publication-ready forest plots. R package version 0.3.0. Available
#>   at: https://github.com/rdboyes/forester
#> 
#> A BibTeX entry for LaTeX users is
#> 
#>   @Manual{,
#>     title = {Forester: An R package for creating publication-ready forest plots.},
#>     author = {Randy Boyes},
#>     year = {2021},
#>     note = {R package version 0.3.0},
#>     url = {https://github.com/rdboyes/forester},
#>   }

Papers Using forester

  1. Damtew, Yohannes Tefera, et al. “Effects of high temperatures and heatwaves on dengue fever: a systematic review and meta-analysis.” Ebiomedicine 91 (2023).

  1. Petersen, Maria Skaalum, et al. “Clinical characteristics of the Omicron variant-results from a Nationwide Symptoms Survey in the Faroe Islands.” International Journal of Infectious Diseases 122 (2022): 636-643.

  1. Petersen, Maria Skaalum, et al. “Clinical characteristics of the Omicron variant-results from a Nationwide Symptoms Survey in the Faroe Islands.” International Journal of Infectious Diseases 122 (2022): 636-643.

4. Xie, Quin Yuhui, et al. “Immune responses to gut bacteria associated with time to diagnosis and clinical response to T cell–directed therapy for type 1 diabetes prevention.” Science Translational Medicine 15.719 (2023): eadh0353.

5. Suarez-Zdunek, Moises Alberto, et al. “High incidence of subclinical peripheral artery disease in people with HIV.” AIDS 36.10 (2022): 1355-1362.

References

  1. Ray, K. K., Wright, R. S., Kallend, D., Koenig, W., Leiter, L. A., Raal, F. J., Bisch, J. A., Richardson, T., Jaros, M., Wijngaard, P. L. J., Kastelein, J. J. P., & ORION-10 and ORION-11 Investigators. (2020). Two Phase 3 Trials of Inclisiran in Patients with Elevated LDL Cholesterol. The New England Journal of Medicine, 382(16), 1507–1519.

About

An R package for creating publication-ready forest plots.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • R 100.0%