| Type: | Package | 
| Title: | Build Species Distribution Modeling using 'caret' | 
| Version: | 1.1.4 | 
| Maintainer: | Luíz Fernando Esser <luizesser@gmail.com> | 
| Description: | Use machine learning algorithms and advanced geographic information system tools to build Species Distribution Modeling in a extensible and modern fashion. | 
| License: | MIT + file LICENSE | 
| BugReports: | https://github.com/luizesser/caretSDM/issues | 
| Imports: | caret, checkmate, cli, CoordinateCleaner, data.table, dismo, dplyr, fs, furrr, future, ggplot2, ggspatial, glue, gtools, httr, lwgeom, mapview, methods, parallelly, pdp, pROC, progressr, purrr, raster, rgbif, Rtsne, sf, stars, stats, stringdist, stringr, terra, tidyr, usdm, utils | 
| Encoding: | UTF-8 | 
| LazyData: | true | 
| RoxygenNote: | 7.3.2 | 
| Suggests: | e1071, kknn, mda, naivebayes, nnet, here, tibble, withr, roxyglobals, covr, knitr, rmarkdown, testthat (≥ 3.0.0) | 
| Config/testthat/edition: | 3 | 
| Depends: | R (≥ 4.2.0) | 
| URL: | https://luizesser.github.io/caretSDM/ | 
| Config/Needs/website: | rmarkdown | 
| Config/roxyglobals/filename: | generated-globals.R | 
| Config/roxyglobals/unique: | FALSE | 
| NeedsCompilation: | no | 
| Packaged: | 2025-08-29 12:43:38 UTC; luizesser | 
| Author: | Dayani Bailly | 
| Repository: | CRAN | 
| Date/Publication: | 2025-08-29 13:50:02 UTC | 
caretSDM: Build Species Distribution Modeling using 'caret'
Description
 
Use machine learning algorithms and advanced geographic information system tools to build Species Distribution Modeling in a extensible and modern fashion.
Author(s)
Maintainer: Luíz Fernando Esser luizesser@gmail.com (ORCID) [copyright holder]
Authors:
- Dayani Bailly (ORCID) 
- Edivando Couto (ORCID) 
- José Hilário Delconte Ferreira (ORCID) 
- Marcos R. Lima (ORCID) 
- Reginaldo Ré (ORCID) 
- Valéria Batista (ORCID) 
See Also
Useful links:
Retrieve Species data from GBIF
Description
This function is a wrapper to get records from GBIF using rgbif and return a
data.frame ready to be used in caretSDM.
Usage
GBIF_data(s, file = NULL, as_df = FALSE, ...)
Arguments
| s | 
 | 
| file | 
 | 
| as_df | Should the output be a  | 
| ... | Arguments to pass on  | 
Value
A data.frame with species occurrences data, or an occurrences object if
as_df = FALSE.
Author(s)
Luíz Fernando Esser (luizesser@gmail.com) https://luizfesser.wordpress.com
References
https://www.gbif.org
Examples
# Select species names:
s <- c("Araucaria angustifolia", "Salminus brasiliensis")
# Run function:
oc <- GBIF_data(s)
Download WorldClim v.2.1 bioclimatic data
Description
This function allows to download data from WorldClim v.2.1 (https://www.worldclim.org/data/index.html) considering multiple GCMs, time periods and SSPs.
Usage
WorldClim_data(path = NULL,
               period = "current",
               variable = "bioc",
               year = "2090",
               gcm = "mi",
               ssp = "585",
               resolution = 10)
Arguments
| path | Directory path to save downloads. | |||||||||||||||||||||||||||||||||||||||||||||||||||||
| period | Can be "current" or "future". | |||||||||||||||||||||||||||||||||||||||||||||||||||||
| variable | Allows to specify which variables you want to retrieve Possible entries are: "tmax","tmin","prec" and/or "bioc". | |||||||||||||||||||||||||||||||||||||||||||||||||||||
| year | Specify the year you want to retrieve data. Possible entries are: "2030", "2050", "2070" and/or "2090". You can use a vector to provide more than one entry. | |||||||||||||||||||||||||||||||||||||||||||||||||||||
| gcm | GCMs to be considered in future scenarios. You can use a vector to provide more than one entry. 
 | |||||||||||||||||||||||||||||||||||||||||||||||||||||
| ssp | SSPs for future data. Possible entries are: "126", "245", "370" and/or "585". You can use a vector to provide more than one entry. | |||||||||||||||||||||||||||||||||||||||||||||||||||||
| resolution | You can select one resolution from the following alternatives: 10, 5, 2.5 OR 30. | 
Details
This function will create a folder entitled "input_data/WorldClim_data_current" or "input_data/WorldClim_data_future". All the data downloaded will be stored in this folder. Note that, despite being possible to retrieve a lot of data at once, it is not recommended to do so, since the data is very heavy.
Value
If data is not downloaded, the function downloads the data and has no return value. If
the data is downloaded, it imports the data as a stack.
Author(s)
Luíz Fernando Esser (luizesser@gmail.com) https://luizfesser.wordpress.com
References
https://www.worldclim.org/data/index.html
Examples
# download data from multiple periods:
year <- c("2050", "2090")
WorldClim_data(period = "future",
               variable = "bioc",
               year = year,
               gcm = "mi",
               ssp = "126",
               resolution = 10)
# download data from one specific period
WorldClim_data(period = "future",
               variable = "bioc",
               year = "2070",
               gcm = "mi",
               ssp = "585",
               resolution = 10)
Add predictors to sdm_area
Description
This function includes new predictors to the sdm_area object.
Usage
add_predictors(sa, pred, variables_selected = NULL, gdal = TRUE)
get_predictors(i)
Arguments
| sa | A  | 
| pred | 
 | 
| variables_selected | 
 | 
| gdal | Boolean. Force the use or not of GDAL when available. See details. | 
| i | 
 | 
Details
add_predictors returns a sdm_area object with a grid built upon the x parameter.
There are two ways to make the grid and resample the variables in sdm_area: with and
without gdal. As standard, if gdal is available in you machine it will be used (gdal = TRUE),
otherwise sf/stars will be used.
Value
For add_predictors the same input sdm_area object is returned including the
pred data binded to the previous grid.
get_predictors retrieves the grid from the i object.
Author(s)
Luíz Fernando Esser (luizesser@gmail.com) and Reginaldo Ré. https://luizfesser.wordpress.com
See Also
Examples
# Create sdm_area object:
sa <- sdm_area(parana, cell_size = 25000, crs = 6933)
# Include predictors:
sa <- add_predictors(sa, bioc)
# Retrieve predictors data:
get_predictors(sa)
Add scenarios to sdm_area
Description
This function includes scenarios in the sdm_area object.
Usage
add_scenarios(sa, scen = NULL, scenarios_names = NULL, pred_as_scen = TRUE,
                     variables_selected = NULL, stationary = NULL, crop_area = NULL)
set_scenarios_names(i, scenarios_names = NULL)
scenarios_names(i)
get_scenarios_data(i)
select_scenarios(i, scenarios_names = NULL)
Arguments
| sa | A  | 
| scen | 
 | 
| scenarios_names | Character vector with names of scenarios. | 
| pred_as_scen | Logical. If  | 
| variables_selected | Character vector with variables names in  | 
| stationary | Names of variables from  | 
| crop_area | A  | 
| i | A  | 
Details
The function add_scenarios adds scenarios to the sdm_area or input_sdm
object. If scen has variables that are not present as predictors the function will use
only variables present in both objects. stationary variables are those that don't change
through the scenarios. It is useful for hidrological variables in fish habitat modeling, for
example (see examples below). When adding multiple scenarios in multiple runs, the function will
always add a new "current" scenario. To avoid that, set pred_as_scen = FALSE.
Value
add_scenarios returns the input sdm_area or input_sdm object with a
new slot called scenarios with scen data as a list, where each slot of the
list holds a scenario and each scenario is a sf object.
set_scenarios_names sets new names for scenarios in sdm_area/input_sdm
object.
scenarios_names returns scenarios' names.
get_scenarios_data retrieves scenarios data as a list of sf objects.
select_scenarios selects scenarios from sdm_area/input_sdm object.
Author(s)
Luíz Fernando Esser (luizesser@gmail.com) https://luizfesser.wordpress.com
See Also
Examples
# Create sdm_area object:
sa <- sdm_area(parana, cell_size = 100000, crs = 6933)
# Include predictors:
sa <- add_predictors(sa, bioc)
# Include scenarios:
sa <- add_scenarios(sa, scen[1:2]) |> select_predictors(c("bio1", "bio12"))
# Set scenarios names:
sa <- set_scenarios_names(sa, scenarios_names = c("future_1", "future_2",
                                                  "current"))
scenarios_names(sa)
# Get scenarios data:
scenarios_grid <- get_scenarios_data(sa)
scenarios_grid
# Select scenarios:
sa <- select_scenarios(sa, scenarios_names = c("future_1"))
# Setting stationary variables in scenarios:
sa <- sdm_area(rivs[c(1:200),], cell_size = 100000, crs = 6933, lines_as_sdm_area = TRUE) |>
  add_predictors(bioc) |>
  add_scenarios(scen, stationary = c("LENGTH_KM", "DIST_DN_KM"))
Caret Algorithms
Description
A data.frame with characteristics of each algorithm available in caretSDM. Each
column is a different characteristics. This can be helpful for more experienced modelers select
algorithms. See the source for a selection method using this data.
Usage
algorithms
Format
## 'algorithms'
A data.frame with 230 rows and 60 columns:
- X
- Algorithms names 
- Further columns
- Algorithms attributes 
Source
<https://topepo.github.io/caret/models-clustered-by-tag-similarity.html>
Bioclimatic Variables
Description
A stars object with bioclimatic variables (bio1, bio4 and bio12) for the Parana state in Brazil.
Data obtained from WorldClim 2.1 at 10 arc-min resolution.
Usage
bioc
Format
## 'bioc'
A stars with 1 attribute and 3 bands:
- bio1
- Annual Mean Temperature 
- bio4
- Temperature Seasonality 
- bio12
- Annual Precipitation 
Source
<https://www.worldclim.org/>
Create buffer around occurrences
Description
Create buffer around records in occ_data to be used as study area
Usage
buffer_sdm(occ_data, size = NULL, crs = NULL)
Arguments
| occ_data | A  | 
| size | 
 | 
| crs | 
 | 
Value
A sf buffer around occ_data records.
Author(s)
Luíz Fernando Esser (luizesser@gmail.com) https://luizfesser.wordpress.com
See Also
Examples
# Create sdm_area object:
study_area <- buffer_sdm(occ, size=50000, crs=6933)
plot(study_area)
Presence data cleaning routine
Description
Data cleaning wrapper using CoordinateCleaner package.
Usage
data_clean(occ, pred = NULL,
           species = NA, lon = NA, lat = NA,
           capitals = TRUE,
           centroids = TRUE,
           duplicated = TRUE,
           identical = TRUE,
           institutions = TRUE,
           invalid = TRUE,
           terrestrial = TRUE,
           independent_test = TRUE)
Arguments
| occ | A  | 
| pred | A  | 
| species | A  | 
| lon | A  | 
| lat | A  | 
| capitals | Boolean to turn on/off the exclusion from countries capitals coordinates (see  | 
| centroids | Boolean to turn on/off the exclusion from countries centroids coordinates (see  | 
| duplicated | Boolean to turn on/off the exclusion from duplicated records (see  | 
| identical | Boolean to turn on/off the exclusion from records with identical lat/long values (see  | 
| institutions | Boolean to turn on/off the exclusion from biodiversity institutions coordinates (see  | 
| invalid | Boolean to turn on/off the exclusion from invalid coordinates (see  | 
| terrestrial | Boolean to turn on/off the exclusion from coordinates falling on sea (see  | 
| independent_test | Boolean. If  | 
Details
If the user does not used GBIF_data function to obtain species records, the function may
have problems to find which column from the presences table has species, longitude and latitude
information. In this regard, we implemented the parameters species, lon and
lat so the use can explicitly inform which columns should be used. If they remain as NA
(standard) the function will try to guess which columns are the correct one.
Value
A occurrences_sdm object or input_sdm with cleaned presence data.
Author(s)
Luíz Fernando Esser (luizesser@gmail.com) https://luizfesser.wordpress.com
See Also
GBIF_data occurrences_sdm sdm_area input_sdm
predictors
Examples
# Create sdm_area object:
sa <- sdm_area(parana, cell_size = 50000, crs = 6933)
# Include predictors:
sa <- add_predictors(sa, bioc) |> select_predictors(c("bio1", "bio12"))
# Create occurrences:
oc <- occurrences_sdm(occ, crs = 6933) |> join_area(sa)
# Create input_sdm:
i <- input_sdm(oc, sa)
# Clean coordinates (terrestrial is set to false to make the run quicker):
i <- data_clean(i, terrestrial = FALSE)
Ensemble GCMs into one scenario
Description
An ensembling method to group different GCMs into one SSP scenario
Usage
gcms_ensembles(i, gcms = NULL)
Arguments
| i | A  | 
| gcms | GCM codes in  | 
Value
A input_sdm object with grouped GCMs.
Author(s)
Luíz Fernando Esser (luizesser@gmail.com) https://luizfesser.wordpress.com
See Also
GBIF_data occurrences_sdm sdm_area input_sdm
predictors
Examples
# Create sdm_area object:
set.seed(1)
sa <- sdm_area(parana, cell_size = 100000, crs = 6933)
# Include predictors:
sa <- add_predictors(sa, bioc)
# Include scenarios:
sa <- add_scenarios(sa, scen) |> select_predictors(c("bio1", "bio12"))
# Create occurrences:
oc <- occurrences_sdm(occ, crs = 6933) |> join_area(sa)
# Create input_sdm:
i <- input_sdm(oc, sa)
# Pseudoabsence generation:
i <- pseudoabsences(i, method="random", n_set = 2)
# Custom trainControl:
ctrl_sdm <- caret::trainControl(method = "boot",
                                number = 1,
                                classProbs = TRUE,
                                returnResamp = "all",
                                summaryFunction = summary_sdm,
                                savePredictions = "all")
# Train models:
i <- train_sdm(i,
               algo = c("naive_bayes"),
               ctrl=ctrl_sdm,
               variables_selected = c("bio1", "bio12")) |>
  suppressWarnings()
# Predict models:
i  <- predict_sdm(i, th=0.8)
#' # Ensemble GCMs:
i <- gcms_ensembles(i, gcms = c("ca", "mi"))
i
input_sdm
Description
This function creates a new input_sdm object.
Usage
input_sdm(...)
add_input_sdm(i1, i2)
Arguments
| ... | Data to be used in SDMs. Can be a  | 
| i1 | A  | 
| i2 | A  | 
Details
If sdm_area is used, it can include predictors and scenarios. In this case,
input_sdm will detect and include as scenarios and predictors in the
input_sdm output. Objects can be included in any order, since the function will work by
detecting their classes.
The returned object is used throughout the whole workflow to apply functions.
Value
A input_sdm object containing:
| grid | 
 | 
| bbox | Four corners for the bounding box (class  | 
| cell_size | 
 | 
| epsg | 
 | 
| predictors | 
 | 
Author(s)
Luiz Fernando Esser (luizesser@gmail.com) https://luizfesser.wordpress.com
See Also
Examples
# Create sdm_area object:
sa <- sdm_area(parana, cell_size = 50000, crs = 6933)
# Include predictors:
sa <- add_predictors(sa, bioc) |> select_predictors(c("bio1", "bio4", "bio12"))
# Include scenarios:
sa <- add_scenarios(sa, scen)
# Create occurrences:
oc <- occurrences_sdm(occ, crs = 6933) |> join_area(sa)
# Create input_sdm:
i <- input_sdm(oc, sa)
is_class functions to check caretSDM data classes.
Description
This functions returns a boolean to check caretSDM object classes.
Usage
is_input_sdm(x)
is_sdm_area(x)
is_occurrences(x)
is_models(x)
is_predictions(x)
Arguments
| x | Object to be tested. | 
Value
Boolean.
Author(s)
Luíz Fernando Esser (luizesser@gmail.com) https://luizfesser.wordpress.com
Examples
# Create sdm_area object:
sa <- sdm_area(parana, cell_size = 25000, crs = 6933)
is_sdm_area(sa)
is_input_sdm(sa)
Join Area
Description
Join cell_id data from sdm_area to a occurrences
Usage
join_area(occ, pred)
Arguments
| occ | A  | 
| pred | A  | 
Details
This function is key in this SDM workflow. It attaches cell_id values to occ, deletes
records outside pred and allows the use of pseudoabsences. This function also tests if
CRS from both occ and pred are equal, otherwise the CRS of pred is used to
convert occ.
Value
A occurrences object with cell_id to each record.
Author(s)
Luíz Fernando Esser (luizesser@gmail.com) https://luizfesser.wordpress.com
See Also
occurrences_sdm sdm_area input_sdm
pseudoabsences
Examples
# Create sdm_area object:
sa <- sdm_area(parana, cell_size = 50000, crs = 6933)
# Include predictors:
sa <- add_predictors(sa, bioc) |> select_predictors(c("bio1", "bio4", "bio12"))
# Include scenarios:
sa <- add_scenarios(sa, scen)
# Create occurrences:
oc <- occurrences_sdm(occ, crs = 6933) |> join_area(sa)
Araucaria angustifolia occurrence data
Description
A data.frame object with Araucaria angustifolia occurrence data obtained from GBIF and
filtered with Parana state sf.
Usage
occ
Format
## 'occ'
A data.frame with 420 rows and 3 columns (EPSG:6933):
- species
- Species name 
- decimalLongitude
- Longitude in meters 
- decimalLatitude
- Latitude in meters 
Source
<https://www.gbif.org>
Occurrences Managing
Description
This function creates and manage occurrences objects.
Usage
occurrences_sdm(x,
                independent_test = NULL,
                p = 0.1,
                crs = NULL,
                independent_test_crs = NULL,
                ...)
n_records(i)
species_names(i)
get_coords(i)
occurrences_as_df(i)
add_occurrences(oc1, oc2)
Arguments
| x | A  | 
| independent_test | Boolean. If  | 
| p | Numeric. Fraction of data to be used as independent test. Standard is 0.1. | 
| crs | Numeric. CRS of  | 
| independent_test_crs | Numeric. CRS of  | 
| ... | A vector with column names addressing the columns with species names, longitude and
latitude, respectively, in  | 
| i | 
 | 
| oc1 | A  | 
| oc2 | A  | 
Details
x must have three columns: species, decimalLongitude and decimalLatitude. When sf
it is only necessary a species column.
n_records return the number of presence records to each species.
species_names return the species names.
get_coords return a data.frame with coordinates of species records.
add_occurrences return a occurrences. This function sums two occurrences objects.
It can also sum a occurrences object with a data.frame object.
occurrences_as_df returns a data.frame with species names and coordinates.
Value
A occurrences object.
Author(s)
Luíz Fernando Esser (luizesser@gmail.com) https://luizfesser.wordpress.com
See Also
Examples
# Create occurrences:
oc <- occurrences_sdm(occ, crs = 6933)
Paraná State
Description
A sf object with a polygon for the Paraná state in Brazil. This is a subset of the
brazilian map provided by official government agency (IBGE)
Usage
parana
Format
## 'parana'
A sf with 1 row and 5 columns:
- GID0
- State code 
- CODIGOIB1
- State's phone code 
- NOMEUF2
- Name of the state 
- SIGLAUF3
- Abbreviation of the state's name 
- geom
- Geometry column of the - sf
Source
<https://www.ibge.gov.br/geociencias/cartas-e-mapas/bases-cartograficas-continuas/15759-brasil.html>
Predictors as PCA-axes
Description
Transform predictors data into PCA-axes.
Usage
pca_predictors(i, cumulative_proportion = 0.99)
pca_summary(i)
get_pca_model(i)
Arguments
| i | A  | 
| cumulative_proportion | A  | 
Details
pca_predictors Transform predictors data into PCA-axes. If the user wants to use PCA-axes
as future scenarios, then scenarios should be added after the PCA transformation (see examples).
pca_summary Returns the summary of prcomp function. See ?stats::prcomp.
get_pca_model Returns the model built to calculate PCA-axes.
Value
input_sdm object with variables from both predictors and scenarios
transformed in PCA-axes.
Author(s)
Luíz Fernando Esser (luizesser@gmail.com) https://luizfesser.wordpress.com
See Also
vif_predictors sdm_area add_scenarios add_predictors
Examples
# Create sdm_area object:
sa <- sdm_area(parana, cell_size = 50000, crs = 6933)
# Include predictors:
sa <- add_predictors(sa, bioc) |> select_predictors(c("bio1", "bio12"))
# Create occurrences:
oc <- occurrences_sdm(occ, crs = 6933) |> join_area(sa)
# Create input_sdm:
i <- input_sdm(oc, sa)
# PCA transformation:
i <- pca_predictors(i)
Model Response to Variables
Description
Obtain the Partial Dependence Plots (PDP) to each variable.
Usage
pdp_sdm(i, spp = NULL, algo = NULL, variables_selected = NULL, mean.only = FALSE)
get_pdp_sdm(i, spp = NULL, algo = NULL, variables_selected = NULL)
Arguments
| i | A  | 
| spp | A  | 
| algo | A  | 
| variables_selected | A  | 
| mean.only | Boolean. Should only the mean curve be plotted or a curve to each run should be included? Standard is FALSE. | 
Value
A plot (for pdp_sdm) or a data.frame (for get_pdp_sdm) with PDP values.
Author(s)
Luíz Fernando Esser (luizesser@gmail.com) https://luizfesser.wordpress.com
See Also
Examples
# Create sdm_area object:
sa <- sdm_area(parana, cell_size = 100000, crs = 6933)
# Include predictors:
sa <- add_predictors(sa, bioc) |> select_predictors(c("bio1", "bio12"))
# Include scenarios:
sa <- add_scenarios(sa)
# Create occurrences:
oc <- occurrences_sdm(occ, crs = 6933) |> join_area(sa)
# Create input_sdm:
i <- input_sdm(oc, sa)
# Pseudoabsence generation:
i <- pseudoabsences(i, method="random", n_set=3)
# Custom trainControl:
ctrl_sdm <- caret::trainControl(method = "repeatedcv",
                                number = 2,
                                repeats = 1,
                                classProbs = TRUE,
                                returnResamp = "all",
                                summaryFunction = summary_sdm,
                                savePredictions = "all")
# Train models:
i <- train_sdm(i, algo = c("naive_bayes"), ctrl=ctrl_sdm)
# PDP plots:
pdp_sdm(i)
get_pdp_sdm(i)
S3 Methods for plot and mapview
Description
This function creates different plots depending on the input.
Usage
plot_occurrences(i, spp_name = NULL, pa = TRUE)
plot_grid(i)
plot_predictors(i, variables_selected = NULL)
plot_scenarios(i, variables_selected = NULL, scenario = NULL)
plot_predictions(
  i,
  spp_name = NULL,
  scenario = NULL,
  id = NULL,
  ensemble = TRUE,
  ensemble_type = "mean_occ_prob"
)
mapview_grid(i)
mapview_occurrences(i, spp_name = NULL, pa = TRUE)
mapview_predictors(i, variables_selected = NULL)
mapview_scenarios(i, variables_selected = NULL, scenario = NULL)
mapview_predictions(
  i,
  spp_name = NULL,
  scenario = NULL,
  id = NULL,
  ensemble = TRUE,
  ensemble_type = "mean_occ_prob"
)
plot_background(i, variables_selected = NULL)
plot_niche(
  i,
  spp_name = NULL,
  variables_selected = NULL,
  scenario = NULL,
  id = NULL,
  ensemble = TRUE,
  ensemble_type = "mean_occ_prob",
  raster = FALSE
)
Arguments
| i | Object to be plotted. Can be a  | 
| spp_name | A character with species to be plotted. If NULL, the first species is plotted. | 
| pa | Boolean. Should pseudoabsences be plotted together? (not implemented yet.) | 
| variables_selected | A character vector with names of variables to be plotted. | 
| scenario | description | 
| id | The id of models to be plotted (only used when  | 
| ensemble | Boolean. Should the ensemble be plotted (TRUE)? Otherwise a prediction will be plotted | 
| ensemble_type | Character of the type of ensemble to be plotted. One of: "mean_occ_prob", "wmean_AUC" or "committee_avg" | 
| raster | Should the niche be extrapolated to a raster covering all possibe values in the environmental space? | 
Details
We implemented a bestiary of plots to help visualizing the process and results. If you are not familiar with mapview, consider using it to better visualize maps.
Value
The plot or mapview desired.
Author(s)
Luíz Fernando Esser (luizesser@gmail.com) https://luizfesser.wordpress.com
See Also
Predict SDM models in new data
Description
This function projects SDM models to new scenarios
Usage
predict_sdm(m,
            scen = NULL,
            metric = "ROC",
            th = 0.9,
            tp = "prob",
            ensembles = TRUE,
            file = NULL,
            add.current = TRUE)
get_predictions(i)
get_ensembles(i)
add_predictions(p1, p2)
Arguments
| m | A  | 
| scen | A  | 
| metric | A character containing the metric in which the  | 
| th | Thresholds for metrics. Can be numeric or a function. | 
| tp | Type of output to be retrieved. See details. | 
| ensembles | Boolean. Should ensembles be calculated? If  | 
| file | File to sabe predictions. | 
| add.current | If current scenario is not available, predictors will be used as the current scenario. | 
| i | A  | 
| p1 | A  | 
| p2 | A  | 
Details
tp is a parameter to be passed on caret to retrieve either the probabilities of classes
(tp="prob") or the raw output (tp="raw"), which could vary depending on the algorithm used, but
usually would be on of the classes (factor vector with presences and pseudoabsences).
When ensembles is set to TRUE, three ensembles are currently implemented.
mean_occ_prob is the mean occurrence probability, which is a simple mean of predictions,
wmean_AUC is the same mean_occ_prob, but weighted by AUC, and committee_avg is the committee
average, as known as majority rule, where predictions are binarized and then a mean is obtained.
get_predictions returns the list of all predictions to all scenarios, all species,
all algorithms and all repetitions. Useful for those who wish to implement their own ensemble
methods.
get_ensembles returns a matrix of data.frames, where each column is a
scenario and each row is a species.
scenarios_names returns the scenarios names in a sdm_area or input_sdm
object.
get_scenarios_data returns the data from scenarios in a sdm_area or
input_sdm object.
Value
A input_sdm or a predictions object.
Author(s)
Luíz Fernando Esser (luizesser@gmail.com) https://luizfesser.wordpress.com
See Also
sdm_area input_sdm mean_validation_metrics
Examples
# Create sdm_area object:
set.seed(1)
sa <- sdm_area(parana, cell_size = 100000, crs = 6933)
# Include predictors:
sa <- add_predictors(sa, bioc) |> select_predictors(c("bio1", "bio12"))
# Include scenarios:
sa <- add_scenarios(sa)
# Create occurrences:
oc <- occurrences_sdm(occ, crs = 6933) |> join_area(sa)
# Create input_sdm:
i <- input_sdm(oc, sa)
# Pseudoabsence generation:
i <- pseudoabsences(i, method="random", n_set=2)
# Custom trainControl:
ctrl_sdm <- caret::trainControl(method = "boot",
                                number = 1,
                                repeats = 1,
                                classProbs = TRUE,
                                returnResamp = "all",
                                summaryFunction = summary_sdm,
                                savePredictions = "all")
# Train models:
i <- train_sdm(i, algo = c("naive_bayes"), ctrl=ctrl_sdm) |>
  suppressWarnings()
# Predict models:
i  <- predict_sdm(i, th = 0.8)
i
Predictors Names Managing
Description
This function manage predictors names in sdm_area objects.
Usage
predictors(x)
## S3 method for class 'sdm_area'
predictors(x)
## S3 method for class 'input_sdm'
predictors(x)
set_predictor_names(x, new_names)
## S3 method for class 'input_sdm'
set_predictor_names(x, new_names)
## S3 method for class 'sdm_area'
set_predictor_names(x, new_names)
get_predictor_names(x)
## S3 method for class 'sdm_area'
get_predictor_names(x)
## S3 method for class 'input_sdm'
get_predictor_names(x)
test_variables_names(sa, scen)
set_variables_names(s1 = NULL, s2 = NULL, new_names = NULL)
Arguments
| x | A  | 
| new_names | A  | 
| sa | A  | 
| scen | A  | 
| s1 | A  | 
| s2 | A  | 
Details
This functions is available so users can modify predictors names to better represent them. Use
carefully to avoid giving wrong names to the predictors. Useful to make sure the predictors names
are equal the names in scenarios.
test_variables_names Tests if variables in a stars object (scen argument)
matches the given sdm_area object (sa argument).
set_variables_names will set s1 object variables names as the s2 object
variables names OR assign new names to it.
Value
predictors and get_predictor_names return a character vector with
predictors names.
test_variables_names returns a logical informing if all variables are equal in both
objects (TRUE) or not (FALSE).
set_variables_names returns the s1 object with new names provided by s2 or
new_names.
Author(s)
Luíz Fernando Esser (luizesser@gmail.com) https://luizfesser.wordpress.com
See Also
Examples
# Create sdm_area object:
sa <- sdm_area(parana, cell_size = 50000, crs = 6933)
# Include predictors:
sa <- add_predictors(sa, bioc)
# Check predictors' names:
get_predictor_names(sa)
Print method for input_sdm
Description
Print method for input_sdm
Usage
## S3 method for class 'input_sdm'
print(x, ...)
Arguments
| x | input_sdm object | 
| ... | passed to other methods | 
Value
Concatenate structured characters to showcase what is stored in the object.
Print method for models
Description
Print method for models
Usage
## S3 method for class 'models'
print(x, ...)
Arguments
| x | models object | 
| ... | passed to other methods | 
Value
Concatenate structured characters to showcase what is stored in the object.
Print method for occurrences
Description
Print method for occurrences
Usage
## S3 method for class 'occurrences'
print(x, ...)
Arguments
| x | occurrences object | 
| ... | passed to other methods | 
Value
Concatenate structured characters to showcase what is stored in the object.
Print method for predictions
Description
Print method for predictions
Usage
## S3 method for class 'predictions'
print(x, ...)
Arguments
| x | predictions object | 
| ... | passed to other methods | 
Value
Concatenate structured characters to showcase what is stored in the object.
Obtain Pseudoabsences
Description
This function obtains pseudoabsences given a set of predictors.
Usage
pseudoabsences(occ,
               pred = NULL,
               method = "random",
               n_set = 10,
               n_pa = NULL,
               variables_selected = NULL,
               th = 0)
n_pseudoabsences(i)
pseudoabsence_method(i)
pseudoabsence_data(i)
Arguments
| occ | A  | 
| pred | A  | 
| method | Method to create pseudoabsences. One of: "random", "bioclim" or "mahal.dist". | 
| n_set | 
 | 
| n_pa | 
 | 
| variables_selected | A vector with variables names to be used while building pseudoabsences. Only used when method is not "random". | 
| th | 
 | 
| i | A  | 
Details
pseudoabsences is used in the SDM workflow to obtain pseudoabsences, a step necessary for
most of the algorithms to run. We implemented three methods so far: "random", which is
self-explanatory, "bioclim" and "mahal.dist". The two last are built with the idea
that pseudoabsences should be environmentally different from presences. Thus, we implemented
two presence-only methods to infer the distribution of the species. "bioclim" uses an
envelope approach (bioclimatic envelope), while "mahal.dist" uses a distance approach
(mahalanobis distance). th parameter enters here as a threshold to binarize those results.
Pseudoabsences are retrieved outside the projected distribution of the species.
n_pseudoabsences returns the number of pseudoabsences obtained per species.
pseudoabsence_method returns the method used to obtain pseudoabsences.
pseudoabsence_data returns a list of species names. Each species name will have a
lists with pseudoabsences data from class sf.
Value
A occurrences_sdm or input_sdm object with pseudoabsence data.
Author(s)
Luíz Fernando Esser (luizesser@gmail.com) https://luizfesser.wordpress.com
See Also
link{input_sdm} sdm_area occurrences_sdm
Examples
# Create sdm_area object:
sa <- sdm_area(parana, cell_size = 25000, crs = 6933)
# Include predictors:
sa <- add_predictors(sa, bioc) |> select_predictors(c("bio1", "bio4", "bio12"))
# Include scenarios:
sa <- add_scenarios(sa)
# Create occurrences:
oc <- occurrences_sdm(occ, crs = 6933) |> join_area(sa)
# Create input_sdm:
i <- input_sdm(oc, sa)
# Pseudoabsence generation:
i <- pseudoabsences(i, method="random")
Hydrologic Variables
Description
A sf LINESTRING object with hydrologic variables (LENGTH_KM and DIST_DN_KM) for the Paraná
state in Brazil. Data obtained from HydroSHEDS for river flows >= 10m3/s.
Usage
rivs
Format
## 'rivs'
A sf with 1031 attributes and 2 fiels:
- LENGTH_KM
- Length of the river reach segment, in kilometers. 
- DIST_DN_KM
- Distance from the reach outlet, i.e., the most downstream pixel of the reach, to the final downstream location along the river network, in kilometers. This downstream location is either the pour point into the ocean or an endorheic sink. 
Source
<https://www.hydrosheds.org/>
Salminus brasiliensis occurrence data
Description
A data.frame object with Salminus brasiliensis occurrence data obtained from GBIF and
filtered with Parana state sf.
Usage
salm
Format
## 'salm'
A data.frame with 46 rows and 3 columns (EPSG:6933):
- species
- Species name 
- decimalLongitude
- Longitude in meters 
- decimalLatitude
- Latitude in meters 
Source
<https://www.gbif.org>
Bioclimatic Variables
Description
A stars object with bioclimatic variables (bio1, bio4 and bio12) and four future scenarios for
the Parana state in Brazil. Data from MIROC6 GCM from WorldClim 2.1 at 10 arc-min resolution.
Usage
scen
Format
## 'scen'
A stars with 4 attribute and 3 bands:
- ca_ssp245_2090
- Intermediate scenario for the year 2090 and GCM CanESM5 
- ca_ssp585_2090
- Extreme scenario for the year 2090 and GCM CanESM5 
- mi_ssp245_2090
- Intermediate scenario for the year 2090 and GCM MIROC6 
- mi_ssp585_2090
- Extreme scenario for the year 2090 and GCM MIROC6 
- bio1
- Annual Mean Temperature 
- bio4
- Temperature Seasonality 
- bio12
- Annual Precipitation 
Source
<https://www.worldclim.org/>
Bioclimatic Variables
Description
A stars object with bioclimatic variables (bio1, bio4 and bio12) and four future scenarios for
the Rio Grande do Sul state in Brazil. Data from MIROC6 GCM from WorldClim 2.1 at 10 arc-min
resolution.
Usage
scen_rs
Format
## 'scen_rs'
A stars with 5 attribute and 3 bands:
- current
- Current scenario with the average values for the years 1970-2000 
- ca_ssp245_2090
- Intermediate scenario for the year 2090 and GCM CanESM5 
- ca_ssp585_2090
- Extreme scenario for the year 2090 and GCM CanESM5 
- mi_ssp245_2090
- Intermediate scenario for the year 2090 and GCM MIROC6 
- mi_ssp585_2090
- Extreme scenario for the year 2090 and GCM MIROC6 
- bio1
- Annual Mean Temperature 
- bio4
- Temperature Seasonality 
- bio12
- Annual Precipitation 
Source
<https://www.worldclim.org/>
Create a sdm_area object
Description
This function creates a new sdm_area object.
Usage
sdm_area(x, cell_size = NULL, crs = NULL, variables_selected = NULL,
                gdal = TRUE, crop_by = NULL, lines_as_sdm_area = FALSE)
get_sdm_area(i)
add_sdm_area(sa1, sa2)
Arguments
| x | A shape or a raster. Usually a shape from  | 
| cell_size | 
 | 
| crs | 
 | 
| variables_selected | A  | 
| gdal | Boolean. Force the use or not of GDAL when available. See details. | 
| crop_by | A shape from  | 
| lines_as_sdm_area | Boolean. If  | 
| i | A  | 
| sa1 | A  | 
| sa2 | A  | 
Details
The function returns a sdm_area object with a grid built upon the x parameter.
There are two ways to make the grid and resample the variables in sdm_area: with and
without gdal. As standard, if gdal is available in you machine it will be used
(gdal = TRUE),
otherwise sf/stars will be used.
get_sdm_area will return the grid built by sdm_area.
add_sdm_area will sum two sdm_area objects. As geoprocessing in caretSDM is
performed using sf objects, add_sdm_area simply applies a rbind in the two
different areas.
Value
A sdm_area object containing:
| grid | 
 | 
| cell_size | 
 | 
Author(s)
Luíz Fernando Esser (luizesser@gmail.com) and Reginaldo Ré. https://luizfesser.wordpress.com
See Also
WorldClim_data parana input_sdm, add_predictors
Examples
# Create sdm_area object:
sa_area <- sdm_area(parana, cell_size = 50000, crs = 6933)
# Create sdm_area using a subset of rivs (lines):
sa_rivers <- sdm_area(rivs[c(1:100),], cell_size = 100000, crs = 6933, lines_as_sdm_area = TRUE)
sdm_as_X functions to transform caretSDM data into other classes.
Description
This functions transform data from a caretSDM object to be used in other packages.
Usage
sdm_as_stars(x,
             what = NULL,
             spp = NULL,
             scen = NULL,
             id = NULL,
             ens = NULL)
sdm_as_raster(x, what = NULL, spp = NULL, scen = NULL, id = NULL, ens = NULL)
sdm_as_terra(x, what = NULL, spp = NULL, scen = NULL, id = NULL, ens = NULL)
Arguments
| x | A  | 
| what | Sometimes multiple data inside  | 
| spp | 
 | 
| scen | 
 | 
| id | 
 | 
| ens | 
 | 
Value
The output is the desired class.
Author(s)
Luíz Fernando Esser (luizesser@gmail.com) https://luizfesser.wordpress.com
Examples
# Create sdm_area object:
sa <- sdm_area(parana, cell_size = 100000, crs = 6933)
# Include predictors:
sa <- add_predictors(sa, bioc) |> select_predictors(c("bio1", "bio12"))
# Include scenarios:
sa <- add_scenarios(sa)
# Create occurrences:
oc <- occurrences_sdm(occ, crs = 6933) |> join_area(sa)
# Create input_sdm:
i <- input_sdm(oc, sa)
# Pseudoabsence generation:
i <- pseudoabsences(i, method = "random", n_set = 2)
# Custom trainControl:
ctrl_sdm <- caret::trainControl(method = "boot",
                                number = 1,
                                classProbs = TRUE,
                                returnResamp = "all",
                                summaryFunction = summary_sdm,
                                savePredictions = "all")
# Train models:
i <- train_sdm(i, algo = c("naive_bayes"), ctrl = ctrl_sdm) |>
  suppressWarnings()
# Predict models:
i  <- predict_sdm(i, th = 0.8)
# Transform in stars:
sdm_as_stars(i)
Tidyverse methods for caretSDM objects
Description
Set of functions to facilitate the use of caretSDM through tidyverse grammatics.
Usage
select_predictors(x, ...)
## S3 method for class 'sdm_area'
select(.data, ...)
## S3 method for class 'input_sdm'
select(.data, ...)
## S3 method for class 'sdm_area'
mutate(.data, ...)
## S3 method for class 'input_sdm'
mutate(.data, ...)
## S3 method for class 'sdm_area'
filter(.data, ..., .by, .preserve)
## S3 method for class 'input_sdm'
filter(.data, ..., .by, .preserve)
## S3 method for class 'occurrences'
filter(.data, ..., .by, .preserve)
filter_species(x, spp = NULL, ...)
Arguments
| x | 
 | 
| ... | 
 | 
| .data | Data to pass to tidyr function. | 
| .by | See ?dplyr::filter. | 
| .preserve | See ?dplyr::filter. | 
| spp | Species to be filtered. | 
Value
The transformed sdm_area/input_sdm object.
Examples
# Create sdm_area object:
sa <- sdm_area(parana, cell_size = 25000, crs = 6933)
# Include predictors:
sa <- add_predictors(sa, bioc) |> select_predictors(c("bio1", "bio4", "bio12"))
Calculates performance across resamples
Description
This function is used in caret::trainControl(summaryFunction=summary_sdm) to calculate
performance metrics across resamples.
Usage
summary_sdm(data, lev = NULL, model = NULL, custom_fun=NULL)
summary_sdm_presence_only(data, lev, threshold)
validate_on_independent_data(model, data_independent, obs_col_name)
Arguments
| data | A  | 
| lev | A  | 
| model | Models names taken from  | 
| custom_fun | A custom function to be applied in models (not yet implemented). | 
| threshold | Threshold for presence-only models. | 
| data_independent | independent data.frame to calculate metrics. | 
| obs_col_name | The name of the column with observed values. | 
Details
See ?caret::defaultSummary for more details and options to pass on
caret::trainControl.
Value
A input_sdm or a predictions object.
Author(s)
Luíz Fernando Esser (luizesser@gmail.com) https://luizfesser.wordpress.com
See Also
Examples
# Create sdm_area object:
sa <- sdm_area(parana, cell_size = 100000, crs = 6933)
# Include predictors:
sa <- add_predictors(sa, bioc) |> select_predictors(c("bio1", "bio12"))
# Include scenarios:
sa <- add_scenarios(sa)
# Create occurrences:
oc <- occurrences_sdm(occ, crs = 6933) |> join_area(sa)
# Create input_sdm:
i <- input_sdm(oc, sa)
# Pseudoabsence generation:
i <- pseudoabsences(i, method = "random")
# Custom trainControl:
ctrl_sdm <- caret::trainControl(method = "repeatedcv",
                                number = 2,
                                repeats = 1,
                                classProbs = TRUE,
                                returnResamp = "all",
                                summaryFunction = summary_sdm,
                                savePredictions = "all")
# Train models:
i <- train_sdm(i, algo = c("naive_bayes"), ctrl=ctrl_sdm) |>
suppressWarnings()
Train SDM models
Description
This function is a wrapper to fit models in caret using caretSDM data.
Usage
train_sdm(occ,
          pred = NULL,
          algo,
          ctrl = NULL,
          variables_selected = NULL,
          parallel = FALSE,
          ...)
get_tune_length(i)
algorithms_used(i)
get_models(i)
get_validation_metrics(i)
mean_validation_metrics(i)
add_models(m1, m2)
Arguments
| occ | A  | 
| pred | A  | 
| algo | A  | 
| ctrl | A  | 
| variables_selected | A  | 
| parallel | Should a paralelization method be used (not yet implemented)? | 
| ... | Additional arguments to be passed to  | 
| i | A  | 
| m1 | A  | 
| m2 | A  | 
Details
The object algorithms has a table comparing algorithms available. If the function
detects that the necessary packages are not available it will ask for installation. This will
happen just in the first time you use the algorithm.
get_tune_length return the length used in grid-search for tunning.
algorithms_used return the names of the algorithms used in the modeling process.
get_models returns a list with trained models (class train) to each species.
get_validation_metrics return a list with a data.frame to each species
with complete values for ROC, Sensitivity, Specificity, with their respectives Standard
Deviations (SD) and TSS to each of the algorithms and pseudoabsence datasets used.
mean_validation_metrics return a list with a tibble to each species
summarizing values for ROC, Sensitivity, Specificity and TSS to each of the algorithms used.
Value
A models or a input_sdm object.
Author(s)
Luíz Fernando Esser (luizesser@gmail.com) https://luizfesser.wordpress.com
See Also
Examples
# Create sdm_area object:
sa <- sdm_area(parana, cell_size = 100000, crs = 6933)
# Include predictors:
sa <- add_predictors(sa, bioc) |> select_predictors(c("bio1", "bio12"))
# Include scenarios:
sa <- add_scenarios(sa)
# Create occurrences:
oc <- occurrences_sdm(occ, crs = 6933) |> join_area(sa)
# Create input_sdm:
i <- input_sdm(oc, sa)
# Pseudoabsence generation:
i <- pseudoabsences(i, method = "random")
# Custom trainControl:
ctrl_sdm <- caret::trainControl(method = "repeatedcv",
                                number = 2,
                                repeats = 1,
                                classProbs = TRUE,
                                returnResamp = "all",
                                summaryFunction = summary_sdm,
                                savePredictions = "all")
# Train models:
i <- train_sdm(i, algo = c("naive_bayes"), ctrl = ctrl_sdm) |>
suppressWarnings()
tSNE
Description
This function calculates tSNE with presences and pseudoabsences data and returns a list of plots.
Usage
tsne_sdm(occ, pred = NULL, variables_selected = NULL)
Arguments
| occ | A  | 
| pred | A  | 
| variables_selected | Variable to be used in t-SNE. It can also be 'vif', if previously calculated. | 
Value
A list of plots, where each plot is a tSNE for a given pseudoabsence dataset.
Author(s)
Luíz Fernando Esser (luizesser@gmail.com) https://luizfesser.wordpress.com
Ensemble of Small Models (ESM) in caretSDM
Description
This functions set parameters to run a ESM when running train_sdm.
Usage
use_esm(x, spp = NULL, n_records = 20)
Arguments
| x | A  | 
| spp | A vector of species names containing the species which the ESM must be applied. Standard is NULL. | 
| n_records | Numeric. Number of species records to apply the ESM. Standard is 20. | 
Details
We supply two different ways to apply the ESM. If species names are provided, then ESM will be
applied only in given species. If a number of species records is provided, then ESM will be
applied in every species with number of records bellow the given threshold. As standard,
use_esm will be apply to every species with less then 20 records.
Value
A input_sdm or occurrences object with ESM parameters.
Author(s)
Luíz Fernando Esser (luizesser@gmail.com) https://luizfesser.wordpress.com
Examples
# Create sdm_area object:
sa <- sdm_area(parana, cell_size = 100000, crs = 6933)
# Include predictors:
sa <- add_predictors(sa, bioc) |> select_predictors(c("bio1", "bio4", "bio12"))
# Include scenarios:
sa <- add_scenarios(sa)
# Create occurrences:
oc <- occurrences_sdm(occ, crs = 6933) |> join_area(sa)
# Create input_sdm:
i <- input_sdm(oc, sa)
# Use MEM:
i <- use_esm(i, n_records = 999)
MacroEcological Models (MEM) in caretSDM
Description
This functions sums all species records into one. Should be used before the data cleaning routine.
Usage
use_mem(x, add = TRUE, name = "MEM")
Arguments
| x | A  | 
| add | Logical. Should the new MEM records be added to the pool ( | 
| name | How should the new records be named? Standard is "MEM". | 
Value
A input_sdm or occurrences object with MEM data.
Author(s)
Luíz Fernando Esser (luizesser@gmail.com) https://luizfesser.wordpress.com
Examples
# Create sdm_area object:
sa <- sdm_area(parana, cell_size = 25000, crs = 6933)
# Include predictors:
sa <- add_predictors(sa, bioc) |> select_predictors(c("bio1", "bio4", "bio12"))
# Include scenarios:
sa <- add_scenarios(sa)
# Create occurrences:
oc <- occurrences_sdm(occ, crs = 6933) |> join_area(sa)
# Create input_sdm:
i <- input_sdm(oc, sa)
# Use MEM:
i <- use_mem(i)
Calculation of variable importance for models
Description
This function retrieves variable importance as a function of ROC curves to each predictor.
Usage
varImp_sdm(m, id = NULL, ...)
Arguments
| m | A  | 
| id | Vector of model ids to filter varImp calculation. | 
| ... | Parameters passing to caret::varImp(). | 
Value
A data.frame with variable importance data.
Author(s)
Luíz Fernando Esser (luizesser@gmail.com) https://luizfesser.wordpress.com
Examples
# Create sdm_area object:
sa <- sdm_area(parana, cell_size = 100000, crs = 6933)
# Include predictors:
sa <- add_predictors(sa, bioc) |> select_predictors(c("bio1", "bio12"))
# Include scenarios:
sa <- add_scenarios(sa)
# Create occurrences:
oc <- occurrences_sdm(occ, crs = 6933) |> join_area(sa)
# Create input_sdm:
i <- input_sdm(oc, sa)
# Pseudoabsence generation:
i <- pseudoabsences(i, method = "random")
# Custom trainControl:
ctrl_sdm <- caret::trainControl(method = "repeatedcv",
                                number = 2,
                                repeats = 1,
                                classProbs = TRUE,
                                returnResamp = "all",
                                summaryFunction = summary_sdm,
                                savePredictions = "all")
# Train models:
i <- train_sdm(i, algo = c("naive_bayes"), ctrl = ctrl_sdm) |>
suppressWarnings()
# Variable importance:
varImp_sdm(i)
Calculate VIF
Description
Apply Variance Inflation Factor (VIF) calculation.
Usage
vif_predictors(pred, area = "all", th = 0.5, maxobservations = 5000, variables_selected =
NULL)
vif_summary(i)
selected_variables(i)
Arguments
| pred | A  | 
| area | Character. Which area should be used in vif selection? Standard is  | 
| th | Threshold to be applied in VIF routine. See ?usdm::vifcor. | 
| maxobservations | Max observations to use to calculate the VIF. | 
| variables_selected | If there is a subset of predictors that should be used in this
function, it can be informed using this parameter. If set to  | 
| i | A  | 
Details
vif_predictors is a wrapper function to run usdm::vifcor in caretSDM.
Value
A input_sdm or predictors object with VIF data.
Author(s)
Luíz Fernando Esser (luizesser@gmail.com) https://luizfesser.wordpress.com
See Also
Examples
# Create sdm_area object:
sa <- sdm_area(parana, cell_size = 25000, crs = 6933)
# Include predictors:
sa <- add_predictors(sa, bioc) |> select_predictors(c("bio1", "bio4", "bio12"))
# Include scenarios:
sa <- add_scenarios(sa, scen)
# Create occurrences:
oc <- occurrences_sdm(occ, crs = 6933) |> join_area(sa)
# Create input_sdm:
i <- input_sdm(oc, sa)
# VIF calculation:
i <- vif_predictors(i)
i
# Retrieve information about vif:
vif_summary(i)
selected_variables(i)
Write caretSDM data
Description
This function exports caretSDM data.
Usage
write_ensembles(x, path = NULL, ext = ".tif", centroid = FALSE)
write_predictions(x, path = NULL, ext = ".tif", centroid = FALSE)
write_predictors(x, path = NULL, ext = ".tif", centroid = FALSE)
write_models(x, path = NULL)
write_gpkg(x, file_path, file_name)
## S3 method for class 'sdm_area'
write_gpkg(x, file_path, file_name)
write_occurrences(x, path = NULL, grid = FALSE, ...)
write_pseudoabsences(x, path = NULL, ext = ".csv", centroid = FALSE)
write_grid(x, path = NULL, centroid = FALSE)
write_validation_metrics(x, path = NULL)
Arguments
| x | Object to be written. Can be of class  | 
| path | A path with filename and the proper extension (see details) or the directory to save files in. | 
| ext | How it should be saved? | 
| centroid | Should coordinates for the centroids of each cell be included? Standard is FALSE. | 
| file_path | A path to save the  | 
| file_name | The name of the  | 
| grid | Boolean. Return a grid. | 
| ... | Arguments to pass to  | 
Details
ext can be set accordingly to the desired output. Possible values are .tif and .asc for
rasters, .csv for for a spreadsheet, but also  one of: c("bna", "csv", "e00", "gdb", "geojson",
"gml", "gmt", "gpkg", "gps", "gtm", "gxt", "jml", "map", "mdb", "nc", "ods", "osm", "pbf", "shp",
"sqlite", "vdv", "xls", "xlsx").
path ideally should only provide the folder. We recommend using:
results/what_are_you_writting. So for writting ensembles users are advised to run:
path = "results/ensembles"
Value
No return value, called for side effects.
Author(s)
Luíz Fernando Esser (luizesser@gmail.com) https://luizfesser.wordpress.com