Here we apply the penetrance package to simulated families where the data-generating penetrance function is known and based on existing penetrance estimates.
The data-generating distribution of the age-specific penetrances is based on existing penetrance estimates for Colorectal cancer in carriers of any pathogenic variant in MLH1 from the PanelPRO Database.
The families were simulated using the PedUtils Rpackage.
<- test_fam2 dat
Then we run the estimation using the default settings.
# Set the random seed
set.seed(2024)
# Set the prior
<- list(
prior_params asymptote = list(g1 = 1, g2 = 1),
threshold = list(min = 5, max = 30),
median = list(m1 = 2, m2 = 2),
first_quartile = list(q1 = 6, q2 = 3)
)
# Set the allele frequency for MLH1 based on PanelPRO Database
<- 0.0004453125
prevMLH1
# We use the default baseline (non-carrier) penetrance
print(baseline_data_default)
# We run the estimation procedure with one chain and 20k iterations
<- penetrance(
out_sim pedigree = dat, twins = NULL, n_chains = 1, n_iter_per_chain = 20000,
ncores = 2, baseline_data = baseline_data_default , prev = prevMLH1,
prior_params = prior_params, burn_in = 0.1, median_max = TRUE,
ageImputation = FALSE, removeProband = FALSE
)
Lee G, Liang JW, Zhang Q, Huang T, Choirat C, Parmigiani G, Braun D. Multi-syndrome, multi-gene risk modeling for individuals with a family history of cancer with the novel R package PanelPRO. Elife. 2021;10:e68699. doi:10.7554/eLife.6869