PrInDT: Prediction and Interpretation in Decision Trees for
Classification and Regression
Optimization of conditional inference trees from the package 'party'
    for classification and regression.
    For optimization, the model space is searched for the best tree on the full sample by 
    means of repeated subsampling. Restrictions are allowed so that only trees are accepted 
    which do not include pre-specified uninterpretable split results (cf. Weihs & Buschfeld, 2021a).
    The function PrInDT() represents the basic resampling loop for 2-class classification (cf. Weihs 
    & Buschfeld, 2021a). The function RePrInDT() (repeated PrInDT()) allows for repeated  
    applications of PrInDT() for different percentages of the observations of the large and the 
    small classes (cf. Weihs & Buschfeld, 2021c). The function NesPrInDT() (nested PrInDT()) 
    allows for an extra layer of subsampling for a specific factor variable (cf. Weihs & Buschfeld, 
    2021b). The functions PrInDTMulev() and PrInDTMulab() deal with multilevel and multilabel 
    classification. In addition to these PrInDT() variants for classification, the function 
    PrInDTreg() has been developed for regression problems. Finally, the function PostPrInDT() 
    allows for a posterior analysis of the distribution of a specified variable in the terminal 
    nodes of a given tree.
    In version 2, additionally structured sampling is implemented in functions PrInDTCstruc() and 
    PrInDTRstruc(). In these functions, repeated measurements data can be analyzed, too. 
    Moreover, multilabel 2-stage versions of classification and regression trees are 
    implemented in functions C2SPrInDT() and R2SPrInDT() as well as interdependent 
    multilabel models in functions SimCPrInDT() and SimRPrInDT(). Finally, for mixtures of 
    classification and regression models functions Mix2SPrInDT() and SimMixPrInDT() are 
    implemented. Most of these extensions of PrInDT are described in Buschfeld & 
    Weihs (2025Fc).
    References:
    – Buschfeld, S., Weihs, C. (2025Fc) "Optimizing decision trees for the analysis of World Englishes 
    and sociolinguistic data", Cambridge Elements.
    – Weihs, C., Buschfeld, S. (2021a) "Combining Prediction and Interpretation in 
    Decision Trees (PrInDT) - a Linguistic Example" <doi:10.48550/arXiv.2103.02336>;
    – Weihs, C., Buschfeld, S. (2021b) "NesPrInDT: Nested undersampling in PrInDT" 
    <doi:10.48550/arXiv.2103.14931>;
    – Weihs, C., Buschfeld, S. (2021c) "Repeated undersampling in PrInDT (RePrInDT): Variation 
    in undersampling and prediction, and ranking of predictors in ensembles" <doi:10.48550/arXiv.2108.05129>.
| Version: | 2.0.2 | 
| Depends: | R (≥ 2.10) | 
| Imports: | graphics, MASS, party, splitstackshape, stats, stringr, utils, gdata | 
| Published: | 2025-09-11 | 
| DOI: | 10.32614/CRAN.package.PrInDT | 
| Author: | Claus Weihs [aut, cre],
  Sarah Buschfeld [aut],
  Niklas Nitsch [ctb] | 
| Maintainer: | Claus Weihs  <claus.weihs at tu-dortmund.de> | 
| License: | GPL-2 | 
| NeedsCompilation: | no | 
| CRAN checks: | PrInDT results | 
Documentation:
Downloads:
Linking:
Please use the canonical form
https://CRAN.R-project.org/package=PrInDT
to link to this page.