Used in textrecipes::step_tokenize_sentencepiece() and
textrecipes::step_tokenize_bpe().
Usage
vocabulary_size(range = c(1000L, 32000L), trans = NULL)Arguments
- range
 A two-element vector holding the defaults for the smallest and largest possible values, respectively. If a transformation is specified, these values should be in the transformed units.
- trans
 A
transobject from thescalespackage, such asscales::transform_log10()orscales::transform_reciprocal(). If not provided, the default is used which matches the units used inrange. If no transformation,NULL.
