# -------------------------------------------- # CITATION file created with {cffr} R package # See also: https://docs.ropensci.org/cffr/ # -------------------------------------------- cff-version: 1.2.0 message: 'To cite package "tok" in publications use:' type: software license: MIT title: 'tok: Fast Text Tokenization' version: 0.2.2 doi: 10.32614/CRAN.package.tok abstract: Interfaces with the 'Hugging Face' tokenizers library to provide implementations of today's most used tokenizers such as the 'Byte-Pair Encoding' algorithm . It's extremely fast for both training new vocabularies and tokenizing texts. authors: - family-names: Kalinowski given-names: Tomasz email: tomasz@posit.co - family-names: Falbel given-names: Daniel email: dfalbel@gmail.com repository: https://cranhaven.r-universe.dev repository-code: https://github.com/mlverse/tok commit: b4a2c37fdb54bbe4ab87ce8e16cb644ead04e7ca url: https://github.com/mlverse/tok date-released: '2026-05-15' contact: - family-names: Kalinowski given-names: Tomasz email: tomasz@posit.co