Kit7850 7850oken1713 1713.13 13 Token9857 9857ize553 553 Everything20696 20696!0 0

Fast and versatile tokenizer for language models compatible with SentencePiece, Tokenizers, Tiktoken and more.

Tokenize text for Llama, Gemini, GPT-4, Mistral and many others. Ready for production in the web, on hardware and in the cloud.
Drag any supported file into the page to import it.