bpe
Here are 10 public repositories matching this topic...
Fast and versatile tokenizer for language models, compatible with SentencePiece, Tokenizers, Tiktoken and more. Supports BPE, Unigram and WordPiece tokenization in JavaScript, Python and Rust.
-
Updated
Mar 20, 2025 - Rust
Text tokenization service (Rust, Axum) (2025)
-
Updated
Mar 22, 2025 - Rust
[Rust] Unofficial implementation of "SuperBPE: Space Travel for Language Models" in Rust
-
Updated
Apr 14, 2025 - Rust
This crate is a rust porting of Andrej Karpathy implementation of Byte Pair Encoding (BPE) algorithm available here https://github.com/karpathy/minbpe
-
Updated
Feb 19, 2024 - Rust
Improve this page
Add a description, image, and links to the bpe topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the bpe topic, visit your repo's landing page and select "manage topics."