Custom Tokenizer

1. Learn vocabulary


      

2. Encode text


      

3. Decode token IDs