Python code to build your BPE - Tokenizer from scratch (w/ HuggingFace)