Byte Pair Encoding Tokenization in NLP