SDS 626: Subword Tokenization with Byte-Pair Encoding — with @JonKrohnLearns​